نقشه راه جدید
عنوان
توضیحات
موضوع اصلی
انتخاب موضوع اصلی
دستهبندیها
در حال بارگذاری...
ساخت نقشه راه
انصراف
LOGO
افزودن نقشه راه
نقشه راه
محتواها
مدیا
ورود
در حال بارگذاری...
Roadmaps
Data Engineer Roadmap
Data Engineer Roadmap
Step by step guide to becoming a Data Engineer in 2025
roadmap.sh
1
2
3
4
Data Engineer
Pre-requisites
Python Roadmap
SQL Roadmap
Data Generation
Understand Different Steps
RELATED ROADMAPS
› Data Analyst Roadmap
› AI & Data Scientist Roadmap
New Relic
Common Tools
Git and GitHub
Introduction
What is Data Engineering?
Data Engineering vs Data Science
Skills and Responsibilities
Data Engineering Lifecycle
Data Structures and Algorithms
Choosing the Right Technologies
Learn the Basics
Programming Skills
Python
Java
Scala
Go
Python is recommended
Linux Basics
Networking Fundamentals
Distributed Systems Basics
Data Generation
Data Storage
Data Ingestion
Data Serving
Data Engineering Lifecycle
Sources of Data
Database
APIs
Logs
Mobile Apps
IoT
Data Collection Considerations
Data Storage
Database Fundamentals
Data Normalization
Data Modelling Techniques
CAP Theorem
OLTP vs OLAP
Sentry
Slowly Changing Dimension - SCD
Horizontal vs Vertical Scaling
Star vs Snowflake Schema
Relational Databases
Learn SQL
Indexing
Transactions
Relational Databases
MySQL
PostgreSQL
MariaDB
Aurora DB
Oracle
MS SQL
Key-Value
NoSQL Databsases
Document
MongoDB
ElasticSearch
CosmosDB
CouchDB
Column
Cassandra
BigTable
HBase
Graph
Neo4j
Neptune
Redis
Memcached
DynamoDB
Data Warehousing
What is Data Warehouse?
Data Warehousing Architectures
Data Warehouse
Google BigQuery
Snowflake
Amazon Redshift
Data Mart
Data Lake
Databricks Delta Lake
Snowflake
Onehouse
Data Mesh
Other Data Architectures
Data Fabric
Data Hub
Metadata-first Architecture
Serverless Options
Cloud Computing
Cloud Architectures
Cloud Providers
AWS
Amazon EC2 ( Compute)
S3 (Storage)
Amazon RDS (Database)
Amazon RDS (Database)
Azure Virtual Machines
Azure Blob Storage
Azure SQL Database
Cluster Management Tools
Data Factory (ETL)
Azure
Compute Engine (Compute)
Google Cloud Storage
Cloud SQL (Database)
Dataflow
Google Cloud
Data Ingestion
Types of Data Ingestion
Batch
Hybrid
Streaming
Realtime
Data Pipelines
ETL Process
Extract Data
Transform Data
Load Data
Data Pipeline Tools
Apache Airflow
dbt
Luigi
Perfect
Cluster Computing Basics
What is Cluster Computing
Distributed File Systems
Job Scheduling
Kubernetes
Apache Hadoop YARN
HDFS
Big Data Tools
Hadoop Ecosystem
HDFS
MapReduce
YARN
Apache Spark
Containers & Orchestration
Docker
Kubernetes
Google Cloud GKE
AWS EKS
Datadog
CI/CD
GitLab CI
Circle CI
GitHub Actions
ArgoCD
Monitoring
Prometheus
Testing
Integration Testing
Unit Testing
End-to-End Testing
Functional Testing
A/B Testing
Load Testing
Smoke Testing
Messaging Systems
What and why use them?
Async vs Sync Communication
Messages vs Streams
Best Practices
Common Tools
Apache Kafka
RabbitMQ
AWS SQS
AWS SNS
Infrastructure as Code -
IaC
Declarative vs Imperative
Idempotency
Reusability
Environmental Management
Terraform
OpenTofu
AWS CDK
Google Deployment Mgr.
Data
Serving
Data Analytics
Visit the Data Analyst Roadmap
Business Intelligence
BI Tools
Microsoft Power BI
Streamlit
Tableu
Looker
Reverse ETL
ETL vs Reverse ETL
Reverse ETL Usecases
Tools
Census
Segment
Hightouch
Security
Authentication vs
Authorization
Encryption
Tokenization
Data Masking
Data Obfuscation
Data Governance
Data Quality
Data Lineage
Metadata Management
Data Interoperability
Data Quality
Privacy
GDPR
ECPA
EU AI Act
Find the interactive version of this
roadmap and more roadmaps at
roadmap.sh
Also visit the following related roadmaps
Python
AI & Data Scientist
SQL
Data and AI Regulations
Data Analyst
MLOps
Machine Learning
MLOps
Subtopic
No description available.
Close