What is etl-extract-transform-load?
A three-phase process where data is extracted, transformed, and loaded into an output container.
ETL is a process used in data warehousing to pull data from various source systems, change its format to fit operational needs, and finally insert it into a target database or data warehouse. E=Extract (getting raw data), T=Transform (cleaning and reformatting), L=Load (storing in the final destination).
graph LR
Center["What is etl-extract-transform-load?"]:::main
Rel_indexing_search["indexing-search"]:::related -.-> Center
click Rel_indexing_search "/terms/indexing-search"
Rel_consensus_mechanism["consensus-mechanism"]:::related -.-> Center
click Rel_consensus_mechanism "/terms/consensus-mechanism"
Rel_edge_computing["edge-computing"]:::related -.-> Center
click Rel_edge_computing "/terms/edge-computing"
classDef main fill:#7c3aed,stroke:#8b5cf6,stroke-width:2px,color:white,font-weight:bold,rx:5,ry:5;
classDef pre fill:#0f172a,stroke:#3b82f6,color:#94a3b8,rx:5,ry:5;
classDef child fill:#0f172a,stroke:#10b981,color:#94a3b8,rx:5,ry:5;
classDef related fill:#0f172a,stroke:#8b5cf6,stroke-dasharray: 5 5,color:#94a3b8,rx:5,ry:5;
linkStyle default stroke:#4b5563,stroke-width:2px;
🧠 Teste de conhecimento
🧒 Explique como se eu tivesse 5 anos
ETL is like a recycling factory. First, the trucks bring in all the 'raw' trash (Extract). Then, machines separate the plastic from paper and wash it (Transform). Finally, the clean material is sent to a warehouse to be used for new things (Load).
🤓 Expert Deep Dive
Traditional ETL processes data in batches, often at night. Modern 'Streaming ETL' processes data as it arrives. Key challenges include data quality enforcement, handling schema changes in source systems, and ensuring the process can recover from failures without duplicating data (idempotency).