What is etl-extract-transform-load?

A three-phase process where data is extracted, transformed, and loaded into an output container.

Treść oczekuje na tłumaczenie. Wyświetlana jest wersja angielska.

ETL is a process used in data warehousing to pull data from various source systems, change its format to fit operational needs, and finally insert it into a target database or data warehouse. E=Extract (getting raw data), T=Transform (cleaning and reformatting), L=Load (storing in the final destination).

        graph LR
  Center["What is etl-extract-transform-load?"]:::main
  Rel_indexing_search["indexing-search"]:::related -.-> Center
  click Rel_indexing_search "/terms/indexing-search"
  Rel_consensus_mechanism["consensus-mechanism"]:::related -.-> Center
  click Rel_consensus_mechanism "/terms/consensus-mechanism"
  Rel_edge_computing["edge-computing"]:::related -.-> Center
  click Rel_edge_computing "/terms/edge-computing"
  classDef main fill:#7c3aed,stroke:#8b5cf6,stroke-width:2px,color:white,font-weight:bold,rx:5,ry:5;
  classDef pre fill:#0f172a,stroke:#3b82f6,color:#94a3b8,rx:5,ry:5;
  classDef child fill:#0f172a,stroke:#10b981,color:#94a3b8,rx:5,ry:5;
  classDef related fill:#0f172a,stroke:#8b5cf6,stroke-dasharray: 5 5,color:#94a3b8,rx:5,ry:5;
  linkStyle default stroke:#4b5563,stroke-width:2px;

      

🧠 Sprawdzenie wiedzy

1 / 1

🧒 Wyjaśnij jak 5-latkowi

ETL is like a recycling factory. First, the trucks bring in all the 'raw' trash (Extract). Then, machines separate the plastic from paper and wash it (Transform). Finally, the clean material is sent to a warehouse to be used for new things (Load).

🤓 Expert Deep Dive

Traditional ETL processes data in batches, often at night. Modern 'Streaming ETL' processes data as it arrives. Key challenges include data quality enforcement, handling schema changes in source systems, and ensuring the process can recover from failures without duplicating data (idempotency).

📚 Źródła