Tag: datax

7 articles

Offline Data Warehouse: E-commerce Core Transaction Incre...

Using DataX (MySQLReader + HDFSWriter) to extract daily incremental data from MySQL order tables, order detail tables, and product information tables into...

Offline Data Warehouse Advertising Business Hive ADS Prac...

Complete solution for exporting Hive ADS layer data to MySQL using DataX. Covers ADS loading, DataX configuration, MySQL table creation, Shell script parameterized execution, and common error diagn...

Offline Data Warehouse Member Metrics Verification, DataX...

Offline data warehouse practice based on Hadoop + Hive + HDFS + DataX + MySQL, covering member metrics testing (active/new/retention), HDFS export, DataX sync to MySQL, and advertising business ODS...

Offline Data Warehouse Hive ADS Export MySQL DataX Practi...

The landing path for exporting Hive ADS layer tables to MySQL in offline data warehouse. Gives typical DataX solution: hdfsreader -> mysqlwriter. Focuses on DataX JSON configuration and common erro...

Offline Data Warehouse Hive New Member & Retention: DWS D...

Offline data warehouse calculates 'new members' daily, and provides consistent definition data foundation for subsequent 'member retention'. Use 'full member table (with first day dt)' as deduplica...

Offline Data Warehouse Hive Practice: DWD to DWS Daily/We...

This article introduces using Hive to build an offline data warehouse for calculating active members (daily/weekly/monthly). Covers the complete flow from DWD...

DataX 3.0 Architecture & Practice: Reader/Writer Plugin M...

DataX (DataX 3.0) is an offline data synchronization/data integration tool widely used and open-sourced within Alibaba, for enterprise-level heterogeneous data...