Tag: canal

8 articles

Big Data #268: Real-time Warehouse ODS Layer - Writing Ka...

Writing dimension tables (DIM) from Kafka typically involves reading real-time or batch data from Kafka topics and updating dimension tables based on the data...

Big Data #269: Real-time Warehouse DIM, DW and ADS Layer ...

DW (Data Warehouse layer) is built from DWD, DWS, and DIM layer data, completing data architecture and integration, establishing consistent dimensions, and...

Big Data #266: Canal Integration with Kafka - Real-time D...

This article introduces Alibaba's open-source Canal tool, which implements Change Data Capture (CDC) by parsing MySQL binlog. Demonstrates how to integrate...

Realtime Warehouse - ODS Lambda Architecture Kappa Archit...

In internet companies, common ODS data includes business log data (Log) and business DB data. For business DB data, collecting data from relational databases...

Canal Deployment: Installation, Service Startup and Commo...

Canal is an open-source data synchronization tool from Alibaba for MySQL database incremental log parsing and synchronization. It simulates the MySQL slave...

Canal Working Principle: Workflow and MySQL Binlog Introd...

Canal is an open-source tool for MySQL database binlog incremental subscription and consumption, primarily used for data synchronization and distributed...

Canal Data Sync: Introduction, Background, Principles and...

Alibaba B2B's cross-region business between domestic sellers and overseas buyers drove the need for data synchronization between Hangzhou and US data centers.

Realtime Warehouse - Business Database Table Structure: T...

Realtime data warehouse is a data warehouse system that differs from traditional batch processing data warehouses by emphasizing low latency, high throughput,...