Tag: Kylin
10 articles
Big Data 165 - Apache Kylin Cube7 Practice: Aggregation Group, RowKey and Encoding
Covers Aggregation Group, Mandatory Dimension, Hierarchy Dimension, Joint Dimension usage trade-offs, and explains impact of dictionary encoding, RowKey order.
Apache Kylin 1.6 Streaming Cubing Practice: Kafka to Minute-level OLAP
Kafka→Kylin real-time OLAP pipeline, providing minute-level aggregation queries for common 2025 business scenarios (e-commerce transactions, user behavior...
Apache Kylin Segment Merge Practice: Manual/Auto Merge, Retention Threshold
Apache Kylin Segment merge practice tutorial, covering manual MERGE Job flow, continuous Segment requirements, Auto Merge multi-level threshold strategy...
Big Data 164 - Apache Kylin Cuboid Pruning Practice: Derived Dimensions & Expansion Control
Cuboid pruning optimization: When there are many dimensions, Cuboid count grows exponentially, causing long build time and storage expansion.
Big Data 161 - Apache Kylin Cube Practice: Modeling, Building and Query Acceleration
Apache Kylin 4.0 Cube modeling and query acceleration method: Complete star modeling with fact tables and dimension tables, design dimensions and measures.
Apache Kylin Incremental Cube & Segment Practice: Daily Partition Column
Using date field of Hive partitioned table as Partition Date Column, split Cube into multiple Segments, incrementally build by range to avoid repeated computation of hist...
Apache Kylin Cube Practice: Hive Load & Pre-computation Acceleration
Apache Kylin is an open-source distributed analysis engine, focused on providing real-time OLAP (Online Analytical Processing) capabilities for big data.
Apache Kylin Cube Practice: From Modeling to Build and Query
Scenario: Using e-commerce sales fact table, pre-compute aggregation queries accelerated by "date" dimension on Kylin.
Apache Kylin Comprehensive Guide: MOLAP Architecture, Hive Integration
Background, evolution and engineering practice of Apache Kylin, focusing on MOLAP solution implementation path for massive data analysis.
Big Data 158 - Apache Kylin 3.1.1 Deployment on Hadoop, Hive and HBase
Complete deployment record of Apache Kylin 3.1.1 on Hadoop 2.9.2, Hive 2.3.9, HBase 1.3.1, Spark 2.4.5 (without-hadoop.