Tag: kylin

10 articles

Apache Kylin Cube7 Practice: Aggregation Group/RowKey/Enc...

Covers Aggregation Group, Mandatory Dimension, Hierarchy Dimension, Joint Dimension usage trade-offs, and explains impact of dictionary encoding, RowKey order, ShardBy sharding on build and query p...

Apache Kylin 1.6 Streaming Cubing Practice: Kafka to Minu...

Kafka→Kylin real-time OLAP pipeline, providing minute-level aggregation queries for common 2025 business scenarios (e-commerce transactions, user behavior, IoT monitoring).

Apache Kylin Segment Merge Practice: Manual/Auto Merge, R...

Apache Kylin Segment merge practice tutorial, covering manual MERGE Job flow, continuous Segment requirements, Auto Merge multi-level threshold strategy, Retention Threshold cleanup logic, deletion...

Apache Kylin Cuboid Pruning Practice: Derived Dimensions ...

Cuboid pruning optimization: When there are many dimensions, Cuboid count grows exponentially, causing long build time and storage expansion. Engineering...

Apache Kylin Cube Practice: Complete Guide for Modeling, ...

Apache Kylin 4.0 Cube modeling and query acceleration method: Complete star modeling with fact tables and dimension tables, design dimensions and measures, use...

Apache Kylin Incremental Cube & Segment Practice: Daily P...

Using date field of Hive partitioned table as Partition Date Column, split Cube into multiple Segments, incrementally build by range to avoid repeated computation of historical data; also compare f...

Apache Kylin Cube Practice: Hive Load & Pre-computation A...

OLAP example: Generate dimension and fact data via Python, after Hive (wzk_kylin) load, design Cube in Kylin (dimensions/measures/Cuboids), and provide...

Apache Kylin Cube Practice: From Modeling to Build and Qu...

Apache Kylin (3.x/4.x) Cube setup and optimization: complete flow from DataSource → Model → Cube, covering dimension modeling, measure design, Cuboid...

Apache Kylin Comprehensive Guide: MOLAP Architecture, Hiv...

Background, evolution and engineering practice of Apache Kylin, focusing on MOLAP solution implementation path for massive data analysis. Core keywords: Apache...

Apache Kylin 3.1.1 Deployment on Hadoop 2.9/Hive 2.3/HBas...

Complete deployment record of Apache Kylin 3.1.1 on Hadoop 2.9.2, Hive 2.3.9, HBase 1.3.1, Spark 2.4.5 (without-hadoop, Scala 2.12) and three-node...