Data Strategie

How are you handling pre-aggregation in ClickHouse at scale? AggregatingMergeTree vs ReplacingMergeTree

Reddit r/BusinessIntelligence

Summary

The discussion on pre-aggregation in ClickHouse focuses on the use of AggregatingMergeTree versus ReplacingMergeTree. AggregatingMergeTree combined with materialized views provides fast aggregation for high-throughput data streams, while ReplacingMergeTree ensures idempotency. However, deduplication only occurs at merge time, creating uncertainty in the results.

Read the full article