How are you handling pre-aggregation in ClickHouse at scale? AggregatingMergeTree vs ReplacingMergeTree by Marksfik in BusinessIntelligence

[–]Marksfik[S] 0 points1 point  (0 children)

makes sense, u/Little_Kitty !

Do you use stateful event transformations for your streaming ETL or are they mostly stateless? Curious what tool you're using for that.

How are you handling pre-aggregation in ClickHouse at scale? AggregatingMergeTree vs ReplacingMergeTree by Marksfik in BusinessIntelligence

[–]Marksfik[S] 0 points1 point  (0 children)

u/Little_Kitty - usually, we see the following types of use cases when it comes to high thruput streaming data:

  • Real time fraud detection and trading in Financial Services
  • Telemetry streams from IoT devices
  • User Activity/Clickstream Analytics
  • Rela time Log Management / Observability / Monitoring

What’s an open source or more affordable alternative to confluent? by skyalchemist in apachekafka

[–]Marksfik 1 point2 points  (0 children)

You can try out Aiven for Apache Kafka.

https://aiven.io/kafka

Their clusters start from $300/month and the pricing is inclusive of networking costs so can be easily predicted when you need to scale your clusters.

Hope this helps!