In this episode of the 'Real-Time Analytics' podcast, dive into the world of Pinot capacity planning with Sandeep Dabade, a solutions engineer at StarTree. Discover how to calculate the perfect cluster size for your real-time analytics requirements and explore essential technical KPIs like read throughput, write throughput, and data size. Sandeep shares invaluable insights into optimizing Pinot for seamless data processing and analytics, making this episode essential for anyone tackling real-time data challenges.

Sandeep's Blogs:

Apache Pinot

Star-Tree Indexes in Apache Pinot - Part 1: Understanding the Impact on Query Performance

Star-tree is a specialized index in Apache Pinot. This index dynamically builds a tree structure to maintain aggregates for a group of dimensions. With star-tree Index, the query latency becomes a function of just a tree traversal with computational complexity of log(n).

Sandeep Dabade
Sandeep Dabade
READ NOW

Apache Pinot

Star-Tree Indexes in Apache Pinot - Part 2: Understanding the Impact during High Concurrency

In this blog, we imitate a real production scenario by firing hundreds of concurrent queries using JMeter and showcase how using a star-tree index helped achieve a >95% drop in p90th / p95th / p99th latencies and 126 x increase in Throughput.

Sandeep Dabade
Sandeep Dabade
READ NOW

Apache Pinot

Star-Tree Indexes in Apache Pinot - Part 3: Understanding the Impact in Real Customer Scenarios

In this part, we will cover some real customer stories that have seen 95% to 99% improvement in query performance using Star-Tree Index.

Sandeep Dabade
Sandeep Dabade
READ NOW

Apache Pinot

Best Practices for Designing Tables in Apache Pinot™

Designing a table and schema is one of the critical activities that have a direct impact on query performance. This blog will define a systematic framework for designing an Apache Pinot Table.

Kulbir Nijjer
Sandeep Dabade
Sandeep Dabade+1
READ NOW