Multi-Stage Query Engine

Analyze your data at your speed

Run complex SQL queries without sacrificing blazing-fast query performance, gaining the business insights your teams need in real-time.

Book a demo

Deliver fast and accurate insights across large datasets

Improve query performance

Allocate resources more effectively without increasing costs

Scale to handle increased data volumes and query complexity

Run complex queries with native query-time joins

Unlock timely and accurate insights from your data in motion and power your real-time analytics application.

Joins Support

Optimize query performance and the execution of joins at scale with native support for serving joins with subseconds latency. The Multi-Stage Query Engine supports all three types of join strategies: Broadcast Joins, Shuffle Hash Distributed Joins, and Lookup Joins. With this capability, StarTree covers the entire spectrum of data joins, providing full coverage from user-facing analytics all the way up to ad hoc analytics.

Multi-Stage Execution Model

The multi-stage execution model is designed to handle complex multi-stage data processing. It includes an intermediate compute stage (consisting of a set of processing servers and a data exchange mechanism) that allows StarTree to handle more complex processing requirements by offloading computation from the brokers.

Indexing and Pruning

StarTree offers a rich set of indexing and pruning techniques to speed up query processing on individual tables. These techniques help reduce the overhead of scanning and aggregations to improve query latency and throughput.

Data Layout

Improve join performance by adapting based on the data layout. StarTree optimizes joins to account for data that is partitioned but not co-located, data that is partitioned and co-located, and random layouts (data is neither partitioned nor co-located).

Johan Adami Stripe — Johan Adami
Software Engineer

keep exploring

The most powerful real-time analytics platform