Tiered Storage
Affordably scale real-time analytics workloads
Analyze all of your data, new and historical, without slowing down queries or blowing your budget. StarTree automatically stores the data you access most often for super-fast results.
Drastically reduce TCO and increase query efficiency
Simplify your data stack
Reduce your cost to serve data
Configure based on your data needs
Greater storage flexibility and efficiency
Unlock timely and accurate insights from your data in motion and power your real-time analytics application.
Tiered storage
Reduce overall storage costs via Tiered Storage. Users can choose costly high I/O, SSD-based server nodes for fresh data and cheaper server nodes (with spinning disks) for less frequently used data. In addition, StarTree also provides a way to offload historical data into cloud object storage (eg: Amazon S3) while retaining the ability to query it seamlessly along with the local datasets.
Columnar database
StarTree, powered by Apache Pinot, is a columnar database that stores the incoming data in a column-wise fashion rather than row-wise. This means that all values in a column are stored together, making it easier to compress similar values.
Compression algorithms
Compression algorithms like Dictionary encoding and Run-Length encoding go hand in hand with columnar representation, wherein repeated values are represented in a more concise form and only unique values are stored in the underlying storage in the raw form. This greatly reduces the on-disk footprint needed to store user data.