Elevate Apache Pinot with StarTree Cloud

StarTree elevates Apache Pinot into a fully managed, production-grade platform, making real-time analytics easier to scale, faster to deploy, and more cost-efficient to operate

Do More With Data
1721172643727
Expert Support

Support from the creators of Apache Pinot

When you become a StarTree customer, you’re not just adopting a technology — you’re gaining access to a team that consists of many of the founding engineers and committers Apache Pinot.

Our team work hands-on with yours – from onboarding through production, helping with everything from data modeling and pipeline integration to query optimization and system tuning. Whether you need help scaling a critical dashboard, configuring ingestion from Kafka, or navigating schema design, you’ll have direct access to the same experts who helped companies like Stripe, Cisco, and DoorDash build their Pinot-based systems.

Deployment Options

Fully managed, with flexible deployment options

StarTree offers deployment options that give you flexibility to work with preferred cloud vendors, with data stored in your cloud or ours, and single tenant or multiple tenant architectures.

StarTree Cloud

SaaS

Get started quickly with the fully managed, StarTree Cloud service. Ideal for organizations that want zero ops overhead and rapid time-to-value.

Setup is simple, leaving you to focus on your data, your schema, and queries. And your SaaS instance is isolated from noisy neighbors at the infrastructure level.

Bring your own cloud

BYOC

For customers who prefer dedicated infrastructure and deeper integration with enterprise systems, we offer a Bring Your Own Cloud (BYOC) model with VPC peering, custom SLAs, and enhanced observability.

In this case, the entire software stack is deployed in your cloud account and data never leaves your governance boundary

Bring your own KUBERNETES

BYOK

For highly regulated environments, StarTree can be deployed behind your firewall—offering full control while still leveraging the power and capabilities of StarTree Cloud.

With this model you can deploy the entire stack in an existing Kubernetes cluster. This can be done in a given cloud provider (eg: Amazon EKS) or in a self-managed Kubernetes cluster on baremetal.

Extended Capabilities

StarTree enhances Apache Pinot with features that make it more powerful while lowering costs

Precision Fetch

Interactive Queries with Tiered Storage

StarTree’s “Precision Fetching” on tiered storage enables Pinot to do fast, interactive queries on data stored in cloud object stores (like S3). This capability is highly performant as it is able to fetch only the data needed for the query.

Precise fetch makes it viable to move historical data to more affordable cloud storage – all while maintaining interactive query response times.

Learn More

Scalable Upserts & Deduplication

Upserts that scale

Upserts in open-source Apache Pinot are limited by the memory overhead needed to handle the primary key map. This puts a limit on how far you can push upsert functionalities.

StarTree Cloud has addressed this limitation to allow updates to billions of primary keys per server—without compromising speed or concurrency. This makes real-time, mutable datasets possible at scale.

Learn More

Data Manager Perspective
IMPROVED DATA MANAGEMENT & ADMINISTRATION

Data management made simpler and more flexible

StarTree’s Data Portal makes it simpler to ingest, manage, and evolve data in Apache Pinot. With intuitive controls for schema evolution, performance tuning, and backfills, you can make critical data changes easily, and without disrupting operations.

Automate operations on StarTree Cloud

Performance Manager

Replaces manual query optimization with ML-driven recommendations and guided index selection. Performance Manager analyzes query patterns to provide one-click optimizations, delivering up to 4x faster queries while balancing storage costs.

Schema Evolution

Enables zero-downtime schema updates in continuous data flows, allowing users to add or modify columns, fields or indexes while the system runs. Ensures continuous query availability during structural changes.

Data Backfill

Automates the detection and filling of data gaps, enabling seamless reloading of data from past events without impacting real-time operations. Maintains consistency across datasets through an automated workflow. More

Dynamic Autoscaling

StarTree enables you to dynamically scale and optimize resources with Minion Autoscaling to avoid paying for idle infrastructure while maintaining performance under load.

Learn More

Role Based Access Control

Support for custom OIDC-compatible Identity Service Providers (IDP) for user authentication and RBAC (Role-Based Access Control) for fast, fine-grained permission enforcement across the data plane. More

Improved Query Console

Interacting with data in Pinot is more efficient with the StarTree Query Console which introduces multi-tabs, syntax highlighting, save query support and a modern UI.

Amberdata Logo White

“The benefits of migrating to StarTree and Pinot center around three areas: query performance, SLAs, and cost. With Pinot, we are now in the subseconds to milliseconds range, the higher query concurrency means we can serve more customers faster, and we’ve been able to reduce our infrastructure costs.”

Amberdata Joanes Espanol
Joanes Espanol
CTO and Co-Founder
SECURITY & PRIVACY

Your data remains private and secure

StarTree is committed to protecting customer and internal data with robust security measures for devices, networks, and systems. StarTree follows strict adherence to regulatory compliance to ensure data integrity and trust.

Security Page Compliance Logos V1

Robust Authentication & Authorization

Encryption of data at rest and in transit

Networking Security

Audit trails and data loss prevention
Keeping Pace with Change

Ready for AI and modern workloads

With StarTree Cloud, Pinot is keeping pace with the latest development and evolving industry standards—making it a future-proof foundation for the next wave of intelligent data products.

Query data in Iceberg & Parquet

Query directly from your data lakehouse. StarTree’s Iceberg and Parquet support opens up the ability to serve low latency and high concurrency analytics without moving data or stitching together brittle pipelines.

More

MCP Server for Apache Pinot

MCP provides a standardized interface for AI models to interact with your data. StarTree’s MCP Server enables AI applictions to get instantaneous responses to queries on rapidly changing data – critical for maintaining AI agent context and fluid interactions

More

Vector auto-embedding for building real-time RAG.

Apache Pinot added support for vectors in 2024. This enables AI models to reason over live data instead of stale snapshots. And with vector auto embedding, StarTree makes that power easier to harness. Generate, ingest, and index vector embeddings in a fully integrated workflow, bypassing the need for complex, stitched-together pipelines.

More

Grafana integration to support observability

StarTree offers seamless integration with Grafana for data visualization and monitoring. This integration allows users to leverage Grafana’s dashboards to interactively visualize data stored and analyzed by Pinot. 

More

Request a Trial

The best way to discover real-time analytics with Apache Pinot is to try it yourself – and there’s no better way than on StarTree Cloud.

Get started today