StarTree Opens the Iceberg Lakehouse to the Outside World

StarTree today announced support for Apache Iceberg in StarTree Cloud, enabling it to serve as both the analytic and serving layer on top of Iceberg, delivering interactive insights to internal and external applications directly from the data lakehouse. With this launch, StarTree redefines what’s possible with Iceberg, transforming it from a passive storage format into a real-time backend capable of powering customer-facing applications and AI agents with high concurrency serving thousands of simultaneous users with consistent speed and reliability.

While Apache Iceberg and Parquet have become popular open table formats for managing data in the lakehouse, they are not query engines and most existing query engines built around them struggle to meet the performance SLAs required for external-facing, high-concurrency analytical applications. As a result, companies have historically avoided serving data directly from their lakehouse, instead relying on reverse ETL pipelines or transforming the data into proprietary formats for separate serving systems—adding latency, complexity, and cost. StarTree removes these constraints by offering real-time query acceleration directly on native Iceberg tables. By combining open table formats like Parquet and Iceberg with Pinot’s powerful indexing and high-performance serving capabilities, StarTree enables applications to deliver live, interactive insights directly from the lakehouse without data duplication, format conversion, or operational trade-offs.

A Real-Time Serving Layer for Iceberg

StarTree Cloud integrates directly with Iceberg using open standards (Parquet and Iceberg table formats) and enhances performance with powerful indexing, intelligent materialized views (StarTree Index), and localized caching. Unlike traditional tools like Presto, Trino, or ClickHouse that rely on lazy loading and scanning, StarTree is built for low-latency, high-concurrency access, making it ideal for powering interactive dashboards, real-time data products, and operational workloads with strict SLAs.

Key capabilities include:

●        Native support for Apache Iceberg and Parquet in StarTree Cloud

●        Real-time indexing and aggregations, including support for numerical, text, JSON, and geo indexes

●        Intelligent materialized views via the StarTree Index

●        Local caching and pruning for low-latency, high-concurrency queries

●        No data movement required—serve directly from Iceberg

●        Intelligent prefetching from Iceberg, minimizing irrelevant data scans

With StarTree Cloud, companies can now unlock the full potential of their lakehouse investments and deliver modern, intelligent user experiences without architectural sprawl.

Availability
Support for Apache Iceberg in StarTree Cloud is available today in private preview. For more information, visit www.startree.ai.

Supporting Resources

●      StarTree Adds Native Iceberg Support: Serve High-Concurrency Queries Directly from Your Lakehouse

Leave a Reply

Discover more from The IT Nerd

Subscribe now to keep reading and get access to the full archive.

Continue reading