Skip to content
brickster.ai
Startups

Databricks-ecosystem startups

Companies building on, for, or around Databricks. Curated from the Databricks Ventures portfolio, Partner Directory, AI Accelerator cohort, and the 2025 Partner Awards. Independent reference, not affiliated; ask us to add or remove a company.

72 active · 36 built for · 14 built on · 22 adjacent · 12 acquired

Showing 30 of 72 startups
  • Alation

    Series D+ · $340M raised

    Enterprise data intelligence and catalog platform with active metadata and agentic AI capabilities.

    Data catalog and governance platform with Unity Catalog integrations. Listed in the Databricks Ventures portfolio.

    2012Redwood City, USA501+
    Ecosystem-adjacentCatalogDatabricks Ventures portfolio; Technology PartnerSources
  • dbt Labs

    Series D+ · $416M raised

    SQL-based data transformation framework that has become the standard for analytics engineering.

    SQL-based transformation framework that runs as a first-class citizen on Databricks. Both a Databricks Ventures portfolio company and a 2025 Partner Award winner. Now merging with Fivetran.

    2016Philadelphia, USA501+
    Ecosystem-adjacentDev ToolingDatabricks Ventures portfolio; Data Integration Partner of the Year 2025Sources
  • Glean

    Series D+ · $600M raised

    AI-powered enterprise knowledge assistant that searches and reasons across company applications.

    Enterprise AI search and agents platform. Part of the Databricks Ventures portfolio; valued at $7.2B in its 2025 Series F.

    2019Palo Alto, USA501+
    Built on DatabricksVertical AIDatabricks Ventures portfolioSources
  • Hightouch

    Series D+ · $322M raised

    Composable customer data platform and reverse-ETL that activates lakehouse data into business tools.

    Reverse-ETL pioneer that activates Databricks data into 200+ business tools. Both a Databricks Ventures portfolio company and 2025 Partner Award winner.

    2018San Francisco, USA201-500
    Ecosystem-adjacentReverse-ETLDatabricks Ventures portfolio; Retail and CG Data Partner of the Year 2025Sources
  • Immuta

    Series D+ · $267M raised

    Data security platform providing access control, policy enforcement and audit on cloud data platforms.

    Data access control and policy automation that integrates deeply with Databricks Unity Catalog. Part of the Databricks Ventures portfolio.

    2015Boston, USA201-500
    Built for DatabricksSecurityDatabricks Ventures portfolio; Technology PartnerSources
  • Labelbox

    Series D+ · $188M raised

    Training data and human-in-the-loop platform for building ML and frontier-model datasets.

    Training data platform with native Databricks integration. Part of the Databricks Ventures portfolio.

    2018San Francisco, USA201-500
    Built for DatabricksML OpsDatabricks Ventures portfolio; Technology PartnerSources
  • Matillion

    Series D+ · $310M raised

    Data productivity cloud with low-code ETL pipelines and orchestration for cloud data platforms.

    Low-code ETL with native Databricks integration. Part of the Databricks Ventures portfolio.

    2010Manchester, UK201-500
    Ecosystem-adjacentDev ToolingDatabricks Ventures portfolio; Technology PartnerSources
  • Perplexity

    Series D+ · $1.5B raised

    AI-powered conversational search engine that synthesizes web answers with cited sources.

    Consumer AI answer engine. Part of the Databricks Ventures portfolio; valued at $20B in 2025.

    2022San Francisco, USA201-500
    Built on DatabricksVertical AIDatabricks Ventures portfolioSources
  • Sigma

    Series D+ · $742M raised

    Cloud BI and analytics with a spreadsheet-style interface that runs natively on cloud data platforms.

    Spreadsheet-style cloud BI on Databricks. Both Databricks Ventures portfolio and 2025 BI Partner of the Year.

    2014San Francisco, USA501+
    Ecosystem-adjacentBIDatabricks Ventures portfolio; BI Partner of the Year 2025Sources
  • Anvilogic

    Series C · $85M raised

    AI-powered SOC platform for security detection engineering on top of data lakes including Databricks.

    Provides a detection engineering layer that runs natively on Databricks-as-security-data-lake. Won the 2025 Databricks Growth Built on Partner of the Year award.

    2019Palo Alto, USA51-200
    Built for DatabricksSecurityDatabricks Ventures portfolio; 2025 Growth Built Partner of the YearSources
  • Hex

    Series C · $122M raised

    Collaborative analytics and data science notebook platform for teams working with SQL and Python.

    Collaborative analytics workspace with deep Databricks integration. Part of the Databricks Ventures portfolio.

    2019San Francisco, USA51-200
    Ecosystem-adjacentBIDatabricks Ventures portfolioSources
  • Hunters

    Series C · $118M raised

    Modern SOC platform that runs detection and response on a customer's Databricks security data lake.

    Modern SIEM alternative that ingests security data directly into a customer's Databricks lakehouse. First security partner on the open security lakehouse ecosystem.

    2018Tel Aviv, Israel51-200
    Built for DatabricksSecurityDatabricks Ventures portfolio; first SOC platform built on DatabricksSources
  • Mistral AI

    Series C · $2.3B raised

    European generative AI lab building frontier open-weight LLMs and enterprise AI products.

    European frontier LLM lab whose models are served via Databricks Mosaic AI Model Serving. Part of the Databricks Ventures portfolio.

    2023Paris, France201-500
    Built for DatabricksVertical AIDatabricks Ventures portfolio; models hosted on Mosaic AISources
  • Anomalo

    Series B · $82M raised

    Automated data quality monitoring using ML to detect anomalies in data warehouses and lakehouses.

    Builds automated data quality monitoring with native Databricks integration. Part of the Databricks Ventures portfolio.

    2018Palo Alto, USA51-200
    Built for DatabricksData QualityDatabricks Ventures portfolioSources
  • Cube

    Series B

    Universal semantic layer that delivers consistent metrics across BI tools and AI applications.

    Open-source semantic layer that runs on Databricks SQL warehouses. Part of the Databricks Ventures portfolio.

    2019San Francisco, USA51-200
    Built for DatabricksSemantic LayerDatabricks Ventures portfolioSources
  • Galileo

    Series B · $68M raised

    GenAI evaluation and observability platform for measuring LLM and agent quality in production.

    Evaluation intelligence for AI teams. Databricks Ventures led participation in the Series B; later acquired by Cisco.

    2021San Francisco, USA51-200
    Built for DatabricksObservabilityDatabricks Ventures portfolio (acquired by Cisco)Sources
  • LangChain

    Series B · $160M raised

    Open-source agent framework and observability platform (LangSmith) for building LLM applications.

    Agent framework and LLM observability with native Databricks Mosaic AI integration. Part of the Databricks Ventures portfolio; unicorn at $1.25B.

    2022San Francisco, USA51-200
    Built for DatabricksML OpsDatabricks Ventures portfolio; first-class integration with Mosaic AISources
  • Lovable

    Series B · $552M raised

    AI software creation platform that turns natural-language prompts into shippable web applications.

    Vibe-coding platform for building apps from natural language. Part of the Databricks Ventures portfolio; valued at $6.6B in late 2025.

    2023Stockholm, Sweden51-200
    Built on DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Noma Security

    Series B · $132M raised

    AI security platform for governing models, agents and third-party AI applications across the enterprise.

    AI model and agent security platform with native Databricks integration. Part of the Databricks Ventures portfolio.

    2023Palo Alto, USA51-200
    Built for DatabricksSecurityDatabricks Ventures portfolioSources
  • Omni

    Series B · $95M raised

    Business intelligence platform combining a semantic model with self-serve exploration for analysts.

    BI platform from ex-Looker and Stitch leaders. Part of the Databricks Ventures portfolio.

    2022San Francisco, USA51-200
    Ecosystem-adjacentBIDatabricks Ventures portfolioSources
  • Prophecy

    Series B · $78M raised

    Low-code data engineering and self-serve transformation platform optimized for Databricks SQL.

    Visual data engineering and AI-driven data prep, deeply integrated with Databricks Lakeflow and SQL. Part of the Databricks Ventures portfolio.

    2017San Ramon, USA51-200
    Built for DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Replit

    Series B

    Browser-based software creation platform that lets anyone build and deploy apps using natural language.

    AI software creation platform. Part of the Databricks Ventures portfolio.

    2016Foster City, USA51-200
    Built on DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Snowplow

    Series B · $60M raised

    Behavioral data platform that captures rich event streams and lands them in lakehouses.

    Behavioral data pipeline that lands AI-ready customer events into Databricks. Part of the Databricks Ventures portfolio.

    2012London, UK51-200
    Ecosystem-adjacentStreamingDatabricks Ventures portfolio; Technology PartnerSources
  • SuperAnnotate

    Series B · $75M raised

    AI pipeline platform for building high-quality multimodal datasets for ML and LLM training.

    Training data platform that won the 2025 Databricks Customer Impact Partner of the Year award.

    2018San Francisco, USA201-500
    Built for DatabricksML OpsDatabricks Ventures portfolio; 2025 Customer Impact Partner of the YearSources
  • Unstructured

    Series B · $65M raised

    Data transformation tools that turn unstructured documents into LLM-ready structured data.

    Unstructured-data ETL for LLMs and RAG. Part of the Databricks Ventures portfolio.

    2022San Francisco, USA51-200
    Built for DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Cleanlab

    Series A · $30M raised

    Automated data curation and quality platform that fixes label errors and trustworthiness issues for AI.

    Data quality platform for ML and LLM training data. Part of the Databricks Ventures portfolio; later acquired by Handshake.

    2021San Francisco, USA11-50
    Built for DatabricksData QualityDatabricks Ventures portfolio (acquired by Handshake)Sources
  • Gable

    Series A · $20M raised

    Data contracts and governance platform to manage source-data changes before they break downstream pipelines.

    Data contracts platform that powers the 'shift-left' movement. Part of the Databricks Ventures portfolio.

    2023Seattle, USA11-50
    Built for DatabricksGovernanceDatabricks Ventures portfolioSources
  • LanceDB

    Series A · $38M raised

    AI-native multimodal data lakehouse with vector indexing on top of an open columnar format.

    Multimodal lakehouse purpose-built for AI workloads. Part of the Databricks Ventures portfolio.

    2022San Francisco, USA11-50
    Ecosystem-adjacentOtherDatabricks Ventures portfolioSources
  • LlamaIndex

    Series A · $28M raised

    Framework and cloud platform for building data-backed agentic LLM applications over unstructured data.

    Open-source framework and managed cloud for RAG and agentic AI. Part of the Databricks Ventures portfolio.

    2023San Francisco, USA11-50
    Built for DatabricksML OpsDatabricks Ventures portfolioSources
  • OpenRouter

    Series A

    Unified API and marketplace for accessing hundreds of large language models with usage billing.

    Multi-model LLM router and marketplace. Part of the Databricks Ventures portfolio.

    2023New York, USA11-50
    Built for DatabricksML OpsDatabricks Ventures portfolioSources