Data Platform AI Startup Jobs
Explore startups tagged with Data Platform and compare hiring activity, company profiles, and direct job links. This page is indexable only when a tag reaches at least 5 companies to avoid thin content.

Tractian
116 jobsTractian is a Manufacturing AI company providing industrial maintenance and reliability solutions, combining hardware, sensors, software and AI to modernize manufacturing maintenance processes as the Industrial Copilot.

Flagship Pioneering
104 jobsBiotech venture creation firm that invents and builds platform companies transforming human health and sustainability.

AlphaSense
48 jobsMarket intelligence platform using AI to analyze business documents and extract insights for investment professionals.

Peregrine
47 jobsReal-time public safety data platform for law enforcement and first responders.

Addepar
34 jobsWealth management platform providing data aggregation and portfolio analytics for financial advisors.

QuEra Computing
34 jobsNeutral-atom quantum computing systems and software.

Tessell
29 jobsFully managed, multi-cloud DBaaS for enterprise data and AI workloads.

Profound
25 jobsProfound is the leading AI search visibility and optimization platform that helps enterprise brands understand and optimize how they appear across AI-powered search engines like ChatGPT, Perplexity, and other generative AI platforms. Serving over 10% of the Fortune 500 including Target, Walmart, and Figma, the company has pioneered the category of Answer Engine Optimization (AEO).

Mixpanel
24 jobsProduct analytics platform that helps companies understand user behavior, measure product adoption, and make better decisions across web and mobile experiences.

SiftStack
20 jobsUnified observability platform for hardware sensor data

Halter
15 jobsVirtual fencing and livestock management using AI-powered smart collars

Labelbox
14 jobsData platform for creating and managing training data for machine learning applications.

Snorkel AI
14 jobsData-centric AI platform for programmatically labeling and managing training data.

Altana AI
6 jobsSupply chain intelligence platform using AI to provide global value chain transparency for enterprises and governments.

Variance
6 jobsCustomer growth platform empowering all employees to drive business revenue.

Summation
5 jobsSummation delivers a decision-grade AI platform that helps enterprise teams generate insights, automate workflows, and surface strategic opportunities with real-time, verified intelligence for faster and more confident decision-making.

WorkHelix
5 jobsEnterprise AI value quantification platform measuring and optimizing AI investment impact.

Polars
4 jobsPolars is a blazingly fast DataFrames library written in Rust, offering Python, R, Node.js, and SQL bindings for efficient, multi-threaded data manipulation at scale.

Datavations
3 jobsAI analytics platform for building materials and home improvement manufacturers.

Activeloop
Activeloop provides a data platform for AI and unstructured data, helping teams build, manage, and query multimodal datasets for AI search and model training.

AI Clearing
AI Clearing provides AI-powered construction progress monitoring and reporting software. Its platform leverages machine learning and drone-captured data to deliver automated digital field tracking, 3D site reports, and 4D geospatial analytics for energy, utilities, and infrastructure projects.

AIR Platforms
AIR Platforms develops AI-powered credit intelligence and workflow software for public and private credit markets, focused on faster underwriting and monitoring decisions.

AirWorks
AirWorks provides geospatial AI and aerial mapping workflows to convert raw field data into utility and infrastructure-ready design deliverables.

Albert Invent
Albert Invent develops AI software for chemistry and materials R&D teams to accelerate formulation, experiment planning, and commercialization workflows.

Aleph
Aleph is an AI-native FP&A platform that connects data, strategy, and spreadsheets for finance teams. It automates financial models, streamlines budgeting and forecasting with 150+ data source integrations. Customers include Zapier, Webflow, and Notion.

Allium
Allium is an enterprise-grade blockchain data platform that provides accurate, fast, and simple blockchain data across 100+ blockchains and 1000s of schemas, delivering historical and real-time data through interactive dashboards, APIs, and custom integrations for institutions and crypto enterprises.

Alloy
Alloy is a data platform for robotics that helps companies process, organize, and search through the massive volumes of sensor, camera, and telemetry data their robots generate. The Sydney-based startup enables natural language search across robot data and automated issue detection, reducing data processing time by up to 90%.

alphaXiv
alphaXiv is an AI research platform that helps users understand academic papers in minutes by bringing together papers, benchmarks, and implementations in one place. The platform bridges the gap between AI research and practice, enabling engineers to quickly surface relevant papers, compare methods, and interact with implementations. Since launching in 2024, alphaXiv has reached millions of users across academia and industry.

Anaconda
Anaconda provides the world's most popular open-source Python and R distribution for data science and AI development. Serving over 45 million users, its platform enables enterprises to manage packages, environments, and AI workflows at scale with security and governance controls.

Angitia
Angitia is a biotechnology company developing novel therapies for endocrine and metabolic diseases, including rare disease programs.

Anomalo
Anomalo is an AI-powered enterprise data quality monitoring platform that automatically detects data issues across warehouses and lakes without manual rule configuration. The platform uses machine learning to monitor structured and unstructured datasets for enterprises like Block and Discover Financial.

Apheris
Apheris provides governed, privacy-preserving data access and collaboration for AI and analytics across sensitive datasets.

Arize
Arize builds AI observability and evaluation software for machine learning, LLM, and agent systems, helping teams trace, monitor, and improve production model performance.

Armada
Edge computing company providing modular data centers and connectivity platforms to deliver cloud and AI capabilities to remote environments.

Array Labs
Array Labs builds high-power, low-cost space radar systems and a formation-flying radar constellation for large-scale 3D Earth imaging and persistent monitoring.

Artie
Fully managed change data capture (CDC) streaming platform that replicates production databases into data warehouses and lakes in real time. Trusted by Substack, ClickUp, and Alloy, processing over 700 billion rows annually.

Artificial Societies
Artificial Societies builds large-scale AI simulations of human societies to help organizations predict how real-world audiences think, feel, and respond to messaging, strategies, and campaigns. Each simulation deploys networks of 300 to 5,000 interconnected AI personas grounded in real-world behavioral data.

Astromech
Astromech is an AI-and-biology startup recruiting across genomics, evolutionary modeling, synthetic data, and alignment research to decode and simulate complex living systems.

Atlas Data Storage
Atlas Data Storage develops DNA-based data storage systems for ultra-long-term, resilient archival storage.

Atomic Industries
Atomic Industries is an AI-powered tool-and-die manufacturer that applies machine learning to modernize precision manufacturing.

Auger
Supply chain technology platform founded by former Amazon exec Dave Clark, integrating with inventory systems to provide AI-powered insights for revolutionizing global supply chain operations.

Augment (goaugment)
Augment builds AI teammates for logistics operations, mapping skills and workflows to deploy the Augie AI productivity platform across supply chain tasks.

autone
AI-powered inventory management platform that helps retail and fashion brands predict demand, optimize stock levels, reduce waste, and increase sales through intelligent forecasting.

Axiom
Axiom is a cloud-native observability platform providing serverless log management, tracing, and AI telemetry. It eliminates the need to manage complex infrastructure or worry about data limits when storing and querying event data.

Axle Energy
Energy flexibility platform-connects distributed assets with markets to decarbonize the grid.

Basedash
Basedash is an AI-native business intelligence platform that lets teams create dashboards and understand their data using natural language. It connects to 600+ data sources and provides shared dashboards without requiring SQL.

Basis Research
Basis Research Institute is a nonprofit applied AI research organization building a universal reasoning engine by establishing the mathematical principles of intelligence. The organization focuses on probabilistic programming, causal reasoning, and program synthesis to solve intractable scientific and societal problems at unprecedented scale.

bem
bem provides a production layer for unstructured data, turning documents, audio, images, and other messy inputs into deterministic, schema-enforced outputs for operational workflows.

BLDX
BLDX is an AI-powered construction management platform that creates permanent digital building identities using blockchain technology. The platform ingests and analyzes construction documents in real-time to identify risks early, helping stakeholders reduce claim costs by an average of 30% or eliminate them entirely.

Braintrust
Braintrust builds evaluation and observability infrastructure for AI applications, enabling teams to test prompts, monitor quality, and ship LLM features with higher reliability.

Breakthru Medicine
Breakthru Medicine is a biotech company developing novel oncology therapies, with a focus on first-in-class radioconjugate treatment approaches for cancer.

Chalk
Chalk builds a real-time inference data platform that gives ML and AI teams low-latency feature computation, retrieval, and production orchestration for online decisioning workloads.

Chamber Cardio
Chamber Cardio develops technology-enabled cardiovascular care services focused on improving heart health outcomes and expanding access to cardiac care.

Chroma
Chroma builds open-source data infrastructure for AI, giving developers an embeddings database and retrieval layer to add memory, search, and state to LLM applications.

Clay
Sales and marketing data platform that helps teams enrich leads, personalize outreach, and automate GTM workflows with AI and data sources.

Conduct
Conduct builds an AI platform for SAP customization work, helping enterprise teams analyze, migrate, and modernize complex SAP changes faster and with less manual engineering overhead.

Conductor Quantum
A company building quantum computers on silicon chips using AI software to create qubits 1000x faster than current manual methods.

Convoke
AI-native operating system for biopharma companies, unifying internal and external data, codifying decision logic, and generating critical deliverables across the drug development lifecycle.

Count
Count is a collaborative agentic analytics platform that helps data teams move from analysis to decision-making with canvases, SQL, Python, and AI-assisted exploration.

Cradle
Cradle builds an AI-powered protein engineering platform that helps biotech teams design and optimize proteins faster across discovery workflows.

Credal.ai
Credal provides a secure AI agent platform for enterprises, enabling teams to build AI agents and MCP-connected workflows across internal data sources with governance controls.

Crusoe
Crusoe builds renewable-powered AI infrastructure, including cloud, data center, and manufacturing capabilities designed for large-scale AI workloads.

Crux
Marketplace and tooling for transferable clean-energy tax credits, extending into broader climate finance.

Cube
Cube is the AI-powered financial intelligence platform built for FP&A teams, providing a spreadsheet-native solution that integrates with Excel and Google Sheets to streamline financial planning, analysis, and reporting workflows.

Cyera
Cyera is an AI-native data security platform that helps enterprises discover, classify, govern, and protect sensitive data across cloud, SaaS, on-premises, and AI environments. The platform uses machine learning and LLMs for high-precision data classification at scale.

Dagster
Dagster builds open-source and commercial orchestration tooling that helps data teams ship, observe, and scale pipelines with a modern developer experience.

Datacurve
Frontier coding data provider for foundation models, using gamified bounty platform with 14,000+ vetted engineers to create expert-quality training data for leading AI labs.

David AI
David AI is the world's first dedicated audio data research lab, building the data layer for next-generation audio AI. Founded by former Scale AI engineers, serving most FAANG companies and major AI labs.

Debut
Debut is a beauty biotechnology company that uses AI-powered ingredient discovery and biofermentation to create high-performing, sustainable beauty ingredients from molecules that nature produces only in minute quantities. Named one of TIME100 Most Innovative Companies in 2025, the company partners with major brands like L'Oreal to develop next-generation skincare actives with a focus on skin longevity. Headquartered in San Diego, Debut has raised over $89 million and screens billions of candidate molecules through its proprietary platform.

Deepnote
Collaborative cloud data notebook platform for data science and analytics teams.

Definite
Definite combines a cloud data warehouse, metrics layer, notebooks, dashboards, and AI assistant workflows into an all-in-one analytics platform for faster self-serve analysis.

Density
Provides privacy-first sensors and software to measure how workplaces are used and optimize real estate.

Dimension Labs
Language data infrastructure platform transforming unstructured conversational data from chats, calls, surveys, and social media into actionable insights for enterprise digital transformation.

Ditto
Ditto builds an edge synchronization platform and mobile database that keeps apps in sync even when devices are offline or intermittently connected.

Doorstep
Predictive delivery intelligence platform providing centimeter-precise indoor tracking for last-mile delivery. Uses smartphone sensors to solve the 'last 500 feet' problem for logistics and e-commerce.

DualBird
DualBird provides a cloud-native hardware-software data and AI infrastructure engine that delivers 10-100x faster performance and 50-90% lower costs through FPGA-based acceleration.

Edia
Edia is an AI-powered unified data platform designed to boost learning outcomes in K-12 mathematics education by enabling teachers to differentiate instruction and deliver personalized help 10x faster, backed by a guarantee of improved state exam results.

Egra AI
Egra AI builds foundation models for EEG brain-signal data and provides reusable embeddings for real-time brain-computer interface applications in adaptive software and neurological monitoring.

Ellipsis Labs
Crypto exchange infrastructure company behind Phoenix, a decentralized exchange on Solana.

Emerald AI
Grid-aware orchestration software that schedules AI data-center compute based on real-time power signals.

EnFi
EnFi builds an AI-powered financial intelligence platform for lenders and financial institutions, focused on credit underwriting, portfolio monitoring, and risk analysis workflows.

Enode
Enode provides an API that connects energy devices like EV chargers, batteries, and solar systems for energy apps.

Enterpret
The leading customer intelligence platform that unifies all feedback with AI, turning noise into intelligence that drives growth for product and CX teams.

Enveda
AI-powered drug discovery platform advancing nature-inspired therapeutics from plants and natural products into clinical development.

Eon
Eon is the first cloud backup posture management (CBPM) platform, automating and unifying complex cloud backups into a queryable data lake for fast recovery, compliance, and AI analytics. Founded by the team behind AWS Disaster Recovery, Eon converts idle backup data into an accessible secondary storage layer for enterprise AI workloads.

Espresso AI
Espresso AI uses generative AI and machine learning to automatically optimize SQL queries and reduce cloud compute costs by up to 70-80% for Snowflake data warehouse users. The platform integrates with existing data warehouse setups to analyze and optimize queries in real time using NLP, program synthesis, and reinforcement learning.

Exa
Embeddings-based search infrastructure providing a meaning-aware search API for AI applications. Exa trains embedding models to convert web pages into vector representations, enabling semantic search that understands intent rather than just matching keywords.

Expert Intelligence
Expert Intelligence builds an AI decision layer for laboratory workflows in pharma, materials, food, and environmental testing, emphasizing governed and traceable decisions.

Felt
Felt is a cloud-native GIS platform and collaborative mapping software that enables users to create custom geospatial maps using AI, transforming location data into tailored visualizations for asset monitoring, project planning, and energy sector applications.

Firecrawl
Firecrawl is a web data infrastructure platform that converts websites into clean, structured data optimized for AI applications through a simple API, turning entire websites into LLM-ready markdown or structured data.

Flare
Flare helps security teams detect, prioritize, and remediate external cyber threats before they escalate with automated threat exposure management and security intelligence.

Flatfile
AI-assisted data exchange platform that helps teams collect, map, validate, and transform messy customer data before it enters core systems.

Floqer
Floqer is a CRM enrichment platform that uses AI and real-time data to clean and enrich company and contact records for GTM, sales, and marketing teams.

Forerunner (withforerunner)
Forerunner builds AI-enabled software for public-sector emergency management and disaster recovery workflows, helping agencies manage grants, compliance, and program operations.

Fundamental
Fundamental builds large tabular models and enterprise AI infrastructure for prediction and analysis on complex business data, focused on tabular reasoning and decision support.

Generation Lab
Generation Lab is an AI biotechnology company building an agent scientist platform to program the immune system and accelerate antibody and cell therapy discovery.

Genesis AI
Genesis AI is building a universal foundation model for robotics, training on large-scale synthetic physical-world data for research and production systems.

Gensyn
Decentralized protocol that executes and verifies machine learning workloads across any device, unifying global compute into a single open network.

Grafana Labs
Company behind the open-source Grafana observability stack providing monitoring, logging, and tracing solutions, reaching $400M ARR as a fully remote company across 40+ countries.

Harmonic
Harmonic builds an AI-powered startup intelligence platform used by venture and growth teams to discover companies, monitor markets, and run sourcing workflows.

Haus
Haus builds incrementality measurement software that helps consumer brands run geo experiments, quantify causal lift, and optimize media spend across online and offline channels.

Human Behavior
Human Behavior builds AI-powered behavioral analytics that analyzes user-session activity to explain why users convert, churn, or stay engaged.

Instill AI
Instill AI is a persistent AI workspace that transforms complex documents into actionable insights. Users can upload unstructured data such as financial reports, legal contracts, and research papers, ask questions across entire projects, and receive answers traced to specific sources with no coding required.

Internet Backyard
Internet Backyard is building the financial backbone for the global compute economy, helping operators finance and scale infrastructure.

Isometric
Isometric is an AI-native certification and registry platform for carbon removal and broader industrial climate attributes, focused on rigorous measurement, verification, and issuance.

Istari Digital
Digital engineering platform focused on digital twins and secure collaboration for modeling and simulation of complex systems.

Julius
Julius is an AI-powered data analysis platform that acts as a personal data scientist, enabling users to analyze and visualize datasets and perform predictive modeling through natural language prompts. With over 2 million users generating more than 10 million visualizations, Julius makes data science accessible to everyone.

Keychain
Keychain is an AI-powered platform for CPG manufacturing that connects brands and retailers with a global network of over 30,000 vetted manufacturers. Its KeychainOS operating system helps manufacturers manage production cycles with AI-driven compliance, planning, and traceability modules.

Langfuse
Langfuse is an open source LLM engineering platform providing observability, prompt management, evaluations, and datasets for AI application development. Acquired by ClickHouse in January 2026, the platform integrates with OpenTelemetry, LangChain, OpenAI SDK, and other major frameworks.

LGND
LGND is building 'ChatGPT for the Earth,' a geospatial AI infrastructure platform that makes the world's satellite and geographic data as accessible and searchable as text. Using transformer-based geographic embeddings, LGND enables teams to create, adapt, and scale geospatial datasets across time and geography with reduced technical overhead.

Lightdash
Lightdash is an open-source BI platform built on dbt that enables self-serve analytics and metrics for modern data teams.

Lila Sciences
A platform using artificial intelligence to support research in life sciences, chemical sciences, and materials sciences through autonomous AI labs building scientific superintelligence.

LlamaIndex
LlamaIndex is a data framework for LLM applications that enables developers to connect, index, and query custom data sources with large language models through their open-source library and LlamaCloud platform.

Loyal
Loyal is a clinical-stage veterinary medicine company developing longevity drugs to help dogs live longer, healthier lives. They are the first company to receive FDA approval for a longevity drug candidate targeting aging in dogs.

Luzmo
Embedded analytics platform enabling SaaS companies to build and deploy customer-facing dashboards directly into their products with drag-and-drop interfaces and code-based SDKs.

Lyric
Lyric is an AI-powered supply chain platform that provides unified modeling, planning, and frontier intelligence for enterprise supply chains. Serving Fortune 500 companies including Coca-Cola, Estee Lauder, and Nike, it makes advanced supply chain science accessible through composable, integrated, and extensible solutions.

Mage
Mage is an open-source, AI-native data pipeline platform that enables teams to build, run, and manage data pipelines for integrating and transforming data using Python, SQL, and R. Available as both open-source and enterprise versions, it provides real-time and batch pipeline orchestration.

Mainstay
Comprehensive market intelligence platform for single-family rental industry, aggregating data from 50+ sources to provide actionable insights for real estate market participants.

Manifold Bio
The first high-throughput in vivo discovery engine combining massively multiplexed in vivo screening and AI-powered design to create tissue-targeted medicines.

Marqo
Marqo is a vector search platform that enables developers to build smarter search experiences with multimodal AI, improving conversion rates for e-commerce, recommendations, and content discovery.

Micro1
Micro1 is an AI platform for human intelligence that helps AI companies find, vet and manage human contractors for data labeling and training, transforming human expertise into high-quality datasets for frontier AI models.

Micruity
Micruity provides retirement-income data infrastructure connecting recordkeepers, insurers, and asset managers to operationalize lifetime income products.

Middesk
Middesk is a business identity platform that automates business verification, risk assessment, and underwriting for financial institutions and enterprises. Backed by Sequoia, Accel, and Insight Partners, the platform enables companies to verify business identities quickly and accurately.

MotherDuck
MotherDuck is a serverless cloud data warehouse built on the open-source DuckDB engine, enabling fast SQL analytics with no infrastructure to manage. The platform supports hybrid local-cloud execution, allowing analysts to query data seamlessly across laptop and cloud.

Neon
Neon is a serverless Postgres platform that separates storage and compute to enable instant database branching, autoscaling to zero, and bottomless storage. Acquired by Databricks for approximately $1 billion in May 2025, Neon continues as an independent product.

Nexthop AI
Nexthop AI builds networking systems for AI-scale data centers, focusing on high-performance switching infrastructure for hyperscale and cloud environments.

Nile
Nile is a Postgres backend for modern B2B SaaS applications, bundling multi-tenant primitives like auth, authorization, and tenant-aware data infrastructure.

Nomic AI
Open-source AI company building tools to structure, understand, and collaborate with unstructured data including GPT4ALL, Atlas visualization platform, and Nomic Embed text embeddings.

Nuraline
Nuraline is an applied AI company building a forward-deployed AI agent that enables AI systems to continuously self-improve in production by connecting telemetry, traces, and user feedback to generate evaluations and tested improvements across the technology stack. The platform works closely with product teams and enterprises requiring measurable reliability gains in reasoning-heavy applications like enterprise search and web navigation. Currently operating as Introspection (introspection.dev), the company is in its earliest stage of development.

Omnea
Omnea provides an AI-native procurement orchestration platform that unifies intake, approvals, vendor risk, and sourcing workflows for enterprise procurement and finance teams.

Omni
Omni is a modern business intelligence and analytics platform that combines a unified semantic data model with SQL flexibility, enabling AI-powered trustworthy answers in seconds. The platform supports embedded analytics, custom dashboards, and governed data exploration.

Onyx
Onyx is an open-source AI platform that connects to your company's docs, apps, and people, enabling enterprise search and AI-assisted knowledge retrieval across 40+ internal data sources. Used by hundreds of thousands of users including Netflix and Ramp, backed by Khosla Ventures and First Round Capital.

ORO
ORO is a human intelligence protocol that enables individuals to contribute private data to AI model training while preserving privacy through zkTLS and TEE cryptography. Users earn token rewards for their data contributions.

Osium AI
Osium AI builds an AI platform for materials and chemicals R&D, helping industrial teams accelerate discovery, formulation, and optimization workflows.

Osmind
Osmind is a public benefit corporation building technology for breakthrough mental health research and treatment, offering a specialized EHR platform for clinicians and researchers working with emerging therapies like ketamine and psychedelics.

Paradigm
Paradigm is an AI-native spreadsheet platform that puts swarms of intelligent agents at your fingertips. Each cell can be powered by its own AI agent to crawl the web, fill in data, and automate structured data workflows. Backed by General Catalyst and Y Combinator.

Parallel
Parallel builds web infrastructure for AI agents, providing APIs and tooling that let models search, browse, and interact with the live web more reliably.

Patch
Platform scaling unified climate action by helping organizations source, purchase, and manage carbon credits with efficiency, transparency, and rigor. API-first infrastructure for the voluntary carbon market.

Peach Finance
API-first loan management platform for lenders to automate servicing, payments, and compliance.

Peec AI
Platform offering AI search analytics for marketing teams, helping brands monitor and improve their visibility in AI-powered search results across ChatGPT and Perplexity

Pendulum
AI-first planning platform for real-time supply chain planning, helping enterprises predict demand, optimize supply, and improve asset visibility.

Phylo
Phylo is an applied AI research laboratory developing Biomni Lab, an integrated workspace enabling scientists to plan, author, execute, and collaborate on complex biological research tasks with state-of-the-art agentic AI.

Pienso
No-code AI platform enabling non-technical teams to build and deploy machine learning models, turning text data into insights without coding for researchers, marketers, and support teams.

Pinecone
Pinecone is a fully managed vector database that makes it easy to add vector search to production applications, powering AI applications with fast and scalable similarity search at any scale.

PlanetScale
PlanetScale is a managed database platform built on Vitess that helps developers ship MySQL workloads with branching, non-blocking schema changes, and global scale.

Planhat
Next-generation customer platform that centralizes customer data, automates workflows, and drives customer experience across the entire lifecycle. Serves over 1,000 customers globally.

Pogo
Pogo empowers consumers to unlock value from their personal data by providing cashback rewards in exchange for purchase data, helping merchants gain real-time consumer insights while enabling over 3 million users to earn and save money.

Positron
Positron develops AI inference hardware and systems designed to deliver lower-cost, lower-power deployment for large language models and other modern AI workloads.

PostHog
PostHog is an open-source product analytics platform providing analytics, session recording, feature flags, A/B testing, and surveys in a single platform for developer teams building successful products.

Power
Clinical trials access platform that matches patients to studies and streamlines enrollment for healthcare providers and life sciences teams.

Prior Labs
Prior Labs builds tabular foundation models that understand spreadsheets and databases, enabling instant pattern inference across any dataset without task-specific training. Their flagship model TabPFN, trained on 130 million synthetic datasets, ranks #1 on the TabArena benchmark and scales to 10 million rows, serving Fortune 500 companies like Hitachi.

Prisma
Prisma builds an open-source ORM, Prisma Postgres, and related tooling for developers building modern data-intensive applications.

Protege
Protege operates a governed marketplace platform for ethical sourcing of multimodal, real-world AI training data with compliant data exchange capabilities.

Qdrant
Qdrant develops an open source vector database for semantic search and retrieval workloads in production AI systems.

RadiantGraph
RadiantGraph provides an AI-powered platform for healthcare organizations to deliver personalized patient engagement and marketing management at scale. The company helps health plans use data-driven insights to improve member engagement and outcomes.

Rainmaker
Rainmaker is making Earth habitable through pioneering modern precipitation enhancement systems using radar validation weather-resistant drones numerical weather modeling and sustainable cloud seeds for water resource solutions.

Rally
Rally is a research operations platform that helps teams manage participant recruitment and research workflows.

Recall.ai
Universal API for real-time meeting data, providing developers with recordings, transcripts, and metadata from Zoom, Google Meet, Microsoft Teams, and other video conferencing platforms. Powers over 300 enterprise clients including Instacart and Sybill.

Response
YC-backed procurement platform simplifying indirect spend management for operations teams in 3PLs, distributors, and retailers with real-time pricing from 100+ industrial vendors.

Roboflow
Roboflow provides end-to-end computer vision tools that enable developers and enterprises to build, deploy, and manage vision AI models. The platform supports the full lifecycle from data labeling and model training to inference deployment, serving over 250,000 developers and powering applications across industries including manufacturing, retail, and agriculture.

Rows
Rows is an AI-powered spreadsheet platform that lets teams extract, connect, and analyze data from PDFs, databases, and APIs using natural language. Acquired by Superhuman in February 2026, with rows.com winding down by May 2026.

Rutter
Unified API platform that connects business data across commerce, accounting, and payments platforms, enabling B2B SaaS companies to integrate financial data seamlessly.

Sennos
Sennos builds AI software for fluidics and fermentation operations, focusing on process monitoring, optimization, and bioprocess analytics.

Sentra
Sentra provides cloud-native Data Security Posture Management platform, securing enterprise data for AI adoption with 300% YoY revenue growth and multiple Fortune 500 customers.

Shovels
Shovels builds construction intelligence software that turns fragmented building permit data into actionable market and go-to-market signals through APIs and analytics tools.

Smart Bricks
Smart Bricks is an AI proptech platform for real-estate investment intelligence, combining data ingestion, valuation modeling, and transaction workflow support.

Sorcerer
Climate data company deploying high-altitude balloons to gather atmospheric measurements and improve weather forecasting models.

Spiral
Spiral is a data infrastructure company that provides a multimodal data platform for AI, unifying governance and exposing a single API for every data modality including video, audio, geospatial, and text, engineered for machine-scale throughput to keep GPUs fully saturated.

Starcloud
Starcloud is building orbital data-center infrastructure to provide large-scale compute capacity in space, leveraging abundant solar power and passive cooling.

Strella
AI-powered customer research platform that uses AI-moderated interviews and real-time synthesis to turn customer research from weeks into hours, helping companies gather human insights efficiently.

Structify
AI-powered data platform that transforms unstructured web data and documents (websites, PDFs, pitch decks, reports) into structured, enterprise-ready datasets using their proprietary DoRa model that navigates and extracts data like a human, enabling real-time web extraction for business intelligence and data workflows.

Sundial
AI-native analytics platform helping teams centralize data, generate insights, and drive faster smarter decisions with data-informed intelligence

Superlinked
Superlinked builds information retrieval infrastructure for AI applications, combining multimodal and structured signals to improve enterprise search and matching.

Supper
AI-native agentic data platform that integrates with SaaS tools and data warehouses, cleanses and normalizes data, and enables self-serve insights through natural language.

SynthBee
SynthBee is an AI-native technology company building enterprise AI and cloud infrastructure solutions, with hiring currently routed through Gusto-hosted postings.

Tako
Tako builds an AI data analyst that embeds into applications and returns cited answers with shareable visualizations across external and internal data sources.

Teleskope
An agentic data security platform that autonomously scans, catalogs, and classifies data in motion and at rest while automating remediations for the AI era.

TensorWave
TensorWave provides AMD-powered AI infrastructure and cloud GPU clusters for large-scale training and inference workloads.

Third Arc Bio
Third Arc Bio is an immunology and inflammation-focused biotechnology company developing targeted immune therapeutics across oncology and inflammatory disease programs.

Thread AI
Thread AI builds enterprise AI infrastructure with a knowledge-graph-based context engine to improve AI application accuracy and reliability.

TigerEye
TigerEye delivers go-to-market planning and analytics software that helps revenue teams forecast pipeline and optimize sales execution.

Tinybird
Tinybird is a real-time data platform that enables data and engineering teams to build real-time data products and APIs at scale. The platform ingests, transforms, and serves large volumes of data with sub-second latency for analytics and operational intelligence.

Tomorrow.io
Tomorrow.io delivers weather intelligence software and proprietary weather data products for aviation, defense, logistics, and enterprise operations.

Tracer
Tracer is the first pipeline monitoring system purpose-built for high-performance computing in life sciences, providing real-time performance metrics, cost breakdowns, and optimization insights for complex computational pipelines.

Transcend
Transcend is an enterprise-grade data privacy infrastructure platform that serves as the compliance layer for customer data. It enables organizations to automate data subject requests, map data across systems, manage consent, and activate data for AI responsibly at scale.

TRM Labs
TRM Labs builds blockchain intelligence software for financial institutions, crypto businesses, and public-sector investigators to detect illicit activity and support compliance workflows.

Truv
Consumer-permissioned data platform that provides API access to payroll, employment, and financial data for streamlined income and employment verification, asset verification, and direct deposit switching.

Ultrahuman
Ultrahuman is a comprehensive health technology platform offering wearable devices including the Ring AIR for sleep and activity tracking, M1 continuous glucose monitoring, and Home environmental health monitoring. The company combines real-time biometric data with AI-driven insights.

Union.ai
Union.ai provides production infrastructure for AI and ML workflows on top of Flyte, helping engineering teams orchestrate, deploy, and operate reliable model and data pipelines.

Upscale AI
Upscale AI provides high-performance networking solutions for AI infrastructure. The company delivers open-standard connectivity for specialized computing, making AI more accessible through ready-to-use networking platforms designed for next-generation AI workloads.

Urban SDK
Urban SDK delivers geospatial AI and mobility analytics software for cities, transportation agencies, and public safety teams to improve planning and operational decisions.

V7
V7 provides a data engine for computer vision teams to manage datasets, label data, and train AI models at scale.

Valar Atomics
Nuclear energy innovation company developing advanced reactor prototypes using silicon carbide technology for next-generation nuclear power

Vectara
Vectara provides a retrieval-augmented generation platform with enterprise search, grounded generation, and evaluation tooling to help teams build production-grade generative AI assistants.

Vega
Vega offers a lightweight, AI-native security analytics fabric that provides instant access to all data sources without requiring migration, designed to replace traditional SIEM solutions with state-of-the-art security analytics for modern enterprises.

Viam
Viam is a software platform that makes the physical world programmable, enabling engineers to build, deploy, and manage robotics and IoT applications. Founded by MongoDB co-founder Eliot Horowitz, Viam powers AI and automation solutions across robotics, food and beverage, climate tech, marine, and industrial manufacturing.

Vultr
Vultr offers global cloud compute services, providing high-performance, cost-effective virtual servers and GPUs for developers and businesses:contentReference[oaicite:29]{index=29}.

Walrus
Walrus is a decentralized data layer for AI and Web3 builders, designed to store large datasets, media, and application state with programmable, verifiable access on Sui.

Watershed
Watershed provides enterprise sustainability software that helps companies measure emissions, manage climate disclosures, run decarbonization programs, and finance carbon removal.

WebAI
Enterprise-grade platform running powerful AI on local devices without cloud dependency, enabling companies to work with sophisticated AI models while keeping sensitive data in place.

Weka
WEKA builds a cloud and AI data platform that accelerates model training and inference workloads with high-performance, software-defined storage.

WisdomAI
Agentic BI and data-insights platform that unifies structured/unstructured data and answers with verified source retrieval.

Wiza
Wiza is a B2B sales prospecting platform that provides real-time verified emails and phone numbers from LinkedIn, with 99%+ deliverability rates for outbound sales teams.

Zenlytic
AI-powered self-service business intelligence platform featuring a natural language AI data analyst that connects to cloud data warehouses like Snowflake, BigQuery, and Databricks.

ZeroGPT
Free AI content detection tool designed to distinguish between AI-generated and human-written text with advanced detection algorithms
FAQ
What is the Data Platform tag page on Fast AI Startup Jobs?
It is a curated landing page that groups AI startup companies tagged with Data Platform, plus links to their company profiles and available jobs.
How many Data Platform companies are included?
This page currently lists 205 companies tagged with Data Platform.
How many jobs are associated with Data Platform companies?
The companies on this page currently account for 553 listed jobs in our public dataset (subject to regular updates).
What roles are most common at Data Platform companies?
Based on currently listed jobs for Data Platform companies, the most common role groups are Engineering (2033), Other (1057), Sales (467).
What funding stages are most common among Data Platform companies?
Common funding stages on this Data Platform page include Series A (74), Seed (37), Series B (35), Series C (15).
Where do the job links go?
Job links point to official company career pages or public job listings, not re-hosted application forms.
How often is this tag page refreshed?
Data is refreshed on a near-daily cadence as public company and job listings change.