Voice AI Startup Jobs
Explore startups tagged with Voice AI and compare hiring activity, company profiles, and direct job links. This page is indexable only when a tag reaches at least 5 companies to avoid thin content.

Abridge
40 jobsMedical AI platform that converts patient-clinician conversations into structured clinical notes.

Deepgram
25 jobsEnterprise voice AI platform offering Speech-to-Text, Text-to-Speech, and Voice Agent APIs for real-time and batch use, deployable in cloud or self-hosted.

ElevenLabs
18 jobsElevenLabs is the leading AI voice technology platform, offering ultra-realistic text-to-speech, voice cloning, and audio AI tools used by content creators, enterprises, and developers worldwide. The company has grown to over $330M in annual recurring revenue and serves customers across 45+ countries including Deutsche Telekom and Revolut.

Descript
17 jobsAI-powered audio and video editing platform that makes editing as easy as editing text.

Rilla
17 jobsAI-powered virtual ride-along platform for field sales and service teams.

Speak
15 jobsAI-powered language-learning app focused on spoken fluency.

Siro
12 jobsAI conversation intelligence and coaching for in-person/field sales teams.

Hume AI
8 jobsEmotion and prosody AI that powers empathic voice interfaces and multimodal understanding.

BoldVoice
6 jobsAn English pronunciation and accent coaching app combining expert video lessons with real-time AI feedback.

SuperDial
4 jobsAI-powered phone automation platform for healthcare and insurance revenue cycle management.

SendBird
2 jobsMessaging and chat API/SDK platform for scalable in-app communication.

Alex AI
An autonomous AI recruiter that automates interviewing and recruiting, conducting thousands of voice interviews to help companies hire the best talent.

Alleviate Health
Alleviate Health builds conversational AI agents that accelerate patient recruitment for clinical research sites. Their AI engages patients 24/7 across SMS and Voice, having supported 500K+ patient interactions across 190+ sites.

Alterego
Alterego is building a near-telepathic neural interface that translates silent speech and biosignals into computing input for more intuitive human-computer interaction.

Aqua Voice
Aqua Voice is an AI-powered voice input software that lets users write at 4x typing speed with sub-50ms startup and 3.2% word error rate. Works across any app including Cursor, Gmail, Slack, and terminal.

AssemblyAI
AssemblyAI develops speech AI APIs and voice models for transcription, speech understanding, and developer workflows that turn audio into structured application data.

Assort Health
Assort Health builds agentic AI voice agents for healthcare call centers, automating appointment management and patient phone calls with specialty-specific generative AI. The platform handles patient access workflows that previously consumed thousands of human hours, improving patient experience while reducing operational costs. With $102 million in total funding, Assort Health serves major healthcare organizations across the United States.

Async
Async (formerly Podcastle) is an AI-powered audio and video content creation platform that helps podcasters, educators, and marketers produce professional-grade multimedia content.

Avoca
Avoca provides AI customer service agents for home service businesses, helping HVAC, plumbing, and electrical contractors answer calls, support customers, and convert leads 24/7.

Beside
AI voice startup building AI receptionist for small businesses handling millions of calls with appointment booking and customer follow-ups

Bland
Builds programmable AI voice agents to handle customer calls and connect to CRM/support workflows.

Boardy
Boardy is an AI networking product that uses conversational workflows to introduce founders, operators, investors, and other professionals who should know each other.

Camradia
AI meeting companion-captures context, action items, and follow-ups across calls.

Cartesia
Cartesia builds real-time voice and multimodal AI models with low-latency inference for production applications. Its platform combines foundational model research with developer APIs for voice agents, audio generation, and on-device intelligence.

Character.AI
Character.AI is a consumer AI chat platform where users create and interact with AI characters for entertainment, roleplay, and conversational experiences across web and mobile.

David AI
David AI is the world's first dedicated audio data research lab, building the data layer for next-generation audio AI. Founded by former Scale AI engineers, serving most FAANG companies and major AI labs.

Dimension Labs
Language data infrastructure platform transforming unstructured conversational data from chats, calls, surveys, and social media into actionable insights for enterprise digital transformation.

EliseAI
EliseAI is an AI automation platform for the housing and healthcare industries, building AI workflows that automate the entire consumer journey from leasing to resident management. Valued at $2.2B after a $250M Series E led by a16z, EliseAI serves 28 of the top 30 property owners in the US.

Fathom
Fathom is an AI meeting assistant that records, transcribes, and summarizes meetings, with workflow automations for CRM and team collaboration.

Fireflies
AI meeting assistant that records, transcribes, and summarizes meetings into searchable notes.

Fish Audio
Fish Audio develops multilingual AI voice generation and voice cloning infrastructure for creators and developers, with text-to-speech APIs and audio production tools.

GigaML
AI voice agents platform for customer care handling complex workflows at scale with over 90% resolution accuracy in production

Gradium
Gradium is a Paris-based AI company building ultra-low latency voice foundation models that enable near-instant, expressive, and multilingual voice AI at scale. Spun out of French AI lab Kyutai, the company was founded by former Google DeepMind researchers who pioneered key advances in audio language modeling. Backed by a $70 million seed round from FirstMark Capital, Eurazeo, Eric Schmidt, and Xavier Niel, Gradium aims to make voice the universal interface for AI.

HappyRobot
Voice AI workers that handle calls, emails, and routine communications.

Heidi Health
AI care partner platform for clinicians that automates clinical documentation, streamlines workflows, and improves patient care. Used across Australia, the US, UK, and Canada with partnerships including the NHS.

Hippocratic AI
Hippocratic AI builds safety-focused healthcare LLMs and voice agents for non-diagnostic patient engagement, navigation, education, and administrative workflows.

Hyro
Hyro provides a responsible AI agents platform for healthcare to automate patient communications across voice and digital channels.

Interhuman AI
Interhuman AI builds the social intelligence layer for AI systems, enabling them to read, interpret, and respond to human behavior in real time. Their technology combines body language, facial expression, and tone-of-voice analysis with contextual awareness to create more natural human-AI interactions.

iyo
Building revolutionary agentic AI audio computers without screens, spun out of Alphabet X. The flagship iyo One is a wearable ear-computer enabling interaction with apps entirely through voice.

Joi AI
Platform for building AI-lationships offering real-time AI-powered chat, photos, and videos for self-exploration and connection with digital characters

JuicyChat
AI chat platform for NSFW character interactions, supporting customizable and multimodal conversations with virtual characters

Kouper
AI platform for care navigation that uses context-rich voice/SMS agents and EHR integrations to close care gaps after discharge.

Layercode
Layercode provides voice AI infrastructure for developers, enabling production-ready, low-latency voice AI agents using TypeScript. The platform handles real-time audio processing across 330+ global edge locations with sub-50ms latency, supporting multiple LLM and voice providers.

Listen Labs
An AI-powered platform that conducts thousands of voice interviews simultaneously for customer research, delivering actionable insights in hours instead of weeks.

LiveKit
LiveKit is an open-source infrastructure platform for building realtime voice, video, and AI agent applications. Their platform powers products like OpenAI's ChatGPT Voice Mode, serving over 200,000 developers with billions of calls per year.

Lorikeet
Lorikeet is an AI customer support platform that provides a universal concierge for complex issue resolution.

Nirva
Nirva is building an AI-powered wearable companion focused on journaling, emotional reflection, and guided daily wellness support.

Nooks
AI Sales Assistant Platform that automates cold calling, prospecting, and coaching for sales reps, using AI dialers, prospecting assistants, and real-time coaching to increase rep productivity.

Nowadays
Nowadays is the first AI-native platform for corporate event planning, automating everything from venue sourcing to vendor negotiations. Built by MIT engineers, it leverages a database of 400,000 global venues and AI agents that contact venues by email and phone to manage end-to-end event logistics.

Omi
Omi builds a wearable and app-based AI memory assistant for capture, transcription, search, and automated follow-up tasks.

Orum
Orum is an AI-powered live conversation platform that supercharges outbound sales activity by automating dialing, detecting voicemails, and connecting reps with live prospects faster. The platform includes AI coaching tools, a virtual salesfloor, and analytics to improve conversation quality.

OurDream.ai
AI companion platform with 2 million monthly users featuring personalized uncensored interactions through text chats, voice calls, image generation, and emotional feedback systems

Parloa
Parloa builds an enterprise AI agent platform for customer service teams to design, test, and scale voice and chat agents across millions of customer conversations.

Payman
Payman AI is an agentic AI platform for banking institutions that enables autonomous execution of financial transactions through conversational interfaces. The platform deploys AI agents that process payments, transfers, and account analysis via voice or text on existing banking rails, with built-in policy enforcement, compliance logging, and audit trails.

PolyAI
Enterprise-grade AI voice assistants handling complex customer interactions across hospitality, healthcare, and financial services, spun out of Cambridge University.

Posh
Unified AI platform for financial institutions enabling banks and credit unions to offer banking services through AI-powered chats and voice conversations

Quo
Quo, formerly OpenPhone, offers an AI-powered business communications platform for calls, messaging, and customer relationship workflows.

Retell AI
AI voice platform enabling businesses to build and deploy AI-powered voice agents for phone operations, contact centers, and customer service. API-driven solution with natural, low-latency conversations and multilingual support.

Sauron
Sauron is the world's first perceptual home security platform that continuously scans the perimeter of your home and reliably identifies and deters all threats using multi-modal sensor fusion combining high-resolution cameras with LiDAR, radar, and thermal sensors for real-time 360-degree perception.

Screenpipe
Screenpipe is an open source desktop AI memory platform that continuously captures screen and audio data for local search and automation.

Sierra
Sierra is a conversational AI platform for businesses co-founded by former Salesforce co-CEO Bret Taylor and ex-Google executive Clay Bavor valued at $10B with $100M+ ARR and voice now primary channel.

Sitch
Sitch is an AI-powered dating and matchmaking platform that combines concierge-style setup workflows with voice and messaging experiences to help users find intentional relationships.

Slingshot AI
AI research & products focused on emotional understanding and mental-health use cases (e.g., empathetic voice/agents).

Subtle Computing
Subtle Computing builds Voicebuds, AI-powered earbuds that isolate a chosen speaker in noisy environments to make in-person conversations clearer and more accessible.

Synthflow
Synthflow provides a no-code voice AI platform for building and deploying production phone agents for customer support, sales, and operations workflows.

Tavus
Tavus is the human computing company building emotionally intelligent AI humans (PALs - Personal Affective Links) that communicate through text, voice, and face-to-face video with agentic capabilities and real-time perception. Powered by proprietary models including Phoenix-4, Sparrow-1, and Raven-1, Tavus enables AI agents that can see, hear, understand emotions, and act autonomously. Over 100,000 developers and enterprises use Tavus technology for recruiting, sales, education, and customer service.

Tin Can
Wi-Fi landline phone for kids and families that reinvents the home phone experience without screens, apps, or texting. Features parental controls via companion app to approve contacts, set quiet hours, and enable Do Not Disturb. Priced at $75 with free calling between Tin Cans or $10/month for regular phone numbers. Addresses smartphone addiction concerns and went viral, selling out first two production runs.

Toma
AI voice agents for automotive dealerships revolutionizing car dealership operations with intelligent communication automation

Trace
Trace builds voice AI infrastructure for financial-service customer interactions, focusing on call automation and support workflow execution.

Tucuvi
Tucuvi builds clinically validated voice AI agents for healthcare outreach and follow-up, helping care teams automate patient communication and chronic care workflows.

TurboScribe
AI-powered transcription service using OpenAI Whisper technology to convert audio and video into searchable, editable transcripts with 99.8% accuracy across 98+ languages, offering automated summaries and exports.

Uniphore
Uniphore provides an enterprise AI platform for customer engagement and operations, combining conversational AI, automation, and analytics in its Business AI Cloud.

Vapi
Vapi is building the leading developer platform for conversational voice AI agents, abstracting away the complexity of building these agents and managing real-time infrastructure to enable enterprise-ready voice AI solutions across finance, healthcare, travel, and other industries.

voize
voize builds a voice AI assistant for nursing documentation, helping care teams capture notes hands-free and in real time.

Willow
Willow develops AI voice dictation software designed to dramatically speed up professional writing and communication workflows across email, documents, and messaging tools.

Wispr Flow
AI-powered voice dictation software for seamless text creation enabling natural speech-to-text conversion with high accuracy
FAQ
What is the Voice AI tag page on Fast AI Startup Jobs?
It is a curated landing page that groups AI startup companies tagged with Voice AI, plus links to their company profiles and available jobs.
How many Voice AI companies are included?
This page currently lists 76 companies tagged with Voice AI.
How many jobs are associated with Voice AI companies?
The companies on this page currently account for 164 listed jobs in our public dataset (subject to regular updates).
What roles are most common at Voice AI companies?
Based on currently listed jobs for Voice AI companies, the most common role groups are Engineering (760), Other (459), Sales (246).
What funding stages are most common among Voice AI companies?
Common funding stages on this Voice AI page include Series A (22), Series B (13), Seed (13), Unknown (8).
Where do the job links go?
Job links point to official company career pages or public job listings, not re-hosted application forms.
How often is this tag page refreshed?
Data is refreshed on a near-daily cadence as public company and job listings change.