Uberduck — Clone for Speech & Singing Maker-friendly cloning for TTS, singing, and rapping; supports voice conversion and API access. 0360 Voice Cloning# API# rapping# singing
Play.ht — Voice Cloning & Cross-Language Cloning plus cross-language synthesis and style transfer for creators, apps, and localization workflows. 0390 Voice Cloning# API# creators# cross-language
Resemble AI — Custom & Rapid Voice Cloning High-quality cloning from short samples, fast ‘Rapid’ cloning option, and enterprise controls for brands and studios. 0380 Voice Cloning# API# brands# enterprise
ElevenLabs — AI Voice Cloning Consumer-to-enterprise voice cloning with instant and professional modes, multilingual support, and a large public voice library. 0340 Voice Cloning# API# instant# multilingual
Gladia Speech-to-Text Multilingual ASR API (async and live) with add-on audio intelligence; developer-focused implementation. 0380 Speech-to-Text# API# ASR# async
Rev AI Speech-to-text APIs for streaming and async transcription, plus language ID and topic/sentiment insights. 0530 Speech-to-Text# API# ASR# batch
Deepgram Speech-to-Text Real-time and batch API with low latency, enterprise scaling, and model choices for accuracy or speed. 0420 Speech-to-Text# API# batch# Deepgram
AssemblyAI Speech-to-Text Developer-friendly ASR API offering streaming, async, and audio intelligence (diarization, topics, sentiment). 0360 Speech-to-Text# API# ASR# AssemblyAI
IBM Watson Speech to Text Speech recognition for applications and contact centers with security and deployment flexibility. 0380 Speech-to-Text# API# ASR# contact center
Google Cloud Speech-to-Text Cloud API for real-time and batch transcription with wide language coverage and enterprise integrations. 0440 Speech-to-Text# API# ASR# batch
Amazon Transcribe Managed ASR service on AWS with streaming and async transcription, custom vocabulary, and domain tunes. 0330 Speech-to-Text# API# ASR# AWS
iSpeech — TTS Platform Cloud TTS and SDKs for web/mobile; natural-sounding voices and developer APIs. 0390 Text-to-Speech# API# iSpeech# mobile
Murf AI — Text-to-Speech Web studio and API with 150–200+ voices, fine control of pitch, pace, and styles. 0300 Text-to-Speech# API# Murf# Studio
WellSaid Labs — Enterprise TTS Human-quality voices modeled on licensed actors; studio workflows and strong brand controls. 0420 Text-to-Speech# API# enterprise# licensed voices
Play.ht — AI Voice Generator Low-latency TTS and voice cloning with 200+ realistic voices and a developer-friendly API. 0300 Text-to-Speech# API# low latency# Play.ht