Picovoice Leopard Private, on-device speech-to-text SDK delivering cloud-level accuracy without sending data out. 0340 Speech-to-Text# Leopard# offline# on-device
Vosk Lightweight open-source offline ASR toolkit supporting many languages and on-device use. 01020 Speech-to-Text# ASR# multilingual# offline
Temi Affordable automated transcription service with quick turnaround and easy exports. 0440 Speech-to-Text# affordable# automatic# captions
Trint AI transcription and collaborative editor with search, highlights, and team sharing. 0430 Speech-to-Text# ASR# collaboration# editor
Sonix Automated transcription with in-browser editing, translation, and export for 50+ languages. 0300 Speech-to-TextTranslation# automatic transcription# captions# editor
Descript Transcription Fast AI transcription integrated into a powerful audio/video editor for creators and teams. 0390 Speech-to-Text# captions# creator tools# Descript
Otter.ai AI meeting assistant providing live transcripts, summaries, and action items for Zoom/Teams/Meet. 0440 Speech-to-Text# Google Meet# meeting notes# Otter
Gladia Speech-to-Text Multilingual ASR API (async and live) with add-on audio intelligence; developer-focused implementation. 0380 Speech-to-Text# API# ASR# async
Rev AI Speech-to-text APIs for streaming and async transcription, plus language ID and topic/sentiment insights. 0520 Speech-to-Text# API# ASR# batch
Speechmatics Speech-to-Text High-accuracy, low-latency enterprise ASR with multilingual/code-switching and real-time or batch modes. 0400 Speech-to-Text# accuracy# batch# enterprise
AssemblyAI Speech-to-Text Developer-friendly ASR API offering streaming, async, and audio intelligence (diarization, topics, sentiment). 0360 Speech-to-Text# API# ASR# AssemblyAI
Deepgram Speech-to-Text Real-time and batch API with low latency, enterprise scaling, and model choices for accuracy or speed. 0420 Speech-to-Text# API# batch# Deepgram
NVIDIA NeMo ASR Open toolkit with SOTA ASR models (Conformer, CTC/Transducer) for training and deployment. 0430 Speech-to-Text# ASR models# Conformer# NeMo
Meta SeamlessM4T Research-grade foundation for ASR and speech translation across ~100 languages. 0360 Speech-to-Text# ASR# Meta# multilingual
NVIDIA Riva GPU-accelerated ASR/TTS SDK for low-latency, on-prem or cloud voice AI deployments. 0350 Speech-to-Text# ASR# GPU# low latency