Vosk Lightweight open-source offline ASR toolkit supporting many languages and on-device use. 01020 Speech-to-Text# ASR# multilingual# offline
Rev AI Speech-to-text APIs for streaming and async transcription, plus language ID and topic/sentiment insights. 0530 Speech-to-Text# API# ASR# batch
Otter.ai AI meeting assistant providing live transcripts, summaries, and action items for Zoom/Teams/Meet. 0450 Speech-to-Text# Google Meet# meeting notes# Otter
Trint AI transcription and collaborative editor with search, highlights, and team sharing. 0440 Speech-to-Text# ASR# collaboration# editor
Temi Affordable automated transcription service with quick turnaround and easy exports. 0440 Speech-to-Text# affordable# automatic# captions
OpenAI Whisper Open-source multilingual ASR model known for robustness on diverse audio and accents. 0440 Speech-to-Text# multilingual# offline# open source
Google Cloud Speech-to-Text Cloud API for real-time and batch transcription with wide language coverage and enterprise integrations. 0440 Speech-to-Text# API# ASR# batch
Deepgram Speech-to-Text Real-time and batch API with low latency, enterprise scaling, and model choices for accuracy or speed. 0420 Speech-to-Text# API# batch# Deepgram
NVIDIA NeMo ASR Open toolkit with SOTA ASR models (Conformer, CTC/Transducer) for training and deployment. 0430 Speech-to-Text# ASR models# Conformer# NeMo
Microsoft Azure Speech to Text Enterprise speech-to-text as part of Azure AI Speech, supporting streaming, batch, and customization. 0420 Speech-to-Text# ASR# Azure# batch
Descript Transcription Fast AI transcription integrated into a powerful audio/video editor for creators and teams. 0400 Speech-to-Text# captions# creator tools# Descript
Speechmatics Speech-to-Text High-accuracy, low-latency enterprise ASR with multilingual/code-switching and real-time or batch modes. 0400 Speech-to-Text# accuracy# batch# enterprise
Gladia Speech-to-Text Multilingual ASR API (async and live) with add-on audio intelligence; developer-focused implementation. 0380 Speech-to-Text# API# ASR# async
IBM Watson Speech to Text Speech recognition for applications and contact centers with security and deployment flexibility. 0380 Speech-to-Text# API# ASR# contact center
AssemblyAI Speech-to-Text Developer-friendly ASR API offering streaming, async, and audio intelligence (diarization, topics, sentiment). 0360 Speech-to-Text# API# ASR# AssemblyAI