Vosk Lightweight open-source offline ASR toolkit supporting many languages and on-device use. 01020 Speech-to-Text# ASR# multilingual# offline
Deepgram Speech-to-Text Real-time and batch API with low latency, enterprise scaling, and model choices for accuracy or speed. 0420 Speech-to-Text# API# batch# Deepgram
Picovoice Leopard Private, on-device speech-to-text SDK delivering cloud-level accuracy without sending data out. 0350 Speech-to-Text# Leopard# offline# on-device
AssemblyAI Speech-to-Text Developer-friendly ASR API offering streaming, async, and audio intelligence (diarization, topics, sentiment). 0360 Speech-to-Text# API# ASR# AssemblyAI
Speechmatics Speech-to-Text High-accuracy, low-latency enterprise ASR with multilingual/code-switching and real-time or batch modes. 0400 Speech-to-Text# accuracy# batch# enterprise
Google Cloud Speech-to-Text Cloud API for real-time and batch transcription with wide language coverage and enterprise integrations. 0440 Speech-to-Text# API# ASR# batch
Rev AI Speech-to-text APIs for streaming and async transcription, plus language ID and topic/sentiment insights. 0530 Speech-to-Text# API# ASR# batch
Amazon Transcribe Managed ASR service on AWS with streaming and async transcription, custom vocabulary, and domain tunes. 0330 Speech-to-Text# API# ASR# AWS
Gladia Speech-to-Text Multilingual ASR API (async and live) with add-on audio intelligence; developer-focused implementation. 0380 Speech-to-Text# API# ASR# async
Microsoft Azure Speech to Text Enterprise speech-to-text as part of Azure AI Speech, supporting streaming, batch, and customization. 0420 Speech-to-Text# ASR# Azure# batch
Otter.ai AI meeting assistant providing live transcripts, summaries, and action items for Zoom/Teams/Meet. 0450 Speech-to-Text# Google Meet# meeting notes# Otter
IBM Watson Speech to Text Speech recognition for applications and contact centers with security and deployment flexibility. 0380 Speech-to-Text# API# ASR# contact center
Descript Transcription Fast AI transcription integrated into a powerful audio/video editor for creators and teams. 0400 Speech-to-Text# captions# creator tools# Descript
OpenAI Whisper Open-source multilingual ASR model known for robustness on diverse audio and accents. 0440 Speech-to-Text# multilingual# offline# open source
Sonix Automated transcription with in-browser editing, translation, and export for 50+ languages. 0310 Speech-to-TextTranslation# automatic transcription# captions# editor