Speech-to-Text

Total 20 articles sites

Accurate speech transcription and diarization

Writing & Documents Images & Design Video & Avatars Audio & Voice Productivity & Office Coding & Dev Search & Research Agents & Automation Marketing & Growth Customer Support Open Source & Models Prompts & Templates Entertainment

Sorting

release update Views Like

Vosk

Lightweight open-source offline ASR toolkit supporting many languages and on-device use.

01020

Speech-to-Text # ASR # multilingual # offline

Deepgram Speech-to-Text

Real-time and batch API with low latency, enterprise scaling, and model choices for accuracy or speed.

0420

Speech-to-Text # API # batch # Deepgram

Picovoice Leopard

Private, on-device speech-to-text SDK delivering cloud-level accuracy without sending data out.

0350

Speech-to-Text # Leopard # offline # on-device

AssemblyAI Speech-to-Text

Developer-friendly ASR API offering streaming, async, and audio intelligence (diarization, topics, sentiment).

0360

Speech-to-Text # API # ASR # AssemblyAI

Speechmatics Speech-to-Text

High-accuracy, low-latency enterprise ASR with multilingual/code-switching and real-time or batch modes.

0400

Speech-to-Text # accuracy # batch # enterprise

Google Cloud Speech-to-Text

Cloud API for real-time and batch transcription with wide language coverage and enterprise integrations.

0440

Speech-to-Text # API # ASR # batch

Rev AI

Speech-to-text APIs for streaming and async transcription, plus language ID and topic/sentiment insights.

0530

Speech-to-Text # API # ASR # batch

Amazon Transcribe

Managed ASR service on AWS with streaming and async transcription, custom vocabulary, and domain tunes.

0330

Speech-to-Text # API # ASR # AWS

Gladia Speech-to-Text

Multilingual ASR API (async and live) with add-on audio intelligence; developer-focused implementation.

0380

Speech-to-Text # API # ASR # async

Microsoft Azure Speech to Text

Enterprise speech-to-text as part of Azure AI Speech, supporting streaming, batch, and customization.

0420

Speech-to-Text # ASR # Azure # batch

Otter.ai

AI meeting assistant providing live transcripts, summaries, and action items for Zoom/Teams/Meet.

0450

Speech-to-Text # Google Meet # meeting notes # Otter

IBM Watson Speech to Text

Speech recognition for applications and contact centers with security and deployment flexibility.

0380

Speech-to-Text # API # ASR # contact center

Descript Transcription

Fast AI transcription integrated into a powerful audio/video editor for creators and teams.

0400

Speech-to-Text # captions # creator tools # Descript

OpenAI Whisper

Open-source multilingual ASR model known for robustness on diverse audio and accents.

0440

Speech-to-Text # multilingual # offline # open source

Sonix

Automated transcription with in-browser editing, translation, and export for 50+ languages.

0310

Speech-to-Text Translation # automatic transcription # captions # editor