ModelScope Alibaba’s open model community hosting multilingual models for vision, speech, and NLP. 0340 Model Hubs# Alibaba# model community# ModelScope
PolyAI Enterprise-grade, human-sounding voice AI agents that resolve customer calls across 45+ languages. 0520 Voicebots & Call Center# multilingual# phone automation# PolyAI
Zowie — AI for Customer Service AI agent that automates conversations, integrates knowledge, and keeps replies on-brand across channels. 0520 Helpdesk Chatbots# AI agent# automation# multilingual
CodeGeeX Multilingual coding assistant with IDE plugins and LLM-based code generation and completion. 0470 Code Assistants# completion# generation# IDE plugin
Sembly AI Joins meetings to create precise notes, tasks, and multilingual transcripts with powerful sharing/export. 0400 Meeting Assistants# AI notes# meeting minutes# multilingual
HeyGen — Voice Clone for Video Fast voice cloning for video narration and multilingual dubbing; pairs with avatars and lip-sync. 0390 Voice Cloning# avatars# dubbing# multilingual
ElevenLabs — AI Voice Cloning Consumer-to-enterprise voice cloning with instant and professional modes, multilingual support, and a large public voice library. 0330 Voice Cloning# API# instant# multilingual
Vosk Lightweight open-source offline ASR toolkit supporting many languages and on-device use. 01020 Speech-to-Text# ASR# multilingual# offline
Gladia Speech-to-Text Multilingual ASR API (async and live) with add-on audio intelligence; developer-focused implementation. 0380 Speech-to-Text# API# ASR# async
Speechmatics Speech-to-Text High-accuracy, low-latency enterprise ASR with multilingual/code-switching and real-time or batch modes. 0400 Speech-to-Text# accuracy# batch# enterprise
Deepgram Speech-to-Text Real-time and batch API with low latency, enterprise scaling, and model choices for accuracy or speed. 0420 Speech-to-Text# API# batch# Deepgram
AssemblyAI Speech-to-Text Developer-friendly ASR API offering streaming, async, and audio intelligence (diarization, topics, sentiment). 0350 Speech-to-Text# API# ASR# AssemblyAI
Meta SeamlessM4T Research-grade foundation for ASR and speech translation across ~100 languages. 0350 Speech-to-Text# ASR# Meta# multilingual
OpenAI Whisper Open-source multilingual ASR model known for robustness on diverse audio and accents. 0430 Speech-to-Text# multilingual# offline# open source
Google Cloud Speech-to-Text Cloud API for real-time and batch transcription with wide language coverage and enterprise integrations. 0430 Speech-to-Text# API# ASR# batch