low latency

Total 5 articles sites

Sorting

release update Views Like

RunPod Serverless Endpoints

Always-on, pre-warmed GPU endpoints for low-latency model inference at scale.

0320

Inference/Hosting & APIs # endpoints # GPU # inference

GroqCloud

Ultra-low latency LPU-powered inference for text, speech, and vision models.

0490

Inference/Hosting & APIs # API # Groq # inference

Speechmatics Speech-to-Text

High-accuracy, low-latency enterprise ASR with multilingual/code-switching and real-time or batch modes.

0400

Speech-to-Text # accuracy # batch # enterprise

NVIDIA Riva

GPU-accelerated ASR/TTS SDK for low-latency, on-prem or cloud voice AI deployments.

0350

Speech-to-Text # ASR # GPU # low latency

Play.ht — AI Voice Generator

Low-latency TTS and voice cloning with 200+ realistic voices and a developer-friendly API.

0300

Text-to-Speech # API # low latency # Play.ht