NVIDIA NIM Prebuilt, optimized inference microservices for leading models on any NVIDIA-accelerated stack. 0320 Inference/Hosting & APIs# GPU# inference# microservices
NVIDIA NeMo Guardrails Programmable guardrails (topic control, PII, jailbreak prevention) for LLM apps. 0380 Guardrails & Moderation# agents# Colang# guardrails
NVIDIA NGC Models Optimized model catalog for NVIDIA GPUs—LLMs, vision, speech—with containers and inference recipes. 0320 Model Hubs# containers# GPU# inference
NVIDIA NeMo Modular, enterprise suite to build, monitor, and optimize AI agents; deploy fast with NIM microservices. 0320 Workflow Builders# agent lifecycle# deployment# microservices
NVIDIA Riva GPU-accelerated ASR/TTS SDK for low-latency, on-prem or cloud voice AI deployments. 0360 Speech-to-Text# ASR# GPU# low latency