autoscaling

Total 2 articles sites

Sorting

release update Views Like

Baseten

Production inference platform—dedicated deployments, autoscaling, and GPU options.

0330

Inference/Hosting & APIs # autoscaling # Baseten # dedicated

Anyscale Endpoints (Ray Serve)

OpenAI-compatible serving on Ray with autoscaling and many-model deployments.

0380

Inference/Hosting & APIs # Anyscale # autoscaling # endpoints