Cerebrium home page

Contact Us
Sign Up
Sign Up

Getting Started

Introduction
Collaborating on Cerebrium

Container Images

Defining Container Images
Custom Python Web Servers
Custom Dockerfiles

GPUs and Compute Resources

Using GPUs
Using CUDA
CPU and Memory

Scaling apps

Scaling Apps
Batching and Concurrency

Deployments

CI/CD Pipelines
Gradual Roll-out
Multi-Region Deployment

Endpoints

OpenAI-Compatible Endpoints
REST API
Streaming Endpoints
WebSocket Endpoints
Webhook Forwarding
Async requests
Custom Domains

Storage

Managing Files

Partner Services

Introduction
Deepgram
Rime

Other concepts

Security & Data Privacy
Using Secrets
Request and Response Logging
Faster Cold Starts
Calculating compute cost

404

Page Not Found

We couldn't find the page you were looking for. Maybe you were looking for?

Deepgram OpenAI compatible vLLM endpoint Outbound Agent with LiveKit

Assistant

Responses are generated using AI and may contain mistakes.