Skip to main content
Partner Services are available from CLI version 1.39.0 and greater
Cerebrium offers specialized services in partnership with leading AI companies, featuring simplified configuration, independent scaling, and quick deployment. Available Partner Services:
  • Deepgram - Speech-to-text (STT) services
  • Rime - Text-to-speech (TTS) services

Benefits of Partner Services

Partner Services provide:
  • Quick and easy deployment
  • Independent scaling of each service
  • Reduced costs by running models on Cerebrium’s optimized runtime
  • Reduced latency by running models on the same network as the app
  • Deploy to specific regions for data compliance and latency requirements

Getting Started

Configure service-specific requirements through the Cerebrium platform. Refer to individual service pages linked above for detailed requirements, which may include:
  1. API keys and authentication details
  2. Service-specific configuration parameters
  3. Resource requirements and limitations

Scaling and Concurrency

Partner Services support independent scaling configurations:
  • Use the min_replicas and max_replicas parameters to control the number of instances
  • The replica_concurrency parameter determines how many concurrent requests each instance can handle
  • Adjust the cooldown parameter to control the time window that must pass at reduced concurrency before scaling down
  • Adjust the hardware section to control the instance type which affects performance and/or cost
For more information on specific Partner Services, see: