Rime
Deploy Rime text-to-speech services on Cerebrium
Rime Partner Service is available from CLI version 1.39.0 and greater
Cerebrium’s partnership with Rime helps teams deliver text-to-speech (TTS) services with efficient deployment, minimized latency, and region selection for data privacy compliance needs.
Setup
-
Create a Rime account and get an API key. In order to use Rime on Cerebrium, you will need to create a Rime account and get an API key. You must then create a secret in Cerebrium with the specific name “RIME_API_KEY”.
-
Create a simple cerebrium app with the CLI:
- Rime services use a simplified TOML configuration with the
[cerebrium.runtime.rime]
section. Create acerebrium.toml
file with the following:
You need to disable auth in the above since you need to use your Rime API key in the header. API authentication is handle by the Rime Server using your API key
- Run
cerebrium deploy
to deploy the Rime service - the output of which should appear as follows:
- Use the Deployment url from the output to send requests to the HTTP Rime service via curl request:
For Websockets, send the following
Scaling and Concurrency
Rime services support independent scaling configurations:
- min_replicas: Minimum instances to maintain (0 for scale-to-zero). Recommended: 1.
- max_replicas: Maximum instances during high load.
- replica_concurrency: Concurrent requests per instance. Recommended: 3.
- cooldown: Seconds an instance remains active after last request. Recommended: 50.
- compute: Instance type. Recommended:
AMPERE_A10
.
Adjust these parameters based on traffic patterns and latency requirements. Best would be to consult the Rime team about concurrency and scalability
For further documentation on Rime, see the Rime documentation.