Custom Python Web Servers
Run ASGI/WSGI Python apps on Cerebrium
While Cerebrium’s default runtime works well for most app needs, teams sometimes need more control over their web server implementation. Using ASGI or WSGI servers through Cerebrium’s custom runtime feature enables capabilities like custom authentication, dynamic batching, frontend dashboards, public endpoints, and websockets.
Setting Up Custom Servers
Here’s a simple FastAPI server implementation that shows how custom servers work in Cerebrium:
Configure this server in cerebrium.toml
by adding a custom runtime section:
The configuration requires three key parameters:
entrypoint
: The command that starts your serverport
: The port your server listens onhealthcheck_endpoint
: The endpoint that confirms server health. If empty, will default to a TCP ping of your specified port
For ASGI applications like FastAPI, include the appropriate server package
(like uvicorn
) in your dependencies. After deployment, your endpoints become
available at https://api.cortex.cerebrium.ai/v4/{project - id}/{app - name} /your/endpoint
.
Our FastAPI Server Example provides a complete implementation.