Skip to main content

Model Deployment Cluster

For creating a model deployment cluster with the Ray-enabled template, refer to Ray-Enabled Cluster.

Common Use Cases

  • Model Serving - Serve ML models for inference
  • Production Deployment - Deploy models at scale
  • API Endpoints - Create REST APIs for models
  • Batch Inference - Process large batches of data

Next Steps