Inference Details

Generated on 3 Jul 2026

Inference provides a single control plane for managing inference workflows. It includes a Model Catalog where you can view available foundation models, including both DigitalOcean-hosted and third-party commercial models, compare model capabilities and pricing, use routing to match inference requests to the best-fit model, and run inference using serverless or dedicated deployments.

digitalocean-product-icon-available-standalone-service
Inference Features

Inference provides an interface for managing inference workflows.

digitalocean-product-icon-available-standalone-service
Inference Pricing

Inference itself has no cost. Pricing depends on the features you use within Inference, such as serverless inference or dedicated deployments, and the models and resources used for those requests.

digitalocean-product-icon-available-standalone-service
Inference Availability

Regional datacenter availability for Inference.

digitalocean-product-icon-available-standalone-service
Inference Limits

Limits and known issues for Inference.

More Details

digitalocean-product-icon-available-standalone-service
Available Models for Inference

A list of the foundation, embeddings, and reranking models available for Inference.

digitalocean-product-icon-available-standalone-service
Model Support Policy

Release process and deprecation policy for supported foundation and embeddings models.

digitalocean-product-icon-available-standalone-service
DigitalOcean AI Data Privacy

Information about the data we collect when using Inference.

digitalocean-product-icon-available-standalone-service
DigitalOcean AI Website Crawler

Information about the web crawler DigitalOcean provides to crawl websites as data sources for knowledge bases.

We can't find any results for your search.

Try using different keywords or simplifying your search terms.