Replicate Pricing, Plans and Cost Breakdown for 2024

Run and fine-tune machine learning models in the cloud.

Replicate logo

Run open-source and private models

Fine-tune language models

Supports multiple programming languages

Replicate Pricing

Replicate is a tool that's all about making it easy for developers to run their machine learning models in the cloud at scale. This is a pay-as-you-go AI tool. You can start using it for free, but after a while, you'll be asked to enter your credit card details. Here's what you need to know about the pricing details:

  • The cost is calculated per second for the predictions you run. The price per second varies based on the hardware the model is run on. For instance, running a public model on a CPU costs $0.000100 per second, while a private model costs $0.000200 per second.
  • If you're using Nvidia GPUs, the cost increases. For example, running a public model on an Nvidia T4 GPU costs $0.000225 per second, while a private model costs $0.000550 per second. The most expensive option is the 8x Nvidia A40 (Large) GPU, which costs $0.005800 per second for both public and private models.
  • Soon, Replicate plans to lower the price of private models and start charging for setup and idle time.
  • If you cancel your prediction before it starts, there's no charge. If you cancel it after it's started, you'll be billed for the time used so far.
  • Billing is done once per month, and the minimum billable time for any prediction is 1 second.

In conclusion, Replicate offers straightforward and predictable pricing with its pay-as-you-go model. They let you get started for free, making it a cost-effective way to try running machine learning models in the cloud even before you make a financial commitment.

Starting price

0.0001

  • Free plan
  • Paid
  • Free trial