Documentation – Replicate

Get started

Replicate makes it easy to run machine learning models in the cloud from your own code.

You can run open-source models, or deploy your own models.

Want to see examples of what you can build with Replicate? Check out our showcase.

Run models

Run a model from Node.js

Run a model from Node.js - Get started with a few lines of JavaScript

Run a model from Python

Run a model from Python - The lingua franca of the machine learning world

Run a model from Google Colab

Run a model from Google Colab - Cloud-hosted Jupyter notebooks

Build a website with Next.js

Build a website with Next.js - React + Node.js for rapid development

Build an app with SwiftUI

Build an app with SwiftUI - Develop for macOS, iOS, and (soon) visionOS

Build an app with Elixir

Build a Discord bot - Chat-based image generation

Build an app with Elixir

Build an app with Elixir - A high-performance successor to Ruby

Push models

Push a model to Replicate

Push a model to Replicate - Use Cog to build and push your own models

Deploy a custom model

Deploy a custom model - Collaborate privately with your team

Fine-tune a language model

Fine-tune an image model - Train a new model on faces or styles

Fine-tune a language model

Fine-tune a language model - Train a new model using your private data

Get a GPU machine

Get a GPU machine - Access powerful cloud compute on Lambda Labs

Push a Diffusers model

Push a Diffusers model - Stable Diffusion and countless others

Push a Transformers model

Push a Transformers model - Attention is all you need

Push a model using GitHub Actions

Push a model using GitHub Actions - Continuous automated model deployment

Learn more

How does Replicate work?

How does Replicate work? - A guide to core concepts

Showcase

Showcase - Imagine what you can build

Using Webhooks

Using Webhooks - Get realtime updates about your predictions

Using Webhooks

Client libraries - JavaScript, Python, Ruby, Swift, Elixir

Using Webhooks

HTTP API reference