Explore
iordcalin/material-transfer
Transfer a material from an image to a subject
cjwbw/openvoice
Updated to OpenVoice v2: Versatile Instant Voice Cloning
snowflake/snowflake-arctic-instruct
An efficient, intelligent, and truly open-source language model
meta/meta-llama-3-70b-instruct
A 70 billion parameter language model from Meta, fine tuned for chat completions
bytedance/sdxl-lightning-4step
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
vaibhavs10/incredibly-fast-whisper
whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗
I want to…
Generate images
Models that generate images from text prompts
Edit images
Tools for manipulating images.
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Caption images
Models that generate text from images
Get embeddings
Models that generate embeddings from inputs
Upscale images
Upscaling models that create high-quality images from low-quality images
Use a language model
Models that can understand and generate text
Extract text from images
Optical character recognition (OCR) and text extraction
Train a language model
Language models that you can fine-tune using Replicate's training API.
Use a face to make images
Make realistic images of people instantly
Chat with images
Ask language models about images
Transcribe speech
Models that convert speech to text
Use handy tools
Toolbelt-type models for videos and images.
Generate music
Models to generate and modify music
Generate videos
Models that create and edit videos
Generate speech
Convert text to speech
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Get structured data
Language models that support grammar-based decoding as well as jsonschema constraints.
Latest models
yuan2.0-2b-mars是源2.0-2B模型的2024年3月版本,源2.0 是浪潮信息发布的新一代基础语言大模型。我们开源了全部的3个模型源2.0-102B,源2.0-51B和源2.0-2B。并且我们提供了预训练,微调,推理服务的相关脚本,以供研发人员做进一步的开发。源2.0是在源1.0的基础上,利用更多样的高质量预训练数据和指令微调数据集,令模型在语义、数学、推理、代码、知识等不同方面具备更强的理解能力。
Idefics2 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Qwen1.5 32B Chat variant. A transformer-based decoder-only language model. Good with Chinese and English.
Real-ESRGAN with optional face correction and adjustable upscale
text2img model trained on LAION HighRes and fine-tuned on internal datasets
snowflake-arctic-embed is a suite of text embedding models that focuses on creating high-quality retrieval models optimized for performance
input your name, and this model will print the most handsome man
Base version of Llama 3, a 70 billion parameter language model from Meta.
A 70 billion parameter language model from Meta, fine tuned for chat completions
Base version of Llama 3, an 8 billion parameter language model from Meta.
OpenBMB MiniCPM-V 2.8B is a strong multimodal large language model for efficient end-side deployment
a powerful and competitive model like Midjourney v6 and DALL-E 3 but Open and Decentralized
HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data
Accelerated transcription, word-level timestamps and diarization with whisperX large-v3
Midjourney v6 text-to-image quality model but Open and Decentralized
GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.
Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
A large, stereo MusicGen that acts as a useful tool for music producers
Nous Hermes 2 Mixtral 8x7B DPO is a Nous Research model trained over the Mixtral 8x7B MoE LLM
Use a subset of https://github.com/barun-saha/slide-deck-ai to create powerpoint slides from a json description - using python-pptx (https://github.com/scanny/python-pptx)