Explore AI Models

Discover, compare, and integrate cutting-edge AI models for your projects

Text GenerationImage GenerationCode ModelsChat ModelsFreeLatest

84 AI models found

Speech & Audio
Meta AI

MusicGen

Open SourcePyTorch

MusicGen is a cutting-edge, single-stage autoregressive transformer AI from Meta AI via the AudioCraft library, designed for high-quality music generation.

Views
1.0K
Favorites
0
Released
2023
Official URL
https://github.com/facebookresearch/audiocraft
audiotext-to-music
Computer Vision
Meta AI (Facebook AI Research)

Detectron2

Open SourcePyTorch

Detectron2 is a powerful open-source computer vision library developed by Meta AI (Facebook AI Research) that excels in object detection, instance segmentation, and keypoint detection tasks.

Views
638
Favorites
0
Released
2019
Official URL
https://github.com/facebookresearch/detectron2
object detection AIsegmentation model
Speech & Audio
myshell.ai

OpenVoice

Open SourcePyTorch

OpenVoice V2 is a cutting-edge open-source voice cloning and speech synthesis model focused on delivering high-fidelity voice outputs with emotional and stylistic flexibility.

Views
590
Favorites
0
Released
2023
Official URL
https://research.myshell.ai/open-voice
voice cloning
Generative Models
Stability AI

Stable Video Diffusion

Open SourcePyTorch

Stable Video Diffusion enables the generation of short video clips from static images through advanced diffusion techniques.

Views
484
Favorites
0
Released
2023
Official URL
https://stability.ai/stable-video
video
Speech & Audio
Meta AI

SeamlessM4T v2

Open SourcePyTorch

SeamlessM4T v2 is Meta AI’s advanced multilingual speech and text translation model, designed for real-time translation across over 100 languages.

Views
414
Favorites
0
Released
2025
Official URL
https://ai.meta.com/research/seamless-communication/
translationspeechai-models
Code Generation
BigCode

StarCoder2

Open SourcePyTorch

StarCoder2 is a large-scale open-source AI model developed by BigCode for code generation and comprehension tasks.

Views
348
Favorites
0
Released
2024
Official URL
https://huggingface.co/bigcode/starcoder2-15b
code-generationdeveloper
Natural Language Processing
EleutherAI

GPT-Neo

Open SourcePyTorch

GPT-Neo is an open-source large language model developed by EleutherAI, designed as an alternative to OpenAI’s GPT-3.

Views
320
Favorites
0
Released
2021
Official URL
https://www.eleuther.ai/
nlp
Natural Language Processing
Mistral AI

Mistral 8x22B

Open SourcePyTorch

Mixtral 8x22B is a cutting‑edge open‑source Mixture‑of‑Experts LLM by Mistral AI, featuring 141B total parameters and 39B active parameters, optimized for multilingual reasoning, math, and coding tasks.

Views
274
Favorites
0
Released
2024
Official URL
https://mistral.ai/news/mixtral-of-experts/
nlp
Multimodal
Baaivision

Emu2-Chat

Open SourcePyTorch

Emu2-Chat is a conversational AI model designed for engaging and context-aware chat interactions, optimized for natural language understanding and generating human-like responses across various domains.

Views
256
Favorites
0
Released
2023
Official URL
https://baaivision.github.io/emu2/
conversational
Natural Language Processing
Google

T5

Open SourceTensorFlow

T5 (Text-to-Text Transfer Transformer) is Google’s powerful open-source model that converts all NLP problems into a text-to-text format, enabling flexible language understanding and generation.

Views
252
Favorites
0
Released
2019
Official URL
https://github.com/google-research/text-to-text-transfer-transformer
nlp
Machine Learning
Kavout Inc.

Kavout (Kai)

FreemiumTensorFlow

Kavout (Kai) is an AI-powered stock analysis platform harnessing the power of deep learning to provide actionable trading insights and predictive analytics.

Views
245
Favorites
0
Official URL
https://www.kavout.com/
aistock ai
Speech & Audio
Microsoft

FastSpeech 2

Open SourcePyTorch

FastSpeech 2 is an improved neural text-to-speech model from Microsoft that generates natural-sounding speech quickly and efficiently.

Views
238
Favorites
0
Released
2020
Official URL
https://arxiv.org/abs/2006.04558
audiotext-to-speech
Generative Models
Craiyon LLC

DALL·E Mini

Open SourceTensorFlow

DALL·E Mini is an open-source text-to-image Generative Adversarial Network that creatively synthesizes high-quality images from textual prompts.

Views
237
Favorites
0
Released
2022
Official URL
https://www.craiyon.com/
image-generation
Multimodal
DeepSeek AI

DeepSeek-VL

Open SourcePyTorch

DeepSeek-VL is a cutting-edge open-source multimodal AI model that integrates vision and language processing to enable tasks like image captioning, semantic search, and cross-modal retrieval.

Views
228
Favorites
0
Released
2024
Official URL
https://github.com/deepseek-ai/DeepSeek-VL
Multimodal AI
Speech & Audio
Stability AI

Stable Audio 2.0

Open SourcePyTorch

Stable Audio 2.0 is an advanced open-source AI model developed by Stability AI for generating music and audio from textual descriptions.

Views
223
Favorites
0
Released
2024
Official URL
https://stability.ai/stable-audio
audiomusic
Speech & Audio
NVIDIA

VITS

Open SourcePyTorch

VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) is an advanced speech synthesis model developed by NVIDIA. It combines variational autoencoders and GANs to generate high-quality, natural-sounding speech directly from text.

Views
222
Favorites
0
Released
2021
Official URL
https://arxiv.org/abs/2106.06103
audiotext-to-speech
Natural Language Processing
BigScience

Bloom

Open SourcePyTorch

Bloom is an open-source multilingual transformer model developed by BigScience, designed for a variety of natural language processing tasks across multiple languages.

Views
211
Favorites
0
Released
2022
Official URL
https://bigscience.huggingface.co/
llm
Speech & Audio
Meta AI

wav2vec 2.0

Open SourcePyTorch

wav2vec 2.0 is a self-supervised speech representation learning model developed by Meta AI, revolutionizing automatic speech recognition (ASR) by significantly decreasing the need for labeled data.

Views
197
Favorites
0
Released
2020
Official URL
https://github.com/facebookresearch/fairseq/tree/main/examples/wav2vec
speech-recognition
84
Total Models
60
Providers
13
Categories
10
Frameworks