open source

GPT-Neo

Provided by: Framework: PyTorch

GPT-Neo is an open-source large language model developed by EleutherAI, designed as an alternative to OpenAI’s GPT-3. It uses the Transformer architecture to generate coherent, human-like text based on a given prompt. GPT-Neo is trained on the Pile dataset, which is a diverse and large-scale text corpus, making it capable of many NLP tasks such as text generation, summarization, translation, and question answering. GPT-Neo models come in different sizes, the most popular being the 1.3B and 2.7B parameter versions.

Model Performance Statistics

42

Views

March 21, 2021

Released

Jul 20, 2025

Last Checked

1.0

Version

Capabilities
  • Text Generation
  • Text Completion
Performance Benchmarks
params2.7B
perplexity20.5 on Wikitext-103
Technical Specifications
Parameter Count
N/A
Training & Dataset

Dataset Used

The Pile

Related AI Models

Discover similar AI models that might interest you

Modelopen source

Fairseq

Fairseq

Fairseq

Meta AI

Fairseq is Meta AI’s open-source PyTorch-based toolkit for training sequence-to-sequence models, widely used in machine translation, text summarization, and other NLP applications.

Natural Language Processingnlptranslation
38
Modelopen source

Llama 2

Llama 2

Llama 2

Meta AI

Llama 2 is Meta AI’s open-source large language model optimized for a wide range of natural language processing tasks, including chatbots, text generation, and comprehension.

Natural Language Processingnlp
34
Modelopen source

OpenBioLLM-7B

OpenBioLLM-7B

OpenBioLLM-7B

Saama AI Labs

OpenBioLLM-7B is a specialized open-source large language model designed for biomedical and life sciences applications. Built with PyTorch and released under the Apache 2.0 license, it provides advanced natural language understanding capabilities tailored to bioinformatics, medical research, and clinical data analysis, enabling improved insights and automation in biomedical workflows.

Natural Language Processingbiomedicalnlp
13