open source

GPT-Neo

Provided by:

EleutherAI

• Framework: PyTorch

GPT-Neo is an open-source large language model developed by EleutherAI, designed as an alternative to OpenAI’s GPT-3. It uses the Transformer architecture to generate coherent, human-like text based on a given prompt. GPT-Neo is trained on the Pile dataset, which is a diverse and large-scale text corpus, making it capable of many NLP tasks such as text generation, summarization, translation, and question answering. GPT-Neo models come in different sizes, the most popular being the 1.3B and 2.7B parameter versions.

GPT-Neo AI Model

Views

March 21, 2021

Released

Jul 20, 2025

Last Checked

1.0

Version

Capabilities

Text Generation
Text Completion

Performance Benchmarks

params2.7B

perplexity20.5 on Wikitext-103

Technical Specifications

Parameter Count: N/A

Training & Dataset

Dataset Used

The Pile

Related AI Models

Discover similar AI models that might interest you

More AI Models

Modelopen source

Fairseq

Meta AI

Fairseq is Meta AI’s open-source PyTorch-based toolkit for training sequence-to-sequence models, widely used in machine translation, text summarization, and other NLP applications.

Natural Language Processingnlptranslation

Modelopen source

Llama 2

Meta AI

Llama 2 is Meta AI’s open-source large language model optimized for a wide range of natural language processing tasks, including chatbots, text generation, and comprehension.

Natural Language Processingnlp

Modelopen source

OpenBioLLM-7B

Saama AI Labs

OpenBioLLM-7B is a specialized open-source large language model designed for biomedical and life sciences applications. Built with PyTorch and released under the Apache 2.0 license, it provides advanced natural language understanding capabilities tailored to bioinformatics, medical research, and clinical data analysis, enabling improved insights and automation in biomedical workflows.

Natural Language Processingbiomedicalnlp

Model Performance Statistics

Dataset Used

Related AI Models

Fairseq

Fairseq

Llama 2

Llama 2

OpenBioLLM-7B

OpenBioLLM-7B