XLNet
Google Brain, CMU
• Framework: TensorFlowXLNet is a powerful Apache‑2.0 open‑source LLM by Google AI & CMU featuring permutation‑based pretraining and Transformer‑XL backbone. It outperforms BERT on 20+ NLP benchmarks like QA, inference, sentiment, and more. Fully supported in Hugging Face for easy integration.
XLNet AI Model

Model Performance Statistics
Views
Released
Last Checked
Version
- Text Classification
- Question Answering
- Parameter Count
- N/A
Dataset Used
BooksCorpus, Wikipedia, Giga5, ClueWeb, Common Crawl
Related AI Models
Discover similar AI models that might interest you
GPT-Neo

GPT-Neo
EleutherAI
GPT-Neo is an open-source large language model developed by EleutherAI, designed as an alternative to OpenAI’s GPT-3. It uses the Transformer architecture to generate coherent, human-like text based on a given prompt. GPT-Neo is trained on the Pile dataset, which is a diverse and large-scale text corpus, making it capable of many NLP tasks such as text generation, summarization, translation, and question answering. GPT-Neo models come in different sizes, the most popular being the 1.3B and 2.7B parameter versions.