open source
MLC-LLM
Provided by:
CMU, SAMOA, OctoML
• Framework: Apache TVMMLC-LLM is a universal and open-source framework developed by the Apache TVM community, CMU, and SAMOAI. It allows deployment of large language models on a wide range of edge devices such as iPhones, Android, WebAssembly, and GPUs, enabling efficient and fast inference anywhere.
MLC-LLM AI Model

Model Performance Statistics
13
Views
July 18, 2023
Released
Jul 20, 2025
Last Checked
0.9
Version
Capabilities
- Cross-platform Deployment
- Hardware Acceleration
Performance Benchmarks
iOS Latency12 tokens/sec
WebGPU SupportYes
Technical Specifications
- Parameter Count
- N/A
Training & Dataset
Dataset Used
N/A
Related AI Models
Discover similar AI models that might interest you
Modelopen source
TensorRT-LLM

TensorRT-LLM
NVIDIA
TensorRT-LLM is an open-source library by NVIDIA that delivers highly optimized inference for large language models. It leverages TensorRT and CUDA to accelerate transformer-based models, enabling efficient deployment across GPUs with minimal latency. Built for developers aiming to scale LLMs efficiently.
Scientific AIaillm
13