vLLM
Portfolio concentration
53%
Top three share
Shows whether the organization is driven by one breakout repo or several visible projects.
Breadth
30 repos
Visible snapshot
24 repositories updated in the last 90 days.
Leading language
Python
Portfolio mix
Python (17), Unknown (5), Go (2)
Average size
830
Stars per repository
Useful for distinguishing one flagship-heavy publisher from a repeatable portfolio.
53%
of the visible star count comes from this organization's top three repositories.
830
stars per repository in this same snapshot.
Python
is the most common language here, with 24 repositories updated in the last 90 days.
Why this rank
This organization stands out because its public portfolio is relatively balanced across 30 repositories.
Organization pages work best when you separate portfolio breadth from flagship concentration. In vLLM's case, the visible top three repositories account for about 53% of total stars in this snapshot, which helps explain whether the organization is known for one breakout project or for a broader repeatable portfolio.
The dominant language mix here is Python (17), Unknown (5), Go (2). That makes this page useful not just for popularity checks, but also for seeing what technical shape an organization's public ecosystem actually has.
Top Repositories
| # | Repository | Language | ⭐ Stars | 🍴 Forks | Updated |
|---|---|---|---|---|---|
| 1 | vllm-project/aibrix Cost-efficient and pluggable Infrastructure components for GenAI inference | Go | 4.8K | 567 | Today |
| 2 | vllm-project/vllm-omni A framework for efficient model inference with omni-modality models | Python | 4.5K | 854 | Today |
| 3 | vllm-project/semantic-router System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge | Go | 3.9K | 647 | Today |
| 4 | vllm-project/llm-compressor Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM | Python | 3.2K | 493 | Today |
| 5 | vllm-project/production-stack vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization | Python | 2.3K | 395 | Yesterday |
| 6 | vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on Ascend | Python | 2K | 1.1K | Today |
| 7 | vllm-project/guidellm Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs | Python | 1.1K | 145 | Today |
| 8 | vllm-project/vllm-metal Community maintained hardware plugin for vLLM on Apple Silicon | Python | 1K | 111 | Today |
| 9 | vllm-project/recipes Common recipes to run vLLM | JavaScript | 762 | 246 | Today |
| 10 | vllm-project/speculators A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM | Python | 381 | 77 | Today |
| 11 | vllm-project/tpu-inference TPU inference for vLLM, with unified JAX and PyTorch support. | Python | 306 | 171 | Today |
| 12 | vllm-project/router A high-performance and light-weight router for vLLM large scale deployment | Rust | 210 | 73 | Today |
| 13 | vllm-project/vllm-skills Agent skills for vLLM | Shell | 67 | 19 | 3 weeks ago |
| 14 | vllm-project/vllm-daily vLLM Daily Summarization of Merged PRs | 50 | 4 | Yesterday | |
| 15 | vllm-project/vllm-openvino | Python | 48 | 12 | 4 months ago |
| 16 | vllm-project/vllm-xpu-kernels The vLLM XPU kernels for Intel GPU | C++ | 38 | 52 | Today |
| 17 | vllm-project/vllm-gaudi Community maintained hardware plugin for vLLM on Intel Gaudi | Python | 38 | 127 | Today |
| 18 | vllm-project/vllm-neuron Community maintained hardware plugin for vLLM on AWS Neuron | Python | 29 | 11 | 1 months ago |
| 19 | vllm-project/agentic-api Stateful API logic for agentic applications using vLLM | Python | 24 | 9 | 1 weeks ago |
| 20 | vllm-project/vllm-nccl Manages vllm-nccl dependency | Python | 18 | 3 | 1 years ago |
| 21 | vllm-project/dllm-plugin vLLM plugin for block-based diffusion language model (dLLM) support | Python | 13 | 5 | Today |
| 22 | vllm-project/vLLM-in-PyTorch-Conference-2025 | 12 | 1 | 4 months ago | |
| 23 | vllm-project/FlashMLA | C++ | 12 | 17 | 1 weeks ago |
| 24 | vllm-project/bart-plugin vLLM Model plugin for the encoder-decoder BART model | Python | 11 | 7 | 2 weeks ago |
| 25 | vllm-project/media-kit vLLM Logo Assets | 8 | 4 | 3 months ago | |
| 26 | vllm-project/perf-dashboard Performance dashboard for vLLM | Python | 1 | 2 | 1 months ago |
| 27 | vllm-project/rfcs | 1 | 0 | 11 months ago | |
| 28 | vllm-project/vllm-dashboard | TypeScript | 0 | 0 | Yesterday |
| 29 | vllm-project/perf-eval | Python | 0 | 0 | Today |
| 30 | vllm-project/DeepGEMM DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling | 0 | 0 | 7 months ago |
Next step after the organization read
Learn and methodology
How to Read This Snapshot
Total stars are useful as a discovery signal, but they do not tell you whether a team maintains every repository equally. Pair this page with release cadence, maintainer activity, and the flagship concentration shown above before making adoption decisions.
For broader background on GitStar's ranking logic and editorial guidance, see Methodology & Editorial Standards.