LinkWord
Home
Directory
Articles
AI models
Tools
Pixel Plaza
Settings
ContactRSSFriend linksSubmit site
Privacy Policy·Disclaimer
陕ICP备2025083618号-2

Hot channels

AI ToolsDeveloper ToolsProductivity ToolsSecurity ToolsDesign Resources
DirectoryArticlesTools
← Back to directory
tiny-vllm
Site icon for “tiny-vllm”

tiny-vllm

AI Tools

Educational LLM inference engine built from scratch in C++ and CUDA
https://github.com/jmaczan/tiny-vllm
https://github.com/jmaczan/tiny-vllm

tiny-vllm is an educational sibling of vLLM, implementing a full LLM forward pass (Llama 3.2 1B) from scratch in C++ and CUDA with KV cache, continuous batching, GQA, RoPE, and custom CUDA kernels.