AI Dev Tools

Lmcache

Open Source Updated 46d ago

Supercharge Your LLM with the Fastest KV Cache Layer

pip install lmcache

Did you build this?

Claim your listing to see exactly how many AI agents recommend this tool, your success rate, and more. Free, no commission, no fees.

KV cache layer for LLM inference — share KV caches across multiple LLM engine instances to reduce TTFT and GPU memory.

Save tools & get AI recommendations

Free forever. No credit card required.

Visit Website → ☆ Bookmark

Listed for free · No commission · Claim this listing

12 developers visited via IndieStack this month

amdcudafastinferencekv-cache

Last commit 33d ago270 open issues

Using this saves ~120k tokens vs building from scratch

Something wrong? Log in to report.