Nov 12, 2025

5 minutes read

Making data transfer in LLM systems faster, leaner, and more scalable

Introducing Shared Memory IPC Caching — a high-performance caching mechanism contributed by Cohere to the vLLM project.

Blog Post Featured Image