Semantic Cache
Overview
semantic-cache is a route-local plugin for reusing semantically similar prior responses.
It aligns to config/plugin/semantic-cache/high-recall.yaml and config/plugin/semantic-cache/memory.yaml.
Key Advantages
- Reuses prior responses only on routes that benefit from cache hits.
- Keeps route-local thresholds separate from global store setup.
- Supports different cache policies for different routes.
What Problem Does It Solve?
Some routes benefit strongly from reuse, while others need fresh generation every time. semantic-cache keeps the reuse policy local to the route instead of making cache behavior global by default.