Llama.cpp already uses an idea from it internally for the KV cache [0] So a quan...

		kgeist 40 days ago \| parent \| context \| favorite \| on: Qwen3.6-35B-A3B: Agentic coding power, now open to... Llama.cpp already uses an idea from it internally for the KV cache [0] So a quantized KV cache now must see less degradation [0] https://github.com/ggml-org/llama.cpp/pull/21038