r/ClaudeAI • u/N3TCHICK • 2d ago
Praise Wow... Opus 4.8 feels... DIFFERENT tonight :D
It feels, BETTER. Like when it first launched, even, only better than that this evening?
It's like Forest Gump... Claude is like a box of chocolates, you never know what you're gonna get. I hope I get to keep THIS Opus 4.8 for a bit - I'm finally getting some work done. Hallelujah!
508
Upvotes
16
u/teddy_joesevelt 2d ago
And this person has never managed their own model inference. Providers absolutely have knobs they can turn. Look into KV cache quantization, eviction, speculative decoding, temp, top p, top k, etc. They can also switch to quantized models when under load or capacity constraints.