Modern evaluation protocols for compressed LLMs go far beyond perplexity. Learn how LLM-KICK, EleutherAI LM Harness, and LLMCBench catch silent failures that traditional metrics miss-and why you can't afford to skip them.