No surprise here - most benchmarks are ly marketing tools in disguise, and it's shocking more people don't call out their flaws. https://news.mit.edu/2026/study-platforms-rank-latest-llms-can-be-unreliable-0209