GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2

“Bigger is not better.”

Bigger turned out to be worse. GPT-5.5 and DeepSeek V4 Pro hallucinate at brutal rates while a smaller open model correctly says when something is impossible. The whole industry bet on scale as the answer. The biggest models are the most confident and the most wrong, which is the worst possible combination.