reddit.com via Reddit

r/ClaudeAI: GPT-5.5 Rates Claude Opus 4.8 as the Better AI Model in a 3-Task Head-to-Head Test Using Developer's Own Knowledge Base

anthropic openai benchmarks claude gpt

Summary

A developer ran a controlled 3-task head-to-head between GPT-5.5 and Claude Opus 4.8 against his own knowledge base in the Recall app and reported that GPT-5.5 itself rated Opus 4.8 the better model — an outcome he described as surprising, with significant caveats about the methodology. The post originally appeared in r/ChatGPT where it received pushback, then was cross-posted to r/ClaudeAI where reactions were more receptive. The test illustrates ongoing community appetite for informal head-to-head comparisons between the two current frontier leaders.