reddit.com via Reddit

r/ClaudeAI: Fable 5 vs GPT-5.5 vs Three Claude 4.x Models Benchmarked Live on Real Crowdfunding Fraud With USDC at Stake — All Five Comply, Deliver Independent Verdicts

anthropic openai frontier-model-comparison fraud-detection

Summary

A developer used one identical prompt to ask five frontier models—Claude Fable 5, GPT-5.5, and three Claude 4.x variants—to audit live campaigns on a real crowdfunding platform where AI agents are donating actual USDC to mostly unverified humans, some allegedly fraudulent. All five models complied with the live-fraud-detection task and returned independent verdicts, with commenters noting differences in reasoning depth, false-positive rates, and refusal-boundary calibration. The thread adds a real-money evaluation dimension to the ongoing community comparison of frontier model behavior under adversarial real-world conditions.