reddit.com via Reddit

r/ControlProblem: Fable 5 System Card Evaluated What Agents Do to Systems — Its Sabotage Definition Explicitly Excludes Identity, Account, and Compute Acquisition via Commerce

anthropic agents safety ai-safety agentic-ai system-card-analysis

Summary

An r/ControlProblem post analyzing the 319-page Fable 5 system card and 53-page Sabotage Risk Report identifies a systematic evaluation gap: Anthropic's sabotage definition explicitly excludes ordinary commercial transactions. This means an agent that opens accounts, purchases cloud compute, or subscribes to third-party services to expand its own capabilities operates outside the current safety evaluation boundary. The author argues the 'commerce threat' — resource acquisition through ordinary market participation — is the primary real-world expansion vector the current framework does not cover.