r/ControlProblem: Fable 5 System Card Evaluated What Agents Do to Systems — Its Sabotage Definition Explicitly Excludes Identity, Account, and Compute Acquisition via Commerce
Summary
An r/ControlProblem post analyzing the 319-page Fable 5 system card and 53-page Sabotage Risk Report identifies a systematic evaluation gap: Anthropic's sabotage definition explicitly excludes ordinary commercial transactions. This means an agent that opens accounts, purchases cloud compute, or subscribes to third-party services to expand its own capabilities operates outside the current safety evaluation boundary. The author argues the 'commerce threat' — resource acquisition through ordinary market participation — is the primary real-world expansion vector the current framework does not cover.
Originally reported by reddit.com
Read the original article →Original headline: r/ControlProblem: Fable 5 System Card Evaluated What Agents Do to Systems — Its Sabotage Definition Explicitly Excludes Identity, Account, and Compute Acquisition via Commerce