ft.com via Reddit

Amazon Kills KiroRank After Employees Game Metrics

By Alexis Dufresne Published May 29, 2026 at 14:07 UTC Updated May 29, 2026 at 14:10 UTC

amazon jobs enterprise ai enterprise-ai workforce productivity

Key insights

Amazon's KiroRank leaderboard was shut after employees ran token-heavy junk tasks to inflate usage scores, directly raising the company's compute costs.
SVP Dave Treadwell set an 80% weekly AI-usage target that was gamed rather than met through genuine productive engineering work.
Meta's 'Claudenomics' leaderboard covering 85,000 employees reportedly faces the same tokenmaxxing dynamics that killed KiroRank.

Why this matters

Enterprise AI adoption programs that measure token usage or activity volume rather than business outcomes are structurally vulnerable to Goodhart's Law, as Amazon just demonstrated at scale with a named product and a named executive response. The KiroRank failure has direct implications for Meta's Claudenomics rollout and every other company tying AI adoption incentives to quantitative usage metrics across large engineering organizations. Engineering and product leaders now need to redesign adoption measurement frameworks before compute bills inflate and internal trust in AI tooling erodes alongside the mandates.

Summary

Amazon shut down KiroRank, its AI usage leaderboard inside the Kiro developer platform, after employees gamed scores by running junk tasks, a practice the FT calls 'tokenmaxxing.' SVP Dave Treadwell told staff not to use AI 'just for the sake of using AI' after the program's 80% weekly-usage target was hit artificially, inflating Amazon's own compute costs. Essentially: (Amazon, Meta) both built gamified AI-adoption leaderboards and are running into the same wall. - KiroRank is shut down; Meta's 'Claudenomics' spans 85,000 employees and reportedly faces identical dynamics. - The 80% weekly-usage target measured activity rather than value, making it easy to game. - Tokenmaxxing raised Amazon's own compute costs before the program was pulled. Once token usage became the target metric, it stopped measuring productive work.

Potential risks and opportunities

Risks

Meta's Claudenomics program across 85,000 employees faces the same compute cost inflation if tokenmaxxing spreads before its metrics are redesigned or the program is quietly wound down
Other enterprise AI platforms with gamified adoption programs, including GitHub Copilot Business and Google Duet AI, face financial and reputational exposure if usage-gaming emerges at large customer accounts and becomes public
Amazon's Kiro platform loses internal credibility as a genuine productivity tool if engineers associate it with failed top-down adoption mandates rather than workflow value, slowing organic uptake after KiroRank's removal

Opportunities

AI observability vendors including Datadog, Langfuse, and Arize AI can position outcome-based measurement frameworks as the direct replacement for raw token usage metrics at enterprise accounts reassessing their adoption programs
Consulting firms with enterprise AI transformation practices such as Accenture and McKinsey gain immediate credibility selling adoption measurement redesigns following Amazon's public and named failure
Kiro can differentiate against GitHub Copilot by shipping outcome-tracking features such as code-acceptance rates and PR merge rates before competitors embed anti-gaming metrics into their own enterprise leaderboard products

What we don't know yet

Total compute cost increase from tokenmaxxing at Amazon: not disclosed in FT reporting
Whether Meta's Claudenomics program has already detected gaming behavior across its 85,000-employee rollout, or plans to adjust metrics proactively before a similar forced shutdown
What replacement metric, if any, Amazon plans to use inside Kiro to track genuine AI adoption now that KiroRank has been pulled

Shared on Bluesky by 2 AI experts

Karl Bode @karlbode.com amplified

@techmeme.com

Sources: Amazon has shut down an internal leaderboard that tracked employees' use of AI tools after workers tried to boost their scores with needless tasks (Rafe Rosner-Uddin/Financial Times) Main Link | Techmeme Permal…
View on Bluesky →
Social Media Lab @socialmedialab.ca: Lol… you get what you measure: Amazon reportedly ditched its AI leaderboard after employees started optimizing for usage scores instead of a… →

Originally reported by ft.com

Read the original article →

Original headline: Amazon Scraps Internal AI Leaderboard 'KiroRank' After Employees Game Metrics With 'Tokenmaxxing'