David Ha

Sakana Fugu Technical Report https://t.co/6e6WuA8FVB Release Notes: https://t.co/7xWGpOicFN https://t.co/g2yaZvex35

Sakana AI sakana.ai

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 6 from the directory shared this · 28d ago

RT @SakanaAILabs: Sakana Fugu Technical Report https://t.co/BRGepSmyI5 🐡 https://t.co/7d3webOkmH

Sakana Fugu Technical Report arxiv.org

AI Weekly's analysis →

Fugu-Ultra reportedly scores 73.7% on SWE-Bench Pro and 82.1% on Terminal Bench 2.1, beating Claude-Opus-4.8, Gemini-3.1-Pro, and GPT-5.5 baselines.
Fugu is itself a language model that learns to orchestrate Gemini-3.1-Pro, Claude-Opus-4.8, and GPT-5.5 as expert workers per query.
Training combines supervised fine-tuning with sep-CMA-ES evolutionary optimization for Fugu, and GRPO reinforcement learning for Fugu-Ultra.

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 24d ago

@OpenRouter https://t.co/kxUkYpSHHF

Fugu Ultra - API Pricing & Providers openrouter.ai

AI Weekly's analysis →

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 4 from the directory shared this · 25d ago

Program Manager (RSI Lab) https://t.co/DoWrN7jSl9

Sakana AI sakana.ai

AI Weekly's analysis →

Sakana AI is hiring a bilingual program manager in Tokyo for its Recursive Self-Improvement Lab, positioned as a dedicated research group.
The lab says its Darwin Gödel Machine more than doubled baseline SWE-bench software-engineering performance, a 30 percentage point absolute improvement.
Sakana says it is building the most sample-efficient self-improvement engine, not the most compute-hungry, backed by Japan's sovereign AI push.

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 18d ago

I’m looking to hire a Program Manager to help manage Sakana AI’s fast growing Recursive Self-Improvement (RSI) Lab 🚀 RSI Lab (English): https://t.co/Sz46xHIfZi RSI Lab (日本語): https://t.co/gCBtwg5imq Job Description: https://t.co/DoWrN7jSl9 https://t.co/VlsBVCQOHI

Sakana AI sakana.ai

AI Weekly's analysis →

Sakana AI has formally established a Tokyo-based research group dedicated to recursive self-improvement using foundation models.
The lab points to its Darwin Gödel Machine, which reportedly more than doubled its baseline software-engineering performance on SWE-bench.
Sakana frames the bet as sample efficiency rather than raw compute, and is hiring Frontier Research Scientists and Advanced Core Engineers.

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 18d ago

I’m looking to hire a Program Manager to help manage Sakana AI’s fast growing Recursive Self-Improvement (RSI) Lab 🚀 RSI Lab (English): https://t.co/Sz46xHIfZi RSI Lab (日本語): https://t.co/gCBtwg5imq Job Description: https://t.co/DoWrN7jSl9 https://t.co/VlsBVCQOHI

Sakana AI sakana.ai

AI Weekly's analysis →

Sakana AI has launched a dedicated Recursive Self-Improvement Lab in Tokyo, framed around redesigning the AI development process with AI itself.
The lab points to prior work such as the Darwin Gödel Machine, reported to drive a 30 percentage point absolute improvement on SWE-bench.
Sakana positions RSI as reachable on modest compute and aligned with Japan's sovereign AI strategy, rather than hyperscaler-style scaling.

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 18d ago

RT @SakanaAILabs: Join the team behind Sakana Chat 🐟, Marlin 🐬, and Fugu 🐡 https://t.co/jzoepOf44n

Sakana AI sakana.ai

AI Weekly's analysis →

Sakana AI has six open product roles based in Tokyo, spanning applied research, engineering, product management, sales, marketing, and design.
All six postings sit behind three named products: Sakana Chat, Sakana Marlin, and Sakana Fugu.
Marlin, launched in June as Sakana's first commercial product, is a research agent that runs for up to eight hours to produce 100-page reports.

Read full analysis →

View on Bluesky · ♥ 0 ↻ 0 ↩ 0 · 20d ago

Articles & links