Ofir Press

I develop tough benchmarks for LMs and then I build agents to try and beat those benchmarks. Postdoc @ Princeton University. https://ofir.io/about
No recent on-topic activity harvested.