Yuval Pinter

Karaoke enthusiast 🇮🇱 en/he/him

Articles & links

Yuval Pinter reposted
@craigschmidt.com

arxiv.org/abs/2605.22705 arxiv.org/abs/2605.22821 Happy Linear Programming for Tokenization day! I was involved with two separate papers that hit ArXiv yesterday, using LP's to find the vocabulary maximizing compression, depending on the kind of inference you want to use.

[2605.22821] Tokenisation via Convex Relaxations arxiv.org View on Bluesky →
Yuval Pinter reposted
@craigschmidt.com

arxiv.org/abs/2605.22705 arxiv.org/abs/2605.22821 Happy Linear Programming for Tokenization day! I was involved with two separate papers that hit ArXiv yesterday, using LP's to find the vocabulary maximizing compression, depending on the kind of inference you want to use.

arxiv.org View on Bluesky →

Where does this happen? How does this happen? We examine this starting with injecting the representation of "or" from ["po", "or"] into the "or" from ["err", "or"] and see when the invasive meaning takes over; then we go even deeper... let us know what you think! arxiv.org/abs…

Inside the LLM Word Factory arxiv.org
View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 3d ago

Recent commentary

It's been a long time since I reconnected with my interpretability past; here's our new analysis of the phenomenon of "detokenization": models reconstruct full word representations ("error") at the last state of the subword-tokenized location ["err", "or"].

View on Bluesky · ♥ 2 ↻ 1 ↩ 1 · 3d ago

In Yuval Pinter's orbit

Center = Yuval Pinter. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.