tweety fish

at different times: haxx0r in cDc PhD in using people for ML and vice versa "theory of mind for autonomous cars" startup guy would you believe, went kablooie at present: newsletter -- buttondown.email/apperceptive &c music, politics, nonsense

Articles & links

tweety fish reposted
@ryanlcooper.com

"As I see it, Prince has badly misread an author whose understanding of business (at least in the book he cites; I haven’t read everything by Drucker) provides a radically different alternative to the standard Silicon Valley CEO narrative." www.programmablemutter.com/p/ai-isnt…

programmablemutter.com View on Bluesky →

well, lo, people have built a benchmark using a task called multi-armed bandit, where success depends on iteratively grasping the relative odds of payouts from different choices, and LLMs are (generally) shit at it arxiv.org/pdf/2403.15371 /

arxiv.org
View on Bluesky · ♥ 48 ↻ 4 ↩ 1 · 2 from the directory shared this · 26d ago

as promised here is the paper comparing LLMs to human results; I haven't gone through it carefully enough to vouch for the methodology and for various reasons I have my suspicions about how robust it'll be but it's certainly conceptually interesting: arxiv.org/pdf/2505.09901

arxiv.org
View on Bluesky · ♥ 1 ↻ 0 ↩ 0 · 24d ago

quoted sentence paraphrased from today's Matt Levine which links www.bloomberg.com/news/article... and robinhood.com/us/en/newsro...

bloomberg.com
View on Bluesky · ♥ 5 ↻ 1 ↩ 2 · 22d ago
tweety fish reposted
@peark.es

Whether you’re a denialist or a booster or just someone trying to be objective, Salesforce is going to give us some pretty clear answers as to whether radically shifting resources from human coders to AI will work.

techloy.com View on Bluesky →

Recent commentary

Last night it occured to me to wond er if LLMs were any good at gambling tasks. This is important not because it'd be funny for LLMs to gamble but because gambling tasks get used to measure human decision-making under risk /

View on Bluesky · ♥ 75 ↻ 9 ↩ 4 · 26d ago

my new opinion is we should ban the sale of anything that currently gets called 'AI' and also ban use of the term specifically so I shut up about it

View on Bluesky · ♥ 53 ↻ 6 ↩ 7 · 16d ago

All the "will 'AI' replace the humanities" shit is so boring, why aren't we trying to supplant random academic fields with other useful coding tools? Metasploit is going to replace comp lit. We'll be doing historical musicology with linters. Let's go

View on Bluesky · ♥ 34 ↻ 7 ↩ 5 · 10d ago

the story about data center investment vastly outstripping investment in public transportation is a story about capital markets and government and doesn't actually have anything to do with 'AI' as such, people get that, yeah?

View on Bluesky · ♥ 38 ↻ 2 ↩ 4 · 16d ago

You know what would be a fun thing to look at is the correlation between “AI will take all the jobs GET READY”-style marketing and “TAKE jobs no no it’s a great productivity tool” as it became clear to all that it was hopeless at female-coded empathetic/creative work and good at male-coded sw eng

View on Bluesky · ♥ 32 ↻ 7 ↩ 2 · 17d ago

listen, I try not to prejudge this stuff too hard because I think it's important for me, at least, to have my skepticism rooted in up-to-date knowledge but "Robinhood offers in-app agentic trading to its users" is one hell of an alarming sentence

View on Bluesky · ♥ 32 ↻ 3 ↩ 5 · 22d ago

Man, the ridiculous bullshit that people will believe about the ‘AI’ industry bums me out, it’s like the left-coded version of the right’s “all cities are dangerous hellholes”. Resist, resist by all means, but don’t let yourself be grifted insensate by charlatans and comforting lies.

View on Bluesky · ♥ 28 ↻ 3 ↩ 3 · 19d ago

In tweety fish's orbit

Center = tweety fish. Left = members they follow (green edges). Right = members who follow them (blue edges). Top = mutual follows (orange edges, slightly larger). Drag any node to reposition; click to open that profile.