Jefouree

The discoveries worth talking about each week.


← Back to Jefouree

Story permalink

Harvard DSR Language Models

What Transformers Are Actually "Looking At" When They Think

Log in to share

Imagine you read a sentence but couldn't say which words you focused on. Researchers just cracked open the brain of transformers to find the hidden geometry of attention — the map of what the model cares about.

This means we're finally getting a clearer picture of how language models actually work underneath, not just what they produce.


Bug reported: No

Confirm action