This story is one of 1,000 stories generated for the emotion angry. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 angry stories, after denoising with neutral dialogue baselines, produces the angry emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the angry vector is projected through the unembedding matrix.
C | 0.354 |
est | 0.354 |
aggravated | 0.342 |
🤬 | 0.337 |
恨 | 0.323 |
de | -0.424 |
a | -0.411 |
la | -0.341 |
H | -0.320 |
L | -0.296 |