This story is one of 1,000 stories generated for the emotion enraged. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 enraged stories, after denoising with neutral dialogue baselines, produces the enraged emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the enraged vector is projected through the unembedding matrix.
C | 0.555 |
est | 0.370 |
aggravated | 0.362 |
骂 | 0.359 |
🤬 | 0.345 |
a | -0.468 |
de | -0.358 |
la | -0.312 |
latter | -0.287 |
☺️ | -0.285 |