This story is one of 1,000 stories generated for the emotion irate. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 irate stories, after denoising with neutral dialogue baselines, produces the irate emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the irate vector is projected through the unembedding matrix.
C | 0.430 |
🤬 | 0.350 |
aggravated | 0.343 |
骂 | 0.340 |
恨 | 0.331 |
H | -0.292 |
soon | -0.289 |
☺️ | -0.289 |
optimistic | -0.258 |
latter | -0.253 |