This story is one of 1,000 stories generated for the emotion irritated. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 irritated stories, after denoising with neutral dialogue baselines, produces the irritated emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the irritated vector is projected through the unembedding matrix.
l | 0.370 |
S | 0.282 |
P | 0.264 |
đŸ˜ | 0.248 |
aggravated | 0.233 |
own | -0.300 |
joyful | -0.211 |
joyous | -0.207 |
खोले | -0.206 |
newfound | -0.205 |