This story is one of 1,000 stories generated for the emotion unhappy. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 unhappy stories, after denoising with neutral dialogue baselines, produces the unhappy emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the unhappy vector is projected through the unembedding matrix.
S | 0.339 |
๐ | 0.317 |
๐ | 0.252 |
๐ | 0.249 |
๐ข | 0.241 |
de | -0.436 |
la | -0.266 |
l | -0.224 |
/ | -0.216 |
- | -0.199 |