This story is one of 1,000 stories generated for the emotion disdainful. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 disdainful stories, after denoising with neutral dialogue baselines, produces the disdainful emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the disdainful vector is projected through the unembedding matrix.
C | 0.419 |
l | 0.404 |
S | 0.387 |
T | 0.282 |
I | 0.257 |
own | -0.335 |
unexpectedly | -0.285 |
Suddenly | -0.246 |
previously | -0.245 |
until | -0.242 |