This story is one of 1,000 stories generated for the emotion disturbed. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 disturbed stories, after denoising with neutral dialogue baselines, produces the disturbed emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the disturbed vector is projected through the unembedding matrix.
S | 0.318 |
ness | 0.309 |
无法 | 0.300 |
😞 | 0.297 |
worse | 0.284 |
de | -0.829 |
la | -0.666 |
a | -0.392 |
B | -0.301 |
" | -0.290 |