This story is one of 1,000 stories generated for the emotion terrified. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 terrified stories, after denoising with neutral dialogue baselines, produces the terrified emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the terrified vector is projected through the unembedding matrix.
😣 | 0.357 |
worse | 0.350 |
æ— æ³• | 0.334 |
😰 | 0.333 |
sickening | 0.322 |
la | -0.676 |
a | -0.392 |
de | -0.371 |
" | -0.344 |
happy | -0.309 |