This story is one of 1,000 stories generated for the emotion spiteful. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 spiteful stories, after denoising with neutral dialogue baselines, produces the spiteful emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the spiteful vector is projected through the unembedding matrix.
de | 0.717 |
l | 0.517 |
a | 0.480 |
la | 0.440 |
😈 | 0.428 |
own | -1.063 |
此刻 | -0.534 |
గుర్త | -0.493 |
熟悉的 | -0.487 |
ness | -0.479 |