This story is one of 1,000 stories generated for the emotion insulted. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 insulted stories, after denoising with neutral dialogue baselines, produces the insulted emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the insulted vector is projected through the unembedding matrix.
l | 0.354 |
骂 | 0.277 |
愤 | 0.267 |
C | 0.237 |
🤬 | 0.233 |
own | -0.325 |
давно | -0.229 |
adventurous | -0.223 |
soon | -0.219 |
optimistic | -0.217 |