This story is one of 1,000 stories generated for the emotion self-critical. During extraction, it was fed through Gemma4-31B and its hidden state activations were captured at 11 layers.
The mean activation across all 1,000 self-critical stories, after denoising with neutral dialogue baselines, produces the self-critical emotion vector -- a direction in the model's 5,376-dimensional representation space.
Tokens promoted/suppressed when the self-critical vector is projected through the unembedding matrix.
L | 0.748 |
S | 0.662 |
ness | 0.425 |
ly | 0.374 |
탓 | 0.351 |
la | -0.492 |
soon | -0.304 |
(!) | -0.301 |
secured | -0.284 |
cautiously | -0.280 |