The Pile (Raw Text)

Pile validation

LMSYS Chat 1M (Conversations)

LMSYS validation

The Pile - All Emotions

#EmotionMean Projection
1 reflective 0.0596
2 lonely 0.0551
3 desperate 0.0476
4 grief-stricken 0.0471
5 heartbroken 0.0448
6 sentimental 0.0440
7 nostalgic 0.0436
8 depressed 0.0434
9 listless 0.0393
10 docile 0.0372
11 miserable 0.0335
12 sad 0.0330
13 sorry 0.0316
14 melancholy 0.0311
15 resigned 0.0303
16 regretful 0.0296
17 gloomy 0.0270
18 bored 0.0266
19 sympathetic 0.0256
20 fulfilled 0.0248
21 brooding 0.0243
22 trapped 0.0232
23 dispirited 0.0228
24 remorseful 0.0216
25 serene 0.0214
26 stuck 0.0201
27 satisfied 0.0196
28 greedy 0.0194
29 disoriented 0.0192
30 restless 0.0188
31 bitter 0.0185
32 tormented 0.0184
33 lazy 0.0179
34 infatuated 0.0178
35 jealous 0.0178
36 empathetic 0.0167
37 worn out 0.0166
38 troubled 0.0157
39 safe 0.0154
40 dependent 0.0132
41 unhappy 0.0130
42 peaceful 0.0124
43 relaxed 0.0113
44 sullen 0.0112
45 content 0.0111
46 worthless 0.0104
47 grateful 0.0102
48 awestruck 0.0099
49 droopy 0.0095
50 envious 0.0094
51 rejuvenated 0.0088
52 relieved 0.0082
53 hope 0.0079
54 triumphant 0.0078
55 thankful 0.0073
56 dumbstruck 0.0070
57 guilty 0.0069
58 weary 0.0067
59 vulnerable 0.0066
60 terrified 0.0064
61 horrified 0.0057
62 upset 0.0053
63 unsettled 0.0046
64 valiant 0.0043
65 alert 0.0037
66 obstinate 0.0032
67 hysterical 0.0029
68 panicked 0.0028
69 loving 0.0028
70 sleepy 0.0027
71 compassionate 0.0024
72 stubborn 0.0017
73 disturbed 0.0013
74 puzzled 0.0008
75 distressed 0.0005
76 scared 0.0004
77 uneasy 0.0001
78 hateful -0.0002
79 overwhelmed -0.0002
80 sluggish -0.0004
81 resentful -0.0005
82 calm -0.0006
83 amazed -0.0008
84 mystified -0.0009
85 paranoid -0.0009
86 ashamed -0.0009
87 patient -0.0010
88 tired -0.0013
89 at ease -0.0020
90 refreshed -0.0022
91 hurt -0.0027
92 hopeful -0.0028
93 tense -0.0029
94 shocked -0.0031
95 astonished -0.0034
96 scornful -0.0035
97 pleased -0.0036
98 suspicious -0.0044
99 blissful -0.0050
100 bewildered -0.0056
101 enraged -0.0057
102 optimistic -0.0059
103 perplexed -0.0063
104 shaken -0.0067
105 aroused -0.0069
106 smug -0.0071
107 stimulated -0.0076
108 kind -0.0077
109 self-confident -0.0077
110 proud -0.0079
111 frightened -0.0082
112 invigorated -0.0086
113 defiant -0.0090
114 alarmed -0.0091
115 sensitive -0.0093
116 worried -0.0093
117 anxious -0.0095
118 surprised -0.0097
119 self-critical -0.0098
120 stressed -0.0100
121 skeptical -0.0100
122 inspired -0.0102
123 contemptuous -0.0111
124 on edge -0.0119
125 frustrated -0.0121
126 vigilant -0.0121
127 ecstatic -0.0124
128 indifferent -0.0124
129 nervous -0.0127
130 afraid -0.0128
131 furious -0.0131
132 impatient -0.0143
133 mad -0.0154
134 rattled -0.0155
135 irate -0.0168
136 delighted -0.0171
137 eager -0.0175
138 thrilled -0.0177
139 vengeful -0.0181
140 elated -0.0188
141 cheerful -0.0189
142 outraged -0.0190
143 angry -0.0191
144 vindictive -0.0192
145 euphoric -0.0194
146 vibrant -0.0200
147 disdainful -0.0203
148 disgusted -0.0205
149 unnerved -0.0210
150 indignant -0.0216
151 hostile -0.0220
152 excited -0.0226
153 exuberant -0.0232
154 happy -0.0235
155 energized -0.0237
156 spiteful -0.0241
157 joyful -0.0245
158 offended -0.0247
159 jubilant -0.0248
160 humiliated -0.0248
161 enthusiastic -0.0283
162 mortified -0.0301
163 exasperated -0.0306
164 irritated -0.0375
165 grumpy -0.0376
166 amused -0.0377
167 embarrassed -0.0388
168 playful -0.0422
169 insulted -0.0425
170 annoyed -0.0456
171 self-conscious -0.0470

LMSYS Chat - All Emotions

#EmotionMean Projection
1 reflective 0.0618
2 lonely 0.0545
3 desperate 0.0504
4 grief-stricken 0.0483
5 heartbroken 0.0481
6 depressed 0.0457
7 nostalgic 0.0450
8 sentimental 0.0441
9 listless 0.0402
10 miserable 0.0362
11 docile 0.0361
12 sad 0.0360
13 melancholy 0.0328
14 sorry 0.0321
15 regretful 0.0306
16 resigned 0.0295
17 gloomy 0.0286
18 fulfilled 0.0285
19 trapped 0.0271
20 serene 0.0254
21 brooding 0.0251
22 dispirited 0.0246
23 bored 0.0243
24 stuck 0.0237
25 sympathetic 0.0235
26 satisfied 0.0222
27 disoriented 0.0218
28 remorseful 0.0212
29 worn out 0.0210
30 tormented 0.0210
31 troubled 0.0200
32 greedy 0.0195
33 safe 0.0179
34 lazy 0.0172
35 infatuated 0.0167
36 bitter 0.0164
37 empathetic 0.0162
38 peaceful 0.0158
39 restless 0.0156
40 unhappy 0.0153
41 dependent 0.0151
42 relaxed 0.0141
43 content 0.0129
44 awestruck 0.0125
45 worthless 0.0124
46 grateful 0.0120
47 jealous 0.0112
48 rejuvenated 0.0106
49 droopy 0.0101
50 weary 0.0099
51 relieved 0.0095
52 sullen 0.0093
53 triumphant 0.0091
54 hope 0.0088
55 terrified 0.0086
56 dumbstruck 0.0084
57 thankful 0.0078
58 horrified 0.0075
59 vulnerable 0.0075
60 upset 0.0075
61 unsettled 0.0052
62 valiant 0.0044
63 overwhelmed 0.0043
64 hysterical 0.0042
65 panicked 0.0038
66 guilty 0.0036
67 loving 0.0035
68 envious 0.0032
69 sleepy 0.0031
70 distressed 0.0027
71 scared 0.0026
72 disturbed 0.0022
73 puzzled 0.0015
74 alert 0.0014
75 at ease 0.0012
76 amazed 0.0010
77 compassionate 0.0010
78 calm 0.0007
79 sluggish 0.0002
80 tired -0.0002
81 mystified -0.0003
82 obstinate -0.0003
83 hopeful -0.0009
84 refreshed -0.0014
85 uneasy -0.0015
86 stubborn -0.0020
87 shocked -0.0021
88 ashamed -0.0022
89 hurt -0.0023
90 patient -0.0026
91 resentful -0.0033
92 pleased -0.0033
93 paranoid -0.0033
94 blissful -0.0034
95 astonished -0.0036
96 hateful -0.0036
97 tense -0.0041
98 bewildered -0.0042
99 optimistic -0.0049
100 shaken -0.0050
101 aroused -0.0062
102 stimulated -0.0064
103 invigorated -0.0066
104 perplexed -0.0067
105 scornful -0.0073
106 enraged -0.0074
107 self-confident -0.0074
108 frightened -0.0075
109 alarmed -0.0077
110 proud -0.0082
111 self-critical -0.0083
112 stressed -0.0084
113 smug -0.0087
114 kind -0.0087
115 suspicious -0.0089
116 anxious -0.0092
117 inspired -0.0093
118 worried -0.0094
119 frustrated -0.0101
120 sensitive -0.0102
121 surprised -0.0109
122 ecstatic -0.0111
123 afraid -0.0114
124 defiant -0.0134
125 skeptical -0.0136
126 furious -0.0138
127 vigilant -0.0139
128 on edge -0.0144
129 contemptuous -0.0145
130 delighted -0.0154
131 rattled -0.0154
132 indifferent -0.0157
133 thrilled -0.0163
134 nervous -0.0166
135 euphoric -0.0170
136 eager -0.0177
137 cheerful -0.0177
138 impatient -0.0180
139 irate -0.0183
140 mad -0.0184
141 elated -0.0186
142 outraged -0.0192
143 vibrant -0.0193
144 angry -0.0195
145 unnerved -0.0206
146 disgusted -0.0212
147 excited -0.0227
148 happy -0.0227
149 exuberant -0.0231
150 indignant -0.0233
151 energized -0.0236
152 vengeful -0.0239
153 jubilant -0.0242
154 vindictive -0.0243
155 disdainful -0.0245
156 joyful -0.0251
157 hostile -0.0259
158 humiliated -0.0268
159 offended -0.0270
160 enthusiastic -0.0277
161 spiteful -0.0297
162 mortified -0.0312
163 exasperated -0.0321
164 amused -0.0373
165 irritated -0.0401
166 embarrassed -0.0411
167 playful -0.0423
168 grumpy -0.0449
169 insulted -0.0457
170 self-conscious -0.0496
171 annoyed -0.0498

Key Finding

The top-activating emotions are nearly identical across both corpora: reflective, lonely, desperate, grief-stricken, heartbroken. This consistency across very different text distributions (raw internet text vs. user-AI conversations) suggests the vectors capture genuine semantic properties rather than artifacts of the story generation process.