senstella commited on
Commit
75ea698
·
verified ·
1 Parent(s): 969b1d8

Upload folder using huggingface_hub

Browse files
Files changed (6) hide show
  1. README.md +30 -0
  2. config.json +2277 -0
  3. model.safetensors +3 -0
  4. tokenizer.model +3 -0
  5. tokenizer.vocab +1024 -0
  6. vocab.txt +1023 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: mlx
3
+ tags:
4
+ - mlx
5
+ - automatic-speech-recognition
6
+ - speech
7
+ - audio
8
+ - FastConformer
9
+ - Conformer
10
+ - Parakeet
11
+ license: cc-by-4.0
12
+ pipeline_tag: automatic-speech-recognition
13
+ base_model: nvidia/parakeet-tdt-1.1b
14
+ ---
15
+
16
+ # mlx-community/parakeet-tdt-1.1b
17
+
18
+ This model was converted to MLX format from [nvidia/parakeet-tdt-1.1b](https://huggingface.co/nvidia/parakeet-tdt-1.1b) using [the conversion script](https://gist.github.com/senstella/77178bb5d6ec67bf8c54705a5f490bed). Please refer to [original model card](https://huggingface.co/nvidia/parakeet-tdt-1.1b) for more details on the model.
19
+
20
+ ## Use with mlx
21
+
22
+ ### parakeet-mlx
23
+
24
+ ```bash
25
+ pip install -U parakeet-mlx
26
+ ```
27
+
28
+ ```bash
29
+ parakeet-mlx audio.wav --model mlx-community/parakeet-tdt-1.1b
30
+ ```
config.json ADDED
@@ -0,0 +1,2277 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "sample_rate": 16000,
3
+ "compute_eval_loss": false,
4
+ "log_prediction": true,
5
+ "rnnt_reduction": "mean_volume",
6
+ "skip_nan_grad": true,
7
+ "model_defaults": {
8
+ "enc_hidden": 1024,
9
+ "pred_hidden": 640,
10
+ "joint_hidden": 640,
11
+ "tdt_durations": [
12
+ 0,
13
+ 1,
14
+ 2,
15
+ 3,
16
+ 4
17
+ ],
18
+ "num_tdt_durations": 5
19
+ },
20
+ "train_ds": {
21
+ "manifest_filepath": null,
22
+ "sample_rate": 16000,
23
+ "batch_size": 1,
24
+ "shuffle": true,
25
+ "num_workers": 8,
26
+ "pin_memory": true,
27
+ "max_duration": 20,
28
+ "min_duration": 0.1,
29
+ "is_tarred": true,
30
+ "tarred_audio_filepaths": null,
31
+ "shuffle_n": 2048,
32
+ "bucketing_strategy": "fully_randomized",
33
+ "bucketing_batch_size": [
34
+ 64,
35
+ 64,
36
+ 32,
37
+ 32,
38
+ 32,
39
+ 32,
40
+ 16,
41
+ 16,
42
+ 64,
43
+ 64,
44
+ 32,
45
+ 32,
46
+ 32,
47
+ 32,
48
+ 16,
49
+ 16,
50
+ 64,
51
+ 64,
52
+ 32,
53
+ 32,
54
+ 32,
55
+ 32,
56
+ 16,
57
+ 16
58
+ ],
59
+ "defer_setup": true
60
+ },
61
+ "validation_ds": {
62
+ "manifest_filepath": null,
63
+ "sample_rate": 16000,
64
+ "batch_size": 16,
65
+ "shuffle": false,
66
+ "use_start_end_token": false,
67
+ "num_workers": 8,
68
+ "pin_memory": true
69
+ },
70
+ "test_ds": {
71
+ "manifest_filepath": null,
72
+ "sample_rate": 16000,
73
+ "batch_size": 16,
74
+ "shuffle": false,
75
+ "use_start_end_token": false,
76
+ "num_workers": 8,
77
+ "pin_memory": true
78
+ },
79
+ "tokenizer": {
80
+ "dir": ".",
81
+ "type": "bpe",
82
+ "model_path": "nemo:1a8cc82b8cbd49fbb55f75cba2f4aacb_tokenizer.model",
83
+ "vocab_path": "nemo:784ee6c83b4b4e718a4710c878469ef8_vocab.txt",
84
+ "spe_tokenizer_vocab": "nemo:686236ca3c9142349d59286057b0e3ca_tokenizer.vocab"
85
+ },
86
+ "preprocessor": {
87
+ "_target_": "nemo.collections.asr.modules.AudioToMelSpectrogramPreprocessor",
88
+ "sample_rate": 16000,
89
+ "normalize": "per_feature",
90
+ "window_size": 0.025,
91
+ "window_stride": 0.01,
92
+ "window": "hann",
93
+ "features": 80,
94
+ "n_fft": 512,
95
+ "frame_splicing": 1,
96
+ "dither": 1e-05,
97
+ "pad_to": 0
98
+ },
99
+ "spec_augment": {
100
+ "_target_": "nemo.collections.asr.modules.SpectrogramAugmentation",
101
+ "freq_masks": 2,
102
+ "time_masks": 10,
103
+ "freq_width": 27,
104
+ "time_width": 0.05
105
+ },
106
+ "encoder": {
107
+ "_target_": "nemo.collections.asr.modules.ConformerEncoder",
108
+ "feat_in": 80,
109
+ "feat_out": -1,
110
+ "n_layers": 42,
111
+ "d_model": 1024,
112
+ "subsampling": "dw_striding",
113
+ "subsampling_factor": 8,
114
+ "subsampling_conv_channels": 256,
115
+ "causal_downsampling": false,
116
+ "reduction": null,
117
+ "reduction_position": null,
118
+ "reduction_factor": 1,
119
+ "ff_expansion_factor": 4,
120
+ "self_attention_model": "rel_pos",
121
+ "n_heads": 8,
122
+ "att_context_size": [
123
+ -1,
124
+ -1
125
+ ],
126
+ "att_context_style": "regular",
127
+ "xscaling": false,
128
+ "untie_biases": true,
129
+ "pos_emb_max_len": 5000,
130
+ "conv_kernel_size": 9,
131
+ "conv_norm_type": "batch_norm",
132
+ "conv_context_size": null,
133
+ "dropout": 0.1,
134
+ "dropout_pre_encoder": 0.1,
135
+ "dropout_emb": 0.0,
136
+ "dropout_att": 0.1,
137
+ "stochastic_depth_drop_prob": 0.0,
138
+ "stochastic_depth_mode": "linear",
139
+ "stochastic_depth_start_layer": 1
140
+ },
141
+ "decoder": {
142
+ "_target_": "nemo.collections.asr.modules.RNNTDecoder",
143
+ "normalization_mode": null,
144
+ "random_state_sampling": false,
145
+ "blank_as_pad": true,
146
+ "prednet": {
147
+ "pred_hidden": 640,
148
+ "pred_rnn_layers": 2,
149
+ "t_max": null,
150
+ "dropout": 0.2
151
+ },
152
+ "vocab_size": 1024
153
+ },
154
+ "joint": {
155
+ "_target_": "nemo.collections.asr.modules.RNNTJoint",
156
+ "log_softmax": null,
157
+ "preserve_memory": false,
158
+ "fuse_loss_wer": true,
159
+ "fused_batch_size": 2,
160
+ "jointnet": {
161
+ "joint_hidden": 640,
162
+ "activation": "relu",
163
+ "dropout": 0.2,
164
+ "encoder_hidden": 1024,
165
+ "pred_hidden": 640
166
+ },
167
+ "num_extra_outputs": 5,
168
+ "num_classes": 1024,
169
+ "vocabulary": [
170
+ "<unk>",
171
+ "\u2581t",
172
+ "\u2581th",
173
+ "\u2581a",
174
+ "\u2581i",
175
+ "\u2581the",
176
+ "re",
177
+ "\u2581w",
178
+ "\u2581s",
179
+ "\u2581o",
180
+ "in",
181
+ "at",
182
+ "er",
183
+ "ou",
184
+ "nd",
185
+ "\u2581c",
186
+ "\u2581b",
187
+ "\u2581h",
188
+ "on",
189
+ "\u2581m",
190
+ "\u2581f",
191
+ "ing",
192
+ "\u2581to",
193
+ "en",
194
+ "\u2581p",
195
+ "\u2581and",
196
+ "\u2581d",
197
+ "es",
198
+ "or",
199
+ "an",
200
+ "ll",
201
+ "\u2581y",
202
+ "\u2581l",
203
+ "ed",
204
+ "\u2581of",
205
+ "\u2581in",
206
+ "it",
207
+ "is",
208
+ "\u2581you",
209
+ "\u2581that",
210
+ "ar",
211
+ "\u2581g",
212
+ "\u2581n",
213
+ "as",
214
+ "om",
215
+ "\u2581it",
216
+ "ic",
217
+ "ve",
218
+ "\u2581e",
219
+ "\u2581wh",
220
+ "\u2581be",
221
+ "us",
222
+ "le",
223
+ "al",
224
+ "ion",
225
+ "ow",
226
+ "\u2581we",
227
+ "\u2581re",
228
+ "\u2581is",
229
+ "ut",
230
+ "ot",
231
+ "ent",
232
+ "\u2581on",
233
+ "et",
234
+ "\u2581ha",
235
+ "ay",
236
+ "ct",
237
+ "\u2581he",
238
+ "id",
239
+ "\u2581for",
240
+ "\u2581st",
241
+ "ver",
242
+ "ly",
243
+ "ro",
244
+ "ig",
245
+ "\u2581so",
246
+ "ld",
247
+ "\u2581this",
248
+ "ke",
249
+ "\u2581u",
250
+ "se",
251
+ "all",
252
+ "st",
253
+ "ur",
254
+ "ce",
255
+ "ch",
256
+ "im",
257
+ "ith",
258
+ "\u2581as",
259
+ "\u2581k",
260
+ "\u2581an",
261
+ "\u2581was",
262
+ "\u2581j",
263
+ "\u2581with",
264
+ "ir",
265
+ "\u2581go",
266
+ "ra",
267
+ "\u2581do",
268
+ "\u2581have",
269
+ "\u2581li",
270
+ "\u2581sh",
271
+ "\u2581se",
272
+ "\u2581they",
273
+ "\u2581are",
274
+ "am",
275
+ "ht",
276
+ "\u2581but",
277
+ "ation",
278
+ "\u2581not",
279
+ "th",
280
+ "\u2581r",
281
+ "ally",
282
+ "ad",
283
+ "ust",
284
+ "\u2581or",
285
+ "\u2581com",
286
+ "ould",
287
+ "\u2581can",
288
+ "ill",
289
+ "\u2581ne",
290
+ "ight",
291
+ "\u2581ch",
292
+ "\u2581de",
293
+ "\u2581con",
294
+ "\u2581at",
295
+ "\u2581mo",
296
+ "ant",
297
+ "oo",
298
+ "il",
299
+ "\u2581me",
300
+ "\u2581what",
301
+ "\u2581there",
302
+ "ter",
303
+ "pe",
304
+ "\u2581ab",
305
+ "\u2581su",
306
+ "ere",
307
+ "ck",
308
+ "\u2581pro",
309
+ "\u2581al",
310
+ "\u2581fr",
311
+ "\u2581kn",
312
+ "\u2581all",
313
+ "ers",
314
+ "\u2581like",
315
+ "ge",
316
+ "\u2581ex",
317
+ "\u2581som",
318
+ "ul",
319
+ "\u2581your",
320
+ "\u2581v",
321
+ "pp",
322
+ "use",
323
+ "\u2581if",
324
+ "ess",
325
+ "ate",
326
+ "est",
327
+ "\u2581know",
328
+ "out",
329
+ "if",
330
+ "\u2581just",
331
+ "ment",
332
+ "qu",
333
+ "op",
334
+ "ain",
335
+ "\u2581one",
336
+ "ol",
337
+ "ri",
338
+ "art",
339
+ "very",
340
+ "\u2581wor",
341
+ "ive",
342
+ "ist",
343
+ "\u2581my",
344
+ "nt",
345
+ "ab",
346
+ "\u2581from",
347
+ "ort",
348
+ "\u2581ma",
349
+ "\u2581about",
350
+ "res",
351
+ "ity",
352
+ "\u2581out",
353
+ "\u2581bec",
354
+ "\u2581le",
355
+ "our",
356
+ "od",
357
+ "and",
358
+ "ink",
359
+ "ie",
360
+ "\u2581up",
361
+ "ind",
362
+ "os",
363
+ "un",
364
+ "ause",
365
+ "oug",
366
+ "um",
367
+ "\u2581some",
368
+ "\u2581int",
369
+ "\u2581by",
370
+ "\u2581pl",
371
+ "\u2581get",
372
+ "el",
373
+ "ard",
374
+ "\u2581when",
375
+ "\u2581don",
376
+ "her",
377
+ "\u2581will",
378
+ "\u2581us",
379
+ "\u2581would",
380
+ "ook",
381
+ "ies",
382
+ "ich",
383
+ "\u2581because",
384
+ "\u2581think",
385
+ "em",
386
+ "\u2581pe",
387
+ "\u2581his",
388
+ "ack",
389
+ "\u2581then",
390
+ "\u2581our",
391
+ "ide",
392
+ "\u2581tim",
393
+ "\u2581how",
394
+ "ven",
395
+ "\u2581tr",
396
+ "\u2581who",
397
+ "\u2581them",
398
+ "ure",
399
+ "\u2581ar",
400
+ "\u2581ye",
401
+ "\u2581more",
402
+ "\u2581going",
403
+ "ect",
404
+ "\u2581sa",
405
+ "\u2581cl",
406
+ "\u2581had",
407
+ "\u2581now",
408
+ "\u2581which",
409
+ "\u2581here",
410
+ "ous",
411
+ "\u2581their",
412
+ "\u2581tw",
413
+ "so",
414
+ "\u2581has",
415
+ "ud",
416
+ "\u2581co",
417
+ "\u2581ta",
418
+ "ound",
419
+ "\u2581were",
420
+ "ast",
421
+ "\u2581peop",
422
+ "ough",
423
+ "\u2581no",
424
+ "\u2581really",
425
+ "\u2581any",
426
+ "\u2581people",
427
+ "\u2581want",
428
+ "\u2581she",
429
+ "\u2581en",
430
+ "\u2581fa",
431
+ "\u2581te",
432
+ "ame",
433
+ "ine",
434
+ "\u2581qu",
435
+ "red",
436
+ "\u2581im",
437
+ "\u2581right",
438
+ "ther",
439
+ "\u2581act",
440
+ "\u2581thing",
441
+ "king",
442
+ "ose",
443
+ "\u2581ad",
444
+ "\u2581see",
445
+ "\u2581time",
446
+ "\u2581these",
447
+ "ci",
448
+ "one",
449
+ "\u2581say",
450
+ "\u2581also",
451
+ "\u2581fe",
452
+ "per",
453
+ "\u2581ag",
454
+ "\u2581man",
455
+ "ore",
456
+ "\u2581un",
457
+ "pt",
458
+ "\u2581her",
459
+ "\u2581look",
460
+ "ong",
461
+ "ice",
462
+ "\u2581very",
463
+ "ff",
464
+ "ions",
465
+ "\u2581comp",
466
+ "\u2581did",
467
+ "itt",
468
+ "\u2581well",
469
+ "\u2581other",
470
+ "iv",
471
+ "ase",
472
+ "ree",
473
+ "hing",
474
+ "\u2581lo",
475
+ "reat",
476
+ "\u2581cont",
477
+ "\u2581part",
478
+ "\u2581into",
479
+ "nder",
480
+ "\u2581been",
481
+ "are",
482
+ "\u2581am",
483
+ "ans",
484
+ "\u2581sp",
485
+ "\u2581two",
486
+ "ue",
487
+ "\u2581way",
488
+ "age",
489
+ "\u2581where",
490
+ "ite",
491
+ "\u2581dis",
492
+ "\u2581than",
493
+ "\u2581every",
494
+ "\u2581pr",
495
+ "\u2581po",
496
+ "ag",
497
+ "\u2581need",
498
+ "ach",
499
+ "iff",
500
+ "ence",
501
+ "pl",
502
+ "own",
503
+ "\u2581ac",
504
+ "ble",
505
+ "\u2581over",
506
+ "iz",
507
+ "\u2581work",
508
+ "\u2581res",
509
+ "\u2581make",
510
+ "\u2581could",
511
+ "\u2581off",
512
+ "ually",
513
+ "\u2581ro",
514
+ "\u2581back",
515
+ "able",
516
+ "ip",
517
+ "ry",
518
+ "\u2581him",
519
+ "\u2581cour",
520
+ "ber",
521
+ "\u2581pre",
522
+ "\u2581fir",
523
+ "\u2581spe",
524
+ "ap",
525
+ "ars",
526
+ "\u2581diff",
527
+ "ire",
528
+ "\u2581somet",
529
+ "\u2581imp",
530
+ "\u2581those",
531
+ "\u2581comm",
532
+ "ance",
533
+ "ick",
534
+ "\u2581even",
535
+ "ated",
536
+ "way",
537
+ "sel",
538
+ "\u2581let",
539
+ "\u2581br",
540
+ "ty",
541
+ "\u2581per",
542
+ "int",
543
+ "\u2581first",
544
+ "\u2581thr",
545
+ "\u2581under",
546
+ "ah",
547
+ "\u2581may",
548
+ "\u2581cou",
549
+ "\u2581new",
550
+ "ress",
551
+ "act",
552
+ "\u2581gr",
553
+ "ep",
554
+ "\u2581said",
555
+ "ations",
556
+ "\u2581good",
557
+ "ace",
558
+ "ass",
559
+ "\u2581does",
560
+ "orm",
561
+ "ish",
562
+ "\u2581af",
563
+ "ving",
564
+ "co",
565
+ "\u2581app",
566
+ "\u2581lot",
567
+ "\u2581things",
568
+ "\u2581tra",
569
+ "ittle",
570
+ "\u2581bl",
571
+ "\u2581little",
572
+ "\u2581mu",
573
+ "cess",
574
+ "fe",
575
+ "ome",
576
+ "\u2581inc",
577
+ "\u2581differe",
578
+ "ary",
579
+ "ical",
580
+ "\u2581only",
581
+ "ult",
582
+ "\u2581again",
583
+ "\u2581got",
584
+ "ens",
585
+ "\u2581gu",
586
+ "\u2581kind",
587
+ "\u2581much",
588
+ "ord",
589
+ "\u2581through",
590
+ "ition",
591
+ "ild",
592
+ "\u2581down",
593
+ "\u2581actually",
594
+ "\u2581something",
595
+ "ang",
596
+ "ru",
597
+ "ces",
598
+ "\u2581fl",
599
+ "ile",
600
+ "ater",
601
+ "\u2581ra",
602
+ "\u2581take",
603
+ "ict",
604
+ "ign",
605
+ "\u2581sc",
606
+ "vel",
607
+ "\u2581bet",
608
+ "\u2581tal",
609
+ "\u2581yeah",
610
+ "\u2581use",
611
+ "fore",
612
+ "\u2581bu",
613
+ "\u2581start",
614
+ "ory",
615
+ "be",
616
+ "\u2581day",
617
+ "wn",
618
+ "xt",
619
+ "ia",
620
+ "ak",
621
+ "\u2581after",
622
+ "\u2581should",
623
+ "\u2581fo",
624
+ "\u2581ho",
625
+ "\u2581hel",
626
+ "\u2581ind",
627
+ "\u2581uh",
628
+ "na",
629
+ "ial",
630
+ "other",
631
+ "\u2581ke",
632
+ "\u2581call",
633
+ "\u2581most",
634
+ "\u2581ok",
635
+ "\u2581different",
636
+ "\u2581em",
637
+ "ting",
638
+ "ple",
639
+ "\u2581being",
640
+ "\u2581bo",
641
+ "ning",
642
+ "\u2581too",
643
+ "ors",
644
+ "\u2581happ",
645
+ "ark",
646
+ "og",
647
+ "\u2581help",
648
+ "\u2581rem",
649
+ "du",
650
+ "ction",
651
+ "ood",
652
+ "\u2581ser",
653
+ "ether",
654
+ "ious",
655
+ "\u2581mean",
656
+ "\u2581many",
657
+ "\u2581court",
658
+ "\u2581bel",
659
+ "ade",
660
+ "\u2581la",
661
+ "ved",
662
+ "\u2581des",
663
+ "\u2581rec",
664
+ "\u2581jo",
665
+ "\u2581dec",
666
+ "ves",
667
+ "\u2581before",
668
+ "\u2581put",
669
+ "self",
670
+ "\u2581point",
671
+ "te",
672
+ "\u2581ev",
673
+ "form",
674
+ "ents",
675
+ "\u2581add",
676
+ "ody",
677
+ "thing",
678
+ "\u2581case",
679
+ "\u2581pers",
680
+ "\u2581cons",
681
+ "iss",
682
+ "\u2581three",
683
+ "oth",
684
+ "\u2581ph",
685
+ "\u2581come",
686
+ "\u2581find",
687
+ "\u2581why",
688
+ "ull",
689
+ "\u2581show",
690
+ "\u2581bas",
691
+ "\u2581great",
692
+ "ily",
693
+ "\u2581rel",
694
+ "\u2581sm",
695
+ "\u2581its",
696
+ "\u2581fact",
697
+ "\u2581pos",
698
+ "ool",
699
+ "ments",
700
+ "ise",
701
+ "nds",
702
+ "ys",
703
+ "\u2581try",
704
+ "ual",
705
+ "ful",
706
+ "erm",
707
+ "\u2581inter",
708
+ "ons",
709
+ "\u2581quest",
710
+ "\u2581sub",
711
+ "we",
712
+ "vers",
713
+ "\u2581supp",
714
+ "\u2581feel",
715
+ "\u2581same",
716
+ "ub",
717
+ "ates",
718
+ "urn",
719
+ "ert",
720
+ "\u2581inv",
721
+ "day",
722
+ "\u2581rep",
723
+ "igh",
724
+ "\u2581sy",
725
+ "\u2581inst",
726
+ "\u2581long",
727
+ "\u2581still",
728
+ "\u2581okay",
729
+ "ft",
730
+ "ific",
731
+ "atch",
732
+ "ought",
733
+ "ath",
734
+ "\u2581own",
735
+ "\u2581made",
736
+ "ix",
737
+ "ced",
738
+ "ks",
739
+ "lic",
740
+ "\u2581wr",
741
+ "de",
742
+ "\u2581cr",
743
+ "\u2581att",
744
+ "\u2581ob",
745
+ "\u2581world",
746
+ "\u2581sure",
747
+ "ward",
748
+ "\u2581bit",
749
+ "\u2581life",
750
+ "\u2581person",
751
+ "\u2581pres",
752
+ "ph",
753
+ "\u2581vide",
754
+ "\u2581reg",
755
+ "\u2581end",
756
+ "ject",
757
+ "ange",
758
+ "\u2581fin",
759
+ "ied",
760
+ "pect",
761
+ "\u2581didn",
762
+ "\u2581around",
763
+ "ian",
764
+ "\u2581car",
765
+ "ible",
766
+ "\u2581sim",
767
+ "ever",
768
+ "\u2581sch",
769
+ "ating",
770
+ "\u2581pol",
771
+ "\u2581set",
772
+ "\u2581oh",
773
+ "cy",
774
+ "\u2581real",
775
+ "\u2581import",
776
+ "\u2581count",
777
+ "\u2581um",
778
+ "\u2581next",
779
+ "cial",
780
+ "les",
781
+ "\u2581hu",
782
+ "\u2581acc",
783
+ "\u2581might",
784
+ "\u2581ent",
785
+ "\u2581doing",
786
+ "\u2581ins",
787
+ "\u2581gen",
788
+ "\u2581play",
789
+ "\u2581cle",
790
+ "\u2581another",
791
+ "ady",
792
+ "ular",
793
+ "ib",
794
+ "ways",
795
+ "ered",
796
+ "ility",
797
+ "ities",
798
+ "\u2581op",
799
+ "\u2581def",
800
+ "\u2581years",
801
+ "\u2581never",
802
+ "ower",
803
+ "ram",
804
+ "\u2581tell",
805
+ "\u2581sl",
806
+ "onna",
807
+ "ail",
808
+ "ren",
809
+ "ute",
810
+ "\u2581gonna",
811
+ "\u2581big",
812
+ "\u2581give",
813
+ "der",
814
+ "ount",
815
+ "\u2581ap",
816
+ "kes",
817
+ "\u2581state",
818
+ "\u2581cor",
819
+ "\u2581min",
820
+ "ically",
821
+ "\u2581mon",
822
+ "\u2581fam",
823
+ "\u2581important",
824
+ "\u2581always",
825
+ "\u2581high",
826
+ "\u2581four",
827
+ "\u2581gra",
828
+ "\u2581ca",
829
+ "\u2581stud",
830
+ "\u2581dist",
831
+ "\u2581talk",
832
+ "\u2581num",
833
+ "\u2581str",
834
+ "\u2581today",
835
+ "ract",
836
+ "\u2581while",
837
+ "ason",
838
+ "\u2581iss",
839
+ "\u2581sur",
840
+ "\u2581char",
841
+ "\u2581last",
842
+ "oy",
843
+ "ited",
844
+ "\u2581exper",
845
+ "\u2581place",
846
+ "\u2581tri",
847
+ "\u2581ear",
848
+ "\u2581belie",
849
+ "\u2581able",
850
+ "\u2581underst",
851
+ "\u2581che",
852
+ "\u2581both",
853
+ "ug",
854
+ "\u2581doesn",
855
+ "\u2581keep",
856
+ "\u2581happen",
857
+ "ings",
858
+ "iew",
859
+ "ather",
860
+ "\u2581ass",
861
+ "\u2581love",
862
+ "ative",
863
+ "av",
864
+ "\u2581yes",
865
+ "\u2581ele",
866
+ "\u2581year",
867
+ "\u2581such",
868
+ "\u2581video",
869
+ "ness",
870
+ "\u2581el",
871
+ "\u2581trans",
872
+ "\u2581five",
873
+ "\u2581produ",
874
+ "ave",
875
+ "erest",
876
+ "als",
877
+ "body",
878
+ "cus",
879
+ "\u2581found",
880
+ "atter",
881
+ "\u2581eff",
882
+ "\u2581god",
883
+ "\u2581used",
884
+ "llow",
885
+ "\u2581interest",
886
+ "\u2581question",
887
+ "hip",
888
+ "\u2581bus",
889
+ "\u2581ask",
890
+ "\u2581exam",
891
+ "\u2581prov",
892
+ "lud",
893
+ "\u2581form",
894
+ "\u2581law",
895
+ "ense",
896
+ "\u2581child",
897
+ "\u2581gl",
898
+ "ne",
899
+ "\u2581each",
900
+ "\u2581understand",
901
+ "\u2581care",
902
+ "stem",
903
+ "\u2581med",
904
+ "\u2581maybe",
905
+ "ably",
906
+ "\u2581det",
907
+ "\u2581coll",
908
+ "its",
909
+ "\u2581commun",
910
+ "\u2581hand",
911
+ "\u2581'",
912
+ "\u2581ref",
913
+ "\u2581lear",
914
+ "\u2581done",
915
+ "\u2581gener",
916
+ "vern",
917
+ "\u2581mr",
918
+ "ween",
919
+ "\u2581better",
920
+ "\u2581between",
921
+ "li",
922
+ "blem",
923
+ "\u2581system",
924
+ "ertain",
925
+ "\u2581school",
926
+ "\u2581eas",
927
+ "\u2581exp",
928
+ "\u2581war",
929
+ "ention",
930
+ "\u2581ty",
931
+ "\u2581govern",
932
+ "ues",
933
+ "\u2581problem",
934
+ "\u2581plan",
935
+ "ac",
936
+ "\u2581conf",
937
+ "\u2581course",
938
+ "ouse",
939
+ "\u2581mar",
940
+ "\u2581stand",
941
+ "\u2581sk",
942
+ "\u2581seco",
943
+ "uring",
944
+ "\u2581ed",
945
+ "\u2581mem",
946
+ "ros",
947
+ "cri",
948
+ "\u2581thought",
949
+ "cept",
950
+ "\u2581partic",
951
+ "\u2581test",
952
+ "olog",
953
+ "iness",
954
+ "\u2581far",
955
+ "led",
956
+ "\u2581col",
957
+ "\u2581looking",
958
+ "\u2581read",
959
+ "\u2581whether",
960
+ "\u2581word",
961
+ "me",
962
+ "\u2581once",
963
+ "ize",
964
+ "\u2581home",
965
+ "\u2581requ",
966
+ "gg",
967
+ "\u2581ide",
968
+ "\u2581thank",
969
+ "ures",
970
+ "\u2581called",
971
+ "\u2581cur",
972
+ "\u2581water",
973
+ "\u2581frie",
974
+ "\u2581side",
975
+ "\u2581best",
976
+ "\u2581number",
977
+ "oney",
978
+ "\u2581turn",
979
+ "ock",
980
+ "\u2581eng",
981
+ "\u2581top",
982
+ "\u2581open",
983
+ "ead",
984
+ "\u2581everything",
985
+ "\u2581term",
986
+ "\u2581prob",
987
+ "\u2581hard",
988
+ "\u2581fun",
989
+ "\u2581spec",
990
+ "\u2581dire",
991
+ "\u2581second",
992
+ "\u2581pa",
993
+ "\u2581build",
994
+ "\u2581run",
995
+ "\u2581sign",
996
+ "\u2581reason",
997
+ "\u2581inform",
998
+ "\u2581watch",
999
+ "ution",
1000
+ "\u2581few",
1001
+ "mo",
1002
+ "\u2581hum",
1003
+ "ision",
1004
+ "\u2581ext",
1005
+ "\u2581tog",
1006
+ "\u2581conc",
1007
+ "\u2581thous",
1008
+ "\u2581thousand",
1009
+ "\u2581support",
1010
+ "\u2581together",
1011
+ "\u2581six",
1012
+ "ps",
1013
+ "\u2581mark",
1014
+ "ics",
1015
+ "\u2581includ",
1016
+ "ef",
1017
+ "\u2581opp",
1018
+ "ident",
1019
+ "\u2581anything",
1020
+ "\u2581met",
1021
+ "\u2581bre",
1022
+ "\u2581jud",
1023
+ "\u2581away",
1024
+ "\u2581old",
1025
+ "\u2581prog",
1026
+ "ten",
1027
+ "\u2581book",
1028
+ "\u2581says",
1029
+ "\u2581seem",
1030
+ "\u2581contin",
1031
+ "\u2581process",
1032
+ "\u2581sing",
1033
+ "\u2581money",
1034
+ "\u2581having",
1035
+ "\u2581beg",
1036
+ "\u2581comple",
1037
+ "\u2581thir",
1038
+ "\u2581using",
1039
+ "\u2581ret",
1040
+ "ger",
1041
+ "\u2581head",
1042
+ "\u2581cre",
1043
+ "\u2581poss",
1044
+ "enty",
1045
+ "\u2581certain",
1046
+ "\u2581clear",
1047
+ "ines",
1048
+ "\u2581wee",
1049
+ "arch",
1050
+ "\u2581inf",
1051
+ "ont",
1052
+ "\u2581sit",
1053
+ "\u2581lead",
1054
+ "alth",
1055
+ "\u2581art",
1056
+ "ross",
1057
+ "\u2581pub",
1058
+ "\u2581without",
1059
+ "\u2581pret",
1060
+ "\u2581getting",
1061
+ "ient",
1062
+ "\u2581z",
1063
+ "\u2581wom",
1064
+ "\u2581power",
1065
+ "ational",
1066
+ "ner",
1067
+ "\u2581rest",
1068
+ "\u2581believe",
1069
+ "\u2581wa",
1070
+ "\u2581aut",
1071
+ "\u2581move",
1072
+ "aim",
1073
+ "\u2581sort",
1074
+ "idence",
1075
+ "\u2581creat",
1076
+ "\u2581expl",
1077
+ "\u2581name",
1078
+ "\u2581went",
1079
+ "\u2581eu",
1080
+ "\u2581change",
1081
+ "\u2581came",
1082
+ "\u2581pay",
1083
+ "ices",
1084
+ "\u2581sin",
1085
+ "\u2581pur",
1086
+ "\u2581pass",
1087
+ "\u2581whole",
1088
+ "\u2581house",
1089
+ "\u2581hund",
1090
+ "\u2581hundred",
1091
+ "\u2581pretty",
1092
+ "\u2581trying",
1093
+ "\u2581ple",
1094
+ "\u2581allow",
1095
+ "\u2581compan",
1096
+ "\u2581government",
1097
+ "\u2581small",
1098
+ "\u2581light",
1099
+ "\u2581bra",
1100
+ "\u2581stu",
1101
+ "aint",
1102
+ "\u2581ah",
1103
+ "\u2581prot",
1104
+ "ets",
1105
+ "\u2581cent",
1106
+ "velop",
1107
+ "\u2581family",
1108
+ "\u2581business",
1109
+ "ety",
1110
+ "\u2581making",
1111
+ "\u2581list",
1112
+ "\u2581experi",
1113
+ "eric",
1114
+ "\u2581follow",
1115
+ "ately",
1116
+ "\u2581probably",
1117
+ "\u2581appe",
1118
+ "\u2581serv",
1119
+ "\u2581val",
1120
+ "\u2581leg",
1121
+ "\u2581resp",
1122
+ "\u2581develop",
1123
+ "ready",
1124
+ "\u2581already",
1125
+ "\u2581sec",
1126
+ "ell",
1127
+ "\u2581saying",
1128
+ "ash",
1129
+ "\u2581hear",
1130
+ "\u2581loc",
1131
+ "\u2581adv",
1132
+ "\u2581pri",
1133
+ "ret",
1134
+ "\u2581lar",
1135
+ "\u2581beh",
1136
+ "\u2581must",
1137
+ "\u2581hon",
1138
+ "\u2581means",
1139
+ "ew",
1140
+ "\u2581par",
1141
+ "\u2581order",
1142
+ "\u2581mom",
1143
+ "gn",
1144
+ "\u2581though",
1145
+ "\u2581record",
1146
+ "\u2581miss",
1147
+ "\u2581dr",
1148
+ "\u2581es",
1149
+ "\u2581eight",
1150
+ "\u2581ever",
1151
+ "\u2581left",
1152
+ "\u2581example",
1153
+ "\u2581enough",
1154
+ "osed",
1155
+ "\u2581claim",
1156
+ "ank",
1157
+ "con",
1158
+ "\u2581americ",
1159
+ "\u2581information",
1160
+ "\u2581arg",
1161
+ "\u2581full",
1162
+ "nce",
1163
+ "\u2581consid",
1164
+ "\u2581working",
1165
+ "ature",
1166
+ "\u2581",
1167
+ "e",
1168
+ "t",
1169
+ "a",
1170
+ "o",
1171
+ "i",
1172
+ "n",
1173
+ "s",
1174
+ "r",
1175
+ "h",
1176
+ "l",
1177
+ "d",
1178
+ "u",
1179
+ "c",
1180
+ "m",
1181
+ "y",
1182
+ "w",
1183
+ "g",
1184
+ "f",
1185
+ "p",
1186
+ "b",
1187
+ "v",
1188
+ "k",
1189
+ "'",
1190
+ "j",
1191
+ "x",
1192
+ "q",
1193
+ "z"
1194
+ ]
1195
+ },
1196
+ "decoding": {
1197
+ "strategy": "greedy",
1198
+ "model_type": "tdt",
1199
+ "durations": [
1200
+ 0,
1201
+ 1,
1202
+ 2,
1203
+ 3,
1204
+ 4
1205
+ ],
1206
+ "greedy": {
1207
+ "max_symbols": 10
1208
+ },
1209
+ "beam": {
1210
+ "beam_size": 2,
1211
+ "return_best_hypothesis": false,
1212
+ "score_norm": true,
1213
+ "tsd_max_sym_exp": 50,
1214
+ "alsd_max_target_len": 2.0
1215
+ }
1216
+ },
1217
+ "loss": {
1218
+ "loss_name": "tdt",
1219
+ "tdt_kwargs": {
1220
+ "fastemit_lambda": 0.0,
1221
+ "clamp": -1.0,
1222
+ "durations": [
1223
+ 0,
1224
+ 1,
1225
+ 2,
1226
+ 3,
1227
+ 4
1228
+ ],
1229
+ "sigma": 0.02,
1230
+ "omega": 0.1
1231
+ }
1232
+ },
1233
+ "optim": {
1234
+ "name": "adamw",
1235
+ "lr": 3.0,
1236
+ "betas": [
1237
+ 0.9,
1238
+ 0.98
1239
+ ],
1240
+ "weight_decay": 0.001,
1241
+ "sched": {
1242
+ "name": "NoamAnnealing",
1243
+ "warmup_steps": 25000,
1244
+ "warmup_ratio": null,
1245
+ "min_lr": 1e-06,
1246
+ "d_model": 1024
1247
+ }
1248
+ },
1249
+ "labels": [
1250
+ "<unk>",
1251
+ "\u2581t",
1252
+ "\u2581th",
1253
+ "\u2581a",
1254
+ "\u2581i",
1255
+ "\u2581the",
1256
+ "re",
1257
+ "\u2581w",
1258
+ "\u2581s",
1259
+ "\u2581o",
1260
+ "in",
1261
+ "at",
1262
+ "er",
1263
+ "ou",
1264
+ "nd",
1265
+ "\u2581c",
1266
+ "\u2581b",
1267
+ "\u2581h",
1268
+ "on",
1269
+ "\u2581m",
1270
+ "\u2581f",
1271
+ "ing",
1272
+ "\u2581to",
1273
+ "en",
1274
+ "\u2581p",
1275
+ "\u2581and",
1276
+ "\u2581d",
1277
+ "es",
1278
+ "or",
1279
+ "an",
1280
+ "ll",
1281
+ "\u2581y",
1282
+ "\u2581l",
1283
+ "ed",
1284
+ "\u2581of",
1285
+ "\u2581in",
1286
+ "it",
1287
+ "is",
1288
+ "\u2581you",
1289
+ "\u2581that",
1290
+ "ar",
1291
+ "\u2581g",
1292
+ "\u2581n",
1293
+ "as",
1294
+ "om",
1295
+ "\u2581it",
1296
+ "ic",
1297
+ "ve",
1298
+ "\u2581e",
1299
+ "\u2581wh",
1300
+ "\u2581be",
1301
+ "us",
1302
+ "le",
1303
+ "al",
1304
+ "ion",
1305
+ "ow",
1306
+ "\u2581we",
1307
+ "\u2581re",
1308
+ "\u2581is",
1309
+ "ut",
1310
+ "ot",
1311
+ "ent",
1312
+ "\u2581on",
1313
+ "et",
1314
+ "\u2581ha",
1315
+ "ay",
1316
+ "ct",
1317
+ "\u2581he",
1318
+ "id",
1319
+ "\u2581for",
1320
+ "\u2581st",
1321
+ "ver",
1322
+ "ly",
1323
+ "ro",
1324
+ "ig",
1325
+ "\u2581so",
1326
+ "ld",
1327
+ "\u2581this",
1328
+ "ke",
1329
+ "\u2581u",
1330
+ "se",
1331
+ "all",
1332
+ "st",
1333
+ "ur",
1334
+ "ce",
1335
+ "ch",
1336
+ "im",
1337
+ "ith",
1338
+ "\u2581as",
1339
+ "\u2581k",
1340
+ "\u2581an",
1341
+ "\u2581was",
1342
+ "\u2581j",
1343
+ "\u2581with",
1344
+ "ir",
1345
+ "\u2581go",
1346
+ "ra",
1347
+ "\u2581do",
1348
+ "\u2581have",
1349
+ "\u2581li",
1350
+ "\u2581sh",
1351
+ "\u2581se",
1352
+ "\u2581they",
1353
+ "\u2581are",
1354
+ "am",
1355
+ "ht",
1356
+ "\u2581but",
1357
+ "ation",
1358
+ "\u2581not",
1359
+ "th",
1360
+ "\u2581r",
1361
+ "ally",
1362
+ "ad",
1363
+ "ust",
1364
+ "\u2581or",
1365
+ "\u2581com",
1366
+ "ould",
1367
+ "\u2581can",
1368
+ "ill",
1369
+ "\u2581ne",
1370
+ "ight",
1371
+ "\u2581ch",
1372
+ "\u2581de",
1373
+ "\u2581con",
1374
+ "\u2581at",
1375
+ "\u2581mo",
1376
+ "ant",
1377
+ "oo",
1378
+ "il",
1379
+ "\u2581me",
1380
+ "\u2581what",
1381
+ "\u2581there",
1382
+ "ter",
1383
+ "pe",
1384
+ "\u2581ab",
1385
+ "\u2581su",
1386
+ "ere",
1387
+ "ck",
1388
+ "\u2581pro",
1389
+ "\u2581al",
1390
+ "\u2581fr",
1391
+ "\u2581kn",
1392
+ "\u2581all",
1393
+ "ers",
1394
+ "\u2581like",
1395
+ "ge",
1396
+ "\u2581ex",
1397
+ "\u2581som",
1398
+ "ul",
1399
+ "\u2581your",
1400
+ "\u2581v",
1401
+ "pp",
1402
+ "use",
1403
+ "\u2581if",
1404
+ "ess",
1405
+ "ate",
1406
+ "est",
1407
+ "\u2581know",
1408
+ "out",
1409
+ "if",
1410
+ "\u2581just",
1411
+ "ment",
1412
+ "qu",
1413
+ "op",
1414
+ "ain",
1415
+ "\u2581one",
1416
+ "ol",
1417
+ "ri",
1418
+ "art",
1419
+ "very",
1420
+ "\u2581wor",
1421
+ "ive",
1422
+ "ist",
1423
+ "\u2581my",
1424
+ "nt",
1425
+ "ab",
1426
+ "\u2581from",
1427
+ "ort",
1428
+ "\u2581ma",
1429
+ "\u2581about",
1430
+ "res",
1431
+ "ity",
1432
+ "\u2581out",
1433
+ "\u2581bec",
1434
+ "\u2581le",
1435
+ "our",
1436
+ "od",
1437
+ "and",
1438
+ "ink",
1439
+ "ie",
1440
+ "\u2581up",
1441
+ "ind",
1442
+ "os",
1443
+ "un",
1444
+ "ause",
1445
+ "oug",
1446
+ "um",
1447
+ "\u2581some",
1448
+ "\u2581int",
1449
+ "\u2581by",
1450
+ "\u2581pl",
1451
+ "\u2581get",
1452
+ "el",
1453
+ "ard",
1454
+ "\u2581when",
1455
+ "\u2581don",
1456
+ "her",
1457
+ "\u2581will",
1458
+ "\u2581us",
1459
+ "\u2581would",
1460
+ "ook",
1461
+ "ies",
1462
+ "ich",
1463
+ "\u2581because",
1464
+ "\u2581think",
1465
+ "em",
1466
+ "\u2581pe",
1467
+ "\u2581his",
1468
+ "ack",
1469
+ "\u2581then",
1470
+ "\u2581our",
1471
+ "ide",
1472
+ "\u2581tim",
1473
+ "\u2581how",
1474
+ "ven",
1475
+ "\u2581tr",
1476
+ "\u2581who",
1477
+ "\u2581them",
1478
+ "ure",
1479
+ "\u2581ar",
1480
+ "\u2581ye",
1481
+ "\u2581more",
1482
+ "\u2581going",
1483
+ "ect",
1484
+ "\u2581sa",
1485
+ "\u2581cl",
1486
+ "\u2581had",
1487
+ "\u2581now",
1488
+ "\u2581which",
1489
+ "\u2581here",
1490
+ "ous",
1491
+ "\u2581their",
1492
+ "\u2581tw",
1493
+ "so",
1494
+ "\u2581has",
1495
+ "ud",
1496
+ "\u2581co",
1497
+ "\u2581ta",
1498
+ "ound",
1499
+ "\u2581were",
1500
+ "ast",
1501
+ "\u2581peop",
1502
+ "ough",
1503
+ "\u2581no",
1504
+ "\u2581really",
1505
+ "\u2581any",
1506
+ "\u2581people",
1507
+ "\u2581want",
1508
+ "\u2581she",
1509
+ "\u2581en",
1510
+ "\u2581fa",
1511
+ "\u2581te",
1512
+ "ame",
1513
+ "ine",
1514
+ "\u2581qu",
1515
+ "red",
1516
+ "\u2581im",
1517
+ "\u2581right",
1518
+ "ther",
1519
+ "\u2581act",
1520
+ "\u2581thing",
1521
+ "king",
1522
+ "ose",
1523
+ "\u2581ad",
1524
+ "\u2581see",
1525
+ "\u2581time",
1526
+ "\u2581these",
1527
+ "ci",
1528
+ "one",
1529
+ "\u2581say",
1530
+ "\u2581also",
1531
+ "\u2581fe",
1532
+ "per",
1533
+ "\u2581ag",
1534
+ "\u2581man",
1535
+ "ore",
1536
+ "\u2581un",
1537
+ "pt",
1538
+ "\u2581her",
1539
+ "\u2581look",
1540
+ "ong",
1541
+ "ice",
1542
+ "\u2581very",
1543
+ "ff",
1544
+ "ions",
1545
+ "\u2581comp",
1546
+ "\u2581did",
1547
+ "itt",
1548
+ "\u2581well",
1549
+ "\u2581other",
1550
+ "iv",
1551
+ "ase",
1552
+ "ree",
1553
+ "hing",
1554
+ "\u2581lo",
1555
+ "reat",
1556
+ "\u2581cont",
1557
+ "\u2581part",
1558
+ "\u2581into",
1559
+ "nder",
1560
+ "\u2581been",
1561
+ "are",
1562
+ "\u2581am",
1563
+ "ans",
1564
+ "\u2581sp",
1565
+ "\u2581two",
1566
+ "ue",
1567
+ "\u2581way",
1568
+ "age",
1569
+ "\u2581where",
1570
+ "ite",
1571
+ "\u2581dis",
1572
+ "\u2581than",
1573
+ "\u2581every",
1574
+ "\u2581pr",
1575
+ "\u2581po",
1576
+ "ag",
1577
+ "\u2581need",
1578
+ "ach",
1579
+ "iff",
1580
+ "ence",
1581
+ "pl",
1582
+ "own",
1583
+ "\u2581ac",
1584
+ "ble",
1585
+ "\u2581over",
1586
+ "iz",
1587
+ "\u2581work",
1588
+ "\u2581res",
1589
+ "\u2581make",
1590
+ "\u2581could",
1591
+ "\u2581off",
1592
+ "ually",
1593
+ "\u2581ro",
1594
+ "\u2581back",
1595
+ "able",
1596
+ "ip",
1597
+ "ry",
1598
+ "\u2581him",
1599
+ "\u2581cour",
1600
+ "ber",
1601
+ "\u2581pre",
1602
+ "\u2581fir",
1603
+ "\u2581spe",
1604
+ "ap",
1605
+ "ars",
1606
+ "\u2581diff",
1607
+ "ire",
1608
+ "\u2581somet",
1609
+ "\u2581imp",
1610
+ "\u2581those",
1611
+ "\u2581comm",
1612
+ "ance",
1613
+ "ick",
1614
+ "\u2581even",
1615
+ "ated",
1616
+ "way",
1617
+ "sel",
1618
+ "\u2581let",
1619
+ "\u2581br",
1620
+ "ty",
1621
+ "\u2581per",
1622
+ "int",
1623
+ "\u2581first",
1624
+ "\u2581thr",
1625
+ "\u2581under",
1626
+ "ah",
1627
+ "\u2581may",
1628
+ "\u2581cou",
1629
+ "\u2581new",
1630
+ "ress",
1631
+ "act",
1632
+ "\u2581gr",
1633
+ "ep",
1634
+ "\u2581said",
1635
+ "ations",
1636
+ "\u2581good",
1637
+ "ace",
1638
+ "ass",
1639
+ "\u2581does",
1640
+ "orm",
1641
+ "ish",
1642
+ "\u2581af",
1643
+ "ving",
1644
+ "co",
1645
+ "\u2581app",
1646
+ "\u2581lot",
1647
+ "\u2581things",
1648
+ "\u2581tra",
1649
+ "ittle",
1650
+ "\u2581bl",
1651
+ "\u2581little",
1652
+ "\u2581mu",
1653
+ "cess",
1654
+ "fe",
1655
+ "ome",
1656
+ "\u2581inc",
1657
+ "\u2581differe",
1658
+ "ary",
1659
+ "ical",
1660
+ "\u2581only",
1661
+ "ult",
1662
+ "\u2581again",
1663
+ "\u2581got",
1664
+ "ens",
1665
+ "\u2581gu",
1666
+ "\u2581kind",
1667
+ "\u2581much",
1668
+ "ord",
1669
+ "\u2581through",
1670
+ "ition",
1671
+ "ild",
1672
+ "\u2581down",
1673
+ "\u2581actually",
1674
+ "\u2581something",
1675
+ "ang",
1676
+ "ru",
1677
+ "ces",
1678
+ "\u2581fl",
1679
+ "ile",
1680
+ "ater",
1681
+ "\u2581ra",
1682
+ "\u2581take",
1683
+ "ict",
1684
+ "ign",
1685
+ "\u2581sc",
1686
+ "vel",
1687
+ "\u2581bet",
1688
+ "\u2581tal",
1689
+ "\u2581yeah",
1690
+ "\u2581use",
1691
+ "fore",
1692
+ "\u2581bu",
1693
+ "\u2581start",
1694
+ "ory",
1695
+ "be",
1696
+ "\u2581day",
1697
+ "wn",
1698
+ "xt",
1699
+ "ia",
1700
+ "ak",
1701
+ "\u2581after",
1702
+ "\u2581should",
1703
+ "\u2581fo",
1704
+ "\u2581ho",
1705
+ "\u2581hel",
1706
+ "\u2581ind",
1707
+ "\u2581uh",
1708
+ "na",
1709
+ "ial",
1710
+ "other",
1711
+ "\u2581ke",
1712
+ "\u2581call",
1713
+ "\u2581most",
1714
+ "\u2581ok",
1715
+ "\u2581different",
1716
+ "\u2581em",
1717
+ "ting",
1718
+ "ple",
1719
+ "\u2581being",
1720
+ "\u2581bo",
1721
+ "ning",
1722
+ "\u2581too",
1723
+ "ors",
1724
+ "\u2581happ",
1725
+ "ark",
1726
+ "og",
1727
+ "\u2581help",
1728
+ "\u2581rem",
1729
+ "du",
1730
+ "ction",
1731
+ "ood",
1732
+ "\u2581ser",
1733
+ "ether",
1734
+ "ious",
1735
+ "\u2581mean",
1736
+ "\u2581many",
1737
+ "\u2581court",
1738
+ "\u2581bel",
1739
+ "ade",
1740
+ "\u2581la",
1741
+ "ved",
1742
+ "\u2581des",
1743
+ "\u2581rec",
1744
+ "\u2581jo",
1745
+ "\u2581dec",
1746
+ "ves",
1747
+ "\u2581before",
1748
+ "\u2581put",
1749
+ "self",
1750
+ "\u2581point",
1751
+ "te",
1752
+ "\u2581ev",
1753
+ "form",
1754
+ "ents",
1755
+ "\u2581add",
1756
+ "ody",
1757
+ "thing",
1758
+ "\u2581case",
1759
+ "\u2581pers",
1760
+ "\u2581cons",
1761
+ "iss",
1762
+ "\u2581three",
1763
+ "oth",
1764
+ "\u2581ph",
1765
+ "\u2581come",
1766
+ "\u2581find",
1767
+ "\u2581why",
1768
+ "ull",
1769
+ "\u2581show",
1770
+ "\u2581bas",
1771
+ "\u2581great",
1772
+ "ily",
1773
+ "\u2581rel",
1774
+ "\u2581sm",
1775
+ "\u2581its",
1776
+ "\u2581fact",
1777
+ "\u2581pos",
1778
+ "ool",
1779
+ "ments",
1780
+ "ise",
1781
+ "nds",
1782
+ "ys",
1783
+ "\u2581try",
1784
+ "ual",
1785
+ "ful",
1786
+ "erm",
1787
+ "\u2581inter",
1788
+ "ons",
1789
+ "\u2581quest",
1790
+ "\u2581sub",
1791
+ "we",
1792
+ "vers",
1793
+ "\u2581supp",
1794
+ "\u2581feel",
1795
+ "\u2581same",
1796
+ "ub",
1797
+ "ates",
1798
+ "urn",
1799
+ "ert",
1800
+ "\u2581inv",
1801
+ "day",
1802
+ "\u2581rep",
1803
+ "igh",
1804
+ "\u2581sy",
1805
+ "\u2581inst",
1806
+ "\u2581long",
1807
+ "\u2581still",
1808
+ "\u2581okay",
1809
+ "ft",
1810
+ "ific",
1811
+ "atch",
1812
+ "ought",
1813
+ "ath",
1814
+ "\u2581own",
1815
+ "\u2581made",
1816
+ "ix",
1817
+ "ced",
1818
+ "ks",
1819
+ "lic",
1820
+ "\u2581wr",
1821
+ "de",
1822
+ "\u2581cr",
1823
+ "\u2581att",
1824
+ "\u2581ob",
1825
+ "\u2581world",
1826
+ "\u2581sure",
1827
+ "ward",
1828
+ "\u2581bit",
1829
+ "\u2581life",
1830
+ "\u2581person",
1831
+ "\u2581pres",
1832
+ "ph",
1833
+ "\u2581vide",
1834
+ "\u2581reg",
1835
+ "\u2581end",
1836
+ "ject",
1837
+ "ange",
1838
+ "\u2581fin",
1839
+ "ied",
1840
+ "pect",
1841
+ "\u2581didn",
1842
+ "\u2581around",
1843
+ "ian",
1844
+ "\u2581car",
1845
+ "ible",
1846
+ "\u2581sim",
1847
+ "ever",
1848
+ "\u2581sch",
1849
+ "ating",
1850
+ "\u2581pol",
1851
+ "\u2581set",
1852
+ "\u2581oh",
1853
+ "cy",
1854
+ "\u2581real",
1855
+ "\u2581import",
1856
+ "\u2581count",
1857
+ "\u2581um",
1858
+ "\u2581next",
1859
+ "cial",
1860
+ "les",
1861
+ "\u2581hu",
1862
+ "\u2581acc",
1863
+ "\u2581might",
1864
+ "\u2581ent",
1865
+ "\u2581doing",
1866
+ "\u2581ins",
1867
+ "\u2581gen",
1868
+ "\u2581play",
1869
+ "\u2581cle",
1870
+ "\u2581another",
1871
+ "ady",
1872
+ "ular",
1873
+ "ib",
1874
+ "ways",
1875
+ "ered",
1876
+ "ility",
1877
+ "ities",
1878
+ "\u2581op",
1879
+ "\u2581def",
1880
+ "\u2581years",
1881
+ "\u2581never",
1882
+ "ower",
1883
+ "ram",
1884
+ "\u2581tell",
1885
+ "\u2581sl",
1886
+ "onna",
1887
+ "ail",
1888
+ "ren",
1889
+ "ute",
1890
+ "\u2581gonna",
1891
+ "\u2581big",
1892
+ "\u2581give",
1893
+ "der",
1894
+ "ount",
1895
+ "\u2581ap",
1896
+ "kes",
1897
+ "\u2581state",
1898
+ "\u2581cor",
1899
+ "\u2581min",
1900
+ "ically",
1901
+ "\u2581mon",
1902
+ "\u2581fam",
1903
+ "\u2581important",
1904
+ "\u2581always",
1905
+ "\u2581high",
1906
+ "\u2581four",
1907
+ "\u2581gra",
1908
+ "\u2581ca",
1909
+ "\u2581stud",
1910
+ "\u2581dist",
1911
+ "\u2581talk",
1912
+ "\u2581num",
1913
+ "\u2581str",
1914
+ "\u2581today",
1915
+ "ract",
1916
+ "\u2581while",
1917
+ "ason",
1918
+ "\u2581iss",
1919
+ "\u2581sur",
1920
+ "\u2581char",
1921
+ "\u2581last",
1922
+ "oy",
1923
+ "ited",
1924
+ "\u2581exper",
1925
+ "\u2581place",
1926
+ "\u2581tri",
1927
+ "\u2581ear",
1928
+ "\u2581belie",
1929
+ "\u2581able",
1930
+ "\u2581underst",
1931
+ "\u2581che",
1932
+ "\u2581both",
1933
+ "ug",
1934
+ "\u2581doesn",
1935
+ "\u2581keep",
1936
+ "\u2581happen",
1937
+ "ings",
1938
+ "iew",
1939
+ "ather",
1940
+ "\u2581ass",
1941
+ "\u2581love",
1942
+ "ative",
1943
+ "av",
1944
+ "\u2581yes",
1945
+ "\u2581ele",
1946
+ "\u2581year",
1947
+ "\u2581such",
1948
+ "\u2581video",
1949
+ "ness",
1950
+ "\u2581el",
1951
+ "\u2581trans",
1952
+ "\u2581five",
1953
+ "\u2581produ",
1954
+ "ave",
1955
+ "erest",
1956
+ "als",
1957
+ "body",
1958
+ "cus",
1959
+ "\u2581found",
1960
+ "atter",
1961
+ "\u2581eff",
1962
+ "\u2581god",
1963
+ "\u2581used",
1964
+ "llow",
1965
+ "\u2581interest",
1966
+ "\u2581question",
1967
+ "hip",
1968
+ "\u2581bus",
1969
+ "\u2581ask",
1970
+ "\u2581exam",
1971
+ "\u2581prov",
1972
+ "lud",
1973
+ "\u2581form",
1974
+ "\u2581law",
1975
+ "ense",
1976
+ "\u2581child",
1977
+ "\u2581gl",
1978
+ "ne",
1979
+ "\u2581each",
1980
+ "\u2581understand",
1981
+ "\u2581care",
1982
+ "stem",
1983
+ "\u2581med",
1984
+ "\u2581maybe",
1985
+ "ably",
1986
+ "\u2581det",
1987
+ "\u2581coll",
1988
+ "its",
1989
+ "\u2581commun",
1990
+ "\u2581hand",
1991
+ "\u2581'",
1992
+ "\u2581ref",
1993
+ "\u2581lear",
1994
+ "\u2581done",
1995
+ "\u2581gener",
1996
+ "vern",
1997
+ "\u2581mr",
1998
+ "ween",
1999
+ "\u2581better",
2000
+ "\u2581between",
2001
+ "li",
2002
+ "blem",
2003
+ "\u2581system",
2004
+ "ertain",
2005
+ "\u2581school",
2006
+ "\u2581eas",
2007
+ "\u2581exp",
2008
+ "\u2581war",
2009
+ "ention",
2010
+ "\u2581ty",
2011
+ "\u2581govern",
2012
+ "ues",
2013
+ "\u2581problem",
2014
+ "\u2581plan",
2015
+ "ac",
2016
+ "\u2581conf",
2017
+ "\u2581course",
2018
+ "ouse",
2019
+ "\u2581mar",
2020
+ "\u2581stand",
2021
+ "\u2581sk",
2022
+ "\u2581seco",
2023
+ "uring",
2024
+ "\u2581ed",
2025
+ "\u2581mem",
2026
+ "ros",
2027
+ "cri",
2028
+ "\u2581thought",
2029
+ "cept",
2030
+ "\u2581partic",
2031
+ "\u2581test",
2032
+ "olog",
2033
+ "iness",
2034
+ "\u2581far",
2035
+ "led",
2036
+ "\u2581col",
2037
+ "\u2581looking",
2038
+ "\u2581read",
2039
+ "\u2581whether",
2040
+ "\u2581word",
2041
+ "me",
2042
+ "\u2581once",
2043
+ "ize",
2044
+ "\u2581home",
2045
+ "\u2581requ",
2046
+ "gg",
2047
+ "\u2581ide",
2048
+ "\u2581thank",
2049
+ "ures",
2050
+ "\u2581called",
2051
+ "\u2581cur",
2052
+ "\u2581water",
2053
+ "\u2581frie",
2054
+ "\u2581side",
2055
+ "\u2581best",
2056
+ "\u2581number",
2057
+ "oney",
2058
+ "\u2581turn",
2059
+ "ock",
2060
+ "\u2581eng",
2061
+ "\u2581top",
2062
+ "\u2581open",
2063
+ "ead",
2064
+ "\u2581everything",
2065
+ "\u2581term",
2066
+ "\u2581prob",
2067
+ "\u2581hard",
2068
+ "\u2581fun",
2069
+ "\u2581spec",
2070
+ "\u2581dire",
2071
+ "\u2581second",
2072
+ "\u2581pa",
2073
+ "\u2581build",
2074
+ "\u2581run",
2075
+ "\u2581sign",
2076
+ "\u2581reason",
2077
+ "\u2581inform",
2078
+ "\u2581watch",
2079
+ "ution",
2080
+ "\u2581few",
2081
+ "mo",
2082
+ "\u2581hum",
2083
+ "ision",
2084
+ "\u2581ext",
2085
+ "\u2581tog",
2086
+ "\u2581conc",
2087
+ "\u2581thous",
2088
+ "\u2581thousand",
2089
+ "\u2581support",
2090
+ "\u2581together",
2091
+ "\u2581six",
2092
+ "ps",
2093
+ "\u2581mark",
2094
+ "ics",
2095
+ "\u2581includ",
2096
+ "ef",
2097
+ "\u2581opp",
2098
+ "ident",
2099
+ "\u2581anything",
2100
+ "\u2581met",
2101
+ "\u2581bre",
2102
+ "\u2581jud",
2103
+ "\u2581away",
2104
+ "\u2581old",
2105
+ "\u2581prog",
2106
+ "ten",
2107
+ "\u2581book",
2108
+ "\u2581says",
2109
+ "\u2581seem",
2110
+ "\u2581contin",
2111
+ "\u2581process",
2112
+ "\u2581sing",
2113
+ "\u2581money",
2114
+ "\u2581having",
2115
+ "\u2581beg",
2116
+ "\u2581comple",
2117
+ "\u2581thir",
2118
+ "\u2581using",
2119
+ "\u2581ret",
2120
+ "ger",
2121
+ "\u2581head",
2122
+ "\u2581cre",
2123
+ "\u2581poss",
2124
+ "enty",
2125
+ "\u2581certain",
2126
+ "\u2581clear",
2127
+ "ines",
2128
+ "\u2581wee",
2129
+ "arch",
2130
+ "\u2581inf",
2131
+ "ont",
2132
+ "\u2581sit",
2133
+ "\u2581lead",
2134
+ "alth",
2135
+ "\u2581art",
2136
+ "ross",
2137
+ "\u2581pub",
2138
+ "\u2581without",
2139
+ "\u2581pret",
2140
+ "\u2581getting",
2141
+ "ient",
2142
+ "\u2581z",
2143
+ "\u2581wom",
2144
+ "\u2581power",
2145
+ "ational",
2146
+ "ner",
2147
+ "\u2581rest",
2148
+ "\u2581believe",
2149
+ "\u2581wa",
2150
+ "\u2581aut",
2151
+ "\u2581move",
2152
+ "aim",
2153
+ "\u2581sort",
2154
+ "idence",
2155
+ "\u2581creat",
2156
+ "\u2581expl",
2157
+ "\u2581name",
2158
+ "\u2581went",
2159
+ "\u2581eu",
2160
+ "\u2581change",
2161
+ "\u2581came",
2162
+ "\u2581pay",
2163
+ "ices",
2164
+ "\u2581sin",
2165
+ "\u2581pur",
2166
+ "\u2581pass",
2167
+ "\u2581whole",
2168
+ "\u2581house",
2169
+ "\u2581hund",
2170
+ "\u2581hundred",
2171
+ "\u2581pretty",
2172
+ "\u2581trying",
2173
+ "\u2581ple",
2174
+ "\u2581allow",
2175
+ "\u2581compan",
2176
+ "\u2581government",
2177
+ "\u2581small",
2178
+ "\u2581light",
2179
+ "\u2581bra",
2180
+ "\u2581stu",
2181
+ "aint",
2182
+ "\u2581ah",
2183
+ "\u2581prot",
2184
+ "ets",
2185
+ "\u2581cent",
2186
+ "velop",
2187
+ "\u2581family",
2188
+ "\u2581business",
2189
+ "ety",
2190
+ "\u2581making",
2191
+ "\u2581list",
2192
+ "\u2581experi",
2193
+ "eric",
2194
+ "\u2581follow",
2195
+ "ately",
2196
+ "\u2581probably",
2197
+ "\u2581appe",
2198
+ "\u2581serv",
2199
+ "\u2581val",
2200
+ "\u2581leg",
2201
+ "\u2581resp",
2202
+ "\u2581develop",
2203
+ "ready",
2204
+ "\u2581already",
2205
+ "\u2581sec",
2206
+ "ell",
2207
+ "\u2581saying",
2208
+ "ash",
2209
+ "\u2581hear",
2210
+ "\u2581loc",
2211
+ "\u2581adv",
2212
+ "\u2581pri",
2213
+ "ret",
2214
+ "\u2581lar",
2215
+ "\u2581beh",
2216
+ "\u2581must",
2217
+ "\u2581hon",
2218
+ "\u2581means",
2219
+ "ew",
2220
+ "\u2581par",
2221
+ "\u2581order",
2222
+ "\u2581mom",
2223
+ "gn",
2224
+ "\u2581though",
2225
+ "\u2581record",
2226
+ "\u2581miss",
2227
+ "\u2581dr",
2228
+ "\u2581es",
2229
+ "\u2581eight",
2230
+ "\u2581ever",
2231
+ "\u2581left",
2232
+ "\u2581example",
2233
+ "\u2581enough",
2234
+ "osed",
2235
+ "\u2581claim",
2236
+ "ank",
2237
+ "con",
2238
+ "\u2581americ",
2239
+ "\u2581information",
2240
+ "\u2581arg",
2241
+ "\u2581full",
2242
+ "nce",
2243
+ "\u2581consid",
2244
+ "\u2581working",
2245
+ "ature",
2246
+ "\u2581",
2247
+ "e",
2248
+ "t",
2249
+ "a",
2250
+ "o",
2251
+ "i",
2252
+ "n",
2253
+ "s",
2254
+ "r",
2255
+ "h",
2256
+ "l",
2257
+ "d",
2258
+ "u",
2259
+ "c",
2260
+ "m",
2261
+ "y",
2262
+ "w",
2263
+ "g",
2264
+ "f",
2265
+ "p",
2266
+ "b",
2267
+ "v",
2268
+ "k",
2269
+ "'",
2270
+ "j",
2271
+ "x",
2272
+ "q",
2273
+ "z"
2274
+ ],
2275
+ "target": "nemo.collections.asr.models.rnnt_bpe_models.EncDecRNNTBPEModel",
2276
+ "nemo_version": "1.20.0rc0"
2277
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa45c7d7c32e649a348cd4281781c9a833bd068638ce7bec599fab85e7d0bdb9
3
+ size 4282259416
tokenizer.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f279c3951a1c280a99a108d07b429bd98c03c94ecd0270748bd7affca9c0817
3
+ size 259162
tokenizer.vocab ADDED
@@ -0,0 +1,1024 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <unk> 0
2
+ ▁t -0
3
+ ▁th -1
4
+ ▁a -2
5
+ ▁i -3
6
+ ▁the -4
7
+ re -5
8
+ ▁w -6
9
+ ▁s -7
10
+ ▁o -8
11
+ in -9
12
+ at -10
13
+ er -11
14
+ ou -12
15
+ nd -13
16
+ ▁c -14
17
+ ▁b -15
18
+ ▁h -16
19
+ on -17
20
+ ▁m -18
21
+ ▁f -19
22
+ ing -20
23
+ ▁to -21
24
+ en -22
25
+ ▁p -23
26
+ ▁and -24
27
+ ▁d -25
28
+ es -26
29
+ or -27
30
+ an -28
31
+ ll -29
32
+ ▁y -30
33
+ ▁l -31
34
+ ed -32
35
+ ▁of -33
36
+ ▁in -34
37
+ it -35
38
+ is -36
39
+ ▁you -37
40
+ ▁that -38
41
+ ar -39
42
+ ▁g -40
43
+ ▁n -41
44
+ as -42
45
+ om -43
46
+ ▁it -44
47
+ ic -45
48
+ ve -46
49
+ ▁e -47
50
+ ▁wh -48
51
+ ▁be -49
52
+ us -50
53
+ le -51
54
+ al -52
55
+ ion -53
56
+ ow -54
57
+ ▁we -55
58
+ ▁re -56
59
+ ▁is -57
60
+ ut -58
61
+ ot -59
62
+ ent -60
63
+ ▁on -61
64
+ et -62
65
+ ▁ha -63
66
+ ay -64
67
+ ct -65
68
+ ▁he -66
69
+ id -67
70
+ ▁for -68
71
+ ▁st -69
72
+ ver -70
73
+ ly -71
74
+ ro -72
75
+ ig -73
76
+ ▁so -74
77
+ ld -75
78
+ ▁this -76
79
+ ke -77
80
+ ▁u -78
81
+ se -79
82
+ all -80
83
+ st -81
84
+ ur -82
85
+ ce -83
86
+ ch -84
87
+ im -85
88
+ ith -86
89
+ ▁as -87
90
+ ▁k -88
91
+ ▁an -89
92
+ ▁was -90
93
+ ▁j -91
94
+ ▁with -92
95
+ ir -93
96
+ ▁go -94
97
+ ra -95
98
+ ▁do -96
99
+ ▁have -97
100
+ ▁li -98
101
+ ▁sh -99
102
+ ▁se -100
103
+ ▁they -101
104
+ ▁are -102
105
+ am -103
106
+ ht -104
107
+ ▁but -105
108
+ ation -106
109
+ ▁not -107
110
+ th -108
111
+ ▁r -109
112
+ ally -110
113
+ ad -111
114
+ ust -112
115
+ ▁or -113
116
+ ▁com -114
117
+ ould -115
118
+ ▁can -116
119
+ ill -117
120
+ ▁ne -118
121
+ ight -119
122
+ ▁ch -120
123
+ ▁de -121
124
+ ▁con -122
125
+ ▁at -123
126
+ ▁mo -124
127
+ ant -125
128
+ oo -126
129
+ il -127
130
+ ▁me -128
131
+ ▁what -129
132
+ ▁there -130
133
+ ter -131
134
+ pe -132
135
+ ▁ab -133
136
+ ▁su -134
137
+ ere -135
138
+ ck -136
139
+ ▁pro -137
140
+ ▁al -138
141
+ ▁fr -139
142
+ ▁kn -140
143
+ ▁all -141
144
+ ers -142
145
+ ▁like -143
146
+ ge -144
147
+ ▁ex -145
148
+ ▁som -146
149
+ ul -147
150
+ ▁your -148
151
+ ▁v -149
152
+ pp -150
153
+ use -151
154
+ ▁if -152
155
+ ess -153
156
+ ate -154
157
+ est -155
158
+ ▁know -156
159
+ out -157
160
+ if -158
161
+ ▁just -159
162
+ ment -160
163
+ qu -161
164
+ op -162
165
+ ain -163
166
+ ▁one -164
167
+ ol -165
168
+ ri -166
169
+ art -167
170
+ very -168
171
+ ▁wor -169
172
+ ive -170
173
+ ist -171
174
+ ▁my -172
175
+ nt -173
176
+ ab -174
177
+ ▁from -175
178
+ ort -176
179
+ ▁ma -177
180
+ ▁about -178
181
+ res -179
182
+ ity -180
183
+ ▁out -181
184
+ ▁bec -182
185
+ ▁le -183
186
+ our -184
187
+ od -185
188
+ and -186
189
+ ink -187
190
+ ie -188
191
+ ▁up -189
192
+ ind -190
193
+ os -191
194
+ un -192
195
+ ause -193
196
+ oug -194
197
+ um -195
198
+ ▁some -196
199
+ ▁int -197
200
+ ▁by -198
201
+ ▁pl -199
202
+ ▁get -200
203
+ el -201
204
+ ard -202
205
+ ▁when -203
206
+ ▁don -204
207
+ her -205
208
+ ▁will -206
209
+ ▁us -207
210
+ ▁would -208
211
+ ook -209
212
+ ies -210
213
+ ich -211
214
+ ▁because -212
215
+ ▁think -213
216
+ em -214
217
+ ▁pe -215
218
+ ▁his -216
219
+ ack -217
220
+ ▁then -218
221
+ ▁our -219
222
+ ide -220
223
+ ▁tim -221
224
+ ▁how -222
225
+ ven -223
226
+ ▁tr -224
227
+ ▁who -225
228
+ ▁them -226
229
+ ure -227
230
+ ▁ar -228
231
+ ▁ye -229
232
+ ▁more -230
233
+ ▁going -231
234
+ ect -232
235
+ ▁sa -233
236
+ ▁cl -234
237
+ ▁had -235
238
+ ▁now -236
239
+ ▁which -237
240
+ ▁here -238
241
+ ous -239
242
+ ▁their -240
243
+ ▁tw -241
244
+ so -242
245
+ ▁has -243
246
+ ud -244
247
+ ▁co -245
248
+ ▁ta -246
249
+ ound -247
250
+ ▁were -248
251
+ ast -249
252
+ ▁peop -250
253
+ ough -251
254
+ ▁no -252
255
+ ▁really -253
256
+ ▁any -254
257
+ ▁people -255
258
+ ▁want -256
259
+ ▁she -257
260
+ ▁en -258
261
+ ▁fa -259
262
+ ▁te -260
263
+ ame -261
264
+ ine -262
265
+ ▁qu -263
266
+ red -264
267
+ ▁im -265
268
+ ▁right -266
269
+ ther -267
270
+ ▁act -268
271
+ ▁thing -269
272
+ king -270
273
+ ose -271
274
+ ▁ad -272
275
+ ▁see -273
276
+ ▁time -274
277
+ ▁these -275
278
+ ci -276
279
+ one -277
280
+ ▁say -278
281
+ ▁also -279
282
+ ▁fe -280
283
+ per -281
284
+ ▁ag -282
285
+ ▁man -283
286
+ ore -284
287
+ ▁un -285
288
+ pt -286
289
+ ▁her -287
290
+ ▁look -288
291
+ ong -289
292
+ ice -290
293
+ ▁very -291
294
+ ff -292
295
+ ions -293
296
+ ▁comp -294
297
+ ▁did -295
298
+ itt -296
299
+ ▁well -297
300
+ ▁other -298
301
+ iv -299
302
+ ase -300
303
+ ree -301
304
+ hing -302
305
+ ▁lo -303
306
+ reat -304
307
+ ▁cont -305
308
+ ▁part -306
309
+ ▁into -307
310
+ nder -308
311
+ ▁been -309
312
+ are -310
313
+ ▁am -311
314
+ ans -312
315
+ ▁sp -313
316
+ ▁two -314
317
+ ue -315
318
+ ▁way -316
319
+ age -317
320
+ ▁where -318
321
+ ite -319
322
+ ▁dis -320
323
+ ▁than -321
324
+ ▁every -322
325
+ ▁pr -323
326
+ ▁po -324
327
+ ag -325
328
+ ▁need -326
329
+ ach -327
330
+ iff -328
331
+ ence -329
332
+ pl -330
333
+ own -331
334
+ ▁ac -332
335
+ ble -333
336
+ ▁over -334
337
+ iz -335
338
+ ▁work -336
339
+ ▁res -337
340
+ ▁make -338
341
+ ▁could -339
342
+ ▁off -340
343
+ ually -341
344
+ ▁ro -342
345
+ ▁back -343
346
+ able -344
347
+ ip -345
348
+ ry -346
349
+ ▁him -347
350
+ ▁cour -348
351
+ ber -349
352
+ ▁pre -350
353
+ ▁fir -351
354
+ ▁spe -352
355
+ ap -353
356
+ ars -354
357
+ ▁diff -355
358
+ ire -356
359
+ ▁somet -357
360
+ ▁imp -358
361
+ ▁those -359
362
+ ▁comm -360
363
+ ance -361
364
+ ick -362
365
+ ▁even -363
366
+ ated -364
367
+ way -365
368
+ sel -366
369
+ ▁let -367
370
+ ▁br -368
371
+ ty -369
372
+ ▁per -370
373
+ int -371
374
+ ▁first -372
375
+ ▁thr -373
376
+ ▁under -374
377
+ ah -375
378
+ ▁may -376
379
+ ▁cou -377
380
+ ▁new -378
381
+ ress -379
382
+ act -380
383
+ ▁gr -381
384
+ ep -382
385
+ ▁said -383
386
+ ations -384
387
+ ▁good -385
388
+ ace -386
389
+ ass -387
390
+ ▁does -388
391
+ orm -389
392
+ ish -390
393
+ ▁af -391
394
+ ving -392
395
+ co -393
396
+ ▁app -394
397
+ ▁lot -395
398
+ ▁things -396
399
+ ▁tra -397
400
+ ittle -398
401
+ ▁bl -399
402
+ ▁little -400
403
+ ▁mu -401
404
+ cess -402
405
+ fe -403
406
+ ome -404
407
+ ▁inc -405
408
+ ▁differe -406
409
+ ary -407
410
+ ical -408
411
+ ▁only -409
412
+ ult -410
413
+ ▁again -411
414
+ ▁got -412
415
+ ens -413
416
+ ▁gu -414
417
+ ▁kind -415
418
+ ▁much -416
419
+ ord -417
420
+ ▁through -418
421
+ ition -419
422
+ ild -420
423
+ ▁down -421
424
+ ▁actually -422
425
+ ▁something -423
426
+ ang -424
427
+ ru -425
428
+ ces -426
429
+ ▁fl -427
430
+ ile -428
431
+ ater -429
432
+ ▁ra -430
433
+ ▁take -431
434
+ ict -432
435
+ ign -433
436
+ ▁sc -434
437
+ vel -435
438
+ ▁bet -436
439
+ ▁tal -437
440
+ ▁yeah -438
441
+ ▁use -439
442
+ fore -440
443
+ ▁bu -441
444
+ ▁start -442
445
+ ory -443
446
+ be -444
447
+ ▁day -445
448
+ wn -446
449
+ xt -447
450
+ ia -448
451
+ ak -449
452
+ ▁after -450
453
+ ▁should -451
454
+ ▁fo -452
455
+ ▁ho -453
456
+ ▁hel -454
457
+ ▁ind -455
458
+ ▁uh -456
459
+ na -457
460
+ ial -458
461
+ other -459
462
+ ▁ke -460
463
+ ▁call -461
464
+ ▁most -462
465
+ ▁ok -463
466
+ ▁different -464
467
+ ▁em -465
468
+ ting -466
469
+ ple -467
470
+ ▁being -468
471
+ ▁bo -469
472
+ ning -470
473
+ ▁too -471
474
+ ors -472
475
+ ▁happ -473
476
+ ark -474
477
+ og -475
478
+ ▁help -476
479
+ ▁rem -477
480
+ du -478
481
+ ction -479
482
+ ood -480
483
+ ▁ser -481
484
+ ether -482
485
+ ious -483
486
+ ▁mean -484
487
+ ▁many -485
488
+ ▁court -486
489
+ ▁bel -487
490
+ ade -488
491
+ ▁la -489
492
+ ved -490
493
+ ▁des -491
494
+ ▁rec -492
495
+ ▁jo -493
496
+ ▁dec -494
497
+ ves -495
498
+ ▁before -496
499
+ ▁put -497
500
+ self -498
501
+ ▁point -499
502
+ te -500
503
+ ▁ev -501
504
+ form -502
505
+ ents -503
506
+ ▁add -504
507
+ ody -505
508
+ thing -506
509
+ ▁case -507
510
+ ▁pers -508
511
+ ▁cons -509
512
+ iss -510
513
+ ▁three -511
514
+ oth -512
515
+ ▁ph -513
516
+ ▁come -514
517
+ ▁find -515
518
+ ▁why -516
519
+ ull -517
520
+ ▁show -518
521
+ ▁bas -519
522
+ ▁great -520
523
+ ily -521
524
+ ▁rel -522
525
+ ▁sm -523
526
+ ▁its -524
527
+ ▁fact -525
528
+ ▁pos -526
529
+ ool -527
530
+ ments -528
531
+ ise -529
532
+ nds -530
533
+ ys -531
534
+ ▁try -532
535
+ ual -533
536
+ ful -534
537
+ erm -535
538
+ ▁inter -536
539
+ ons -537
540
+ ▁quest -538
541
+ ▁sub -539
542
+ we -540
543
+ vers -541
544
+ ▁supp -542
545
+ ▁feel -543
546
+ ▁same -544
547
+ ub -545
548
+ ates -546
549
+ urn -547
550
+ ert -548
551
+ ▁inv -549
552
+ day -550
553
+ ▁rep -551
554
+ igh -552
555
+ ▁sy -553
556
+ ▁inst -554
557
+ ▁long -555
558
+ ▁still -556
559
+ ▁okay -557
560
+ ft -558
561
+ ific -559
562
+ atch -560
563
+ ought -561
564
+ ath -562
565
+ ▁own -563
566
+ ▁made -564
567
+ ix -565
568
+ ced -566
569
+ ks -567
570
+ lic -568
571
+ ▁wr -569
572
+ de -570
573
+ ▁cr -571
574
+ ▁att -572
575
+ ▁ob -573
576
+ ▁world -574
577
+ ▁sure -575
578
+ ward -576
579
+ ▁bit -577
580
+ ▁life -578
581
+ ▁person -579
582
+ ▁pres -580
583
+ ph -581
584
+ ▁vide -582
585
+ ▁reg -583
586
+ ▁end -584
587
+ ject -585
588
+ ange -586
589
+ ▁fin -587
590
+ ied -588
591
+ pect -589
592
+ ▁didn -590
593
+ ▁around -591
594
+ ian -592
595
+ ▁car -593
596
+ ible -594
597
+ ▁sim -595
598
+ ever -596
599
+ ▁sch -597
600
+ ating -598
601
+ ▁pol -599
602
+ ▁set -600
603
+ ▁oh -601
604
+ cy -602
605
+ ▁real -603
606
+ ▁import -604
607
+ ▁count -605
608
+ ▁um -606
609
+ ▁next -607
610
+ cial -608
611
+ les -609
612
+ ▁hu -610
613
+ ▁acc -611
614
+ ▁might -612
615
+ ▁ent -613
616
+ ▁doing -614
617
+ ▁ins -615
618
+ ▁gen -616
619
+ ▁play -617
620
+ ▁cle -618
621
+ ▁another -619
622
+ ady -620
623
+ ular -621
624
+ ib -622
625
+ ways -623
626
+ ered -624
627
+ ility -625
628
+ ities -626
629
+ ▁op -627
630
+ ▁def -628
631
+ ▁years -629
632
+ ▁never -630
633
+ ower -631
634
+ ram -632
635
+ ▁tell -633
636
+ ▁sl -634
637
+ onna -635
638
+ ail -636
639
+ ren -637
640
+ ute -638
641
+ ▁gonna -639
642
+ ▁big -640
643
+ ▁give -641
644
+ der -642
645
+ ount -643
646
+ ▁ap -644
647
+ kes -645
648
+ ▁state -646
649
+ ▁cor -647
650
+ ▁min -648
651
+ ically -649
652
+ ▁mon -650
653
+ ▁fam -651
654
+ ▁important -652
655
+ ▁always -653
656
+ ▁high -654
657
+ ▁four -655
658
+ ▁gra -656
659
+ ▁ca -657
660
+ ▁stud -658
661
+ ▁dist -659
662
+ ▁talk -660
663
+ ▁num -661
664
+ ▁str -662
665
+ ▁today -663
666
+ ract -664
667
+ ▁while -665
668
+ ason -666
669
+ ▁iss -667
670
+ ▁sur -668
671
+ ▁char -669
672
+ ▁last -670
673
+ oy -671
674
+ ited -672
675
+ ▁exper -673
676
+ ▁place -674
677
+ ▁tri -675
678
+ ▁ear -676
679
+ ▁belie -677
680
+ ▁able -678
681
+ ▁underst -679
682
+ ▁che -680
683
+ ▁both -681
684
+ ug -682
685
+ ▁doesn -683
686
+ ▁keep -684
687
+ ▁happen -685
688
+ ings -686
689
+ iew -687
690
+ ather -688
691
+ ▁ass -689
692
+ ▁love -690
693
+ ative -691
694
+ av -692
695
+ ▁yes -693
696
+ ▁ele -694
697
+ ▁year -695
698
+ ▁such -696
699
+ ▁video -697
700
+ ness -698
701
+ ▁el -699
702
+ ▁trans -700
703
+ ▁five -701
704
+ ▁produ -702
705
+ ave -703
706
+ erest -704
707
+ als -705
708
+ body -706
709
+ cus -707
710
+ ▁found -708
711
+ atter -709
712
+ ▁eff -710
713
+ ▁god -711
714
+ ▁used -712
715
+ llow -713
716
+ ▁interest -714
717
+ ▁question -715
718
+ hip -716
719
+ ▁bus -717
720
+ ▁ask -718
721
+ ▁exam -719
722
+ ▁prov -720
723
+ lud -721
724
+ ▁form -722
725
+ ▁law -723
726
+ ense -724
727
+ ▁child -725
728
+ ▁gl -726
729
+ ne -727
730
+ ▁each -728
731
+ ▁understand -729
732
+ ▁care -730
733
+ stem -731
734
+ ▁med -732
735
+ ▁maybe -733
736
+ ably -734
737
+ ▁det -735
738
+ ▁coll -736
739
+ its -737
740
+ ▁commun -738
741
+ ▁hand -739
742
+ ▁' -740
743
+ ▁ref -741
744
+ ▁lear -742
745
+ ▁done -743
746
+ ▁gener -744
747
+ vern -745
748
+ ▁mr -746
749
+ ween -747
750
+ ▁better -748
751
+ ▁between -749
752
+ li -750
753
+ blem -751
754
+ ▁system -752
755
+ ertain -753
756
+ ▁school -754
757
+ ▁eas -755
758
+ ▁exp -756
759
+ ▁war -757
760
+ ention -758
761
+ ▁ty -759
762
+ ▁govern -760
763
+ ues -761
764
+ ▁problem -762
765
+ ▁plan -763
766
+ ac -764
767
+ ▁conf -765
768
+ ▁course -766
769
+ ouse -767
770
+ ▁mar -768
771
+ ▁stand -769
772
+ ▁sk -770
773
+ ▁seco -771
774
+ uring -772
775
+ ▁ed -773
776
+ ▁mem -774
777
+ ros -775
778
+ cri -776
779
+ ▁thought -777
780
+ cept -778
781
+ ▁partic -779
782
+ ▁test -780
783
+ olog -781
784
+ iness -782
785
+ ▁far -783
786
+ led -784
787
+ ▁col -785
788
+ ▁looking -786
789
+ ▁read -787
790
+ ▁whether -788
791
+ ▁word -789
792
+ me -790
793
+ ▁once -791
794
+ ize -792
795
+ ▁home -793
796
+ ▁requ -794
797
+ gg -795
798
+ ▁ide -796
799
+ ▁thank -797
800
+ ures -798
801
+ ▁called -799
802
+ ▁cur -800
803
+ ▁water -801
804
+ ▁frie -802
805
+ ▁side -803
806
+ ▁best -804
807
+ ▁number -805
808
+ oney -806
809
+ ▁turn -807
810
+ ock -808
811
+ ▁eng -809
812
+ ▁top -810
813
+ ▁open -811
814
+ ead -812
815
+ ▁everything -813
816
+ ▁term -814
817
+ ▁prob -815
818
+ ▁hard -816
819
+ ▁fun -817
820
+ ▁spec -818
821
+ ▁dire -819
822
+ ▁second -820
823
+ ▁pa -821
824
+ ▁build -822
825
+ ▁run -823
826
+ ▁sign -824
827
+ ▁reason -825
828
+ ▁inform -826
829
+ ▁watch -827
830
+ ution -828
831
+ ▁few -829
832
+ mo -830
833
+ ▁hum -831
834
+ ision -832
835
+ ▁ext -833
836
+ ▁tog -834
837
+ ▁conc -835
838
+ ▁thous -836
839
+ ▁thousand -837
840
+ ▁support -838
841
+ ▁together -839
842
+ ▁six -840
843
+ ps -841
844
+ ▁mark -842
845
+ ics -843
846
+ ▁includ -844
847
+ ef -845
848
+ ▁opp -846
849
+ ident -847
850
+ ▁anything -848
851
+ ▁met -849
852
+ ▁bre -850
853
+ ▁jud -851
854
+ ▁away -852
855
+ ▁old -853
856
+ ▁prog -854
857
+ ten -855
858
+ ▁book -856
859
+ ▁says -857
860
+ ▁seem -858
861
+ ▁contin -859
862
+ ▁process -860
863
+ ▁sing -861
864
+ ▁money -862
865
+ ▁having -863
866
+ ▁beg -864
867
+ ▁comple -865
868
+ ▁thir -866
869
+ ▁using -867
870
+ ▁ret -868
871
+ ger -869
872
+ ▁head -870
873
+ ▁cre -871
874
+ ▁poss -872
875
+ enty -873
876
+ ▁certain -874
877
+ ▁clear -875
878
+ ines -876
879
+ ▁wee -877
880
+ arch -878
881
+ ▁inf -879
882
+ ont -880
883
+ ▁sit -881
884
+ ▁lead -882
885
+ alth -883
886
+ ▁art -884
887
+ ross -885
888
+ ▁pub -886
889
+ ▁without -887
890
+ ▁pret -888
891
+ ▁getting -889
892
+ ient -890
893
+ ▁z -891
894
+ ▁wom -892
895
+ ▁power -893
896
+ ational -894
897
+ ner -895
898
+ ▁rest -896
899
+ ▁believe -897
900
+ ▁wa -898
901
+ ▁aut -899
902
+ ▁move -900
903
+ aim -901
904
+ ▁sort -902
905
+ idence -903
906
+ ▁creat -904
907
+ ▁expl -905
908
+ ▁name -906
909
+ ▁went -907
910
+ ▁eu -908
911
+ ▁change -909
912
+ ▁came -910
913
+ ▁pay -911
914
+ ices -912
915
+ ▁sin -913
916
+ ▁pur -914
917
+ ▁pass -915
918
+ ▁whole -916
919
+ ▁house -917
920
+ ▁hund -918
921
+ ▁hundred -919
922
+ ▁pretty -920
923
+ ▁trying -921
924
+ ▁ple -922
925
+ ▁allow -923
926
+ ▁compan -924
927
+ ▁government -925
928
+ ▁small -926
929
+ ▁light -927
930
+ ▁bra -928
931
+ ▁stu -929
932
+ aint -930
933
+ ▁ah -931
934
+ ▁prot -932
935
+ ets -933
936
+ ▁cent -934
937
+ velop -935
938
+ ▁family -936
939
+ ▁business -937
940
+ ety -938
941
+ ▁making -939
942
+ ▁list -940
943
+ ▁experi -941
944
+ eric -942
945
+ ▁follow -943
946
+ ately -944
947
+ ▁probably -945
948
+ ▁appe -946
949
+ ▁serv -947
950
+ ▁val -948
951
+ ▁leg -949
952
+ ▁resp -950
953
+ ▁develop -951
954
+ ready -952
955
+ ▁already -953
956
+ ▁sec -954
957
+ ell -955
958
+ ▁saying -956
959
+ ash -957
960
+ ▁hear -958
961
+ ▁loc -959
962
+ ▁adv -960
963
+ ▁pri -961
964
+ ret -962
965
+ ▁lar -963
966
+ ▁beh -964
967
+ ▁must -965
968
+ ▁hon -966
969
+ ▁means -967
970
+ ew -968
971
+ ▁par -969
972
+ ▁order -970
973
+ ▁mom -971
974
+ gn -972
975
+ ▁though -973
976
+ ▁record -974
977
+ ▁miss -975
978
+ ▁dr -976
979
+ ▁es -977
980
+ ▁eight -978
981
+ ▁ever -979
982
+ ▁left -980
983
+ ▁example -981
984
+ ▁enough -982
985
+ osed -983
986
+ ▁claim -984
987
+ ank -985
988
+ con -986
989
+ ▁americ -987
990
+ ▁information -988
991
+ ▁arg -989
992
+ ▁full -990
993
+ nce -991
994
+ ▁consid -992
995
+ ▁working -993
996
+ ature -994
997
+ ▁ -995
998
+ e -996
999
+ t -997
1000
+ a -998
1001
+ o -999
1002
+ i -1000
1003
+ n -1001
1004
+ s -1002
1005
+ r -1003
1006
+ h -1004
1007
+ l -1005
1008
+ d -1006
1009
+ u -1007
1010
+ c -1008
1011
+ m -1009
1012
+ y -1010
1013
+ w -1011
1014
+ g -1012
1015
+ f -1013
1016
+ p -1014
1017
+ b -1015
1018
+ v -1016
1019
+ k -1017
1020
+ ' -1018
1021
+ j -1019
1022
+ x -1020
1023
+ q -1021
1024
+ z -1022
vocab.txt ADDED
@@ -0,0 +1,1023 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ t
2
+ th
3
+ a
4
+ i
5
+ the
6
+ ##re
7
+ w
8
+ s
9
+ o
10
+ ##in
11
+ ##at
12
+ ##er
13
+ ##ou
14
+ ##nd
15
+ c
16
+ b
17
+ h
18
+ ##on
19
+ m
20
+ f
21
+ ##ing
22
+ to
23
+ ##en
24
+ p
25
+ and
26
+ d
27
+ ##es
28
+ ##or
29
+ ##an
30
+ ##ll
31
+ y
32
+ l
33
+ ##ed
34
+ of
35
+ in
36
+ ##it
37
+ ##is
38
+ you
39
+ that
40
+ ##ar
41
+ g
42
+ n
43
+ ##as
44
+ ##om
45
+ it
46
+ ##ic
47
+ ##ve
48
+ e
49
+ wh
50
+ be
51
+ ##us
52
+ ##le
53
+ ##al
54
+ ##ion
55
+ ##ow
56
+ we
57
+ re
58
+ is
59
+ ##ut
60
+ ##ot
61
+ ##ent
62
+ on
63
+ ##et
64
+ ha
65
+ ##ay
66
+ ##ct
67
+ he
68
+ ##id
69
+ for
70
+ st
71
+ ##ver
72
+ ##ly
73
+ ##ro
74
+ ##ig
75
+ so
76
+ ##ld
77
+ this
78
+ ##ke
79
+ u
80
+ ##se
81
+ ##all
82
+ ##st
83
+ ##ur
84
+ ##ce
85
+ ##ch
86
+ ##im
87
+ ##ith
88
+ as
89
+ k
90
+ an
91
+ was
92
+ j
93
+ with
94
+ ##ir
95
+ go
96
+ ##ra
97
+ do
98
+ have
99
+ li
100
+ sh
101
+ se
102
+ they
103
+ are
104
+ ##am
105
+ ##ht
106
+ but
107
+ ##ation
108
+ not
109
+ ##th
110
+ r
111
+ ##ally
112
+ ##ad
113
+ ##ust
114
+ or
115
+ com
116
+ ##ould
117
+ can
118
+ ##ill
119
+ ne
120
+ ##ight
121
+ ch
122
+ de
123
+ con
124
+ at
125
+ mo
126
+ ##ant
127
+ ##oo
128
+ ##il
129
+ me
130
+ what
131
+ there
132
+ ##ter
133
+ ##pe
134
+ ab
135
+ su
136
+ ##ere
137
+ ##ck
138
+ pro
139
+ al
140
+ fr
141
+ kn
142
+ all
143
+ ##ers
144
+ like
145
+ ##ge
146
+ ex
147
+ som
148
+ ##ul
149
+ your
150
+ v
151
+ ##pp
152
+ ##use
153
+ if
154
+ ##ess
155
+ ##ate
156
+ ##est
157
+ know
158
+ ##out
159
+ ##if
160
+ just
161
+ ##ment
162
+ ##qu
163
+ ##op
164
+ ##ain
165
+ one
166
+ ##ol
167
+ ##ri
168
+ ##art
169
+ ##very
170
+ wor
171
+ ##ive
172
+ ##ist
173
+ my
174
+ ##nt
175
+ ##ab
176
+ from
177
+ ##ort
178
+ ma
179
+ about
180
+ ##res
181
+ ##ity
182
+ out
183
+ bec
184
+ le
185
+ ##our
186
+ ##od
187
+ ##and
188
+ ##ink
189
+ ##ie
190
+ up
191
+ ##ind
192
+ ##os
193
+ ##un
194
+ ##ause
195
+ ##oug
196
+ ##um
197
+ some
198
+ int
199
+ by
200
+ pl
201
+ get
202
+ ##el
203
+ ##ard
204
+ when
205
+ don
206
+ ##her
207
+ will
208
+ us
209
+ would
210
+ ##ook
211
+ ##ies
212
+ ##ich
213
+ because
214
+ think
215
+ ##em
216
+ pe
217
+ his
218
+ ##ack
219
+ then
220
+ our
221
+ ##ide
222
+ tim
223
+ how
224
+ ##ven
225
+ tr
226
+ who
227
+ them
228
+ ##ure
229
+ ar
230
+ ye
231
+ more
232
+ going
233
+ ##ect
234
+ sa
235
+ cl
236
+ had
237
+ now
238
+ which
239
+ here
240
+ ##ous
241
+ their
242
+ tw
243
+ ##so
244
+ has
245
+ ##ud
246
+ co
247
+ ta
248
+ ##ound
249
+ were
250
+ ##ast
251
+ peop
252
+ ##ough
253
+ no
254
+ really
255
+ any
256
+ people
257
+ want
258
+ she
259
+ en
260
+ fa
261
+ te
262
+ ##ame
263
+ ##ine
264
+ qu
265
+ ##red
266
+ im
267
+ right
268
+ ##ther
269
+ act
270
+ thing
271
+ ##king
272
+ ##ose
273
+ ad
274
+ see
275
+ time
276
+ these
277
+ ##ci
278
+ ##one
279
+ say
280
+ also
281
+ fe
282
+ ##per
283
+ ag
284
+ man
285
+ ##ore
286
+ un
287
+ ##pt
288
+ her
289
+ look
290
+ ##ong
291
+ ##ice
292
+ very
293
+ ##ff
294
+ ##ions
295
+ comp
296
+ did
297
+ ##itt
298
+ well
299
+ other
300
+ ##iv
301
+ ##ase
302
+ ##ree
303
+ ##hing
304
+ lo
305
+ ##reat
306
+ cont
307
+ part
308
+ into
309
+ ##nder
310
+ been
311
+ ##are
312
+ am
313
+ ##ans
314
+ sp
315
+ two
316
+ ##ue
317
+ way
318
+ ##age
319
+ where
320
+ ##ite
321
+ dis
322
+ than
323
+ every
324
+ pr
325
+ po
326
+ ##ag
327
+ need
328
+ ##ach
329
+ ##iff
330
+ ##ence
331
+ ##pl
332
+ ##own
333
+ ac
334
+ ##ble
335
+ over
336
+ ##iz
337
+ work
338
+ res
339
+ make
340
+ could
341
+ off
342
+ ##ually
343
+ ro
344
+ back
345
+ ##able
346
+ ##ip
347
+ ##ry
348
+ him
349
+ cour
350
+ ##ber
351
+ pre
352
+ fir
353
+ spe
354
+ ##ap
355
+ ##ars
356
+ diff
357
+ ##ire
358
+ somet
359
+ imp
360
+ those
361
+ comm
362
+ ##ance
363
+ ##ick
364
+ even
365
+ ##ated
366
+ ##way
367
+ ##sel
368
+ let
369
+ br
370
+ ##ty
371
+ per
372
+ ##int
373
+ first
374
+ thr
375
+ under
376
+ ##ah
377
+ may
378
+ cou
379
+ new
380
+ ##ress
381
+ ##act
382
+ gr
383
+ ##ep
384
+ said
385
+ ##ations
386
+ good
387
+ ##ace
388
+ ##ass
389
+ does
390
+ ##orm
391
+ ##ish
392
+ af
393
+ ##ving
394
+ ##co
395
+ app
396
+ lot
397
+ things
398
+ tra
399
+ ##ittle
400
+ bl
401
+ little
402
+ mu
403
+ ##cess
404
+ ##fe
405
+ ##ome
406
+ inc
407
+ differe
408
+ ##ary
409
+ ##ical
410
+ only
411
+ ##ult
412
+ again
413
+ got
414
+ ##ens
415
+ gu
416
+ kind
417
+ much
418
+ ##ord
419
+ through
420
+ ##ition
421
+ ##ild
422
+ down
423
+ actually
424
+ something
425
+ ##ang
426
+ ##ru
427
+ ##ces
428
+ fl
429
+ ##ile
430
+ ##ater
431
+ ra
432
+ take
433
+ ##ict
434
+ ##ign
435
+ sc
436
+ ##vel
437
+ bet
438
+ tal
439
+ yeah
440
+ use
441
+ ##fore
442
+ bu
443
+ start
444
+ ##ory
445
+ ##be
446
+ day
447
+ ##wn
448
+ ##xt
449
+ ##ia
450
+ ##ak
451
+ after
452
+ should
453
+ fo
454
+ ho
455
+ hel
456
+ ind
457
+ uh
458
+ ##na
459
+ ##ial
460
+ ##other
461
+ ke
462
+ call
463
+ most
464
+ ok
465
+ different
466
+ em
467
+ ##ting
468
+ ##ple
469
+ being
470
+ bo
471
+ ##ning
472
+ too
473
+ ##ors
474
+ happ
475
+ ##ark
476
+ ##og
477
+ help
478
+ rem
479
+ ##du
480
+ ##ction
481
+ ##ood
482
+ ser
483
+ ##ether
484
+ ##ious
485
+ mean
486
+ many
487
+ court
488
+ bel
489
+ ##ade
490
+ la
491
+ ##ved
492
+ des
493
+ rec
494
+ jo
495
+ dec
496
+ ##ves
497
+ before
498
+ put
499
+ ##self
500
+ point
501
+ ##te
502
+ ev
503
+ ##form
504
+ ##ents
505
+ add
506
+ ##ody
507
+ ##thing
508
+ case
509
+ pers
510
+ cons
511
+ ##iss
512
+ three
513
+ ##oth
514
+ ph
515
+ come
516
+ find
517
+ why
518
+ ##ull
519
+ show
520
+ bas
521
+ great
522
+ ##ily
523
+ rel
524
+ sm
525
+ its
526
+ fact
527
+ pos
528
+ ##ool
529
+ ##ments
530
+ ##ise
531
+ ##nds
532
+ ##ys
533
+ try
534
+ ##ual
535
+ ##ful
536
+ ##erm
537
+ inter
538
+ ##ons
539
+ quest
540
+ sub
541
+ ##we
542
+ ##vers
543
+ supp
544
+ feel
545
+ same
546
+ ##ub
547
+ ##ates
548
+ ##urn
549
+ ##ert
550
+ inv
551
+ ##day
552
+ rep
553
+ ##igh
554
+ sy
555
+ inst
556
+ long
557
+ still
558
+ okay
559
+ ##ft
560
+ ##ific
561
+ ##atch
562
+ ##ought
563
+ ##ath
564
+ own
565
+ made
566
+ ##ix
567
+ ##ced
568
+ ##ks
569
+ ##lic
570
+ wr
571
+ ##de
572
+ cr
573
+ att
574
+ ob
575
+ world
576
+ sure
577
+ ##ward
578
+ bit
579
+ life
580
+ person
581
+ pres
582
+ ##ph
583
+ vide
584
+ reg
585
+ end
586
+ ##ject
587
+ ##ange
588
+ fin
589
+ ##ied
590
+ ##pect
591
+ didn
592
+ around
593
+ ##ian
594
+ car
595
+ ##ible
596
+ sim
597
+ ##ever
598
+ sch
599
+ ##ating
600
+ pol
601
+ set
602
+ oh
603
+ ##cy
604
+ real
605
+ import
606
+ count
607
+ um
608
+ next
609
+ ##cial
610
+ ##les
611
+ hu
612
+ acc
613
+ might
614
+ ent
615
+ doing
616
+ ins
617
+ gen
618
+ play
619
+ cle
620
+ another
621
+ ##ady
622
+ ##ular
623
+ ##ib
624
+ ##ways
625
+ ##ered
626
+ ##ility
627
+ ##ities
628
+ op
629
+ def
630
+ years
631
+ never
632
+ ##ower
633
+ ##ram
634
+ tell
635
+ sl
636
+ ##onna
637
+ ##ail
638
+ ##ren
639
+ ##ute
640
+ gonna
641
+ big
642
+ give
643
+ ##der
644
+ ##ount
645
+ ap
646
+ ##kes
647
+ state
648
+ cor
649
+ min
650
+ ##ically
651
+ mon
652
+ fam
653
+ important
654
+ always
655
+ high
656
+ four
657
+ gra
658
+ ca
659
+ stud
660
+ dist
661
+ talk
662
+ num
663
+ str
664
+ today
665
+ ##ract
666
+ while
667
+ ##ason
668
+ iss
669
+ sur
670
+ char
671
+ last
672
+ ##oy
673
+ ##ited
674
+ exper
675
+ place
676
+ tri
677
+ ear
678
+ belie
679
+ able
680
+ underst
681
+ che
682
+ both
683
+ ##ug
684
+ doesn
685
+ keep
686
+ happen
687
+ ##ings
688
+ ##iew
689
+ ##ather
690
+ ass
691
+ love
692
+ ##ative
693
+ ##av
694
+ yes
695
+ ele
696
+ year
697
+ such
698
+ video
699
+ ##ness
700
+ el
701
+ trans
702
+ five
703
+ produ
704
+ ##ave
705
+ ##erest
706
+ ##als
707
+ ##body
708
+ ##cus
709
+ found
710
+ ##atter
711
+ eff
712
+ god
713
+ used
714
+ ##llow
715
+ interest
716
+ question
717
+ ##hip
718
+ bus
719
+ ask
720
+ exam
721
+ prov
722
+ ##lud
723
+ form
724
+ law
725
+ ##ense
726
+ child
727
+ gl
728
+ ##ne
729
+ each
730
+ understand
731
+ care
732
+ ##stem
733
+ med
734
+ maybe
735
+ ##ably
736
+ det
737
+ coll
738
+ ##its
739
+ commun
740
+ hand
741
+ '
742
+ ref
743
+ lear
744
+ done
745
+ gener
746
+ ##vern
747
+ mr
748
+ ##ween
749
+ better
750
+ between
751
+ ##li
752
+ ##blem
753
+ system
754
+ ##ertain
755
+ school
756
+ eas
757
+ exp
758
+ war
759
+ ##ention
760
+ ty
761
+ govern
762
+ ##ues
763
+ problem
764
+ plan
765
+ ##ac
766
+ conf
767
+ course
768
+ ##ouse
769
+ mar
770
+ stand
771
+ sk
772
+ seco
773
+ ##uring
774
+ ed
775
+ mem
776
+ ##ros
777
+ ##cri
778
+ thought
779
+ ##cept
780
+ partic
781
+ test
782
+ ##olog
783
+ ##iness
784
+ far
785
+ ##led
786
+ col
787
+ looking
788
+ read
789
+ whether
790
+ word
791
+ ##me
792
+ once
793
+ ##ize
794
+ home
795
+ requ
796
+ ##gg
797
+ ide
798
+ thank
799
+ ##ures
800
+ called
801
+ cur
802
+ water
803
+ frie
804
+ side
805
+ best
806
+ number
807
+ ##oney
808
+ turn
809
+ ##ock
810
+ eng
811
+ top
812
+ open
813
+ ##ead
814
+ everything
815
+ term
816
+ prob
817
+ hard
818
+ fun
819
+ spec
820
+ dire
821
+ second
822
+ pa
823
+ build
824
+ run
825
+ sign
826
+ reason
827
+ inform
828
+ watch
829
+ ##ution
830
+ few
831
+ ##mo
832
+ hum
833
+ ##ision
834
+ ext
835
+ tog
836
+ conc
837
+ thous
838
+ thousand
839
+ support
840
+ together
841
+ six
842
+ ##ps
843
+ mark
844
+ ##ics
845
+ includ
846
+ ##ef
847
+ opp
848
+ ##ident
849
+ anything
850
+ met
851
+ bre
852
+ jud
853
+ away
854
+ old
855
+ prog
856
+ ##ten
857
+ book
858
+ says
859
+ seem
860
+ contin
861
+ process
862
+ sing
863
+ money
864
+ having
865
+ beg
866
+ comple
867
+ thir
868
+ using
869
+ ret
870
+ ##ger
871
+ head
872
+ cre
873
+ poss
874
+ ##enty
875
+ certain
876
+ clear
877
+ ##ines
878
+ wee
879
+ ##arch
880
+ inf
881
+ ##ont
882
+ sit
883
+ lead
884
+ ##alth
885
+ art
886
+ ##ross
887
+ pub
888
+ without
889
+ pret
890
+ getting
891
+ ##ient
892
+ z
893
+ wom
894
+ power
895
+ ##ational
896
+ ##ner
897
+ rest
898
+ believe
899
+ wa
900
+ aut
901
+ move
902
+ ##aim
903
+ sort
904
+ ##idence
905
+ creat
906
+ expl
907
+ name
908
+ went
909
+ eu
910
+ change
911
+ came
912
+ pay
913
+ ##ices
914
+ sin
915
+ pur
916
+ pass
917
+ whole
918
+ house
919
+ hund
920
+ hundred
921
+ pretty
922
+ trying
923
+ ple
924
+ allow
925
+ compan
926
+ government
927
+ small
928
+ light
929
+ bra
930
+ stu
931
+ ##aint
932
+ ah
933
+ prot
934
+ ##ets
935
+ cent
936
+ ##velop
937
+ family
938
+ business
939
+ ##ety
940
+ making
941
+ list
942
+ experi
943
+ ##eric
944
+ follow
945
+ ##ately
946
+ probably
947
+ appe
948
+ serv
949
+ val
950
+ leg
951
+ resp
952
+ develop
953
+ ##ready
954
+ already
955
+ sec
956
+ ##ell
957
+ saying
958
+ ##ash
959
+ hear
960
+ loc
961
+ adv
962
+ pri
963
+ ##ret
964
+ lar
965
+ beh
966
+ must
967
+ hon
968
+ means
969
+ ##ew
970
+ par
971
+ order
972
+ mom
973
+ ##gn
974
+ though
975
+ record
976
+ miss
977
+ dr
978
+ es
979
+ eight
980
+ ever
981
+ left
982
+ example
983
+ enough
984
+ ##osed
985
+ claim
986
+ ##ank
987
+ ##con
988
+ americ
989
+ information
990
+ arg
991
+ full
992
+ ##nce
993
+ consid
994
+ working
995
+ ##ature
996
+
997
+ ##e
998
+ ##t
999
+ ##a
1000
+ ##o
1001
+ ##i
1002
+ ##n
1003
+ ##s
1004
+ ##r
1005
+ ##h
1006
+ ##l
1007
+ ##d
1008
+ ##u
1009
+ ##c
1010
+ ##m
1011
+ ##y
1012
+ ##w
1013
+ ##g
1014
+ ##f
1015
+ ##p
1016
+ ##b
1017
+ ##v
1018
+ ##k
1019
+ ##'
1020
+ ##j
1021
+ ##x
1022
+ ##q
1023
+ ##z