lochhonest commited on
Commit
fa4b2bb
·
verified ·
1 Parent(s): 4e711d0

modernbert-finetuned-for-sas-dot-product

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,773 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:3625
8
+ - loss:CachedMultipleNegativesRankingLoss
9
+ base_model: nomic-ai/modernbert-embed-base
10
+ widget:
11
+ - source_sentence: What is the purpose of the analysis steps outlined in the document?
12
+ sentences:
13
+ - "Structure of the document\n\nThe structure of the present document is as follows:\n\
14
+ \n- Chapter [sasguide:par:analysis] introduces the investigator to the\n \
15
+ \ analysis of XMM-Newton\n (http://www.cosmos.esa.int/web/xmm-newton/technical-details)\
16
+ \ data.\n It provides a brief description of XMM-Newton\n (http://www.cosmos.esa.int/web/xmm-newton/technical-details)\n\
17
+ \ observation and calibration files and outlines the analysis steps\n required\
18
+ \ to produce calibrated event files and to extract scientific\n products.\n\
19
+ \n- Chapter [sasguide:par:gui] describes the SAS graphical user\n interface\
20
+ \ (GUI), a user friendly tool which enables SAS interactive\n analysis tasks\
21
+ \ to be run without using the command line.\n\n- Chapters [sasguide:par:epic],\
22
+ \ [sasguide:par:rgs] and\n [sasguide:par:om] describe the SAS analysis steps\
23
+ \ required to obtain\n EPIC\n (http://www.cosmos.esa.int/web/xmm-newton/technical-details-epic),\n\
24
+ \ RGS (http://www.cosmos.esa.int/web/xmm-newton/technical-details-rgs)\n \
25
+ \ and OM\n (http://www.cosmos.esa.int/web/xmm-newton/technical-details-om)\
26
+ \ data\n products, respectively, which can be used afterwards by standard\n\
27
+ \ astronomical software packages.\n\n- Chapter [sasguide:xmmextractor] gives\
28
+ \ an overview of a SAS procedure\n to produce high-level science products for\
29
+ \ XMM-Newton\n (http://www.cosmos.esa.int/web/xmm-newton/technical-details) cameras\n\
30
+ \ from the raw, uncalibrated, data files contained in the Observation\n \
31
+ \ Data File (ODF). The procedure allows for some interactivity which\n lets\
32
+ \ the user take decisions concerning the analysis process.\n"
33
+ - "ommosaic\n- Whilst tests have so far shown that scattered-light features do\
34
+ \ not\n effect the cross-correlation algorithm, further testing is underway\n\
35
+ \ and the method should be used with caution.\n\n- Tests using the cross-algorithm\
36
+ \ on aspect-corrected sky-images have\n so-far shown the computed offsets to\
37
+ \ be small (less than 0.2\n pixels). If both the aspect-correction and the\
38
+ \ cross-correlation\n algorithm were perfect, they should be zero. However\
39
+ \ the error in\n the aspect-correction can be up to 1 arc sec, and further\
40
+ \ testing\n needs to be done to evaluate any differences.\n\n- It would be\
41
+ \ desireable to perform a further aspect correction on the\n sky-images- this\
42
+ \ would require either a new OM task or, possibly, a\n modification to omsrclistcomb.\
43
+ \ Note that even if the sky-images have\n not been aspect-corrected, the coordinates\
44
+ \ of the sources in the\n observation source-list file may have.\n"
45
+ - 'emchain
46
+
47
+ The chain will adapt to the evolution of its constituents and to the
48
+
49
+ organisation of the pipeline.
50
+
51
+
52
+ The current implementation is a Perl script.
53
+
54
+ '
55
+ - source_sentence: What is the purpose of transforming the sky coordinates?
56
+ sentences:
57
+ - "PRODUCT: SSC logo 1 (SSCLG1) \n\n- This file contains a schematic image of\
58
+ \ the XMM-Newton telescope\n front end\n\n- This is a product of class PPSOBS\n\
59
+ \n- The product is delivered in PNG format\n\n- There is one file per observation.\
60
+ \ File size is 3KB.\n"
61
+ - "dpsssrc\n## Parameters\n\n**set** (Mandatory): EPIC Maximum-Likelihood source\
62
+ \ list, identifier\nTTTTTT = EMSRLI (Type: string, Default: P0123700101PNS003EMSRLI0000.FIT,\n\
63
+ Range: applies only for EPIC Maximum-Likelihood detection lists)\n\n**maxlikthresh**\
64
+ \ (Optional): threshold to set the source flag\n‘source likelihood below a certain\
65
+ \ threshold (Type: real value, Default: 50., Range: must be greater than 0.)’\
66
+ \ to T \n**prefix** (Optional): prefix for output name (‘prefix (Type: string,\n\
67
+ Default: flag_, Range: applies only for EPIC Maximum-Likelihood\ndetection lists)_\n\
68
+ [INPUT FILES]\ndpsssrc\n1. POOOOOOOOOODDSEEEEMSRLISXXX.FIT EPIC Maximum-Likelihood\
69
+ \ detection\n source List. The identifier for TTTTTT must be equal to EMSRLI.\
70
+ \ The\n naming convention given above is according to and SSC-LUX-TN-0038.\n\
71
+ \n[OUTPUT FILES]\ndpsssrc\n1. flag_POOOOOOOOOODDSEEEEMSRLISXXX.FIT\n\n Copy\
72
+ \ of a EPIC Maximum-Likelihood detection list. Three new column\n are created\
73
+ \ and the source flag setting for sources below a certain\n detection threshold\
74
+ \ is performed.\n\n[COMMENTS] dpsssrc\n- This task only applies for EPIC Maximum-Likelihood\
75
+ \ detection lists.\n[CAL USAGE] dpsssrc"
76
+ - "emosaicproc\nThe data preparation consists of:\n\n- transforming the sky coordinates\
77
+ \ by every event file corresponding\n to the different instruments and pointings\
78
+ \ to a common image centre,\n\n- extracting images per instrument, pointing\
79
+ \ and spectral selection,\n using common filtering and evtl. provided GTIs\n"
80
+ - source_sentence: In pn imaging mode event lists, what is the type of the OFFSETX
81
+ column?
82
+ sentences:
83
+ - 'Observation summary products
84
+
85
+ '
86
+ - "- The FITS format OM OSW images are rotated and rebinned to\n North-aligned\
87
+ \ sky coordinates.\n\n- The files are identified using the keyword\n\n \
88
+ \ CONTENT = 'OM OSW SKY IMAGE' (SIMAGE)\n for default and User configurations\n\
89
+ \ or\n CONTENT = 'OM FULL-FRAME SKY IMAGE' (FSIMAG)\n for\
90
+ \ low and high resolution full-frame configurations\n or\n CONTENT\
91
+ \ = 'OM FAST MODE OSW SKY IMAGE' (SIMAGF)\n for FAST mode window\n\n-\
92
+ \ This is a product of class OMSW.\n\n- These images may be rectified against\
93
+ \ the USNO-B1 catalogue in which\n case keywords RA_OFF, DEC_OFF and POSCOROK\
94
+ \ are added to the header.\n\n- This product is used for comparison between\
95
+ \ optical and X-ray images\n (also in sky coordinates). It is used in the production\
96
+ \ of OM OSW\n PNG images.\n\n- There is one OM OSW FITS sky image for each\
97
+ \ OM OSW FITS image, and\n each file, on average, occupies ∼1.2MB uncompressed\
98
+ \ while the FAST\n mode image is typically 14KB uncompressed. For the full\
99
+ \ frame images\n the size is typically 22MB uncompressed.\n"
100
+ - "- In pn event lists this extension contains the CCD columns to which\n an\
101
+ \ additional offset is applied to reduce noise (the offset is later\n subtracted\
102
+ \ again by the SAS). In the MOS event lists this extension\n currently defines\
103
+ \ columns outside the sensitive CCD window, to which\n formal very high values\
104
+ \ of the offset are associated. These columns\n are discarded by the data processing.\n\
105
+ \n- For MOS imaging mode event lists this extension contains the\n following\
106
+ \ columns:\n\n Name Type Description\n --------- ----------------\
107
+ \ ----------------------------------------------------------\n RAWX \
108
+ \ 2-byte INTEGER Row or column of the bad offset\n OFFSETX 2-byte INTEGER\
109
+ \ amplitude of additional column offset (0 for row offset)\n OFFSETY \
110
+ \ 2-byte INTEGER amplitude of additional row offset (0 for column offset)\n\
111
+ \ CCDNR 1-byte CCD where the offset occurs\n\n- For MOS timing\
112
+ \ mode event lists this extension contains the\n following columns:\n\n \
113
+ \ Name Type Description\n --------- ---------------- ---------------------------------------\n\
114
+ \ RAWX 2-byte INTEGER Row or column of the bad offset\n OFFSETX\
115
+ \ 2-byte INTEGER amplitude of additional column offset\n CCDNR 1-byte\
116
+ \ CCD where the offset occurs\n\n- For pn imaging mode event lists\
117
+ \ this extension contains the\n following columns:\n\n Name Type\
118
+ \ Description\n --------- ---------------- ---------------------------------------\n\
119
+ \ RAWX 2-byte INTEGER Row of the bad offset\n OFFSETX 2-byte\
120
+ \ INTEGER amplitude of additional column offset\n CCDNR 1-byte \
121
+ \ CCD where the offset occurs\n\n- For pn timing mode event lists this\
122
+ \ extension contains the following\n columns:\n\n Name Type \
123
+ \ Description\n --------- ---------------- ---------------------------------------\n\
124
+ \ RAWX 2-byte INTEGER Row of the bad offset\n OFFSETX 2-byte\
125
+ \ INTEGER amplitude of additional column offset\n CCDNR 1-byte \
126
+ \ CCD where the offset occurs\n\n- Note that currently, the offset table\
127
+ \ is always empty and only the\n CCDNR column is present and it is unfilled.\
128
+ \ This may change in the\n future.\n"
129
+ - source_sentence: What is the minimal time resolution with which entries into the
130
+ AHF can be made in case of instantaneous excursions in the stable pointing mode?
131
+ sentences:
132
+ - "- This extension is only present in pn event files. It gives the\n number\
133
+ \ of rejections of each column (discarded lines) of CCD nn over\n the course\
134
+ \ of the exposure.\n\n- There is one extension per CCD in the relevant mode\
135
+ \ (IMAGING or\n TIMING) during the exposure.\n\n- This extension contains\
136
+ \ the following columns:\n\n Name Type Description\n \
137
+ \ -------- ---------------- -----------------------------------------------\n\
138
+ \ DLIODF 4-byte INTEGER Number of rejections by onboard MIP algorithm\n\
139
+ \ DLISAS 4-byte INTEGER Number of subsequent rejections by the SAS\n\n\
140
+ - Each row corresponds to a column.\n"
141
+ - 'Attitude & Orbit Control Subsystem (AOCS)
142
+
143
+
144
+ The AOCS determines the attitude of the  while in orbit, based on the
145
+
146
+ information from one of ’s two star trackers (which are operated in cold
147
+
148
+ redundancy) and its “Fine Sun Sensors”. During slews and the post-slew
149
+
150
+ phase (comprising attitude determination, trim, and settling of the
151
+
152
+ spacecraft), entries are made into the Attitude History File (AHF) every
153
+
154
+ 10 seconds. Note that during slews the AHF will not contain attitudes
155
+
156
+ reconstructed from actual AOCS telemetry, but the results of a slew
157
+
158
+ time/path predictor, based on the actually observed slew start/end times
159
+
160
+ and attitudes. The accuracy of the attitude reconstruction during slews
161
+
162
+ is expected to be better than 1′.
163
+
164
+
165
+ In the “stable pointing mode” (i.e., after the slews and the post-slew
166
+
167
+ phase), the conditions under which entries into the AHF are made are
168
+
169
+ optimised parameters. An entry is made into the AHF only in case of
170
+
171
+ Relative Pointing Errors (RPEs) exceeding the programmed limit. The
172
+
173
+ minimal programmable limit (i.e., the smallest programmable deviation
174
+
175
+ from the nominal boresight) is 1″. The minimal time resolution with
176
+
177
+ which entries into the AHF can be made in case of such instantaneous
178
+
179
+ excursions is 2 seconds. For a single nominal pointing entry, only a
180
+
181
+ mean RPE will be provided.
182
+
183
+
184
+ The AOCS attitude information is independent of that from the OM
185
+
186
+ (http://www.cosmos.esa.int/web/xmm-newton/technical-details-om) ’s star
187
+
188
+ tracking windows (§ [uhb:par:omwindows]).
189
+
190
+
191
+ The AHF is a file containing processed AOCS telemetry. Clipped to the
192
+
193
+ start/end times of an observation or slew, the complete AHF for the
194
+
195
+ relevant revolution becomes an ODF or SDF component which is delivered
196
+
197
+ to the observer. For “stable pointing periods” the data records identify
198
+
199
+ intervals of time during which the spacecraft’s boresight did not
200
+
201
+ deviate by more than a configurable limit from the mean boresight during
202
+
203
+ that period. For open loop slews and post-slew attitude trims, the AHF
204
+
205
+ provides the instantaneous boresight at equidistant points in time
206
+
207
+ (typically 10 seconds). It should be noted that attitudes for open loop
208
+
209
+ slews are derived from a “slew model” into which the boundary conditions
210
+
211
+ (actual start/end times and attitudes) have been entered, i.e., the
212
+
213
+ intermediate attitudes provided for slews are not based on sensor data
214
+
215
+ telemetered during the slews.
216
+
217
+
218
+ It should be noted that there is an additional file that holds attitude
219
+
220
+ data, which can be used by the
221
+
222
+ http://www.cosmos.esa.int/web/xmm-newton/sas
223
+
224
+ (http://www.cosmos.esa.int/web/xmm-newton/sas). This is the so called
225
+
226
+ Raw Attitude File (RAF) which provides the attitude information at the
227
+
228
+ maximum possible rate (one entry every 0.5 s). The AHF is in fact a
229
+
230
+ smoothed and filtered version of the RAF. The online documentation of
231
+
232
+ the http://www.cosmos.esa.int/web/xmm-newton/sas
233
+
234
+ (http://www.cosmos.esa.int/web/xmm-newton/sas) package oal (section on
235
+
236
+ SAS_ATTITUDE) gives further info on how to select amongst the AHF and
237
+
238
+ RAF source of ODF attitude data.
239
+
240
+ '
241
+ - 'Generating EPIC images
242
+
243
+
244
+ EPIC (http://www.cosmos.esa.int/web/xmm-newton/technical-details-epic)
245
+
246
+ images can be created from an event file with the evselect
247
+
248
+ (http://xmm-tools.cosmos.esa.int/external/sas/current/doc/evselect/index.html)
249
+
250
+ task from the command line or with the xmmselect
251
+
252
+ (http://xmm-tools.cosmos.esa.int/external/sas/current/doc/xmmselect/index.html)
253
+
254
+ task in an interactive GUI driven way.
255
+
256
+ '
257
+ - source_sentence: What is the primary purpose of the FITS format?
258
+ sentences:
259
+ - 'Scope: xrt
260
+
261
+
262
+ There will be three instances of each of these files.
263
+
264
+
265
+ Calibration type: XAreaEf
266
+
267
+ Description: X-ray effective area of a single mirror module versus
268
+
269
+ energy, field angle, and azimuth
270
+
271
+
272
+ Calibration type: XEncirEn
273
+
274
+ Description: X-ray encircled energy function of a single mirror module
275
+
276
+ versus energy, field angle, and azimuth
277
+
278
+
279
+ Calibration type: XPSF
280
+
281
+ Description: X-ray point spread function of a single mirror module
282
+
283
+ versus energy, field angle, and field azimuth
284
+
285
+ '
286
+ - 'ASCII
287
+
288
+
289
+ ASCII files are used to present script and some tabular information. In
290
+
291
+ particular, each ODF/SDF contains a single summary file, with a summary
292
+
293
+ of the information relating to the observation or slew (see
294
+
295
+ Sect. [dfhb:par:odf]).
296
+
297
+ '
298
+ - 'FITS format
299
+
300
+
301
+ All of the ODF/SDF component files, with the exception of the summary
302
+
303
+ files, reconstructed orbit file, and raw attitude file, are FITS files
304
+
305
+ and conform to the standard. A description of the FITS format can be
306
+
307
+ found in , which is accessible also at the URL
308
+
309
+ The calibration files and the bulk of the PPS products also conform to
310
+
311
+ the FITS standard. Wherever possible and desirable the calibration files
312
+
313
+ and the PPS products follow the conventions of the OGIP
314
+
315
+ (http://heasarc.gsfc.nasa.gov/docs/heasarc/ofwg/ofwg_intro.html) (Office
316
+
317
+ of Guest Investigator Programs) FITS working group. The HEASARC FITS
318
+
319
+ Working Group activities are described at the following URL:
320
+
321
+ For FITS files where OGIP FITS standards are not applicable or
322
+
323
+ available, new standards closely following the OGIP approach are used.
324
+
325
+
326
+ The FITS format is primarily designed to store scientific data sets
327
+
328
+ consisting of multidimensional arrays (1-D spectra, 2-D images or 3-D
329
+
330
+ data cubes) and 2-dimensional tables containing rows and columns of
331
+
332
+ data. A FITS data file is composed of a sequence of Header + Data Units
333
+
334
+ (HDUs).
335
+
336
+
337
+ The general structure of a FITS file is as follows:
338
+
339
+
340
+ - a primary header;
341
+
342
+
343
+ - a primary data array of zero length;
344
+
345
+
346
+ - zero or more extensions
347
+
348
+
349
+ Each extension consists of an extension header and a data section.
350
+
351
+ Extensions are named and can appear in any order. Only the following
352
+
353
+ FITS extensions are used:
354
+
355
+
356
+ - ASCII table: XTENSION=TABLE
357
+
358
+
359
+ - binary table: XTENSION=BINTABLE
360
+
361
+
362
+ - image: XTENSION=IMAGE
363
+
364
+
365
+ The header consists of keyword=value statements, which describe the
366
+
367
+ organisation of the data in the HDU and the format of the contents. It
368
+
369
+ may also provide additional information, for example, about instrument
370
+
371
+ status or the history of the data. The following block contains the
372
+
373
+ data, which are structured as specified in the header. The data section
374
+
375
+ of the HDU may contain a digital image, a table or a multidimensional
376
+
377
+ matrix that is not an image. An HDU need not contain data.
378
+
379
+ '
380
+ pipeline_tag: sentence-similarity
381
+ library_name: sentence-transformers
382
+ ---
383
+
384
+ # SentenceTransformer based on nomic-ai/modernbert-embed-base
385
+
386
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [nomic-ai/modernbert-embed-base](https://huggingface.co/nomic-ai/modernbert-embed-base). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
387
+
388
+ ## Model Details
389
+
390
+ ### Model Description
391
+ - **Model Type:** Sentence Transformer
392
+ - **Base model:** [nomic-ai/modernbert-embed-base](https://huggingface.co/nomic-ai/modernbert-embed-base) <!-- at revision d556a88e332558790b210f7bdbe87da2fa94a8d8 -->
393
+ - **Maximum Sequence Length:** 8192 tokens
394
+ - **Output Dimensionality:** 768 dimensions
395
+ - **Similarity Function:** Cosine Similarity
396
+ <!-- - **Training Dataset:** Unknown -->
397
+ <!-- - **Language:** Unknown -->
398
+ <!-- - **License:** Unknown -->
399
+
400
+ ### Model Sources
401
+
402
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
403
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
404
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
405
+
406
+ ### Full Model Architecture
407
+
408
+ ```
409
+ SentenceTransformer(
410
+ (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: ModernBertModel
411
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
412
+ (2): Normalize()
413
+ )
414
+ ```
415
+
416
+ ## Usage
417
+
418
+ ### Direct Usage (Sentence Transformers)
419
+
420
+ First install the Sentence Transformers library:
421
+
422
+ ```bash
423
+ pip install -U sentence-transformers
424
+ ```
425
+
426
+ Then you can load this model and run inference.
427
+ ```python
428
+ from sentence_transformers import SentenceTransformer
429
+
430
+ # Download from the 🤗 Hub
431
+ model = SentenceTransformer("sentence_transformers_model_id")
432
+ # Run inference
433
+ sentences = [
434
+ 'What is the primary purpose of the FITS format?',
435
+ 'FITS format\n\nAll of the ODF/SDF component files, with the exception of the summary\nfiles, reconstructed orbit file, and raw attitude file, are FITS files\nand conform to the standard. A description of the FITS format can be\nfound in , which is accessible also at the URL\nThe calibration files and the bulk of the PPS products also conform to\nthe FITS standard. Wherever possible and desirable the calibration files\nand the PPS products follow the conventions of the OGIP\n(http://heasarc.gsfc.nasa.gov/docs/heasarc/ofwg/ofwg_intro.html) (Office\nof Guest Investigator Programs) FITS working group. The HEASARC FITS\nWorking Group activities are described at the following URL:\nFor FITS files where OGIP FITS standards are not applicable or\navailable, new standards closely following the OGIP approach are used.\n\nThe FITS format is primarily designed to store scientific data sets\nconsisting of multidimensional arrays (1-D spectra, 2-D images or 3-D\ndata cubes) and 2-dimensional tables containing rows and columns of\ndata. A FITS data file is composed of a sequence of Header + Data Units\n(HDUs).\n\nThe general structure of a FITS file is as follows:\n\n- a primary header;\n\n- a primary data array of zero length;\n\n- zero or more extensions\n\nEach extension consists of an extension header and a data section.\nExtensions are named and can appear in any order. Only the following\nFITS extensions are used:\n\n- ASCII table: XTENSION=TABLE\n\n- binary table: XTENSION=BINTABLE\n\n- image: XTENSION=IMAGE\n\nThe header consists of keyword=value statements, which describe the\norganisation of the data in the HDU and the format of the contents. It\nmay also provide additional information, for example, about instrument\nstatus or the history of the data. The following block contains the\ndata, which are structured as specified in the header. The data section\nof the HDU may contain a digital image, a table or a multidimensional\nmatrix that is not an image. An HDU need not contain data.\n',
436
+ 'ASCII\n\nASCII files are used to present script and some tabular information. In\nparticular, each ODF/SDF contains a single summary file, with a summary\nof the information relating to the observation or slew (see\nSect.\xa0[dfhb:par:odf]).\n',
437
+ ]
438
+ embeddings = model.encode(sentences)
439
+ print(embeddings.shape)
440
+ # [3, 768]
441
+
442
+ # Get the similarity scores for the embeddings
443
+ similarities = model.similarity(embeddings, embeddings)
444
+ print(similarities.shape)
445
+ # [3, 3]
446
+ ```
447
+
448
+ <!--
449
+ ### Direct Usage (Transformers)
450
+
451
+ <details><summary>Click to see the direct usage in Transformers</summary>
452
+
453
+ </details>
454
+ -->
455
+
456
+ <!--
457
+ ### Downstream Usage (Sentence Transformers)
458
+
459
+ You can finetune this model on your own dataset.
460
+
461
+ <details><summary>Click to expand</summary>
462
+
463
+ </details>
464
+ -->
465
+
466
+ <!--
467
+ ### Out-of-Scope Use
468
+
469
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
470
+ -->
471
+
472
+ <!--
473
+ ## Bias, Risks and Limitations
474
+
475
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
476
+ -->
477
+
478
+ <!--
479
+ ### Recommendations
480
+
481
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
482
+ -->
483
+
484
+ ## Training Details
485
+
486
+ ### Training Dataset
487
+
488
+ #### Unnamed Dataset
489
+
490
+ * Size: 3,625 training samples
491
+ * Columns: <code>anchor</code> and <code>positive</code>
492
+ * Approximate statistics based on the first 1000 samples:
493
+ | | anchor | positive |
494
+ |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
495
+ | type | string | string |
496
+ | details | <ul><li>min: 2 tokens</li><li>mean: 15.71 tokens</li><li>max: 38 tokens</li></ul> | <ul><li>min: 2 tokens</li><li>mean: 412.57 tokens</li><li>max: 3755 tokens</li></ul> |
497
+ * Samples:
498
+ | anchor | positive |
499
+ |:---------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
500
+ | <code>What is the purpose of the document described in the preface?</code> | <code>Preface<br><br>This is the reference document describing the individual XMM-Newton<br>Survey Science Centre (SSC) data product files. It is intended to be of<br>use to software developers, archive administrators and to scientists<br>analysing XMM-Newton data. Please see the SSC data products Interface<br>Control Document (XMM-SOC-ICD-0006-SSC, issue 4.0) for a description of<br>the product group files and other related files that are sent to the<br>SOC.<br><br>This version (4.3) includes changes related to the upgrade to SAS16.0 in<br>the processing pipeline originally developped in 2012 to uniformly<br>process all the XMM data at that time, from which the 3XMM catalogue was<br>derived. Revisions and additions since version 4.2 are identified by<br>change bars at the right of each page.<br><br>This document will continue to evolve through subsequent issues, under<br>indirect control from the SAS and SSC configuration control boards.<br><br>This document is the result of the work of many people. Contributors<br>have included:<br><br>Hermann Brunner, G...</code> |
501
+ | <code>What version of the document is described in the preface?</code> | <code>Preface<br><br>This is the reference document describing the individual XMM-Newton<br>Survey Science Centre (SSC) data product files. It is intended to be of<br>use to software developers, archive administrators and to scientists<br>analysing XMM-Newton data. Please see the SSC data products Interface<br>Control Document (XMM-SOC-ICD-0006-SSC, issue 4.0) for a description of<br>the product group files and other related files that are sent to the<br>SOC.<br><br>This version (4.3) includes changes related to the upgrade to SAS16.0 in<br>the processing pipeline originally developped in 2012 to uniformly<br>process all the XMM data at that time, from which the 3XMM catalogue was<br>derived. Revisions and additions since version 4.2 are identified by<br>change bars at the right of each page.<br><br>This document will continue to evolve through subsequent issues, under<br>indirect control from the SAS and SSC configuration control boards.<br><br>This document is the result of the work of many people. Contributors<br>have included:<br><br>Hermann Brunner, G...</code> |
502
+ | <code>What is the main change in version 4.3 of the document?</code> | <code>Preface<br><br>This is the reference document describing the individual XMM-Newton<br>Survey Science Centre (SSC) data product files. It is intended to be of<br>use to software developers, archive administrators and to scientists<br>analysing XMM-Newton data. Please see the SSC data products Interface<br>Control Document (XMM-SOC-ICD-0006-SSC, issue 4.0) for a description of<br>the product group files and other related files that are sent to the<br>SOC.<br><br>This version (4.3) includes changes related to the upgrade to SAS16.0 in<br>the processing pipeline originally developped in 2012 to uniformly<br>process all the XMM data at that time, from which the 3XMM catalogue was<br>derived. Revisions and additions since version 4.2 are identified by<br>change bars at the right of each page.<br><br>This document will continue to evolve through subsequent issues, under<br>indirect control from the SAS and SSC configuration control boards.<br><br>This document is the result of the work of many people. Contributors<br>have included:<br><br>Hermann Brunner, G...</code> |
503
+ * Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
504
+ ```json
505
+ {
506
+ "scale": 1.0,
507
+ "similarity_fct": "get_similarity"
508
+ }
509
+ ```
510
+
511
+ ### Evaluation Dataset
512
+
513
+ #### Unnamed Dataset
514
+
515
+ * Size: 30 evaluation samples
516
+ * Columns: <code>anchor</code> and <code>positive</code>
517
+ * Approximate statistics based on the first 30 samples:
518
+ | | anchor | positive |
519
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
520
+ | type | string | string |
521
+ | details | <ul><li>min: 10 tokens</li><li>mean: 16.73 tokens</li><li>max: 31 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 655.37 tokens</li><li>max: 6762 tokens</li></ul> |
522
+ * Samples:
523
+ | anchor | positive |
524
+ |:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
525
+ | <code>In pn imaging mode event lists, what is the type of the OFFSETX column?</code> | <code>- In pn event lists this extension contains the CCD columns to which<br> an additional offset is applied to reduce noise (the offset is later<br> subtracted again by the SAS). In the MOS event lists this extension<br> currently defines columns outside the sensitive CCD window, to which<br> formal very high values of the offset are associated. These columns<br> are discarded by the data processing.<br><br>- For MOS imaging mode event lists this extension contains the<br> following columns:<br><br> Name Type Description<br> --------- ---------------- ----------------------------------------------------------<br> RAWX 2-byte INTEGER Row or column of the bad offset<br> OFFSETX 2-byte INTEGER amplitude of additional column offset (0 for row offset)<br> OFFSETY 2-byte INTEGER amplitude of additional row offset (0 for column offset)<br> CCDNR 1-byte CCD where the offset occurs<br><br>- For MOS timing mode event lists this extension contains the...</code> |
526
+ | <code>What are the three binary table extensions created per source used for?</code> | <code>- This product lists bright sources detected by EPIC which fall in the<br> RGS field of view. It also includes the entries for the proposal<br> position and the on-axis location. EPIC and RGS positions are given,<br> as well as RGS spatial and energy-dispersion angle extraction<br> regions for the sources and a background region.<br><br>- These files are identified using the keyword<br><br> CONTENT = 'RGS SOURCE LIST' / File content<br><br> in the primary header.<br><br>- There are two binary table extensions (SRCLIST and RGSn_BACKGROUND),<br> plus a further three binary table extensions per source<br> (RGSn_SRCm_SPATIAL, RGSn_SRCm_ORDER_1 and RGSn_SRCm_ORDER_2, where n<br> is the number of the RGS (1 or 2) and m is the number of the source.<br><br>- The SRCLIST extension has the following columns:<br><br> Name Type Description<br> -------------- ------------------ ---------------------------------------------------------<br> INDEX 2-byte INTEGER Source inde...</code> |
527
+ | <code>What is the purpose of the analysis steps outlined in the document?</code> | <code>Structure of the document<br><br>The structure of the present document is as follows:<br><br>- Chapter [sasguide:par:analysis] introduces the investigator to the<br> analysis of XMM-Newton<br> (http://www.cosmos.esa.int/web/xmm-newton/technical-details) data.<br> It provides a brief description of XMM-Newton<br> (http://www.cosmos.esa.int/web/xmm-newton/technical-details)<br> observation and calibration files and outlines the analysis steps<br> required to produce calibrated event files and to extract scientific<br> products.<br><br>- Chapter [sasguide:par:gui] describes the SAS graphical user<br> interface (GUI), a user friendly tool which enables SAS interactive<br> analysis tasks to be run without using the command line.<br><br>- Chapters [sasguide:par:epic], [sasguide:par:rgs] and<br> [sasguide:par:om] describe the SAS analysis steps required to obtain<br> EPIC<br> (http://www.cosmos.esa.int/web/xmm-newton/technical-details-epic),<br> RGS (http://www.cosmos.esa.int/web/xmm-newton/technical-details-r...</code> |
528
+ * Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
529
+ ```json
530
+ {
531
+ "scale": 1.0,
532
+ "similarity_fct": "get_similarity"
533
+ }
534
+ ```
535
+
536
+ ### Training Hyperparameters
537
+ #### Non-Default Hyperparameters
538
+
539
+ - `eval_strategy`: steps
540
+ - `per_device_train_batch_size`: 16
541
+ - `per_device_eval_batch_size`: 4
542
+ - `num_train_epochs`: 2
543
+ - `lr_scheduler_type`: constant
544
+ - `warmup_ratio`: 0.1
545
+ - `bf16`: True
546
+ - `batch_sampler`: no_duplicates
547
+
548
+ #### All Hyperparameters
549
+ <details><summary>Click to expand</summary>
550
+
551
+ - `overwrite_output_dir`: False
552
+ - `do_predict`: False
553
+ - `eval_strategy`: steps
554
+ - `prediction_loss_only`: True
555
+ - `per_device_train_batch_size`: 16
556
+ - `per_device_eval_batch_size`: 4
557
+ - `per_gpu_train_batch_size`: None
558
+ - `per_gpu_eval_batch_size`: None
559
+ - `gradient_accumulation_steps`: 1
560
+ - `eval_accumulation_steps`: None
561
+ - `torch_empty_cache_steps`: None
562
+ - `learning_rate`: 5e-05
563
+ - `weight_decay`: 0.0
564
+ - `adam_beta1`: 0.9
565
+ - `adam_beta2`: 0.999
566
+ - `adam_epsilon`: 1e-08
567
+ - `max_grad_norm`: 1.0
568
+ - `num_train_epochs`: 2
569
+ - `max_steps`: -1
570
+ - `lr_scheduler_type`: constant
571
+ - `lr_scheduler_kwargs`: {}
572
+ - `warmup_ratio`: 0.1
573
+ - `warmup_steps`: 0
574
+ - `log_level`: passive
575
+ - `log_level_replica`: warning
576
+ - `log_on_each_node`: True
577
+ - `logging_nan_inf_filter`: True
578
+ - `save_safetensors`: True
579
+ - `save_on_each_node`: False
580
+ - `save_only_model`: False
581
+ - `restore_callback_states_from_checkpoint`: False
582
+ - `no_cuda`: False
583
+ - `use_cpu`: False
584
+ - `use_mps_device`: False
585
+ - `seed`: 42
586
+ - `data_seed`: None
587
+ - `jit_mode_eval`: False
588
+ - `use_ipex`: False
589
+ - `bf16`: True
590
+ - `fp16`: False
591
+ - `fp16_opt_level`: O1
592
+ - `half_precision_backend`: auto
593
+ - `bf16_full_eval`: False
594
+ - `fp16_full_eval`: False
595
+ - `tf32`: None
596
+ - `local_rank`: 0
597
+ - `ddp_backend`: None
598
+ - `tpu_num_cores`: None
599
+ - `tpu_metrics_debug`: False
600
+ - `debug`: []
601
+ - `dataloader_drop_last`: False
602
+ - `dataloader_num_workers`: 0
603
+ - `dataloader_prefetch_factor`: None
604
+ - `past_index`: -1
605
+ - `disable_tqdm`: False
606
+ - `remove_unused_columns`: True
607
+ - `label_names`: None
608
+ - `load_best_model_at_end`: False
609
+ - `ignore_data_skip`: False
610
+ - `fsdp`: []
611
+ - `fsdp_min_num_params`: 0
612
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
613
+ - `fsdp_transformer_layer_cls_to_wrap`: None
614
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
615
+ - `deepspeed`: None
616
+ - `label_smoothing_factor`: 0.0
617
+ - `optim`: adamw_torch
618
+ - `optim_args`: None
619
+ - `adafactor`: False
620
+ - `group_by_length`: False
621
+ - `length_column_name`: length
622
+ - `ddp_find_unused_parameters`: None
623
+ - `ddp_bucket_cap_mb`: None
624
+ - `ddp_broadcast_buffers`: False
625
+ - `dataloader_pin_memory`: True
626
+ - `dataloader_persistent_workers`: False
627
+ - `skip_memory_metrics`: True
628
+ - `use_legacy_prediction_loop`: False
629
+ - `push_to_hub`: False
630
+ - `resume_from_checkpoint`: None
631
+ - `hub_model_id`: None
632
+ - `hub_strategy`: every_save
633
+ - `hub_private_repo`: None
634
+ - `hub_always_push`: False
635
+ - `gradient_checkpointing`: False
636
+ - `gradient_checkpointing_kwargs`: None
637
+ - `include_inputs_for_metrics`: False
638
+ - `include_for_metrics`: []
639
+ - `eval_do_concat_batches`: True
640
+ - `fp16_backend`: auto
641
+ - `push_to_hub_model_id`: None
642
+ - `push_to_hub_organization`: None
643
+ - `mp_parameters`:
644
+ - `auto_find_batch_size`: False
645
+ - `full_determinism`: False
646
+ - `torchdynamo`: None
647
+ - `ray_scope`: last
648
+ - `ddp_timeout`: 1800
649
+ - `torch_compile`: False
650
+ - `torch_compile_backend`: None
651
+ - `torch_compile_mode`: None
652
+ - `dispatch_batches`: None
653
+ - `split_batches`: None
654
+ - `include_tokens_per_second`: False
655
+ - `include_num_input_tokens_seen`: False
656
+ - `neftune_noise_alpha`: None
657
+ - `optim_target_modules`: None
658
+ - `batch_eval_metrics`: False
659
+ - `eval_on_start`: False
660
+ - `use_liger_kernel`: False
661
+ - `eval_use_gather_object`: False
662
+ - `average_tokens_across_devices`: False
663
+ - `prompts`: None
664
+ - `batch_sampler`: no_duplicates
665
+ - `multi_dataset_batch_sampler`: proportional
666
+
667
+ </details>
668
+
669
+ ### Training Logs
670
+ | Epoch | Step | Training Loss | Validation Loss |
671
+ |:------:|:----:|:-------------:|:---------------:|
672
+ | 0.0441 | 10 | 2.3929 | - |
673
+ | 0.0881 | 20 | 2.2876 | - |
674
+ | 0.1322 | 30 | 2.2502 | - |
675
+ | 0.1762 | 40 | 2.2265 | - |
676
+ | 0.2203 | 50 | 2.176 | 0.9569 |
677
+ | 0.2643 | 60 | 2.1931 | - |
678
+ | 0.3084 | 70 | 2.1666 | - |
679
+ | 0.3524 | 80 | 2.1637 | - |
680
+ | 0.3965 | 90 | 2.1684 | - |
681
+ | 0.4405 | 100 | 2.1373 | 0.9265 |
682
+ | 0.4846 | 110 | 2.135 | - |
683
+ | 0.5286 | 120 | 2.1159 | - |
684
+ | 0.5727 | 130 | 2.113 | - |
685
+ | 0.6167 | 140 | 2.098 | - |
686
+ | 0.6608 | 150 | 2.0931 | 0.9054 |
687
+ | 0.7048 | 160 | 2.0954 | - |
688
+ | 0.7489 | 170 | 2.0882 | - |
689
+ | 0.7930 | 180 | 2.0926 | - |
690
+ | 0.8370 | 190 | 2.1139 | - |
691
+ | 0.8811 | 200 | 2.1151 | 0.8745 |
692
+ | 0.9251 | 210 | 2.1033 | - |
693
+ | 0.9692 | 220 | 2.1014 | - |
694
+ | 1.0132 | 230 | 2.0139 | - |
695
+ | 1.0573 | 240 | 2.0408 | - |
696
+ | 1.1013 | 250 | 2.0257 | 0.9039 |
697
+ | 1.1454 | 260 | 2.0401 | - |
698
+ | 1.1894 | 270 | 2.0189 | - |
699
+ | 1.2335 | 280 | 2.0521 | - |
700
+ | 1.2775 | 290 | 2.055 | - |
701
+ | 1.3216 | 300 | 2.0407 | 0.9321 |
702
+ | 1.3656 | 310 | 2.0252 | - |
703
+ | 1.4097 | 320 | 2.0126 | - |
704
+ | 1.4537 | 330 | 2.0431 | - |
705
+ | 1.4978 | 340 | 2.0293 | - |
706
+ | 1.5419 | 350 | 2.042 | 0.9105 |
707
+ | 1.5859 | 360 | 2.0557 | - |
708
+ | 1.6300 | 370 | 2.0481 | - |
709
+ | 1.6740 | 380 | 2.0169 | - |
710
+ | 1.7181 | 390 | 2.0402 | - |
711
+ | 1.7621 | 400 | 2.0376 | 0.8873 |
712
+ | 1.8062 | 410 | 2.045 | - |
713
+ | 1.8502 | 420 | 1.9934 | - |
714
+ | 1.8943 | 430 | 2.0335 | - |
715
+ | 1.9383 | 440 | 2.0278 | - |
716
+ | 1.9824 | 450 | 2.0313 | 0.8658 |
717
+
718
+
719
+ ### Framework Versions
720
+ - Python: 3.10.14
721
+ - Sentence Transformers: 3.4.1
722
+ - Transformers: 4.49.0
723
+ - PyTorch: 2.6.0+cu124
724
+ - Accelerate: 1.4.0
725
+ - Datasets: 3.3.2
726
+ - Tokenizers: 0.21.0
727
+
728
+ ## Citation
729
+
730
+ ### BibTeX
731
+
732
+ #### Sentence Transformers
733
+ ```bibtex
734
+ @inproceedings{reimers-2019-sentence-bert,
735
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
736
+ author = "Reimers, Nils and Gurevych, Iryna",
737
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
738
+ month = "11",
739
+ year = "2019",
740
+ publisher = "Association for Computational Linguistics",
741
+ url = "https://arxiv.org/abs/1908.10084",
742
+ }
743
+ ```
744
+
745
+ #### CachedMultipleNegativesRankingLoss
746
+ ```bibtex
747
+ @misc{gao2021scaling,
748
+ title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
749
+ author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
750
+ year={2021},
751
+ eprint={2101.06983},
752
+ archivePrefix={arXiv},
753
+ primaryClass={cs.LG}
754
+ }
755
+ ```
756
+
757
+ <!--
758
+ ## Glossary
759
+
760
+ *Clearly define terms in order to be accessible across audiences.*
761
+ -->
762
+
763
+ <!--
764
+ ## Model Card Authors
765
+
766
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
767
+ -->
768
+
769
+ <!--
770
+ ## Model Card Contact
771
+
772
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
773
+ -->
config.json ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "nomic-ai/modernbert-embed-base",
3
+ "architectures": [
4
+ "ModernBertModel"
5
+ ],
6
+ "attention_bias": false,
7
+ "attention_dropout": 0.0,
8
+ "bos_token_id": 50281,
9
+ "classifier_activation": "gelu",
10
+ "classifier_bias": false,
11
+ "classifier_dropout": 0.0,
12
+ "classifier_pooling": "mean",
13
+ "cls_token_id": 50281,
14
+ "decoder_bias": true,
15
+ "deterministic_flash_attn": false,
16
+ "embedding_dropout": 0.0,
17
+ "eos_token_id": 50282,
18
+ "global_attn_every_n_layers": 3,
19
+ "global_rope_theta": 160000.0,
20
+ "gradient_checkpointing": false,
21
+ "hidden_activation": "gelu",
22
+ "hidden_size": 768,
23
+ "initializer_cutoff_factor": 2.0,
24
+ "initializer_range": 0.02,
25
+ "intermediate_size": 1152,
26
+ "layer_norm_eps": 1e-05,
27
+ "local_attention": 128,
28
+ "local_rope_theta": 10000.0,
29
+ "max_position_embeddings": 8192,
30
+ "mlp_bias": false,
31
+ "mlp_dropout": 0.0,
32
+ "model_type": "modernbert",
33
+ "norm_bias": false,
34
+ "norm_eps": 1e-05,
35
+ "num_attention_heads": 12,
36
+ "num_hidden_layers": 22,
37
+ "pad_token_id": 50283,
38
+ "position_embedding_type": "absolute",
39
+ "reference_compile": true,
40
+ "repad_logits_with_grad": false,
41
+ "sep_token_id": 50282,
42
+ "sparse_pred_ignore_index": -100,
43
+ "sparse_prediction": false,
44
+ "torch_dtype": "float32",
45
+ "transformers_version": "4.49.0",
46
+ "vocab_size": 50368
47
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.4.1",
4
+ "transformers": "4.49.0",
5
+ "pytorch": "2.6.0+cu124"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5ebbf76fde0249579689534efc214781fbcab7aaac5631ab97851a216c9c7e23
3
+ size 596070136
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 8192,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": true,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,945 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "|||IP_ADDRESS|||",
5
+ "lstrip": false,
6
+ "normalized": true,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": false
10
+ },
11
+ "1": {
12
+ "content": "<|padding|>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "50254": {
20
+ "content": " ",
21
+ "lstrip": false,
22
+ "normalized": true,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": false
26
+ },
27
+ "50255": {
28
+ "content": " ",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": false
34
+ },
35
+ "50256": {
36
+ "content": " ",
37
+ "lstrip": false,
38
+ "normalized": true,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": false
42
+ },
43
+ "50257": {
44
+ "content": " ",
45
+ "lstrip": false,
46
+ "normalized": true,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": false
50
+ },
51
+ "50258": {
52
+ "content": " ",
53
+ "lstrip": false,
54
+ "normalized": true,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": false
58
+ },
59
+ "50259": {
60
+ "content": " ",
61
+ "lstrip": false,
62
+ "normalized": true,
63
+ "rstrip": false,
64
+ "single_word": false,
65
+ "special": false
66
+ },
67
+ "50260": {
68
+ "content": " ",
69
+ "lstrip": false,
70
+ "normalized": true,
71
+ "rstrip": false,
72
+ "single_word": false,
73
+ "special": false
74
+ },
75
+ "50261": {
76
+ "content": " ",
77
+ "lstrip": false,
78
+ "normalized": true,
79
+ "rstrip": false,
80
+ "single_word": false,
81
+ "special": false
82
+ },
83
+ "50262": {
84
+ "content": " ",
85
+ "lstrip": false,
86
+ "normalized": true,
87
+ "rstrip": false,
88
+ "single_word": false,
89
+ "special": false
90
+ },
91
+ "50263": {
92
+ "content": " ",
93
+ "lstrip": false,
94
+ "normalized": true,
95
+ "rstrip": false,
96
+ "single_word": false,
97
+ "special": false
98
+ },
99
+ "50264": {
100
+ "content": " ",
101
+ "lstrip": false,
102
+ "normalized": true,
103
+ "rstrip": false,
104
+ "single_word": false,
105
+ "special": false
106
+ },
107
+ "50265": {
108
+ "content": " ",
109
+ "lstrip": false,
110
+ "normalized": true,
111
+ "rstrip": false,
112
+ "single_word": false,
113
+ "special": false
114
+ },
115
+ "50266": {
116
+ "content": " ",
117
+ "lstrip": false,
118
+ "normalized": true,
119
+ "rstrip": false,
120
+ "single_word": false,
121
+ "special": false
122
+ },
123
+ "50267": {
124
+ "content": " ",
125
+ "lstrip": false,
126
+ "normalized": true,
127
+ "rstrip": false,
128
+ "single_word": false,
129
+ "special": false
130
+ },
131
+ "50268": {
132
+ "content": " ",
133
+ "lstrip": false,
134
+ "normalized": true,
135
+ "rstrip": false,
136
+ "single_word": false,
137
+ "special": false
138
+ },
139
+ "50269": {
140
+ "content": " ",
141
+ "lstrip": false,
142
+ "normalized": true,
143
+ "rstrip": false,
144
+ "single_word": false,
145
+ "special": false
146
+ },
147
+ "50270": {
148
+ "content": " ",
149
+ "lstrip": false,
150
+ "normalized": true,
151
+ "rstrip": false,
152
+ "single_word": false,
153
+ "special": false
154
+ },
155
+ "50271": {
156
+ "content": " ",
157
+ "lstrip": false,
158
+ "normalized": true,
159
+ "rstrip": false,
160
+ "single_word": false,
161
+ "special": false
162
+ },
163
+ "50272": {
164
+ "content": " ",
165
+ "lstrip": false,
166
+ "normalized": true,
167
+ "rstrip": false,
168
+ "single_word": false,
169
+ "special": false
170
+ },
171
+ "50273": {
172
+ "content": " ",
173
+ "lstrip": false,
174
+ "normalized": true,
175
+ "rstrip": false,
176
+ "single_word": false,
177
+ "special": false
178
+ },
179
+ "50274": {
180
+ "content": " ",
181
+ "lstrip": false,
182
+ "normalized": true,
183
+ "rstrip": false,
184
+ "single_word": false,
185
+ "special": false
186
+ },
187
+ "50275": {
188
+ "content": " ",
189
+ "lstrip": false,
190
+ "normalized": true,
191
+ "rstrip": false,
192
+ "single_word": false,
193
+ "special": false
194
+ },
195
+ "50276": {
196
+ "content": " ",
197
+ "lstrip": false,
198
+ "normalized": true,
199
+ "rstrip": false,
200
+ "single_word": false,
201
+ "special": false
202
+ },
203
+ "50277": {
204
+ "content": "|||EMAIL_ADDRESS|||",
205
+ "lstrip": false,
206
+ "normalized": true,
207
+ "rstrip": false,
208
+ "single_word": false,
209
+ "special": false
210
+ },
211
+ "50278": {
212
+ "content": "|||PHONE_NUMBER|||",
213
+ "lstrip": false,
214
+ "normalized": true,
215
+ "rstrip": false,
216
+ "single_word": false,
217
+ "special": false
218
+ },
219
+ "50279": {
220
+ "content": "<|endoftext|>",
221
+ "lstrip": false,
222
+ "normalized": false,
223
+ "rstrip": false,
224
+ "single_word": false,
225
+ "special": true
226
+ },
227
+ "50280": {
228
+ "content": "[UNK]",
229
+ "lstrip": false,
230
+ "normalized": false,
231
+ "rstrip": false,
232
+ "single_word": false,
233
+ "special": true
234
+ },
235
+ "50281": {
236
+ "content": "[CLS]",
237
+ "lstrip": false,
238
+ "normalized": false,
239
+ "rstrip": false,
240
+ "single_word": false,
241
+ "special": true
242
+ },
243
+ "50282": {
244
+ "content": "[SEP]",
245
+ "lstrip": false,
246
+ "normalized": false,
247
+ "rstrip": false,
248
+ "single_word": false,
249
+ "special": true
250
+ },
251
+ "50283": {
252
+ "content": "[PAD]",
253
+ "lstrip": false,
254
+ "normalized": false,
255
+ "rstrip": false,
256
+ "single_word": false,
257
+ "special": true
258
+ },
259
+ "50284": {
260
+ "content": "[MASK]",
261
+ "lstrip": true,
262
+ "normalized": false,
263
+ "rstrip": false,
264
+ "single_word": false,
265
+ "special": true
266
+ },
267
+ "50285": {
268
+ "content": "[unused0]",
269
+ "lstrip": false,
270
+ "normalized": true,
271
+ "rstrip": false,
272
+ "single_word": false,
273
+ "special": false
274
+ },
275
+ "50286": {
276
+ "content": "[unused1]",
277
+ "lstrip": false,
278
+ "normalized": true,
279
+ "rstrip": false,
280
+ "single_word": false,
281
+ "special": false
282
+ },
283
+ "50287": {
284
+ "content": "[unused2]",
285
+ "lstrip": false,
286
+ "normalized": true,
287
+ "rstrip": false,
288
+ "single_word": false,
289
+ "special": false
290
+ },
291
+ "50288": {
292
+ "content": "[unused3]",
293
+ "lstrip": false,
294
+ "normalized": true,
295
+ "rstrip": false,
296
+ "single_word": false,
297
+ "special": false
298
+ },
299
+ "50289": {
300
+ "content": "[unused4]",
301
+ "lstrip": false,
302
+ "normalized": true,
303
+ "rstrip": false,
304
+ "single_word": false,
305
+ "special": false
306
+ },
307
+ "50290": {
308
+ "content": "[unused5]",
309
+ "lstrip": false,
310
+ "normalized": true,
311
+ "rstrip": false,
312
+ "single_word": false,
313
+ "special": false
314
+ },
315
+ "50291": {
316
+ "content": "[unused6]",
317
+ "lstrip": false,
318
+ "normalized": true,
319
+ "rstrip": false,
320
+ "single_word": false,
321
+ "special": false
322
+ },
323
+ "50292": {
324
+ "content": "[unused7]",
325
+ "lstrip": false,
326
+ "normalized": true,
327
+ "rstrip": false,
328
+ "single_word": false,
329
+ "special": false
330
+ },
331
+ "50293": {
332
+ "content": "[unused8]",
333
+ "lstrip": false,
334
+ "normalized": true,
335
+ "rstrip": false,
336
+ "single_word": false,
337
+ "special": false
338
+ },
339
+ "50294": {
340
+ "content": "[unused9]",
341
+ "lstrip": false,
342
+ "normalized": true,
343
+ "rstrip": false,
344
+ "single_word": false,
345
+ "special": false
346
+ },
347
+ "50295": {
348
+ "content": "[unused10]",
349
+ "lstrip": false,
350
+ "normalized": true,
351
+ "rstrip": false,
352
+ "single_word": false,
353
+ "special": false
354
+ },
355
+ "50296": {
356
+ "content": "[unused11]",
357
+ "lstrip": false,
358
+ "normalized": true,
359
+ "rstrip": false,
360
+ "single_word": false,
361
+ "special": false
362
+ },
363
+ "50297": {
364
+ "content": "[unused12]",
365
+ "lstrip": false,
366
+ "normalized": true,
367
+ "rstrip": false,
368
+ "single_word": false,
369
+ "special": false
370
+ },
371
+ "50298": {
372
+ "content": "[unused13]",
373
+ "lstrip": false,
374
+ "normalized": true,
375
+ "rstrip": false,
376
+ "single_word": false,
377
+ "special": false
378
+ },
379
+ "50299": {
380
+ "content": "[unused14]",
381
+ "lstrip": false,
382
+ "normalized": true,
383
+ "rstrip": false,
384
+ "single_word": false,
385
+ "special": false
386
+ },
387
+ "50300": {
388
+ "content": "[unused15]",
389
+ "lstrip": false,
390
+ "normalized": true,
391
+ "rstrip": false,
392
+ "single_word": false,
393
+ "special": false
394
+ },
395
+ "50301": {
396
+ "content": "[unused16]",
397
+ "lstrip": false,
398
+ "normalized": true,
399
+ "rstrip": false,
400
+ "single_word": false,
401
+ "special": false
402
+ },
403
+ "50302": {
404
+ "content": "[unused17]",
405
+ "lstrip": false,
406
+ "normalized": true,
407
+ "rstrip": false,
408
+ "single_word": false,
409
+ "special": false
410
+ },
411
+ "50303": {
412
+ "content": "[unused18]",
413
+ "lstrip": false,
414
+ "normalized": true,
415
+ "rstrip": false,
416
+ "single_word": false,
417
+ "special": false
418
+ },
419
+ "50304": {
420
+ "content": "[unused19]",
421
+ "lstrip": false,
422
+ "normalized": true,
423
+ "rstrip": false,
424
+ "single_word": false,
425
+ "special": false
426
+ },
427
+ "50305": {
428
+ "content": "[unused20]",
429
+ "lstrip": false,
430
+ "normalized": true,
431
+ "rstrip": false,
432
+ "single_word": false,
433
+ "special": false
434
+ },
435
+ "50306": {
436
+ "content": "[unused21]",
437
+ "lstrip": false,
438
+ "normalized": true,
439
+ "rstrip": false,
440
+ "single_word": false,
441
+ "special": false
442
+ },
443
+ "50307": {
444
+ "content": "[unused22]",
445
+ "lstrip": false,
446
+ "normalized": true,
447
+ "rstrip": false,
448
+ "single_word": false,
449
+ "special": false
450
+ },
451
+ "50308": {
452
+ "content": "[unused23]",
453
+ "lstrip": false,
454
+ "normalized": true,
455
+ "rstrip": false,
456
+ "single_word": false,
457
+ "special": false
458
+ },
459
+ "50309": {
460
+ "content": "[unused24]",
461
+ "lstrip": false,
462
+ "normalized": true,
463
+ "rstrip": false,
464
+ "single_word": false,
465
+ "special": false
466
+ },
467
+ "50310": {
468
+ "content": "[unused25]",
469
+ "lstrip": false,
470
+ "normalized": true,
471
+ "rstrip": false,
472
+ "single_word": false,
473
+ "special": false
474
+ },
475
+ "50311": {
476
+ "content": "[unused26]",
477
+ "lstrip": false,
478
+ "normalized": true,
479
+ "rstrip": false,
480
+ "single_word": false,
481
+ "special": false
482
+ },
483
+ "50312": {
484
+ "content": "[unused27]",
485
+ "lstrip": false,
486
+ "normalized": true,
487
+ "rstrip": false,
488
+ "single_word": false,
489
+ "special": false
490
+ },
491
+ "50313": {
492
+ "content": "[unused28]",
493
+ "lstrip": false,
494
+ "normalized": true,
495
+ "rstrip": false,
496
+ "single_word": false,
497
+ "special": false
498
+ },
499
+ "50314": {
500
+ "content": "[unused29]",
501
+ "lstrip": false,
502
+ "normalized": true,
503
+ "rstrip": false,
504
+ "single_word": false,
505
+ "special": false
506
+ },
507
+ "50315": {
508
+ "content": "[unused30]",
509
+ "lstrip": false,
510
+ "normalized": true,
511
+ "rstrip": false,
512
+ "single_word": false,
513
+ "special": false
514
+ },
515
+ "50316": {
516
+ "content": "[unused31]",
517
+ "lstrip": false,
518
+ "normalized": true,
519
+ "rstrip": false,
520
+ "single_word": false,
521
+ "special": false
522
+ },
523
+ "50317": {
524
+ "content": "[unused32]",
525
+ "lstrip": false,
526
+ "normalized": true,
527
+ "rstrip": false,
528
+ "single_word": false,
529
+ "special": false
530
+ },
531
+ "50318": {
532
+ "content": "[unused33]",
533
+ "lstrip": false,
534
+ "normalized": true,
535
+ "rstrip": false,
536
+ "single_word": false,
537
+ "special": false
538
+ },
539
+ "50319": {
540
+ "content": "[unused34]",
541
+ "lstrip": false,
542
+ "normalized": true,
543
+ "rstrip": false,
544
+ "single_word": false,
545
+ "special": false
546
+ },
547
+ "50320": {
548
+ "content": "[unused35]",
549
+ "lstrip": false,
550
+ "normalized": true,
551
+ "rstrip": false,
552
+ "single_word": false,
553
+ "special": false
554
+ },
555
+ "50321": {
556
+ "content": "[unused36]",
557
+ "lstrip": false,
558
+ "normalized": true,
559
+ "rstrip": false,
560
+ "single_word": false,
561
+ "special": false
562
+ },
563
+ "50322": {
564
+ "content": "[unused37]",
565
+ "lstrip": false,
566
+ "normalized": true,
567
+ "rstrip": false,
568
+ "single_word": false,
569
+ "special": false
570
+ },
571
+ "50323": {
572
+ "content": "[unused38]",
573
+ "lstrip": false,
574
+ "normalized": true,
575
+ "rstrip": false,
576
+ "single_word": false,
577
+ "special": false
578
+ },
579
+ "50324": {
580
+ "content": "[unused39]",
581
+ "lstrip": false,
582
+ "normalized": true,
583
+ "rstrip": false,
584
+ "single_word": false,
585
+ "special": false
586
+ },
587
+ "50325": {
588
+ "content": "[unused40]",
589
+ "lstrip": false,
590
+ "normalized": true,
591
+ "rstrip": false,
592
+ "single_word": false,
593
+ "special": false
594
+ },
595
+ "50326": {
596
+ "content": "[unused41]",
597
+ "lstrip": false,
598
+ "normalized": true,
599
+ "rstrip": false,
600
+ "single_word": false,
601
+ "special": false
602
+ },
603
+ "50327": {
604
+ "content": "[unused42]",
605
+ "lstrip": false,
606
+ "normalized": true,
607
+ "rstrip": false,
608
+ "single_word": false,
609
+ "special": false
610
+ },
611
+ "50328": {
612
+ "content": "[unused43]",
613
+ "lstrip": false,
614
+ "normalized": true,
615
+ "rstrip": false,
616
+ "single_word": false,
617
+ "special": false
618
+ },
619
+ "50329": {
620
+ "content": "[unused44]",
621
+ "lstrip": false,
622
+ "normalized": true,
623
+ "rstrip": false,
624
+ "single_word": false,
625
+ "special": false
626
+ },
627
+ "50330": {
628
+ "content": "[unused45]",
629
+ "lstrip": false,
630
+ "normalized": true,
631
+ "rstrip": false,
632
+ "single_word": false,
633
+ "special": false
634
+ },
635
+ "50331": {
636
+ "content": "[unused46]",
637
+ "lstrip": false,
638
+ "normalized": true,
639
+ "rstrip": false,
640
+ "single_word": false,
641
+ "special": false
642
+ },
643
+ "50332": {
644
+ "content": "[unused47]",
645
+ "lstrip": false,
646
+ "normalized": true,
647
+ "rstrip": false,
648
+ "single_word": false,
649
+ "special": false
650
+ },
651
+ "50333": {
652
+ "content": "[unused48]",
653
+ "lstrip": false,
654
+ "normalized": true,
655
+ "rstrip": false,
656
+ "single_word": false,
657
+ "special": false
658
+ },
659
+ "50334": {
660
+ "content": "[unused49]",
661
+ "lstrip": false,
662
+ "normalized": true,
663
+ "rstrip": false,
664
+ "single_word": false,
665
+ "special": false
666
+ },
667
+ "50335": {
668
+ "content": "[unused50]",
669
+ "lstrip": false,
670
+ "normalized": true,
671
+ "rstrip": false,
672
+ "single_word": false,
673
+ "special": false
674
+ },
675
+ "50336": {
676
+ "content": "[unused51]",
677
+ "lstrip": false,
678
+ "normalized": true,
679
+ "rstrip": false,
680
+ "single_word": false,
681
+ "special": false
682
+ },
683
+ "50337": {
684
+ "content": "[unused52]",
685
+ "lstrip": false,
686
+ "normalized": true,
687
+ "rstrip": false,
688
+ "single_word": false,
689
+ "special": false
690
+ },
691
+ "50338": {
692
+ "content": "[unused53]",
693
+ "lstrip": false,
694
+ "normalized": true,
695
+ "rstrip": false,
696
+ "single_word": false,
697
+ "special": false
698
+ },
699
+ "50339": {
700
+ "content": "[unused54]",
701
+ "lstrip": false,
702
+ "normalized": true,
703
+ "rstrip": false,
704
+ "single_word": false,
705
+ "special": false
706
+ },
707
+ "50340": {
708
+ "content": "[unused55]",
709
+ "lstrip": false,
710
+ "normalized": true,
711
+ "rstrip": false,
712
+ "single_word": false,
713
+ "special": false
714
+ },
715
+ "50341": {
716
+ "content": "[unused56]",
717
+ "lstrip": false,
718
+ "normalized": true,
719
+ "rstrip": false,
720
+ "single_word": false,
721
+ "special": false
722
+ },
723
+ "50342": {
724
+ "content": "[unused57]",
725
+ "lstrip": false,
726
+ "normalized": true,
727
+ "rstrip": false,
728
+ "single_word": false,
729
+ "special": false
730
+ },
731
+ "50343": {
732
+ "content": "[unused58]",
733
+ "lstrip": false,
734
+ "normalized": true,
735
+ "rstrip": false,
736
+ "single_word": false,
737
+ "special": false
738
+ },
739
+ "50344": {
740
+ "content": "[unused59]",
741
+ "lstrip": false,
742
+ "normalized": true,
743
+ "rstrip": false,
744
+ "single_word": false,
745
+ "special": false
746
+ },
747
+ "50345": {
748
+ "content": "[unused60]",
749
+ "lstrip": false,
750
+ "normalized": true,
751
+ "rstrip": false,
752
+ "single_word": false,
753
+ "special": false
754
+ },
755
+ "50346": {
756
+ "content": "[unused61]",
757
+ "lstrip": false,
758
+ "normalized": true,
759
+ "rstrip": false,
760
+ "single_word": false,
761
+ "special": false
762
+ },
763
+ "50347": {
764
+ "content": "[unused62]",
765
+ "lstrip": false,
766
+ "normalized": true,
767
+ "rstrip": false,
768
+ "single_word": false,
769
+ "special": false
770
+ },
771
+ "50348": {
772
+ "content": "[unused63]",
773
+ "lstrip": false,
774
+ "normalized": true,
775
+ "rstrip": false,
776
+ "single_word": false,
777
+ "special": false
778
+ },
779
+ "50349": {
780
+ "content": "[unused64]",
781
+ "lstrip": false,
782
+ "normalized": true,
783
+ "rstrip": false,
784
+ "single_word": false,
785
+ "special": false
786
+ },
787
+ "50350": {
788
+ "content": "[unused65]",
789
+ "lstrip": false,
790
+ "normalized": true,
791
+ "rstrip": false,
792
+ "single_word": false,
793
+ "special": false
794
+ },
795
+ "50351": {
796
+ "content": "[unused66]",
797
+ "lstrip": false,
798
+ "normalized": true,
799
+ "rstrip": false,
800
+ "single_word": false,
801
+ "special": false
802
+ },
803
+ "50352": {
804
+ "content": "[unused67]",
805
+ "lstrip": false,
806
+ "normalized": true,
807
+ "rstrip": false,
808
+ "single_word": false,
809
+ "special": false
810
+ },
811
+ "50353": {
812
+ "content": "[unused68]",
813
+ "lstrip": false,
814
+ "normalized": true,
815
+ "rstrip": false,
816
+ "single_word": false,
817
+ "special": false
818
+ },
819
+ "50354": {
820
+ "content": "[unused69]",
821
+ "lstrip": false,
822
+ "normalized": true,
823
+ "rstrip": false,
824
+ "single_word": false,
825
+ "special": false
826
+ },
827
+ "50355": {
828
+ "content": "[unused70]",
829
+ "lstrip": false,
830
+ "normalized": true,
831
+ "rstrip": false,
832
+ "single_word": false,
833
+ "special": false
834
+ },
835
+ "50356": {
836
+ "content": "[unused71]",
837
+ "lstrip": false,
838
+ "normalized": true,
839
+ "rstrip": false,
840
+ "single_word": false,
841
+ "special": false
842
+ },
843
+ "50357": {
844
+ "content": "[unused72]",
845
+ "lstrip": false,
846
+ "normalized": true,
847
+ "rstrip": false,
848
+ "single_word": false,
849
+ "special": false
850
+ },
851
+ "50358": {
852
+ "content": "[unused73]",
853
+ "lstrip": false,
854
+ "normalized": true,
855
+ "rstrip": false,
856
+ "single_word": false,
857
+ "special": false
858
+ },
859
+ "50359": {
860
+ "content": "[unused74]",
861
+ "lstrip": false,
862
+ "normalized": true,
863
+ "rstrip": false,
864
+ "single_word": false,
865
+ "special": false
866
+ },
867
+ "50360": {
868
+ "content": "[unused75]",
869
+ "lstrip": false,
870
+ "normalized": true,
871
+ "rstrip": false,
872
+ "single_word": false,
873
+ "special": false
874
+ },
875
+ "50361": {
876
+ "content": "[unused76]",
877
+ "lstrip": false,
878
+ "normalized": true,
879
+ "rstrip": false,
880
+ "single_word": false,
881
+ "special": false
882
+ },
883
+ "50362": {
884
+ "content": "[unused77]",
885
+ "lstrip": false,
886
+ "normalized": true,
887
+ "rstrip": false,
888
+ "single_word": false,
889
+ "special": false
890
+ },
891
+ "50363": {
892
+ "content": "[unused78]",
893
+ "lstrip": false,
894
+ "normalized": true,
895
+ "rstrip": false,
896
+ "single_word": false,
897
+ "special": false
898
+ },
899
+ "50364": {
900
+ "content": "[unused79]",
901
+ "lstrip": false,
902
+ "normalized": true,
903
+ "rstrip": false,
904
+ "single_word": false,
905
+ "special": false
906
+ },
907
+ "50365": {
908
+ "content": "[unused80]",
909
+ "lstrip": false,
910
+ "normalized": true,
911
+ "rstrip": false,
912
+ "single_word": false,
913
+ "special": false
914
+ },
915
+ "50366": {
916
+ "content": "[unused81]",
917
+ "lstrip": false,
918
+ "normalized": true,
919
+ "rstrip": false,
920
+ "single_word": false,
921
+ "special": false
922
+ },
923
+ "50367": {
924
+ "content": "[unused82]",
925
+ "lstrip": false,
926
+ "normalized": true,
927
+ "rstrip": false,
928
+ "single_word": false,
929
+ "special": false
930
+ }
931
+ },
932
+ "clean_up_tokenization_spaces": true,
933
+ "cls_token": "[CLS]",
934
+ "extra_special_tokens": {},
935
+ "mask_token": "[MASK]",
936
+ "model_input_names": [
937
+ "input_ids",
938
+ "attention_mask"
939
+ ],
940
+ "model_max_length": 8192,
941
+ "pad_token": "[PAD]",
942
+ "sep_token": "[SEP]",
943
+ "tokenizer_class": "PreTrainedTokenizer",
944
+ "unk_token": "[UNK]"
945
+ }
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba0dcf96faba51f961767dcfe072f0f65967c5b346356e2ce674afc613291937
3
+ size 5560