HariModelMaven commited on
Commit
d49a06e
·
verified ·
1 Parent(s): f2becfe

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,648 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:8000
8
+ - loss:MultipleNegativesRankingLoss
9
+ base_model: sentence-transformers/all-distilroberta-v1
10
+ widget:
11
+ - source_sentence: Is acute doxorubicin cardiotoxicity associated with p53-induced
12
+ inhibition of the mammalian target of rapamycin pathway?
13
+ sentences:
14
+ - Tyrosinase, the rate-limiting enzyme required for melanin production, has been
15
+ targeted to develop active brightening/lightening materials for skin products.
16
+ Unexpected depigmentation of the skin characterized with the diverse symptoms
17
+ was reported in some subjects who used a tyrosinase-competitive inhibiting quasi-drug,
18
+ rhododendrol. To investigate the mechanism underlying the depigmentation caused
19
+ by rhododendrol-containing cosmetics, this study was performed. The mechanism
20
+ above was examined using more than dozen of melanocytes derived from donors of
21
+ different ethnic backgrounds. The RNAi technology was utilized to confirm the
22
+ effect of tyrosinase to induce the cytotoxicity of rhododendrol and liquid chromatography-tandem
23
+ mass spectrometry was introduced to detect rhododendrol and its metabolites in
24
+ the presence of tyrosinase. Melanocyte damage was related to tyrosinase activity
25
+ at a certain threshold. Treatment with a tyrosinase-specific siRNA was shown to
26
+ dramatically rescue the rhododendrol-induced melanocyte impairment. Hydroxyl-rhododendrol
27
+ was detected only in melanocytes with higher tyrosinase activity. When an equivalent
28
+ amount of hydroxyl-rhododendrol was administered, cell viability was almost equally
29
+ suppressed even in melanocytes with lower tyrosinase activity.
30
+ - Doxorubicin is used to treat childhood and adult cancer. Doxorubicin treatment
31
+ is associated with both acute and chronic cardiotoxicity. The cardiotoxic effects
32
+ of doxorubicin are cumulative, which limits its chemotherapeutic dose. Free radical
33
+ generation and p53-dependent apoptosis are thought to contribute to doxorubicin-induced
34
+ cardiotoxicity. Adult transgenic (MHC-CB7) mice expressing cardiomyocyte-restricted
35
+ dominant-interfering p53 and their nontransgenic littermates were treated with
36
+ doxorubicin (20 mg/kg cumulative dose). Nontransgenic mice exhibited reduced left
37
+ ventricular systolic function (predoxorubicin fractional shortening [FS] 61+/-2%,
38
+ postdoxorubicin FS 45+/-2%, mean+/-SEM, P<0.008), reduced cardiac mass, and high
39
+ levels of cardiomyocyte apoptosis 7 days after the initiation of doxorubicin treatment.
40
+ In contrast, doxorubicin-treated MHC-CB7 mice exhibited normal left ventricular
41
+ systolic function (predoxorubicin FS 63+/-2%, postdoxorubicin FS 60+/-2%, P>0.008),
42
+ normal cardiac mass, and low levels of cardiomyocyte apoptosis. Western blot analyses
43
+ indicated that mTOR (mammalian target of rapamycin) signaling was inhibited in
44
+ doxorubicin-treated nontransgenic mice but not in doxorubicin-treated MHC-CB7
45
+ mice. Accordingly, transgenic mice with cardiomyocyte-restricted, constitutively
46
+ active mTOR expression (MHC-mTORca) were studied. Left ventricular systolic function
47
+ (predoxorubicin FS 64+/-2%, postdoxorubicin FS 60+/-3%, P>0.008) and cardiac mass
48
+ were normal in doxorubicin-treated MHC-mTORca mice, despite levels of cardiomyocyte
49
+ apoptosis similar to those seen in doxorubicin-treated nontransgenic mice.
50
+ - To examine the regulatory aspects of zinc-α2-glycoprotein (ZAG) association with
51
+ obesity-related insulin resistance. ZAG mRNA and protein were analyzed in subcutaneous
52
+ adipose tissue (AT) and circulation of lean, obese, prediabetic, and type 2 diabetic
53
+ men; both subcutaneous and visceral AT were explored in lean and extremely obese.
54
+ Clinical and ex vivo findings were corroborated by results of in vitro ZAG silencing
55
+ experiment. Subcutaneous AT ZAG was reduced in obesity, with a trend to further
56
+ decrease with prediabetes and type 2 diabetes. ZAG was 3.3-fold higher in subcutaneous
57
+ than in visceral AT of lean individuals. All differences were lost in extreme
58
+ obesity. Obesity-associated changes in AT were not paralleled by alterations of
59
+ circulating ZAG. Subcutaneous AT ZAG correlated with adiposity, adipocyte hypertrophy,
60
+ whole-body and AT insulin sensitivity, mitochondrial content, expression of GLUT4,
61
+ PGC1α, and adiponectin. Subcutaneous AT ZAG and adipocyte size were the only predictors
62
+ of insulin sensitivity, independent on age and BMI. Silencing ZAG resulted in
63
+ reduced adiponectin, IRS1, GLUT4, and PGC1α gene expression in primary human adipocytes.
64
+ - source_sentence: Is avoidance of polypharmacy and excessive blood pressure control
65
+ associated with improved renal function in older patients?
66
+ sentences:
67
+ - Elderly patients are particularly susceptible to polypharmacy. The present study
68
+ evaluated the renal effects of optimizing potentially nephrotoxic medications
69
+ in an older population. Retrospective study of patients' ≥ 60 years treated between
70
+ January of 2013 and February of 2015 in a Nephrology Clinic. The renal effect
71
+ of avoiding polypharmacy was studied. Sixty-one patients were studied. Median
72
+ age was 81 years (range 60-94). Twenty-five patients (41%) were male. NSAIDs alone
73
+ were stopped in seven patients (11.4%), a dose reduction in antihypertensives
74
+ was done in 11 patients (18%), one or more antihypertensives were discontinued
75
+ in 20 patients (32.7%) and discontinuation and dose reduction of multiple medications
76
+ was carried out in 23 patients (37.7%). The number of antihypertensives was reduced
77
+ from a median of 3 (range of 0-8) at baseline to a median of 2 (range 0-7), p
78
+ < 0.001 after intervention. After intervention, the glomerular filtration rate
79
+ (GFR) improved significantly, from a baseline of 32 ± 15.5 cc/min/1.73 m(2) to
80
+ 39.5 ± 17 cc/min/1.73 m(2) at t1 (p < 0.001) and 44.5 ± 18.7 cc/min/1.73 m(2)
81
+ at t2 (p < 0.001 vs. baseline). In a multivariate model, after adjusting for ACEIs/ARBs
82
+ discontinuation/dose reduction, NSAIDs use and change in DBP, an increase in SBP
83
+ at time 1 remained significantly associated with increments in GFR on follow-up
84
+ (estimate = 0.20, p = 0.01).
85
+ - Endothelial dysfunction and hypertension is more common in individuals with diabetes
86
+ than in the general population. This study was aimed to investigate the underlying
87
+ mechanisms responsible for endothelial dysfunction of type 1 diabetic rats fed
88
+ with high-salt diet. Type 1 diabetes (DM) was induced by intraperitoneal injection
89
+ of streptozotocin (70 mg·kg(-1)). Normal or diabetic rats were randomly fed high-salt
90
+ food (HS, 8% NaCl) or standard food (CON) for 6 weeks. Both HS (143±10 mmHg) and
91
+ DM+HS (169±11 mmHg) groups displayed significantly higher systolic blood pressure
92
+ than those in the CON group (112±12 mmHg, P<0.01). DM+HS rats exhibited more pronounced
93
+ impairment of vasorelaxation to acetylcholine and insulin compared with either
94
+ DM or HS. Akt/endothelial nitric oxide synthase (eNOS) phosphorylation levels
95
+ and nitric oxide (NO) concentration in DM+HS were significantly lower than in
96
+ DM. The levels of caveolin-1 (cav-1) in DM+HS were significantly higher than that
97
+ in DM and HS. Co-immunoprecipitation results showed increased interaction between
98
+ cav-1 and eNOS in the DM+HS group. In the presence of cav-1 small interfering
99
+ RNA (siRNA), eNOS phosphorylations in human umbilical vein endothelial cells (HUVEC)
100
+ were significantly increased compared with control siRNA. Cav-1 was slightly but
101
+ not significantly lower in HUVEC cultured with high glucose and high-salt buffer
102
+ solution and pretreated with wortmannin or l-nitro-arginine methyl ester.
103
+ - 'Several studies have revealed a correlation between sialosyl Tn antigen (STN)
104
+ and certain clinicopathologic features of various cancers, and that STN is an
105
+ independent prognostic factor. However, the clinical significance of the expression
106
+ of STN in gastric cancer has not been reported. Thus, the purpose of this study
107
+ was to evaluate immunohistochemically the clinical significance of expression
108
+ of STN in gastric cancer. The expression of STN in surgically resected specimens
109
+ of human gastric cancer was evaluated immunohistochemically using a monoclonal
110
+ antibody (TKH-2), in 60 patients whose serum STN levels were measured and in 54
111
+ patients with advanced cancer who had been followed for more than 5 years after
112
+ gastrectomy. The correlations between the level of STN expression and clinicopathologic
113
+ factors were analyzed. The staining intensity was graded as follows: (-), less
114
+ than 5% of the cancer cells expressed STN; (+), 5-50%; (++), more than 50%. Sialosyl
115
+ TN antigen staining was detected mainly on the cell membrane, in the cytoplasm,
116
+ and in the luminal contents, and 57.2% of the 60 specimens expressed STN, whereas
117
+ the corresponding value for positive serum levels was 15%. A higher percentage
118
+ of advanced tumors expressed STN than did the early cases, but the difference
119
+ was not statistically significant. All cases with strong staining, the (++) cases,
120
+ were advanced cases either with lymph node metastases or with cancer invading
121
+ in or beyond the muscle layer proper. The expression of STN appeared to be related
122
+ to the clinical stage, the extent of cancer invasion, and the presence of lymph
123
+ node metastases. Sialosyl TN antigen was detected in the serum in less than 6%
124
+ of the patients whose tumors were (-) or (+) for STN expression, and in 86.7%
125
+ of the patients whose tumors expressed high levels of STN (++). The estimated
126
+ 5-year survival in advanced cases (Stage III) was significantly better in those
127
+ with negative STN expression than in those with positive STN expression (P < 0.01).'
128
+ - source_sentence: Does platelet attachment stimulate endothelial cell regeneration
129
+ after arterial injury?
130
+ sentences:
131
+ - The efficacy of lansoprazole (LPZ) at inhibiting gastric acid secretion is influenced
132
+ by cytochrome P450 2C19 (CYP2C19) polymorphism. The purpose of the present study
133
+ was to investigate whether CYP2C19 polymorphism had an influence on the remission
134
+ of erosive reflux esophagitis (RE) during maintenance therapy with LPZ. Eighty-two
135
+ Japanese patients with initial healing of erosive RE by 8 weeks of LPZ therapy
136
+ were enrolled. As maintenance therapy, the patients were treated with LPZ (15
137
+ mg/day) for 6 months. The CYP2C19 genotype, Helicobacter pylori infection status,
138
+ and serum pepsinogen (PG) I/II ratio were assessed before treatment. The patients
139
+ were investigated for relapse by endoscopy at 6 months or when symptoms recurred.
140
+ The proportion of patients in remission after 6 months was 61.5%, 78.0%, and 100%
141
+ among homozygous extensive metabolizers (homo-EM), heterozygous EM (hetero-EM),
142
+ and poor metabolizers (PM), respectively. The percentage of PM patients who remained
143
+ in remission was significantly higher than that of homo-EM or hetero-EM.
144
+ - Arterial injury is associated with endothelial disruption and attachment of platelets
145
+ to an exposed subintimal layer. A variety of factors released by platelets may
146
+ affect the ability of endothelial cells bordering an injury to regenerate. In
147
+ this study an organ culture model of arterial injury was used to investigate the
148
+ relationship between attachment of platelets to a superficial arterial injury
149
+ and endothelial regeneration. A defined superficial endothelial injury was made
150
+ in whole vessel wall explants of rabbit thoracic aorta. Injured explants were
151
+ treated with either fresh whole platelets, the supernatant of platelets aggregated
152
+ by collagen, or basic fibroblast growth factor. Four days after injury and treatment,
153
+ the average distance of endothelial regeneration was determined. A dramatic increase
154
+ in the rate of endothelial cell regeneration was observed when injured vessels
155
+ were exposed to fresh whole platelets (p = 0.003). This increase in regeneration
156
+ was comparable to that observed with fibroblast growth factor. No increase in
157
+ the regenerative rate was found after exposure of explants to the supernatant
158
+ of aggregated platelets (p = 0.69).
159
+ - To introduce an elastomeric continuous infusion pump for pain control after outpatient
160
+ orbital implant surgery. Retrospective, noncomparative consecutive case series
161
+ of all patients undergoing enucleation, evisceration, or secondary orbital implantation
162
+ using the On-Q pain system between August 2004 and January 2006. Postoperative
163
+ pain score, need for narcotics, and adverse events were recorded. The On-Q catheter
164
+ is inserted intraoperatively through the lateral lower eyelid into the muscle
165
+ cone under direct visualization, prior to the orbital implant placement. The On-Q
166
+ system continually infuses anesthesia (bupivacaine) to the retrobulbar site for
167
+ 5 days. Among 20 patients, mean postoperative period pain score, with On-Q in
168
+ place, was 1.3 (scale of 0 to 10). Nine patients (45%) did not need any adjunctive
169
+ oral narcotics. Two patients experienced postoperative nausea. One catheter connector
170
+ leaked, thereby decreasing delivery of retrobulbar anesthetic resulting a pain
171
+ level of 6, the highest level in the study. There were no postoperative infections.
172
+ No systemic toxic effects from bupivacaine were observed clinically.
173
+ - source_sentence: Do mid-regional proadrenomedullin levels predict recurrence of
174
+ atrial fibrillation after catheter ablation?
175
+ sentences:
176
+ - We evaluated the prognostic value of mid-regional proadrenomedullin (MR-proADM)
177
+ in atrial fibrillation (AF) patients undergoing radiofrequency ablation. Plasma
178
+ concentrations of MR-proADM were measured at baseline and after 12months in 87
179
+ AF patients in whom radiofrequency ablation was performed. The association between
180
+ MR-proADM and AF recurrence was tested by univariable and multivariable Cox models.
181
+ In all 87 patients radiofrequency ablation was successfully performed. Of the
182
+ total population 54% had paroxysmal AF. The mean left ventricular ejection fraction
183
+ was 54% (minimum 25%). After 12months of follow-up, 71% of the patients were free
184
+ of AF recurrence. At baseline, mean MR-proADM in the total population was 0.72nmol/l±0.22.
185
+ Patients with AF recurrence had significantly higher baseline MR-proADM (0.89nmol/l±0.29)
186
+ as compared with patients without AF recurrence (0.65nmol/l±0.14; p<0.001). After
187
+ 12months, mean MR-proADM plasma concentration remained higher in patients with
188
+ AF recurrence (0.81nmol/l±0.22 as compared with patients free of AF 0.54nmol/l±0.20;
189
+ p<0.001). Receiver operating characteristic (ROC) curve analysis for MR-proADM
190
+ yields a specificity of 98% and a sensitivity of 64% with an optimal cut-off value
191
+ of 0.82nmol/l to predict recurrence of AF after catheter ablation. In the logistic
192
+ regression analysis only MR-proADM remained independently predictive for AF recurrence.
193
+ - There has been growing interest in the role that implicit processing of drug cues
194
+ can play in motivating drug use behavior. However, the extent to which drug cue
195
+ processing biases relate to the processing biases exhibited to other types of
196
+ evocative stimuli is largely unknown. The goal of the present study was to determine
197
+ how the implicit cognitive processing of smoking cues relates to the processing
198
+ of affective cues using a novel paradigm. Smokers (n = 50) and nonsmokers (n =
199
+ 38) completed a picture-viewing task, in which participants were presented with
200
+ a series of smoking, pleasant, unpleasant, and neutral images while engaging in
201
+ a distractor task designed to direct controlled resources away from conscious
202
+ processing of image content. Electroencephalogram recordings were obtained throughout
203
+ the task for extraction of event-related potentials (ERPs). Smokers exhibited
204
+ differential processing of smoking cues across 3 different ERP indices compared
205
+ with nonsmokers. Comparable effects were found for pleasant cues on 2 of these
206
+ indices. Late cognitive processing of smoking and pleasant cues was associated
207
+ with nicotine dependence and cigarette use.
208
+ - To evaluate the role of toll-like receptors (TLR) 2 and 4 in host responses to
209
+ Aspergillus fumigatus by use of cultured telomerase-immortalized human corneal
210
+ epithelial cells (HCECs). HCECs were stimulated with inactive antigens from A.
211
+ fumigatus. The expression of TLR2 and TLR4, phosphorylation of Ikappa B-alpha
212
+ (pIkappa B-alpha), and release of interleukin (IL)-1beta and IL-6 was measured
213
+ with and without inhibitors to TLR2 and TLR4. Exposure of HCECs to A. fumigatus
214
+ antigens resulted in up-regulation of TLR2 and TLR4, activation of pIkappa B,
215
+ and release of IL-1beta and IL-6 in HCECs, effects that could be inhibited by
216
+ treatment with TLR2 and TLR4 antibodies.
217
+ - source_sentence: Are peripheral blood lymphocytes from patients with rheumatoid
218
+ arthritis differentially sensitive to apoptosis induced by anti-tumour necrosis
219
+ factor-alpha therapy?
220
+ sentences:
221
+ - The aim of this study was to investigate the prognostic effect of serum free light
222
+ chain (sFLC) response after 2 cycles of first-line chemotherapy (CT) in multiple
223
+ myeloma (MM) patients. The data of 78 newly diagnosed MM patients who had sFLC
224
+ levels at diagnosis and after 2 cycles of first-line CT were included in the study.
225
+ The prognostic effect of sFLCs were evaluated with normalization of sFLC κ/λ ratio
226
+ after 2 cycles of CT and involved/uninvolved (i/u) sFLCs. At the end of follow-up
227
+ the probability of overall survival (OS) was 95.7% versus 68.5% in patients with
228
+ and without normalized sFLC κ/λ ratio, respectively (P = .072). The probability
229
+ of OS with i/u sFLC assessment was 97.4% versus 55.8% with regard to i/u sFLC
230
+ ≤ 10 and > 10, respectively (P = .001). In univariate and multivariate analysis
231
+ including sFLC ratio, age, sex, and International Staging System, i/u sFLC ratio
232
+ > 10 after 2 cycles of CT was identified as an independent risk factor for OS
233
+ (P = .015; hazard ratio [HR], 13.2; 95% confidence interval [CI], 1.668-104.65
234
+ vs. P = .011; HR, 15.17; 95% CI, 1.85-123.89).
235
+ - This study examined links between DNA methylation and birth weight centile (BWC),
236
+ and explored the impact of genetic variation. Using HumanMethylation450 arrays,
237
+ we examined candidate gene-associated CpGs in cord blood from newborns with low
238
+ (<15th centile), medium (40-60th centile) and high (>85th centile) BWC (n = 12).
239
+ Candidates were examined in an investigation cohort (n = 110) using pyrosequencing
240
+ and genotyping for putative methylation-associated polymorphisms performed using
241
+ standard PCR. Array analysis identified 314 candidate genes associated with BWC
242
+ extremes, four of which showed ≥ 4 BWC-linked CpGs. Of these, PM20D1 and MI886
243
+ suggested genetically determined methylation levels. However, methylation at three
244
+ CpGs in FGFR2 remained significantly associated with high BWC (p = 0.004-0.027).
245
+ - The efficacy of anti-tumour necrosis factor-alpha (TNF-alpha) therapies in rheumatoid
246
+ arthritis (RA) has been mainly attributed to TNF-alpha neutralisation. Other mechanism
247
+ as immune cell apoptosis, which is impaired in RA, may also be induced by anti-TNF-alpha
248
+ therapies. The aim of our study was to investigate whether TNF-alpha inhibitors
249
+ could induce apoptosis in vitro of the peripheral blood lymphocytes of RA patients.
250
+ Peripheral blood mononuclear cells (PBMC) isolated from 24 patients with RA and
251
+ 18 healthy donors were incubated with anti-TNF-alpha agents, infliximab or etanercept,
252
+ in comparison with no agent and including an isotypic control, for 48 hours. Apoptosis
253
+ was detected and quantified by annexin V labelling of phosphatidylserine externalization
254
+ using cytofluorometric analysis and compared with PBMC production TNF-alpha in
255
+ vitro. In healthy donors, induced apoptosis was observed in 0.3% to 3.8% of lymphocytes
256
+ with both therapies. In RA patients the treatment induced lymphocyte apoptosis
257
+ in 17 of 24 patients with a percentage of annexin V-positive lymphocytes ranging
258
+ from 0.1% to 25%. Among these 17 RA patients, a significant in vitro lymphocyte
259
+ apoptosis (> 4%) was observed in 11 patients (46%) compared with healthy donors
260
+ (p < 0.01). The variability of the response to anti-TNF-alpha within the RA population
261
+ was not dependent on TNF-alpha synthesis or disease activity.
262
+ datasets:
263
+ - HariModelMaven/pumed-finetuning
264
+ pipeline_tag: sentence-similarity
265
+ library_name: sentence-transformers
266
+ metrics:
267
+ - cosine_accuracy
268
+ model-index:
269
+ - name: SentenceTransformer based on sentence-transformers/all-distilroberta-v1
270
+ results:
271
+ - task:
272
+ type: triplet
273
+ name: Triplet
274
+ dataset:
275
+ name: ai pubmed validation
276
+ type: ai-pubmed-validation
277
+ metrics:
278
+ - type: cosine_accuracy
279
+ value: 1.0
280
+ name: Cosine Accuracy
281
+ ---
282
+
283
+ # SentenceTransformer based on sentence-transformers/all-distilroberta-v1
284
+
285
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-distilroberta-v1](https://huggingface.co/sentence-transformers/all-distilroberta-v1) on the [pumed-finetuning](https://huggingface.co/datasets/HariModelMaven/pumed-finetuning) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
286
+
287
+ ## Model Details
288
+
289
+ ### Model Description
290
+ - **Model Type:** Sentence Transformer
291
+ - **Base model:** [sentence-transformers/all-distilroberta-v1](https://huggingface.co/sentence-transformers/all-distilroberta-v1) <!-- at revision 842eaed40bee4d61673a81c92d5689a8fed7a09f -->
292
+ - **Maximum Sequence Length:** 512 tokens
293
+ - **Output Dimensionality:** 768 dimensions
294
+ - **Similarity Function:** Cosine Similarity
295
+ - **Training Dataset:**
296
+ - [pumed-finetuning](https://huggingface.co/datasets/HariModelMaven/pumed-finetuning)
297
+ <!-- - **Language:** Unknown -->
298
+ <!-- - **License:** Unknown -->
299
+
300
+ ### Model Sources
301
+
302
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
303
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
304
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
305
+
306
+ ### Full Model Architecture
307
+
308
+ ```
309
+ SentenceTransformer(
310
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: RobertaModel
311
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
312
+ (2): Normalize()
313
+ )
314
+ ```
315
+
316
+ ## Usage
317
+
318
+ ### Direct Usage (Sentence Transformers)
319
+
320
+ First install the Sentence Transformers library:
321
+
322
+ ```bash
323
+ pip install -U sentence-transformers
324
+ ```
325
+
326
+ Then you can load this model and run inference.
327
+ ```python
328
+ from sentence_transformers import SentenceTransformer
329
+
330
+ # Download from the 🤗 Hub
331
+ model = SentenceTransformer("HariModelMaven/distilroberta-pubmed-embeddings")
332
+ # Run inference
333
+ sentences = [
334
+ 'Are peripheral blood lymphocytes from patients with rheumatoid arthritis differentially sensitive to apoptosis induced by anti-tumour necrosis factor-alpha therapy?',
335
+ 'The efficacy of anti-tumour necrosis factor-alpha (TNF-alpha) therapies in rheumatoid arthritis (RA) has been mainly attributed to TNF-alpha neutralisation. Other mechanism as immune cell apoptosis, which is impaired in RA, may also be induced by anti-TNF-alpha therapies. The aim of our study was to investigate whether TNF-alpha inhibitors could induce apoptosis in vitro of the peripheral blood lymphocytes of RA patients. Peripheral blood mononuclear cells (PBMC) isolated from 24 patients with RA and 18 healthy donors were incubated with anti-TNF-alpha agents, infliximab or etanercept, in comparison with no agent and including an isotypic control, for 48 hours. Apoptosis was detected and quantified by annexin V labelling of phosphatidylserine externalization using cytofluorometric analysis and compared with PBMC production TNF-alpha in vitro. In healthy donors, induced apoptosis was observed in 0.3% to 3.8% of lymphocytes with both therapies. In RA patients the treatment induced lymphocyte apoptosis in 17 of 24 patients with a percentage of annexin V-positive lymphocytes ranging from 0.1% to 25%. Among these 17 RA patients, a significant in vitro lymphocyte apoptosis (> 4%) was observed in 11 patients (46%) compared with healthy donors (p < 0.01). The variability of the response to anti-TNF-alpha within the RA population was not dependent on TNF-alpha synthesis or disease activity.',
336
+ 'This study examined links between DNA methylation and birth weight centile (BWC), and explored the impact of genetic variation. Using HumanMethylation450 arrays, we examined candidate gene-associated CpGs in cord blood from newborns with low (<15th centile), medium (40-60th centile) and high (>85th centile) BWC (n = 12). Candidates were examined in an investigation cohort (n = 110) using pyrosequencing and genotyping for putative methylation-associated polymorphisms performed using standard PCR. Array analysis identified 314 candidate genes associated with BWC extremes, four of which showed ≥ 4 BWC-linked CpGs. Of these, PM20D1 and MI886 suggested genetically determined methylation levels. However, methylation at three CpGs in FGFR2 remained significantly associated with high BWC (p = 0.004-0.027).',
337
+ ]
338
+ embeddings = model.encode(sentences)
339
+ print(embeddings.shape)
340
+ # [3, 768]
341
+
342
+ # Get the similarity scores for the embeddings
343
+ similarities = model.similarity(embeddings, embeddings)
344
+ print(similarities.shape)
345
+ # [3, 3]
346
+ ```
347
+
348
+ <!--
349
+ ### Direct Usage (Transformers)
350
+
351
+ <details><summary>Click to see the direct usage in Transformers</summary>
352
+
353
+ </details>
354
+ -->
355
+
356
+ <!--
357
+ ### Downstream Usage (Sentence Transformers)
358
+
359
+ You can finetune this model on your own dataset.
360
+
361
+ <details><summary>Click to expand</summary>
362
+
363
+ </details>
364
+ -->
365
+
366
+ <!--
367
+ ### Out-of-Scope Use
368
+
369
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
370
+ -->
371
+
372
+ ## Evaluation
373
+
374
+ ### Metrics
375
+
376
+ #### Triplet
377
+
378
+ * Dataset: `ai-pubmed-validation`
379
+ * Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
380
+
381
+ | Metric | Value |
382
+ |:--------------------|:--------|
383
+ | **cosine_accuracy** | **1.0** |
384
+
385
+ <!--
386
+ ## Bias, Risks and Limitations
387
+
388
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
389
+ -->
390
+
391
+ <!--
392
+ ### Recommendations
393
+
394
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
395
+ -->
396
+
397
+ ## Training Details
398
+
399
+ ### Training Dataset
400
+
401
+ #### pumed-finetuning
402
+
403
+ * Dataset: [pumed-finetuning](https://huggingface.co/datasets/HariModelMaven/pumed-finetuning) at [02cc2cc](https://huggingface.co/datasets/HariModelMaven/pumed-finetuning/tree/02cc2cc18a0aad3651a3ee0164319a2281d4da6f)
404
+ * Size: 8,000 training samples
405
+ * Columns: <code>instruction</code>, <code>context</code>, and <code>context_neg</code>
406
+ * Approximate statistics based on the first 1000 samples:
407
+ | | instruction | context | context_neg |
408
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
409
+ | type | string | string | string |
410
+ | details | <ul><li>min: 11 tokens</li><li>mean: 26.36 tokens</li><li>max: 67 tokens</li></ul> | <ul><li>min: 31 tokens</li><li>mean: 320.03 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 82 tokens</li><li>mean: 321.22 tokens</li><li>max: 512 tokens</li></ul> |
411
+ * Samples:
412
+ | instruction | context | context_neg |
413
+ |:------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
414
+ | <code>Do competency assessment of primary care physicians as part of a peer review program?</code> | <code>To design and test a program that assesses clinical competence as a second stage in a peer review process and to determine the program's reliability. A three-cohort study of Ontario primary care physicians. Reference physicians (n = 26) randomly drawn from the Hamilton, Ontario, area; volunteer, self-referred physicians (n = 20); and physicians referred by the licensing body (n = 37) as a result of a disciplinary hearing or peer review. Standardized patients, structured oral examinations, chart-stimulated recall, objective structured clinical examination, and multiple-choice examination. Test reliability was high, ranging from 0.73 to 0.91, and all tests discriminated among subgroups. Demographic variables relating to the final category were age, Canadian or foreign graduates, and whether or not participants were certified in family medicine.</code> | <code>Static stretch is frequently observed in the lung. Both static stretch and cyclic stretch can induce cell death and Na(+)/K(+)-ATPase trafficking, but stretch-induced alveolar epithelial cell (AEC) functions are much less responsive to static than to cyclic stretch. AEC remodeling under static stretch may be partly explained. The aim of this study was to explore the AEC remodeling and functional changes under static stretch conditions. We used A549 cells as a model of AEC type II cells. We assessed F-actin content and cell viability by fluorescence staining at various static-stretch magnitudes and time points. Specifically, we used scanning electron microscopy to explore the possible biological mechanisms used by A549 cells to 'escape' static-stretch-induced injury. Finally, we measured choline cytidylyltransferase-alpha (CCT alpha) mRNA and protein by real-time PCR and Western blot to evaluate cellular secretory function. The results showed that the magnitude of static stretch was the...</code> |
415
+ | <code>Is age an important determinant of the growth hormone response to sprint exercise in non-obese young men?</code> | <code>The factors that regulate the growth hormone (GH) response to physiological stimuli, such as exercise, are not fully understood. The aim of the present study is to determine whether age, body composition, measures of sprint performance or the metabolic response to a sprint are predictors of the GH response to sprint exercise in non-obese young men. Twenty-seven healthy, non-obese males aged 18-32 years performed an all-out 30-second sprint on a cycle ergometer. Univariate linear regression analysis was employed to evaluate age-, BMI-, performance- and metabolic-dependent changes from pre-exercise to peak GH and integrated GH for 60 min after the sprint. GH was elevated following the sprint (change in GH: 17.0 +/- 14.2 microg l(-1); integrated GH: 662 +/- 582 min microg l(-1)). Performance characteristics, the metabolic response to exercise and BMI were not significant predictors of the GH response to exercise. However, age emerged as a significant predictor of both integrated GH (beta ...</code> | <code>We have previously reported the crucial roles of oncogenic Kirsten rat sarcoma viral oncogene homolog (KRAS) in inhibiting apoptosis and disrupting cell polarity via the regulation of phosphodiesterase 4 (PDE4) expression in human colorectal cancer HCT116 cells in three-dimensional cultures (3DC). Herein we evaluated the effects of resveratrol, a PDE4 inhibitor, on the luminal cavity formation and the induction of apoptosis in HCT116 cells. Apoptosis was detected by immunofluorescence using confocal laser scanning microscopy with an antibody against cleaved caspase-3 in HCT116 cells treated with or without resveratrol in a two-dimensional culture (2DC) or 3DC. Resveratrol did not induce apoptosis of HCT116 cells in 2DC, whereas the number of apoptotic HCT116 cells increased after resveratrol treatment in 3DC, leading to formation of a luminal cavity.</code> |
416
+ | <code>Is terlipressin more effective in decreasing variceal pressure than portal pressure in cirrhotic patients?</code> | <code>Terlipressin decreases portal pressure. However, its effects on variceal pressure have been poorly investigated. This study investigated the variceal, splanchnic and systemic hemodynamic effects of terlipressin. Twenty cirrhotic patients with esophageal varices grade II-III, and portal pressure > or =12 mmHg were studied. Hepatic venous pressure gradient, variceal pressure and systemic hemodynamic parameters were obtained. After baseline measurements, in a double-blind administration, 14 patients received a 2mg/iv injection of terlipressin and six patients received placebo. The same measurements were repeated 60 min later. No demographic or biochemical differences were observed in basal condition between groups. Terlipressin produced significant decreases in intravariceal pressure from 20.9+4.9 to 16.3+/-4.7 mmHg (p<0.01, -21+/- 16%), variceal pressure gradient from 18.9+/-4.8 to 13.5+/-6.0 mmHg (p<0.01, -28+/-27%), estimated variceal wall tension from 78+/-29 to 59+/-31 mmHg x mm (p<0...</code> | <code>Based on the theories of brain reserve and cognitive reserve, we investigated whether larger maximal lifetime brain growth (MLBG) and/or greater lifetime intellectual enrichment protect against cognitive decline over time. Forty patients with multiple sclerosis (MS) underwent baseline and 4.5-year follow-up evaluations of cognitive efficiency (Symbol Digit Modalities Test, Paced Auditory Serial Addition Task) and memory (Selective Reminding Test, Spatial Recall Test). Baseline and follow-up MRIs quantified disease progression: percentage brain volume change (cerebral atrophy), percentage change in T2 lesion volume. MLBG (brain reserve) was estimated with intracranial volume; intellectual enrichment (cognitive reserve) was estimated with vocabulary. We performed repeated-measures analyses of covariance to investigate whether larger MLBG and/or greater intellectual enrichment moderate/attenuate cognitive decline over time, controlling for disease progression. Patients with MS declined in...</code> |
417
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
418
+ ```json
419
+ {
420
+ "scale": 20.0,
421
+ "similarity_fct": "cos_sim"
422
+ }
423
+ ```
424
+
425
+ ### Evaluation Dataset
426
+
427
+ #### pumed-finetuning
428
+
429
+ * Dataset: [pumed-finetuning](https://huggingface.co/datasets/HariModelMaven/pumed-finetuning) at [02cc2cc](https://huggingface.co/datasets/HariModelMaven/pumed-finetuning/tree/02cc2cc18a0aad3651a3ee0164319a2281d4da6f)
430
+ * Size: 1,000 evaluation samples
431
+ * Columns: <code>instruction</code>, <code>context</code>, and <code>context_neg</code>
432
+ * Approximate statistics based on the first 1000 samples:
433
+ | | instruction | context | context_neg |
434
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
435
+ | type | string | string | string |
436
+ | details | <ul><li>min: 11 tokens</li><li>mean: 26.08 tokens</li><li>max: 52 tokens</li></ul> | <ul><li>min: 38 tokens</li><li>mean: 311.42 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 45 tokens</li><li>mean: 316.2 tokens</li><li>max: 512 tokens</li></ul> |
437
+ * Samples:
438
+ | instruction | context | context_neg |
439
+ |:----------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
440
+ | <code>Are pre-transplant impedance measures of reflux associated with early allograft injury after lung transplantation?</code> | <code>Acid reflux has been associated with poorer outcomes after lung transplantation. Standard pre-transplant reflux assessment has not been universally adopted. Non-acid reflux may also induce a pulmonary inflammatory cascade, leading to acute and chronic rejection. Esophageal multichannel intraluminal impedance and pH testing (MII-pH) may be valuable in standard pre-transplant evaluation. We assessed the association between pre-transplant MII-pH measures and early allograft injury in lung transplant patients. This was a retrospective cohort study of lung transplant recipients who underwent pre-transplant MII-pH at a tertiary center from 2007 to 2012. Results from pre-transplant MII-pH, cardiopulmonary function testing, and results of biopsy specimen analysis of the transplanted lung were recorded. Time-to-event analyses were performed using Cox proportional hazards and Kaplan-Maier methods to assess the associations between MII-pH measures and development of acute rejection or lymphocytic...</code> | <code>The yeast cell cycle is largely controlled by the cyclin-dependent kinase (CDK) Cdc28. Recent evidence suggests that both CDK complex stability as well as function during mitosis is determined by precise regulation of Swe1, a CDK inhibitory kinase and cyclin binding partner. A model of mitotic progression has been provided by study of filamentous yeast. When facing nutrient-limited conditions, Ras2-mediated PKA and MAPK signaling cascades induce a switch from round to filamentous morphology resulting in delayed mitotic progression. To delineate how the dimorphic switch contributes to cell cycle regulation, temperature sensitive cdc28 mutants exhibiting constitutive filamentation were subjected to epistasis analyses with RAS2 signaling effectors. It was found that Swe1-mediated inhibitory tyrosine phosphorylation of Cdc28 during filamentous growth is in part mediated by Ras2 activation of PKA, but not Kss1-MAPK, signaling. This pathway is further influenced by Cks1, a conserved CDK-bind...</code> |
441
+ | <code>Is predictive accuracy of the TRISS survival statistic improved by a modification that includes admission pH?</code> | <code>To determine if pH measured at the time of hospital admission and corrected for PCO2 was an independent predictor of trauma survival. Phase 1 was a retrospective case-control analysis of 1708 patients, followed by multivariate multiple logistic regression analysis of a subset of 919 patients for whom the Revised Trauma Score (RTS), Injury Severity Score (ISS), and pH were available. Phase 2 was a prospective comparison of a mathematical model of survival derived in phase 1 (pH-TRISS) with the TRISS method in 508 of 1325 subsequently admitted trauma patients. Urban level 1 trauma center. All patients admitted with blunt or penetrating trauma during the study period. Survival vs mortality. In phase 1, factors significantly associated with mortality by t test and chi 2 analysis included the RTS, ISS< Glasgow Coma Scale, corrected pH (CpH), and sum of the head, chest, and abdominal components of the Abbreviated Injury Scale-85 (AIS85) (HCAISS) (for all, P < .0001). The TRISS statistic was ...</code> | <code>Ovarian cancer is the most lethal gynecologic malignancy, and there is an unmet clinical need to develop new therapies. Although showing promising anticancer activity, Niclosamide may not be used as a monotherapy. We seek to investigate whether inhibiting IGF signaling potentiates Niclosamide's anticancer efficacy in human ovarian cancer cells. Cell proliferation and migration are assessed. Cell cycle progression and apoptosis are analyzed by flow cytometry. Inhibition of IGF signaling is accomplished by adenovirus-mediated expression of siRNAs targeting IGF-1R. Cancer-associated pathways are assessed using pathway-specific reporters. Subcutaneous xenograft model is used to determine anticancer activity. We find that Niclosamide is highly effective on inhibiting cell proliferation, cell migration, and cell cycle progression, and inducing apoptosis in human ovarian cancer cells, possibly by targeting multiple signaling pathways involved in ELK1/SRF, AP-1, MYC/MAX and NFkB. Silencing IGF...</code> |
442
+ | <code>Does exposure to intermittent nociceptive stimulation under pentobarbital anesthesia disrupt spinal cord function in rats?</code> | <code>Spinal cord plasticity can be assessed in spinal rats using an instrumental learning paradigm in which subjects learn an instrumental response, hindlimb flexion, to minimize shock exposure. Prior exposure to uncontrollable intermittent stimulation blocks learning in spinal rats but has no effect if given before spinal transection, suggesting that supraspinal systems modulate nociceptive input to the spinal cord, rendering it less susceptible to the detrimental consequences of uncontrollable stimulation. The present study examines whether disrupting brain function with pentobarbital blocks descending inhibitory systems that normally modulate nociceptive input, making the spinal cord more sensitive to the adverse effect of uncontrollable intermittent stimulation. Male Sprague-Dawley rats received uncontrollable intermittent stimulation during pentobarbital anesthesia after (experiment 1) or before (experiment 2) spinal cord transection. They were then tested for instrumental learning at ...</code> | <code>Increased serum hepcidin has been reported in patients receiving chronic hemodialysis, and hypothesized to contribute to the alterations of iron metabolism of end-stage renal disease. However, no quantitative assessment is available to date; the clinical determinants are still under definition; and the role of genetic factors, namely HFE mutations, has not yet been evaluated. The aim of this study was to quantitatively assess serum hepcidin-25 in hemodialysis patients versus controls, and analyze the relationship between hepcidin, iron indices, HFE genotype, and erythropoietic parameters. Sixty-five hemodialysis patients and 57 healthy controls were considered. Hepcidin-25 was evaluated by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry, HFE genotype by restriction analysis. Serum hepcidin-25 was higher in hemodialysis patients compared with controls. In patients, hepcidin-25 correlated positively with ferritin and C reactive protein, and negatively with s...</code> |
443
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
444
+ ```json
445
+ {
446
+ "scale": 20.0,
447
+ "similarity_fct": "cos_sim"
448
+ }
449
+ ```
450
+
451
+ ### Training Hyperparameters
452
+ #### Non-Default Hyperparameters
453
+
454
+ - `eval_strategy`: steps
455
+ - `per_device_train_batch_size`: 16
456
+ - `per_device_eval_batch_size`: 16
457
+ - `learning_rate`: 2e-05
458
+ - `num_train_epochs`: 1
459
+ - `warmup_ratio`: 0.1
460
+ - `batch_sampler`: no_duplicates
461
+
462
+ #### All Hyperparameters
463
+ <details><summary>Click to expand</summary>
464
+
465
+ - `overwrite_output_dir`: False
466
+ - `do_predict`: False
467
+ - `eval_strategy`: steps
468
+ - `prediction_loss_only`: True
469
+ - `per_device_train_batch_size`: 16
470
+ - `per_device_eval_batch_size`: 16
471
+ - `per_gpu_train_batch_size`: None
472
+ - `per_gpu_eval_batch_size`: None
473
+ - `gradient_accumulation_steps`: 1
474
+ - `eval_accumulation_steps`: None
475
+ - `torch_empty_cache_steps`: None
476
+ - `learning_rate`: 2e-05
477
+ - `weight_decay`: 0.0
478
+ - `adam_beta1`: 0.9
479
+ - `adam_beta2`: 0.999
480
+ - `adam_epsilon`: 1e-08
481
+ - `max_grad_norm`: 1.0
482
+ - `num_train_epochs`: 1
483
+ - `max_steps`: -1
484
+ - `lr_scheduler_type`: linear
485
+ - `lr_scheduler_kwargs`: {}
486
+ - `warmup_ratio`: 0.1
487
+ - `warmup_steps`: 0
488
+ - `log_level`: passive
489
+ - `log_level_replica`: warning
490
+ - `log_on_each_node`: True
491
+ - `logging_nan_inf_filter`: True
492
+ - `save_safetensors`: True
493
+ - `save_on_each_node`: False
494
+ - `save_only_model`: False
495
+ - `restore_callback_states_from_checkpoint`: False
496
+ - `no_cuda`: False
497
+ - `use_cpu`: False
498
+ - `use_mps_device`: False
499
+ - `seed`: 42
500
+ - `data_seed`: None
501
+ - `jit_mode_eval`: False
502
+ - `use_ipex`: False
503
+ - `bf16`: False
504
+ - `fp16`: False
505
+ - `fp16_opt_level`: O1
506
+ - `half_precision_backend`: auto
507
+ - `bf16_full_eval`: False
508
+ - `fp16_full_eval`: False
509
+ - `tf32`: None
510
+ - `local_rank`: 0
511
+ - `ddp_backend`: None
512
+ - `tpu_num_cores`: None
513
+ - `tpu_metrics_debug`: False
514
+ - `debug`: []
515
+ - `dataloader_drop_last`: False
516
+ - `dataloader_num_workers`: 0
517
+ - `dataloader_prefetch_factor`: None
518
+ - `past_index`: -1
519
+ - `disable_tqdm`: False
520
+ - `remove_unused_columns`: True
521
+ - `label_names`: None
522
+ - `load_best_model_at_end`: False
523
+ - `ignore_data_skip`: False
524
+ - `fsdp`: []
525
+ - `fsdp_min_num_params`: 0
526
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
527
+ - `fsdp_transformer_layer_cls_to_wrap`: None
528
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
529
+ - `deepspeed`: None
530
+ - `label_smoothing_factor`: 0.0
531
+ - `optim`: adamw_torch
532
+ - `optim_args`: None
533
+ - `adafactor`: False
534
+ - `group_by_length`: False
535
+ - `length_column_name`: length
536
+ - `ddp_find_unused_parameters`: None
537
+ - `ddp_bucket_cap_mb`: None
538
+ - `ddp_broadcast_buffers`: False
539
+ - `dataloader_pin_memory`: True
540
+ - `dataloader_persistent_workers`: False
541
+ - `skip_memory_metrics`: True
542
+ - `use_legacy_prediction_loop`: False
543
+ - `push_to_hub`: False
544
+ - `resume_from_checkpoint`: None
545
+ - `hub_model_id`: None
546
+ - `hub_strategy`: every_save
547
+ - `hub_private_repo`: None
548
+ - `hub_always_push`: False
549
+ - `gradient_checkpointing`: False
550
+ - `gradient_checkpointing_kwargs`: None
551
+ - `include_inputs_for_metrics`: False
552
+ - `include_for_metrics`: []
553
+ - `eval_do_concat_batches`: True
554
+ - `fp16_backend`: auto
555
+ - `push_to_hub_model_id`: None
556
+ - `push_to_hub_organization`: None
557
+ - `mp_parameters`:
558
+ - `auto_find_batch_size`: False
559
+ - `full_determinism`: False
560
+ - `torchdynamo`: None
561
+ - `ray_scope`: last
562
+ - `ddp_timeout`: 1800
563
+ - `torch_compile`: False
564
+ - `torch_compile_backend`: None
565
+ - `torch_compile_mode`: None
566
+ - `dispatch_batches`: None
567
+ - `split_batches`: None
568
+ - `include_tokens_per_second`: False
569
+ - `include_num_input_tokens_seen`: False
570
+ - `neftune_noise_alpha`: None
571
+ - `optim_target_modules`: None
572
+ - `batch_eval_metrics`: False
573
+ - `eval_on_start`: False
574
+ - `use_liger_kernel`: False
575
+ - `eval_use_gather_object`: False
576
+ - `average_tokens_across_devices`: False
577
+ - `prompts`: None
578
+ - `batch_sampler`: no_duplicates
579
+ - `multi_dataset_batch_sampler`: proportional
580
+
581
+ </details>
582
+
583
+ ### Training Logs
584
+ | Epoch | Step | Training Loss | Validation Loss | ai-pubmed-validation_cosine_accuracy |
585
+ |:-----:|:----:|:-------------:|:---------------:|:------------------------------------:|
586
+ | -1 | -1 | - | - | 1.0 |
587
+ | 0.2 | 100 | 0.0058 | 0.0046 | 1.0 |
588
+ | 0.4 | 200 | 0.0051 | 0.0035 | 1.0 |
589
+ | 0.6 | 300 | 0.0043 | 0.0031 | 1.0 |
590
+ | 0.8 | 400 | 0.0026 | 0.0034 | 1.0 |
591
+ | 1.0 | 500 | 0.0048 | 0.0034 | 1.0 |
592
+
593
+
594
+ ### Framework Versions
595
+ - Python: 3.11.11
596
+ - Sentence Transformers: 3.4.1
597
+ - Transformers: 4.48.3
598
+ - PyTorch: 2.5.1+cu124
599
+ - Accelerate: 1.3.0
600
+ - Datasets: 3.3.2
601
+ - Tokenizers: 0.21.0
602
+
603
+ ## Citation
604
+
605
+ ### BibTeX
606
+
607
+ #### Sentence Transformers
608
+ ```bibtex
609
+ @inproceedings{reimers-2019-sentence-bert,
610
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
611
+ author = "Reimers, Nils and Gurevych, Iryna",
612
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
613
+ month = "11",
614
+ year = "2019",
615
+ publisher = "Association for Computational Linguistics",
616
+ url = "https://arxiv.org/abs/1908.10084",
617
+ }
618
+ ```
619
+
620
+ #### MultipleNegativesRankingLoss
621
+ ```bibtex
622
+ @misc{henderson2017efficient,
623
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
624
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
625
+ year={2017},
626
+ eprint={1705.00652},
627
+ archivePrefix={arXiv},
628
+ primaryClass={cs.CL}
629
+ }
630
+ ```
631
+
632
+ <!--
633
+ ## Glossary
634
+
635
+ *Clearly define terms in order to be accessible across audiences.*
636
+ -->
637
+
638
+ <!--
639
+ ## Model Card Authors
640
+
641
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
642
+ -->
643
+
644
+ <!--
645
+ ## Model Card Contact
646
+
647
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
648
+ -->
config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "/content/models/distilroberta-pubmed-embeddings/checkpoint-500",
3
+ "architectures": [
4
+ "RobertaModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": 0,
8
+ "classifier_dropout": null,
9
+ "eos_token_id": 2,
10
+ "gradient_checkpointing": false,
11
+ "hidden_act": "gelu",
12
+ "hidden_dropout_prob": 0.1,
13
+ "hidden_size": 768,
14
+ "initializer_range": 0.02,
15
+ "intermediate_size": 3072,
16
+ "layer_norm_eps": 1e-05,
17
+ "max_position_embeddings": 514,
18
+ "model_type": "roberta",
19
+ "num_attention_heads": 12,
20
+ "num_hidden_layers": 6,
21
+ "pad_token_id": 1,
22
+ "position_embedding_type": "absolute",
23
+ "torch_dtype": "float32",
24
+ "transformers_version": "4.48.3",
25
+ "type_vocab_size": 1,
26
+ "use_cache": true,
27
+ "vocab_size": 50265
28
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.4.1",
4
+ "transformers": "4.48.3",
5
+ "pytorch": "2.5.1+cu124"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0722e8d0f953e10e7f3a3de0736b0461a2a847c854e3c52c53ff5d26352cd2bc
3
+ size 328485128
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_prefix_space": false,
3
+ "added_tokens_decoder": {
4
+ "0": {
5
+ "content": "<s>",
6
+ "lstrip": false,
7
+ "normalized": false,
8
+ "rstrip": false,
9
+ "single_word": false,
10
+ "special": true
11
+ },
12
+ "1": {
13
+ "content": "<pad>",
14
+ "lstrip": false,
15
+ "normalized": false,
16
+ "rstrip": false,
17
+ "single_word": false,
18
+ "special": true
19
+ },
20
+ "2": {
21
+ "content": "</s>",
22
+ "lstrip": false,
23
+ "normalized": false,
24
+ "rstrip": false,
25
+ "single_word": false,
26
+ "special": true
27
+ },
28
+ "3": {
29
+ "content": "<unk>",
30
+ "lstrip": false,
31
+ "normalized": false,
32
+ "rstrip": false,
33
+ "single_word": false,
34
+ "special": true
35
+ },
36
+ "50264": {
37
+ "content": "<mask>",
38
+ "lstrip": true,
39
+ "normalized": false,
40
+ "rstrip": false,
41
+ "single_word": false,
42
+ "special": true
43
+ }
44
+ },
45
+ "bos_token": "<s>",
46
+ "clean_up_tokenization_spaces": false,
47
+ "cls_token": "<s>",
48
+ "eos_token": "</s>",
49
+ "errors": "replace",
50
+ "extra_special_tokens": {},
51
+ "mask_token": "<mask>",
52
+ "max_length": 128,
53
+ "model_max_length": 512,
54
+ "pad_to_multiple_of": null,
55
+ "pad_token": "<pad>",
56
+ "pad_token_type_id": 0,
57
+ "padding_side": "right",
58
+ "sep_token": "</s>",
59
+ "stride": 0,
60
+ "tokenizer_class": "RobertaTokenizer",
61
+ "trim_offsets": true,
62
+ "truncation_side": "right",
63
+ "truncation_strategy": "longest_first",
64
+ "unk_token": "<unk>"
65
+ }
vocab.json ADDED
The diff for this file is too large to render. See raw diff