devnote5676 commited on
Commit
6a93326
·
1 Parent(s): 9e67f5f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -7
README.md CHANGED
@@ -27,10 +27,14 @@ This classifier is intended to predict the existence of social values from text
27
  10. tradition
28
 
29
  ## Datasets
30
- This model is finetuned on two datasets: ValueNet and Touche23-ValueEval
 
31
  We follow the original paper to convert both datasets into a binary classification task for each dimension.
32
- - ValueNet: a sentence has a positive label if the original label contains 1 (positive) or -1 (negative), and 0 if the original label is 0.
33
- - ValueEval: a sentence is assigned a positive label if the original label vector is marked 1 for that dimension. Since the original paper follows a 20-dimension refined categorization, we map them back to 10 dimensions. Therefore, the same sentence appears ten times, once for each dimension.
 
 
 
34
 
35
  ## How to use
36
  Start your sentence with a label that indicates which dimension to measure. An example would be:
@@ -40,10 +44,15 @@ Start your sentence with a label that indicates which dimension to measure. An e
40
  Please make sure to follow the exact format "<value\_name>" at the beginning of the sentence as this is a special token in the tokenizer: any spaces or different formats will not be encoded correctly.
41
 
42
  ## Performances
43
- - F1 score (macro)
44
- - ValueNet only: 0.648
45
- - ValueEval only: 0.744
46
  - Combined: 0.759
 
 
 
 
 
47
 
48
  ## Training details
49
  - Base model: bert-base-uncased
@@ -53,4 +62,4 @@ Please make sure to follow the exact format "<value\_name>" at the beginning of
53
  - Upsampled training set to maintain 1:1 balance for pos:neg labels
54
 
55
  ## References
56
- Do Differences in Values Influence Disagreements in Online Discussions? (EMNLP'23) [link](https://aclanthology.org/2023.emnlp-main.992/)
 
27
  10. tradition
28
 
29
  ## Datasets
30
+ This model is finetuned on two datasets: ValueNet (A New Dataset for Human Value Driven Dialogue System, Qiu et al. 2021) and Touche23-ValueEval (The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments, Mirzakhmedova et al., 2023).
31
+
32
  We follow the original paper to convert both datasets into a binary classification task for each dimension.
33
+ - ValueNet
34
+ - A sentence has a positive label if the original label contains 1 (positive) or -1 (negative), and 0 if the original label is 0.
35
+ - ValueEval
36
+ - A sentence is assigned a positive label if the original label vector is marked 1 for that dimension.
37
+ - Since the original paper follows a 20-dimension refined categorization, we map them back to 10 dimensions. Therefore, the same sentence appears ten times, once for each dimension.
38
 
39
  ## How to use
40
  Start your sentence with a label that indicates which dimension to measure. An example would be:
 
44
  Please make sure to follow the exact format "<value\_name>" at the beginning of the sentence as this is a special token in the tokenizer: any spaces or different formats will not be encoded correctly.
45
 
46
  ## Performances
47
+ - macro F1 score
48
+ - on ValueNet: 0.648
49
+ - on ValueEval: 0.744
50
  - Combined: 0.759
51
+
52
+ - ROC-AUC
53
+ - on ValueNet: 0.736
54
+ - on ValueEval:0.847
55
+ - Combined: 0.855
56
 
57
  ## Training details
58
  - Base model: bert-base-uncased
 
62
  - Upsampled training set to maintain 1:1 balance for pos:neg labels
63
 
64
  ## References
65
+ - Do Differences in Values Influence Disagreements in Online Discussions? (EMNLP'23) [link](https://aclanthology.org/2023.emnlp-main.992/)