IDEA-CCNL
/

Zhouwenwang-Unified-1.3B

Model card Files Files and versions Community

suolyer commited on Nov 25, 2021

Commit

a8400d3

·

1 Parent(s): b78f377

Update README.md

Files changed (1) hide show

README.md +32 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ There is no structure of Zhouwenwang-1.3B in [Transformers](https://github.com/h
  git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git
  ```
-### Load Model
 ```python
 from model.roformer.modeling_roformer import RoFormerModel
 from model.roformer.configuration_roformer import RoFormerConfig
@@ -27,6 +27,37 @@ model = RoFormerModel.from_pretrained("IDEA-CCNL/Zhouwenwang-1.3B")
 ```
 ## Scores on downstream chinese tasks (without any data augmentation)
 |     Model| afqmc    |  tnews  | iflytek    |  ocnli  |  cmnli  | wsc  | csl  |
 | :--------:    | :-----:  | :----:  | :-----:   | :----: | :----: | :----: | :----: |

  git clone https://github.com/IDEA-CCNL/Fengshenbang-LM.git
  ```
+### Load model
 ```python
 from model.roformer.modeling_roformer import RoFormerModel
 from model.roformer.configuration_roformer import RoFormerConfig
 ```
+### Generate task
+You can use Zhouwenwang-1.3B to continue writing
+```python
+from model.roformer.modeling_roformer import RoFormerModel
+from transformers import AutoTokenizer
+import torch
+import numpy as np
+sentence = '清华大学位于'
+max_length = 32
+model_pretrained_weight_path = '/home/'  # 预训练模型权重路径
+tokenizer = AutoTokenizer.from_pretrained(model_pretrained_weight_path)
+model = RoFormerModel.from_pretrained(model_pretrained_weight_path)
+for i in range(max_length):
+    encode = torch.tensor(
+        [[tokenizer.cls_token_id]+tokenizer.encode(sentence, add_special_tokens=False)]).long()
+    logits = model(encode)[0]
+    logits = torch.nn.functional.linear(
+        logits, model.embeddings.word_embeddings.weight)
+    logits = torch.nn.functional.softmax(
+        logits, dim=-1).cpu().detach().numpy()[0]
+    sentence = sentence + \
+        tokenizer.decode(int(np.random.choice(logits.shape[1], p=logits[-1])))
+    if sentence[-1] == '。':
+        break
+print(sentence)
+```
 ## Scores on downstream chinese tasks (without any data augmentation)
 |     Model| afqmc    |  tnews  | iflytek    |  ocnli  |  cmnli  | wsc  | csl  |
 | :--------:    | :-----:  | :----:  | :-----:   | :----: | :----: | :----: | :----: |