sukuya commited on
Commit
3740b04
·
verified ·
1 Parent(s): ab2272a

update model card

Browse files
Files changed (1) hide show
  1. README.md +67 -3
README.md CHANGED
@@ -1,3 +1,67 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - ja
6
+ ---
7
+
8
+ # RakutenAI-2.0-mini
9
+ ## Model Description
10
+ RakutenAI-2.0-mini is a lightweight Japanese language model trained from scratch using a transformer architecture, designed for efficient performance in resource-constrained environments. As a foundation model, it serves as the backbone for instruct models.
11
+
12
+ *If you are looking for instruct model, check [RakutenAI-2.0-mini-instruct](https://huggingface.co/Rakuten/RakutenAI-2.0-mini-instruct)*.
13
+
14
+
15
+ ## Model Usage
16
+ ## Usage
17
+ ```python
18
+ from transformers import AutoModelForCausalLM, AutoTokenizer
19
+ model_path = "Rakuten/RakutenAI-2.0-mini"
20
+ tokenizer = AutoTokenizer.from_pretrained(model_path)
21
+ model = AutoModelForCausalLM.from_pretrained(model_path, torch_dtype="auto", device_map="auto")
22
+ model.eval()
23
+
24
+ requests = [
25
+ "南硫黄島原生自然環境保全地域は、自然",
26
+ "The capybara is a giant cavy rodent",
27
+ ]
28
+
29
+ for req in requests:
30
+ input_text = tokenizer(req, return_tensors="pt").to(device=model.device)
31
+ tokens = model.generate(
32
+ **input_text,
33
+ max_new_tokens=512,
34
+ do_sample=True,
35
+ pad_token_id=tokenizer.eos_token_id,
36
+ )
37
+ out = tokenizer.decode(tokens[0], skip_special_tokens=True)
38
+ print("INPUT:\n" + req)
39
+ print("OUTPUT:\n" + out)
40
+
41
+ ```
42
+
43
+ ## Model Details
44
+
45
+ * **Developed by**: [Rakuten Group, Inc.](https://ai.rakuten.com/)
46
+ * **Language(s)**: Japanese, English
47
+ * **License**: This model is licensed under [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
48
+ * **Model Architecture**: Transformer
49
+
50
+ ### Limitations and Bias
51
+
52
+ The suite of RakutenAI-2.0 models is capable of generating human-like text on a wide range of topics. However, like all LLMs, they have limitations and can produce biased, inaccurate, or unsafe outputs. Please exercise caution and judgement while interacting with them.
53
+
54
+ ## Citation
55
+ For citing our work on the suite of RakutenAI-2.0 models, please use:
56
+
57
+ ```
58
+ @misc{rakutengroup2025rakutenai2.0,
59
+ author = {Rakuten Group, Inc.},
60
+ title = {RakutenAI-2.0},
61
+ year = {2025},
62
+ publisher = {Hugging Face},
63
+ url = {https://huggingface.co/Rakuten},
64
+ }
65
+
66
+ ```
67
+