image/png

HRM8K ํ•œ๊ตญ์–ด ์ˆ˜ํ•™ ๋Šฅ๋ ฅ ๋ฒค์น˜๋งˆํฌ ์ค‘ ์ผ๋ถ€

  • thinking์€ ์ตœ๋Œ€ 8192ํ† ํฐ๋งŒ ํ—ˆ์šฉ
  • ์ตœ๋Œ€ ํ† ํฐ์„ ์†Œ๋ชจํ•ด๋„ </thought> ๋‚˜ </think>๋ฅผ ์ƒ์„ฑํ•˜์ง€ ๋ชปํ–ˆ์„ ๊ฒฝ์šฐ, ๊ฐ•์ œ๋กœ ํ•ด๋‹น ํ† ํฐ์„ ์ƒ์„ฑํ•œ ํ›„ ๋‹ต๋ณ€ ์ƒ์„ฑ ์š”์ฒญ
  • temperature = 0.6 (EXAONE), 0.7(Qwen2.5)
  • top_p = 0.95
Model GSM8K KSM MATH Reasoning Language
LGAI-EXAONE/EXAONE-Deep-7.8B 85.48 31.72 84.23 English
werty1248/EXAONE-Deep-7.8B-Ko-Thought 78.85 28.01 79.45 Korean
werty1248/EXAONE-3.5-7.8B-Stratos-Ko 83.02 15.97 67.49 English
werty1248/Qwen2.5-7B-s1.1-Ko-Native 76.27 15.48 66.45 Korean(Chinese/English)
LGAI-EXAONE/EXAONE-3.5-7.8B-Instruct 81.58 14.71 63.50 (CoT in Korean)

ํŠน์ด์‚ฌํ•ญ

  • EXAONE-Deep-7.8B์€ GSM8K์—์„œ ์„ฑ๋Šฅ ๊ฐœ์„ ์ด ์žˆ์—ˆ์œผ๋‚˜, ํ•œ๊ตญ์–ด ํŠœ๋‹ ๊ณผ์ •์—์„œ ์„ฑ๋Šฅ์ด ์›๋ณธ ๋ชจ๋ธ๋ณด๋‹ค ํ•˜๋ฝ.
  • LGAI-EXAONE/EXAONE-Deep-7.8B์€ ์ฃผ์–ด์ง„ 8192ํ† ํฐ์ด ๋ถ€์กฑํ•ด์„œ ๋‹ต๋ณ€์„ ๋‚ด์ง€ ๋ชปํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ์•˜์Œ
    • KSM์˜ ๊ฒฝ์šฐ, EXAONE-Deep-7.8B์ด ์ผ๋‹จ ๋‹ต๋ณ€์„ ์ œ์ถœํ•˜๋Š” ๋ฐ ์„ฑ๊ณตํ•œ ์งˆ๋ฌธ ์ค‘ ์ •๋‹ต๋ฅ ์€ 75.61% (werty1248/EXAONE-Deep-7.8B-Ko-Thought๋Š” 52.36%)
  • ๊ตฌ์ฒด์ ์ธ ๊ธฐ๋ก์€ ํ•˜์ง€ ์•Š์•˜์ง€๋งŒ, werty1248/EXAONE-3.5-7.8B-s1.1-Ko-Native์˜ ์ ์ˆ˜๋Š” LGAI-EXAONE/EXAONE-3.5-7.8B-Instruct์™€ ๊ฑฐ์˜ ๋™์ผํ–ˆ์Œ
    • ๊ธฐ๋ณธ EXAONE ๋ชจ๋ธ์€ ์ถ”๋ก  ํ•™์Šต์—์„œ warm start๊ฐ€ ์•ˆ๋˜๋Š” ๊ฒƒ์ด ์•„๋‹๊นŒ?
    • ๋ฐ˜๋Œ€๋กœ EXAONE-Deep์˜ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ํ•œ๊ตญ์–ด๋กœ ์˜ฎ๊ธฐ๋Š” ๊ณผ์ •์€ ํ›จ์”ฌ ์ž˜ ํ•™์Šตํ•จ
  • ํ•œ๊ตญ์–ด ํ•™์Šต ๊ณผ์ •์—์„œ "ํ•œ ๋ฒˆ ์‹ค์ˆ˜ํ•˜๋ฉด ๋ญ๊ฐ€ ์ž˜๋ชป๋˜์—ˆ๋Š”์ง€ ๋ชจ๋ฅธ ์ฑ„ ๊ฐ™์€ ์˜ค๋ฅ˜๋ฅผ ๊ณ„์† ๋ฐ˜๋ณต ๊ฒ€์ฆํ•˜๋ ค๋Š” ํ˜„์ƒ"์ด ๋‹ค์‹œ ๋ฐœ์ƒํ•จ.
    • ์‹คํŒจ ์˜ˆ์‹œ

์งˆ๋ฌธ: Carla๋Š” 200GB ํŒŒ์ผ์„ ๋‹ค์šด๋กœ๋“œ ์ค‘์ž…๋‹ˆ๋‹ค. ๋ณดํ†ต 1๋ถ„์— 2GB๋ฅผ ๋‹ค์šด๋กœ๋“œํ•  ์ˆ˜ ์žˆ์ง€๋งŒ, ๋‹ค์šด๋กœ๋“œ์˜ 40%๊ฐ€ ์ง„ํ–‰๋์„ ๋•Œ Windows๊ฐ€ \
์—…๋ฐ์ดํŠธ ์„ค์น˜๋ฅผ ์œ„ํ•ด ์žฌ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์€ 20๋ถ„์ด ๊ฑธ๋ฆฝ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๊ณ  ๋‚˜์„œ Carla๋Š” ๋‹ค์šด๋กœ๋“œ๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ์‹œ์ž‘ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. ํŒŒ์ผ์„ \
๋‹ค์šด๋กœ๋“œํ•˜๋Š” ๋ฐ ์ด ์–ผ๋งˆ๋‚˜ ๊ฑธ๋ฆฌ๋‚˜์š”?

๋‹ต๋ณ€: <thought>
...(์ƒ๋žต)

๊ทธ๋Ÿฐ๋ฐ ๋‹ค๋ฅธ ๊ฐ€๋Šฅ์„ฑ์€ ์—†์„๊นŒ? ์˜ˆ๋ฅผ ๋“ค์–ด, ์žฌ์‹œ์ž‘ ํ›„์— ๋‹ค์šด๋กœ๋“œ๋ฅผ ์‹œ์ž‘ํ•  ๋•Œ, ์ด๋ฏธ ๋‹ค์šด๋กœ๋“œ๊ฐ€ ์ง„ํ–‰๋˜๊ณ  ์žˆ๋Š” ์ƒํƒœ์—์„œ ์žฌ์‹œ์ž‘์ด ๋˜๋ฉด, \
๊ทธ ๋ถ€๋ถ„์„ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•ด์•ผ ํ•˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ์žˆ์„ ์ˆ˜ ์žˆ์ง€ ์•Š์„๊นŒ? ๊ทธ๋Ÿฐ๋ฐ ๋ฌธ์ œ์—์„œ๋Š” "๋‹ค์šด๋กœ๋“œ๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ์‹œ์ž‘ํ•ด์•ผ ํ•œ๋‹ค"๊ณ  ํ–ˆ์œผ๋‹ˆ๊นŒ, \
์žฌ์‹œ์ž‘ ํ›„์—๋Š” ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œ๋ฅผ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ ๊ฐ™์•„. ์ฆ‰, ์žฌ์‹œ์ž‘ ํ›„์—๋Š” ๋‚จ์€ ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์šด๋กœ๋“œํ•˜๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ, ์ด๋ฏธ \
๋‹ค์šด๋กœ๋“œํ•œ 40%๋ฅผ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•˜๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ, ๊ทธ๋ƒฅ ๋‹ค์‹œ ์‹œ์ž‘ํ•ด์„œ ๋‚จ์€ 60%๋ฅผ ๋‹ค์šด๋กœ๋“œํ•˜๋Š” ๊ฑฐ์•ผ. ์•„๋‹ˆ๋ฉด, ์žฌ์‹œ์ž‘ ํ›„์— ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œ๋ฅผ \
์‹œ์ž‘ํ•  ๋•Œ, ์ด๋ฏธ ๋‹ค์šด๋กœ๋“œํ•œ 40%๋ฅผ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•ด์•ผ ํ•˜๋Š”์ง€?

์ด ๋ถ€๋ถ„์ด ํ—ท๊ฐˆ๋ฆฌ๋„ค. ๋ฌธ์ œ๋ฅผ ๋‹ค์‹œ ์ฝ์–ด๋ณด์ž. "๋‹ค์šด๋กœ๋“œ์˜ 40%๊ฐ€ ์ง„ํ–‰๋์„ ๋•Œ Windows๊ฐ€ ์—…๋ฐ์ดํŠธ ์„ค์น˜๋ฅผ ์œ„ํ•ด ์žฌ์‹œ์ž‘ํ•ฉ๋‹ˆ๋‹ค. ์ด ๊ณผ์ •์€ \
20๋ถ„์ด ๊ฑธ๋ฆฝ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๊ณ  ๋‚˜์„œ Carla๋Š” ๋‹ค์šด๋กœ๋“œ๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ์‹œ์ž‘ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค."

์—ฌ๊ธฐ์„œ "์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ์‹œ์ž‘ํ•ด์•ผ ํ•œ๋‹ค"๋Š” ๊ฒƒ์€, ์žฌ์‹œ์ž‘ ํ›„์— ๋‹ค์šด๋กœ๋“œ๋ฅผ ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ์žฌ์‹œ์ž‘ ๊ณผ์ •์ด ๋๋‚˜๊ณ  ๋‚˜์„œ \
๋‹ค์šด๋กœ๋“œ๋ฅผ ๋‹ค์‹œ ์‹œ์ž‘ํ•˜๋Š”๋ฐ, ์ด๋•Œ ์ด๋ฏธ ๋‹ค์šด๋กœ๋“œํ•œ 40%๋ฅผ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•ด์•ผ ํ•˜๋Š” ๊ฑด๊ฐ€? ์•„๋‹ˆ๋ฉด, ์žฌ์‹œ์ž‘ ํ›„์—๋Š” ๋‚จ์€ 60%๋ฅผ ๋‹ค์šด๋กœ๋“œํ•˜๋Š” \
๊ฑด๊ฐ€?

๋ฌธ์ œ์˜ ๋งฅ๋ฝ์ƒ, ์žฌ์‹œ์ž‘์„ ์œ„ํ•ด ์‹œ์Šคํ…œ์ด ์žฌ์‹œ์ž‘๋˜๋ฉด ๋‹ค์šด๋กœ๋“œ๊ฐ€ ์ค‘๋‹จ๋˜๊ณ , ์žฌ์‹œ์ž‘์ด ์™„๋ฃŒ๋œ ํ›„์— ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œ๋ฅผ ์‹œ์ž‘ํ•˜๋Š”๋ฐ, ์ด๋•Œ๋Š” ์ฒ˜์Œ๋ถ€ํ„ฐ \
๋‹ค์‹œ ์‹œ์ž‘ํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ์ด๋ฏธ ๋‹ค์šด๋กœ๋“œํ•œ ๋ถ€๋ถ„์€ ์œ ์ง€ํ•œ ์ฑ„๋กœ ๋‚จ์€ ๋ถ€๋ถ„์„ ๋‹ค์šด๋กœ๋“œํ•ด์•ผ ํ•  ๊ฒƒ ๊ฐ™์•„. ๊ทธ๋Ÿฐ๋ฐ ๋ฌธ์ œ์—์„œ๋Š” "์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ \
์‹œ์ž‘ํ•ด์•ผ ํ•œ๋‹ค"๊ณ  ํ–ˆ์œผ๋‹ˆ๊นŒ, ์žฌ์‹œ์ž‘ ํ›„์—๋Š” ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œ๋ฅผ ์‹œ์ž‘ํ•ด์„œ, ์ด๋ฏธ ๋‹ค์šด๋กœ๋“œํ•œ 40%๋ฅผ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•ด์•ผ ํ•˜๋Š” ์ƒํ™ฉ์ด ๋  ์ˆ˜ \
์žˆ๋‚˜?

๋งŒ์•ฝ ๊ทธ๋ ‡๋‹ค๋ฉด, ์žฌ์‹œ์ž‘ ํ›„์— ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œ๋ฅผ ์‹œ์ž‘ํ•  ๋•Œ, ์ด๋ฏธ 40%๊ฐ€ ๋‹ค์šด๋กœ๋“œ๋œ ์ƒํƒœ์—์„œ ์‹œ์ž‘ํ•˜๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ, ์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œ๋ฅผ \
์‹œ์ž‘ํ•ด์„œ 40%๋ฅผ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•ด์•ผ ํ•œ๋‹ค๋ฉด, ์‹œ๊ฐ„์ด ๋” ๊ฑธ๋ฆด ์ˆ˜ ์žˆ๊ฒ ์ง€. ํ•˜์ง€๋งŒ ๋ฌธ์ œ์—์„œ "์ฒ˜์Œ๋ถ€ํ„ฐ ๋‹ค์‹œ ์‹œ์ž‘ํ•ด์•ผ ํ•œ๋‹ค"๋Š” ํ‘œํ˜„์€ ์•„๋งˆ๋„ ์žฌ์‹œ์ž‘ \
๊ณผ์ •์ด ๋๋‚˜๊ณ  ๋‚˜์„œ ๋‹ค์šด๋กœ๋“œ๋ฅผ ๋‹ค์‹œ ์‹œ์ž‘ํ•˜๋Š”๋ฐ, ์ด๋•Œ๋Š” ์ด๋ฏธ ๋‹ค์šด๋กœ๋“œํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค์‹œ ๋‹ค์šด๋กœ๋“œํ•˜๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ, ๋‚จ์€ ๋ฐ์ดํ„ฐ๋ฅผ ๋‹ค์šด๋กœ๋“œํ•˜๋Š” \
๊ฒƒ์„ ์˜๋ฏธํ•  ์ˆ˜๋„ ์žˆ์–ด.

(ํ•˜๋žต)...
  • ์˜์–ด ๋ชจ๋ธ์—์„œ๋Š” ํ‹€๋ฆฌ๋”๋ผ๋„ ํ’€์ด๊ณผ์ •์ด ํ‹€๋ ค์„œ ํ‹€๋ฆฐ ๋‹ต์ด ๋‚˜์˜ฌ ๋ฟ, ์ด ์ •๋„๋กœ ์ด์ƒํ•œ ๋…ผ๋ฆฌ๋ฅผ ๋ฐ˜๋ณตํ•˜์ง€๋Š” ์•Š์Œ

Future work

  • ์›๋ž˜ EXAONE-Deep์ด ๊ฐ€์ง€๊ณ  ์žˆ์—ˆ๋˜, ๊ฐ„๋‹จํ•œ ์งˆ๋ฌธ์— ๋Œ€ํ•ด์„œ๋Š” ์ƒ๊ฐํ•˜์ง€ ์•Š๋Š” ๋Šฅ๋ ฅ์„ ์œ ์ง€์‹œํ‚ค๊ธฐ
  • Merge, GRPO ๋“ฑ์˜ Post Training์œผ๋กœ ์ถ”๋ก  ์ •ํ™•๋„ ๊ฐœ์„ 
  • ์ถ”๋ก  ๊ธธ์ด ์ตœ์ ํ™”
Downloads last month
13
Safetensors
Model size
7.82B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support