🧠 HyperCLOVAX-SEED-Counseling

This model is a fine-tuned version of naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-1.5B, specialized for empathetic counseling for teenagers.


πŸ” Model Overview

  • Base model: naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-1.5B
  • Fine-tuning objective: Provide warm, non-judgmental, and emotionally supportive counseling responses tailored to youth clients.
  • Language: Korean (ν•œκ΅­μ–΄)

The model has been trained on real and synthetic conversations between counselors and teenage clients. It emphasizes:

  • Empathy and emotional validation (e.g., "κ·Έλž¬κ΅¬λ‚˜", "μΆ©λΆ„νžˆ 이해돼")
  • Open-ended questions for self-exploration
  • Avoiding direct advice or judgment
  • Handling crisis situations with safe referrals

πŸ§‘β€βš•οΈ System Prompt Guideline

The system message used during training and inference is:

당신은 곡감 λŠ₯λ ₯이 λ›°μ–΄λ‚œ μ „λ¬Έ μ²­μ†Œλ…„ μƒλ‹΄μ‚¬μž…λ‹ˆλ‹€.  
(μ€‘λž΅: λ”°λœ»ν•˜κ³  곡감적인 상담 λŒ€ν™” κ·œμΉ™ 포함)

This ensures the assistant maintains a friendly, safe, and supportive tone.


πŸ’¬ Inference Example

from transformers import AutoTokenizer, AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained("Daehoya/HyperCLOVAX-SEED-Counseling")
tokenizer = AutoTokenizer.from_pretrained("Daehoya/HyperCLOVAX-SEED-Counseling")

prompt = "μš”μ¦˜ μΉœκ΅¬λ“€κ³Ό λ©€μ–΄μ§„ 것 κ°™μ•„..."

inputs = tokenizer.apply_chat_template(
    [
        {"role": "system", "content": "당신은 곡감 λŠ₯λ ₯이 λ›°μ–΄λ‚œ μ „λ¬Έ μ²­μ†Œλ…„ μƒλ‹΄μ‚¬μž…λ‹ˆλ‹€."},
        {"role": "user", "content": prompt}
    ],
    return_tensors="pt"
).to(model.device)

output = model.generate(inputs, max_new_tokens=300, temperature=0.7)
print(tokenizer.decode(output[0], skip_special_tokens=True))

πŸ§ͺ Training Details

  • Optimizer: AdamW
  • Batch size: 3 per device (gradient_accumulation_steps=20)
  • Epochs: 3
  • Max input length: up to 8192 tokens
  • Hardware: 4Γ—A100 GPUs
  • Precision: FP16
  • Framework: transformers.Trainer

πŸ“ Files Included

  • pytorch_model.bin or model-*.safetensors: Model weights
  • tokenizer.json, tokenizer_config.json: Tokenizer files
  • config.json: Model config
  • generation_config.json: Sampling configuration
  • README.md: This file

πŸ“œ License

This model is released under the same license as the base model. Please review NAVER CLOVA's licensing policy.


πŸ™ Acknowledgements

Thanks to NAVER CLOVA for the base model and the community for ongoing contributions in mental health AI.

Downloads last month
10
Safetensors
Model size
1.59B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support