Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.16k
Follow
Microsoft
10.3k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
46
Train
Use this model
Update README.md
#38
by
daekeun-ml
- opened
6 days ago
base:
refs/heads/main
←
from:
refs/pr/38
Discussion
Files changed
+120
-0
daekeun-ml
6 days ago
Add Appendix. B: Fine-tuning Korean speech
See translation
Update README.md
0efc8967
daekeun-ml
changed pull request status to
closed
5 days ago
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Your need to confirm your account before you can post a new comment.
Comment
·
Sign up
or
log in
to comment