Update modeling_phi3.py
#102 opened 27 days ago
by
TarunSinghal
Update modeling_phi3.py for compatibility with transformers 4.49
1
1
#100 opened about 2 months ago
by
sylwia-kuros

Config class problems
#99 opened 3 months ago
by
VityaVitalich

Request: DOI
#98 opened 4 months ago
by
Hlias11
Phi-3 is not generating from input embeddings [BUG]RuntimeError: shape '[-1, 0]' is invalid for input of size 5
#97 opened 6 months ago
by
Ryz3n758
Thanks for Phi 3 mini and your inclusion of MedQA benchmark in your testing!
1
#95 opened 9 months ago
by
Hugman2345
KV cahing problem during the inference loop
#94 opened 9 months ago
by
mohamedlotfy50
Issue with Phi-3 Mini 4K Instruct Response Format
1
1
#93 opened 9 months ago
by
adameda
Is it required to add a BOS token?
1
#92 opened 9 months ago
by
iarbel
Thanks for sharing this model, I use it in my open source app for synthetic data :)
2
1
#91 opened 9 months ago
by
lhoestq

phi3 4K vs phi3 4K. dupplcated name on the leaderboard
#89 opened 10 months ago
by
bedio
Please add more AutoModel Mapping
#88 opened 10 months ago
by
qcqced
phi3 4K vs 128K
4
3
#87 opened 10 months ago
by
Emilio
CUDA error when using the code example with pipeline provided on the model page
2
#86 opened 10 months ago
by
saurabhkumar
Issues with llamacpp/LM studio and ollama
9
#85 opened 10 months ago
by
rombodawg

Model doesn't seem to tokenize new lines in chat template?
1
6
#84 opened 10 months ago
by
bartowski

Underreported HumanEval Scores?
2
#83 opened 10 months ago
by
VaibhavSahai
Uploaded GGUF and exl2 as Phi 3.1
7
5
#80 opened 10 months ago
by
bartowski

Thanks for the updated version!
4
#78 opened 10 months ago
by
Nafnlaus
The model stops after generating one new token
1
2
#76 opened 10 months ago
by
rajiv-data-chef
Create the tokenizer.json properly (with TemplateProcessing included).
#75 opened 10 months ago
by
Narsil

Jetson nano
#74 opened 11 months ago
by
idotr7
fixed generation_args in Sample inference code
1
#73 opened 11 months ago
by
dkleine

Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
2
#72 opened 11 months ago
by
Kenkentron
tflite convertion
1
#71 opened 11 months ago
by
henrywang0314
fine-tuning with structured data set
1
2
#68 opened 11 months ago
by
don412
fp16 normal weights
#62 opened 12 months ago
by
gioaca
Recent change on the rstrip property on special tokens
3
1
#59 opened 12 months ago
by
xxhansh
Help with merging LoRA layers back onto Phi3
1
#55 opened 12 months ago
by
SHIMURA0321
