akhilfau
/

smollm2-360m-physics-gguf

GGUF

conversational

Model card Files Files and versions Community

akhilfau commited on 12 days ago

Commit

c1fc40e

verified ·

1 Parent(s): caabd76

Create README.md

Browse files

Files changed (1) hide show

README.md +98 -0

README.md ADDED Viewed

	@@ -0,0 +1,98 @@

+smollm2-360m-physics-gguf
+**Author**: [Akhil Vallala](https://www.linkedin.com/in/akhil-fau)
+**Base Model**: [`akhilfau/fine-tuned-smolLM2-360M-with-on-combined_Instruction_dataset`](https://huggingface.co/akhilfau/fine-tuned-smolLM2-360M-with-on-combined_Instruction_dataset)
+**Architecture**: LLaMA (SmolLM2)
+**Parameter count**: 362M
+**Format**: GGUF (Q4_K_M, Q8_0, FP16)
+**License**: Apache 2.0
+**Model Type**: Instruction-tuned Small Language Model (SLM)
+**Use Case**: Solving physics word problems on mobile devices
+Model Overview
+This GGUF model is a quantized version of the **Tiny-Physics** model, based on SmolLM2-360M and fine-tuned for **physics word problem solving** using both real and synthetic datasets. It is designed to deliver accurate, low-latency performance on **mobile and edge devices**.
+Datasets Used
+- 📘 **camel-ai/physics**: Publicly available dataset with 20,000+ physics QA pairs
+- 📘 **Seed dataset**: Extracted from *1000 Solved Problems in Classical Physics*
+- 🧠 **Synthetic dataset**: 6,279 rigorously validated question-answer pairs generated using a GPT-4o-based multi-agent system
+These datasets were formatted for instruction tuning using structured prompt–response pairs.
+Training Details
+- **Model**: SmolLM2-360M
+- **Fine-tuning**: Instruction fine-tuning with LoRA (Low-Rank Adaptation)
+- **Libraries**: Hugging Face Transformers, TRL, Lighteval
+- **Training Epochs**: 3 (best accuracy observed at 3–5 epochs)
+- **Fine-tuning Objective**: Maximize performance on MMLU College Physics
+- **Best Model Accuracy**: `24.51%` on MMLU College Physics
+Evaluation
+**Evaluated with**: [Lighteval](https://github.com/huggingface/lighteval)
+**Benchmark**: [MMLU College Physics](https://huggingface.co/datasets/hendrycks_test)
+**Performance**:
+| Dataset                      | Accuracy (SmolLM2-360M-Instruct) |
+|-----------------------------|----------------------------------|
+| MMLU: College Physics       | 24.51%                           |
+| Instruction-Tuned camel-ai  | 25.49%                           |
+| Combined Instruction Dataset| 24.51%                           |
+GGUF Quantization
+Model is provided in multiple quantization formats:
+| Format   | Size   | Accuracy Retention | Inference Speed | RAM Usage     | Target Use                         |
+|----------|--------|--------------------|------------------|---------------|------------------------------------|
+| `Q4_K_M` | ~271MB | ~95–97%            | Fast             | ~600–800MB    | Ideal for mid-range mobile devices |
+| `Q8_0`   | ~386MB | ~99%               | Medium           | ~1–1.5GB      | Best for higher-end devices        |
+| `FP16`   | ~800MB | 100%               | Slow             | ~2GB+         | Reference use only                 |
+How to Use
+```bash
+# Using llama.cpp
+./main -m smollm2-360m-physics-gguf.Q4_K_M.gguf -p "What is the acceleration of a 2kg mass falling from 5 meters?"
+```
+Or via `llama-cpp-python`:
+```python
+from llama_cpp import Llama
+llm = Llama(model_path="smollm2-360m-physics-gguf.Q4_K_M.gguf")
+output = llm("What is the potential energy of a 3kg object at 10 meters?")
+```
+Intended Use
+- 📚 **Physics tutoring apps**
+- 📶 **Offline mobile inference**
+- 🧑‍🏫 **Educational tools for conceptual reasoning**
+- 🔋 **Low-power deployment scenarios**
+Limitations
+- Not trained on multiple-choice formatted data (MCQ output mismatch possible)
+- Topic imbalance in datasets may affect generalization
+- Not suitable for non-physics or open-domain tasks
+Carbon Footprint
+Training and fine-tuning consumed approx. **2.64 kg CO₂e**, equivalent to a ~7-mile car ride. This was achieved using local GPU resources (RTX A5500) and energy-efficient batch tuning with LoRA.
+Citation
+```bibtex
+@misc{vallala2025tinyphysics,
+  title={Tiny-Physics: A Compact Large Language Model for Physics Word Problems on Mobile Devices},
+  author={Akhil Vallala},
+  year={2025},
+  howpublished={\url{https://huggingface.co/akhilfau/smollm2-360m-physics-gguf}},
+}
+```