thirdeyeai
/

elevate360m-orca

Text Generation

instruction-tuned

text-generation-inference

Model card Files Files and versions Community

elevate360m-orca / README.md

sloshywings's picture

Update README.md

0fe5bd3 verified about 1 month ago

|

history blame contribute delete

1.42 kB

	---
	library_name: transformers
	tags:
	- chat
	- tool-calling
	- instruction-tuned
	- edge-device
	- conversational
	license: mit
	datasets:
	- Open-Orca/OpenOrca
	base_model:
	- HuggingFaceTB/SmolLM2-360M-Instruct
	---

	# Model Card for `thirdeyeai/elevate-360m`

	## Model Summary

	360M parameter transformer model trained for efficient chat completion and tool call prediction on edge devices. Suitable for low-latency applications.

	## Model Details

	- Developed by: Thirdeye AI
	- Finetuned from model: HuggingFaceTB/SmolLM2-360M-Instruct
	- Model type: Causal decoder-only transformer
	- Language(s): English
	- License: apache-2.0
	- Hardware: Trained on 1x A100 GPU
	- Training time: < 24 hours

	## Model Sources

	- Repository: https://huggingface.co/thirdeyeai/elevate-360m

	## Uses

	### Direct Use

	Primarily for chat completion and tool call prediction in edge environments with constrained resources.

	### Out-of-Scope Use

	Not optimized for multi-language support, long-context reasoning, or open-ended generation without tool grounding.

	## Bias, Risks, and Limitations

	Trained on publicly available instruction-following datasets. May reflect biases present in those datasets. Not suitable for high-stakes or safety-critical applications.

	### Recommendations

	Use only with proper evaluation and safety checks in deployment environments. Validate outputs before taking action.