lex-au commited on
Commit
9319b14
·
verified ·
1 Parent(s): e6d2768

Push model card

Browse files
Files changed (1) hide show
  1. README.md +123 -3
README.md CHANGED
@@ -1,3 +1,123 @@
1
- ---
2
- license: llama3
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ base_model:
4
+ - meta-llama/Meta-Llama-3-8B-Instruct
5
+ tags:
6
+ - Conversation
7
+ - Roleplay
8
+ - Spatial
9
+ - Temporal
10
+ ---
11
+
12
+ # 🧠 Model Card: LLaMA 3 8B Instruct – Conversational Roleplay Enhanced
13
+
14
+ **Model Name:** `lex-au/vocalis-q4_k_m.gguf`
15
+ **Base Model:** Meta LLaMA 3 8B Instruct
16
+ **Fine-Tuned By:** Lex (lex-au)
17
+ **Model Type:** Causal Decoder Transformer
18
+ **License:** Non-commercial (see Meta LLaMA 3 license)
19
+
20
+ ---
21
+
22
+ ## ✨ Overview
23
+
24
+ `vocalis-q4_k_m.gguf` is a fine-tuned and quantised variant of Meta’s LLaMA 3 8B Instruct model, purpose-built for **immersive roleplaying** and **low-latency conversational AI** in the [Vocalis web app](https://github.com/Lex-au/Vocalis). This model tracks **spatial and temporal context** across turns, enabling natural, persistent narrative flows and highly dynamic character interactions.
25
+
26
+ Designed for **low-resource environments**, it balances responsiveness with rich generative quality, making it ideal for real-time applications.
27
+
28
+ ---
29
+
30
+ ## 🛠 Fine-Tuning Objective
31
+
32
+ - **Purpose:** Enable vivid storytelling and contextually grounded interactions for roleplay-based use cases.
33
+ - **Focus Areas:**
34
+ - Spatial awareness (tracking objects, locations, movement)
35
+ - Temporal sequencing (event continuity, time-of-day, memory within thread)
36
+ - Expressive language generation with adaptable tone
37
+ - **Fine-Tuning Strategy:** LoRA-based adaptation on multi-turn dialogues, fantasy/sci-fi roleplay transcripts, and narrative world-state tracking prompts.
38
+ - **Deployment Goal:** Minimal latency, low memory footprint — optimised for running inside [Vocalis](https://github.com/Lex-au/Vocalis)
39
+
40
+ ---
41
+
42
+ ## 🗣 Use Cases
43
+
44
+ - Text-based RPG narration
45
+ - Interactive fiction engines
46
+ - AI-driven NPCs for browser games
47
+ - Persistent-world simulation bots
48
+ - Web-deployed immersive chat agents
49
+
50
+ ---
51
+
52
+ ## ✅ Strengths
53
+
54
+ - Maintains spatial & narrative continuity across turns
55
+ - Generates richly descriptive, tone-sensitive prose
56
+ - Adapts easily to character tone and dynamic environments
57
+ - Ideal for 1st-person, 2nd-person, or multi-speaker scenarios
58
+
59
+ ---
60
+
61
+ ## ⚠️ Known Limitations
62
+
63
+ - Context window constraints may cause memory loss over extended turns
64
+ - Can over-describe or hallucinate detail in ambiguous settings
65
+ - Not suited for task-based or factual Q&A applications
66
+ - No long-term memory or external knowledge integration
67
+
68
+ ---
69
+
70
+ ## 💬 Example Interaction
71
+
72
+ **User:**
73
+ You enter the crumbling temple. Where’s the light coming from?
74
+
75
+ **Assistant:**
76
+ _As you step into the ancient temple, a soft, ethereal glow emanates from within its walls. The light seems to seep through the weathered stone, casting an otherworldly illumination on the dusty artifacts and worn carvings that line the passageway._
77
+
78
+ _Upon closer inspection, you notice that the light is coming from a series of intricate, glowing symbols etched into the ceiling above. These mystical runes pulse with a gentle, blue-green luminescence, their soft radiance casting an enchanting atmosphere throughout the temple's interior._
79
+
80
+ _The air inside the temple feels heavy with age and mystery, yet the soothing light from the celestial symbols imbues the space with a sense of peace and ancient wisdom. As you venture deeper into the labyrinthine structure, the glow seems to grow stronger, guiding you towards secrets that have lain dormant for centuries within the sacred halls._
81
+
82
+ ---
83
+
84
+ ## 📎 Metadata
85
+
86
+ - **Format:** GGUF
87
+ - **Quantisation:** Q4_K_M
88
+ - **Optimised For:** Real-time inference (Vocalis-compatible)
89
+ - **Recommended Frontends:** OpenWebUI, KoboldCPP, LM Studio, or custom web clients (e.g., Flask/Vite apps)
90
+
91
+ ---
92
+
93
+ ## 🔗 Project Link
94
+
95
+ > 🧪 This model was fine-tuned specifically for the [**Vocalis** webapp project](https://github.com/Lex-au/Vocalis) — a low-latency, voice-enabled AI assistant platform.
96
+
97
+ ---
98
+
99
+ ## 📖 Citation & Attribution
100
+
101
+ The base model used for fine-tuning is **Meta’s LLaMA 3 8B Instruct**. For licensing, refer to [Meta's license terms](https://ai.meta.com/resources/models-and-libraries/llama-downloads/).
102
+
103
+ This repository contains a quantised version optimised for low-latency use in the Vocalis real-time webapp.
104
+
105
+ If you use this model in your research or application, please cite:
106
+
107
+ ```
108
+ @misc{llama3-8b-2024,
109
+ author = {Meta AI},
110
+ title = {LLaMA 3 8B Instruct},
111
+ year = {2024},
112
+ publisher = {Meta},
113
+ howpublished = {\url{https://ai.meta.com/llama/}}
114
+ }
115
+
116
+ @misc{vocalis-roleplay-2025,
117
+ author = {Lex-au},
118
+ title = {Vocalis-Q4_K_M: Fine-Tuned LLaMA 3 for Conversational Roleplay},
119
+ note = {Low-latency roleplay model optimised for use in the Vocalis webapp},
120
+ year = {2025},
121
+ howpublished = {\url{https://github.com/Lex-au/Vocalis}}
122
+ }
123
+ ```