πŸ›• Tamil Nadu Heritage Knowledge LLM

A domain-specific Language Model dedicated to capturing the rich cultural, architectural, and historical heritage of Tamil Nadu. This LLM is fine-tuned on a custom dataset and designed to answer questions or generate content related to Tamil Nadu's temples, history, monuments, and culture.


πŸš€ Project Status: Early Development Phase

This project is in its starting phase, and we welcome contributors to expand the dataset, improve the model, and help build a comprehensive open-source heritage model.


πŸ“¦ Base Model


πŸ—‚ Dataset

  • Source: Boobalamurugan/tn-heritage-sites-dataset
  • Size: ~1.93K entries
  • Description: This dataset contains information about heritage sites in Tamil Nadu, including temples, historical monuments, kings, architecture, and festivals.

πŸ“’ The dataset is still small. Feel free to contribute and help us grow this valuable resource!


🧠 Model Objective

  • Build a knowledgeable and culturally aware assistant focused on Tamil Nadu heritage.
  • Answer factual questions about heritage sites, kings, festivals, inscriptions, architecture, etc.
  • Generate informative content or summaries for educational or cultural purposes.

✨ Features (Planned)

  • βœ… Q&A on Tamil Nadu temples, kings, and architecture
  • βœ… Context-aware content generation (e.g., temple descriptions, cultural significance)
  • πŸ”„ Summarization of historical texts (Coming Soon)
  • πŸ”„ Integration into web or mobile apps (Planned)

🀝 How to Contribute

πŸ“ Dataset Contributions

  • Fork the dataset repository
  • Add more entries (temples, kings, historical facts)
  • Submit a pull request with the added data

🧠 Model Training

  • Fine-tune the model with additional data
  • Improve prompt formatting and pre-processing
  • Evaluate the model’s responses

πŸ“ License

This project is licensed under the Apache 2.0 License β€” free for commercial and non-commercial use with proper attribution.


🌐 Tags

#tamil-heritage #open-source #language-model #llm #tamil-nadu #text-generation


πŸ“¬ Stay Connected

For updates and discussions:

  • Follow the project creator: @Boobalamurugan
  • Join our community forum (Coming Soon)
Downloads last month
85
GGUF
Model size
8.03B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for Boobalamurugan/TN_Heritage_LLM

Quantized
(1)
this model

Dataset used to train Boobalamurugan/TN_Heritage_LLM