Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mesolitica 's Collections
Malaysian Finetuned Instruct
MaLLaM 🌙
Malaysian CausalLM
Malaysian LLM2Vec
Malaysian Seq2Seq
Malaysian MaskLM
Malaysian pretraining dataset
Malay instructions dataset
Malaysian synthetic dataset
Speech-to-Text dataset
Malaysian Whisper
Malaysian Text-to-Speech
Visual Multimodal dataset
Audio Multimodal dataset

MaLLaM 🌙

updated Dec 23, 2024

Pretrain from scratch 4096 context length on 90B tokens Malaysian text, https://huggingface.co/papers/2401.14680

Upvote
14

  • mesolitica/mallam-1.1B-4096

    Text Generation • Updated Oct 7, 2024 • 573 • 8

  • mesolitica/mallam-3B-4096

    Text Generation • Updated Oct 7, 2024 • 39 • 1

  • mesolitica/mallam-5B-4096

    Text Generation • Updated Oct 13, 2024 • 78 • 2

  • mesolitica/mallam-1.1b-20k-instructions

    Text Generation • Updated Dec 19, 2023 • 21 • 1

  • mesolitica/mallam-1.1b-20k-instructions-v2

    Text Generation • Updated Jan 25, 2024 • 26

  • mesolitica/mallam-3b-20k-instructions

    Text Generation • Updated Dec 16, 2023 • 3

  • mesolitica/mallam-5b-20k-instructions

    Text Generation • Updated Dec 17, 2023 • 13 • 1

  • mesolitica/mallam-5b-20k-instructions-v2

    Text Generation • Updated Jan 25, 2024 • 5 • 1
Upvote
14
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs