malaysia-ai
/

malaysian-sfw-classifier

Text Classification

text-generation-inference

Model card Files Files and versions Community

malaysian-sfw-classifier / README.md

aisyahhrazak's picture

Update README.md

0fd40e2 verified 10 months ago

|

1.88 kB

	---
	language:
	- ms
	library_name: transformers
	---

	Safe for Work Classifier Model for Malaysian Data

	Current version supports Malay. We are working towards supporting malay, english and indo.

	Base Model finetuned from https://huggingface.co/mesolitica/malaysian-mistral-191M-MLM-512 with Malaysian NSFW data.

	Data Source: https://huggingface.co/datasets/malaysia-ai/Malaysian-NSFW

	Github Repo: https://github.com/malaysia-ai/sfw-classifier

	Project Board: https://github.com/orgs/malaysia-ai/projects/6

	![Image in a markdown cell](https://github.com/mesolitica/malaysian-llmops/raw/main/e2e.png)

	Current Labels Available:

	- religion insult
	- sexist
	- racist
	- psychiatric or mental illness
	- harassment
	- safe for work
	- porn
	- self-harm
	- violence



	### How to use

	```python
	from classifier import MistralForSequenceClassification
	model = MistralForSequenceClassification.from_pretrained('malaysia-ai/malaysian-sfw-classifier')
	```


	```
	precision recall f1-score support

	racist 0.85034 0.90307 0.87591 1661
	religion insult 0.86809 0.85966 0.86386 3399
	psychiatric or mental illness 0.94707 0.84181 0.89134 5803
	sexist 0.80637 0.78992 0.79806 1666
	harassment 0.83653 0.88663 0.86085 935
	porn 0.96709 0.95341 0.96020 1202
	safe for work 0.80132 0.91232 0.85322 3205
	self-harm 0.85751 0.90934 0.88267 364
	violence 0.77521 0.84049 0.80653 1235

	accuracy 0.86754 19470
	macro avg 0.85661 0.87741 0.86585 19470
	weighted avg 0.87235 0.86754 0.86822 19470

	```