streamlit transformers torch tokenizers langdetect huggingface_hub pandas nltk sentencepiece