streamlit llama-cpp-python huggingface-hub transformers torch bitsandbytes accelerate