model_info: name: anemll-Meta-Llama-3.2-1B-ctx2048 version: 0.1.2 description: | Demonstarates running Meta-Llama-3.2-1B on Apple Neural Engine Context length: 2048 Batch size: 128 Chunks: 2 license: MIT author: Anemll framework: Core ML language: Python parameters: context_length: 2048 batch_size: 128 lut_embeddings: none lut_ffn: 4 lut_lmhead: 4 num_chunks: 2 model_prefix: llama