Whats the closest modeling code?
#9
by
michaelfeil
- opened
For frameworks support, from which model should be best start modeling (llama, bert, qwen)
Hello, the easiest model to start from is Llama, as EuroBERT follows the Llama architecture, but without the causal mask.
What are other key differences besides causal=False?