Ragged attention supported in vLLM
#18
by
patrickvonplaten
- opened
Will you add interleaved_sliding_window
to hf config.json as well? Are we going to use this parameter going forward?
patrickvonplaten
changed pull request status to
merged