Hi @k10 , marian type models are not yet supported by optimum-neuron. To add its cache, we will need to add the export and inference support for it first.
I opened a ticket here, feel free to pick the task up if you want to contribute!
Your need to confirm your account before you can post a new comment.