The MAMBA product transformer which has a language modeling head on top (linear layer with weights tied on the input
arXivLabs is really a framework that enables collaborators to build and share new arXiv functions https://k2spiceshop.com/product/liquid-k2-on-paper-online/