Centro Interdipartimentale Mente/Cervello
Efficiency as an Inductive Bias: Towards Tokenizer-free and Dynamically Sparse Language Models
2nd International Workshop of the Signature Initiative