The robust European language model benchmark.
Python 133 36
There was an error while loading. Please reload this page.
Collection of all evaluation results from the EuroEval framework.
Loading…