GPT MT Benchmark

cabreraalex

20.2k
4

TruthfulQA

a13x

817
8

TruthfulQA (https://arxiv.org/abs/2109.07958) task in the Open-LLM-Leaderboard.

What does the OpenLLM Leaderboard measure?

a13x

21

An investigation of the Open LLM Leaderboard and why you should double-check before using the top-ranked model.

GPT MT Benchmark Report

cabreraalex

14

Explore how LLMs compare to dedicated language translation models, particularly for low-resourced languages.