eval-leaderboard/test-yu.json

16 lines
317 B
JSON
Raw Normal View History

2025-07-18 07:26:26 +00:00
{
"id": "test-yu",
"description": "ChatGPT 3.5",
"owner": "",
"results": {
"ceval": 52.5,
"agieval": 39.9,
"mmlu": 69.1,
"GaokaoBench": 51.1,
"triviaqa": 63.8,
"hellaswag": 79.5,
"cmmlu": 53.9,
"C3": 85.6,
"lambada": 57.5
}
}