Yanolja Arena
Yanolja Arena helps find the best LLMs for summarizing and translating text. We compare two random models at a time and use an ELO rating system to score them.
This is an open-source project. Check it out on GitHub.
- For Summaries:
- Enter the text you want summarized into the prompt box.
- For Translations:
- Choose the language you're translating from and to.
- Enter the text you want translated into the prompt box.
- Voting:
- After you see both results, pick which one you think is better.
- gpt-4o-2024-11-20
- gpt-4o-mini-2024-07-18
- claude-3-5-sonnet-20241022
- claude-3-5-haiku-20241022
- gemini-1.5-pro-002
- gemini-1.5-flash-002
- google/gemma-2-9b-it
- google/gemma-2-27b-it
- meta-llama/Meta-Llama-3.1-8B-Instruct
- meta-llama/Meta-Llama-3.1-70B-Instruct
- meta-llama/Meta-Llama-3.1-405B-Instruct
- meta-llama/Llama-3.2-3B-Instruct
- meta-llama/Llama-3.2-1B-Instruct
- Qwen/Qwen2.5-72B-Instruct
Source language
Choose the source language for translation.
Target language
Choose the target language for translation.
Summary language
Rank | Model | Elo rating |
---|---|---|
10 | meta-llama/Meta-Llama-3.1-405B-Instruct | 1013 |
Rank | Model | Elo rating |
---|---|---|
10 | meta-llama/Meta-Llama-3.1-405B-Instruct | 1013 |
The leaderboard is updated every 10 minutes.
Source language
Target language
Rank | Model | Elo rating |
---|---|---|
12 | meta-llama/Meta-Llama-3.1-405B-Instruct | 1019 |
Rank | Model | Elo rating |
---|---|---|
12 | meta-llama/Meta-Llama-3.1-405B-Instruct | 1019 |
The leaderboard is updated every 10 minutes.