Yanolja Arena

Yanolja Arena helps find the best LLMs for summarizing and translating text. We compare two random models at a time and use an ELO rating system to score them.

This is an open-source project. Check it out on GitHub.

  1. For Summaries:
  • Enter the text you want summarized into the prompt box.
  1. For Translations:
  • Choose the language you're translating from and to.
  • Enter the text you want translated into the prompt box.
  1. Voting:
  • After you see both results, pick which one you think is better.
  • gpt-4o-2024-11-20
  • gpt-4o-mini-2024-07-18
  • claude-3-5-sonnet-20241022
  • claude-3-5-haiku-20241022
  • gemini-1.5-pro-002
  • gemini-1.5-flash-002
  • google/gemma-2-9b-it
  • google/gemma-2-27b-it
  • meta-llama/Meta-Llama-3.1-8B-Instruct
  • meta-llama/Meta-Llama-3.1-70B-Instruct
  • meta-llama/Meta-Llama-3.1-405B-Instruct
  • meta-llama/Llama-3.2-3B-Instruct
  • meta-llama/Llama-3.2-1B-Instruct
  • Qwen/Qwen2.5-72B-Instruct
Category

The chosen category determines the instruction sent to the LLMs.

Summary language
Rank
Model
Elo rating
10
meta-llama/Meta-Llama-3.1-405B-Instruct
1013

The leaderboard is updated every 10 minutes.