Discover/arena.ai API
live

arena.ai APIarena.ai

Fetch AI model leaderboard rankings from arena.ai across agent, text, image, video, and code arenas. Get ELO scores, confidence intervals, and model metadata.

Endpoints
1
Updated
4h ago
Try it
The arena/modality to fetch rankings for. Accepts exactly one of: agent, text, text-to-image, image-edit, text-to-video, image-to-video, video-edit, code/webdev, code/image-to-webdev.
api.parse.bot/scraper/dee580cd-1286-483f-818e-02b6459a0d69/<endpoint>
Ready to send
Fill in the parameters and hit sign in to send to see live response data here.
Use it in your codegrab a free API key at signup
curl -X GET 'https://api.parse.bot/scraper/dee580cd-1286-483f-818e-02b6459a0d69/get_leaderboard' \
  -H 'X-API-Key: $PARSE_API_KEY'
All endpoints · 1 totalclick to expand

Get the full leaderboard rankings for a specified arena. Returns all ranked models with their scores, confidence intervals, and metadata. The agent arena returns signal-based scores (task outcome, steerability, tool hallucination, etc.), while other arenas return ELO-style ratings with vote counts.

Input
ParamTypeDescription
arenastringThe arena/modality to fetch rankings for. Accepts exactly one of: agent, text, text-to-image, image-edit, text-to-video, image-to-video, video-edit, code/webdev, code/image-to-webdev.
Response
{
  "type": "object",
  "fields": {
    "arena": "string",
    "models": "array of model ranking objects",
    "model_count": "integer",
    "last_updated": "string (ISO datetime, agent arena only)",
    "total_sessions": "integer (agent arena only)",
    "leaderboard_slug": "string (non-agent arenas only)"
  },
  "sample": {
    "data": {
      "arena": "agent",
      "models": [
        {
          "rank": 1,
          "model": "GPT 5.5 (High)",
          "license": "Proprietary",
          "sessions": 27140,
          "avg_score": {
            "ci": 0.0129,
            "value": 0.0922,
            "pipelines": 5
          },
          "signal_ci": {
            "steerability": 0.0239,
            "task_outcome_explicit": 0.023
          },
          "rank_spread": {
            "max": 5,
            "min": 1
          },
          "organization": "OpenAI",
          "signal_scores": {
            "steerability": 0.0959,
            "task_outcome_explicit": 0.0613
          }
        }
      ],
      "model_count": 20,
      "last_updated": "2026-06-08T13:00:00.000Z",
      "total_sessions": 463644
    },
    "status": "success"
  }
}

About the arena.ai API

The Arena AI Leaderboard API exposes a single get_leaderboard endpoint that returns ranked AI model data across five arenas: agent, text, text-to-image, image-edit, and code. Each response includes per-model scores, confidence intervals, and metadata — with the agent arena also returning signal-based sub-scores for task outcome, steerability, and tool hallucination, plus session-level totals and a last-updated timestamp.

What the API returns

The get_leaderboard endpoint accepts an arena parameter and returns a ranked list of AI models for that modality. The response always includes the arena name, a models array of ranking objects, and a model_count integer. The models array carries per-model scores and confidence intervals, allowing you to compare statistical separation between models rather than just raw rank order.

Agent arena vs. other arenas

The agent arena response includes fields that the other arenas do not: last_updated (ISO datetime), total_sessions (the number of evaluation sessions behind the rankings), and signal-based sub-scores such as task outcome, steerability, and tool hallucination rate. Non-agent arenas — text, text-to-image, image-edit — return ELO-style ratings and expose a leaderboard_slug string instead. These structural differences mean you should branch on the arena field in your response handling.

Supported arenas

The arena input accepts exactly one value per call: agent, text, text-to-image, image-edit, or text (code). There is no batch or multi-arena endpoint; separate calls are needed to compare rankings across modalities. The model_count field lets you quickly confirm how many models are ranked without iterating the full models array.

Common use cases
  • Build a model selection dashboard that surfaces the current top-ranked agents and their task-outcome scores from the agent arena.
  • Track ELO-style rating changes for text models over time by polling get_leaderboard with arena=text on a schedule.
  • Compare image generation models by pulling text-to-image and image-edit arena rankings side by side using model_count to normalize comparisons.
  • Alert engineering teams when a preferred model drops below a threshold rank in any arena, using confidence interval fields to filter noise.
  • Populate a live leaderboard UI that shows total_sessions and last_updated for the agent arena to signal data freshness to end users.
  • Audit tool hallucination rates across agent models to shortlist candidates for production agentic workflows.
Pricing & limitsSee full pricing →
TierPriceCredits/monthRate limit
Free$0/mo1005 req/min
Hobby$30/mo1,00020 req/min
Developer$100/mo5,000250 req/min

One credit = one API call regardless of which marketplace API you call. Exceeding the rate limit returns a 429 response. Authenticate with the X-API-Key header.

Frequently asked questions
Does arena.ai have an official developer API?+
Arena.ai does not publish a documented public developer API for leaderboard data. This Parse API provides structured access to the leaderboard data available at arena.ai/agent and related pages.
How does the agent arena response differ from other arenas?+
The agent arena returns signal-based sub-scores (task outcome, steerability, tool hallucination), a total_sessions count, and a last_updated ISO datetime. Non-agent arenas return ELO-style ratings and a leaderboard_slug field instead. Both include arena, models, and model_count.
Can I retrieve historical leaderboard rankings or track rank changes over time?+
The API returns the current leaderboard snapshot only; there is no built-in historical endpoint. You can store responses over time in your own database to build a history. If you need a dedicated historical-data endpoint, you can fork this API on Parse and revise it to add one.
Are video or code arenas available as separate arena values?+
The arena parameter currently accepts agent, text, text-to-image, and image-edit. A distinct video or code arena value is not currently exposed. You can fork this API on Parse and revise it to add support for additional arena slugs if they become available on arena.ai.
What does the `models` array contain for each ranked model?+
Each object in models includes the model's rank, score, and confidence interval bounds. The agent arena entries also carry the individual signal-based sub-scores. Exact field names per model depend on the arena; the agent arena entries are the most granular.
Page content last updated . Spec covers 1 endpoint from arena.ai.
Related APIs in Developer ToolsSee all →
artificialanalysis.ai API
Compare and rank LLM models and providers across performance benchmarks, then dive into detailed specifications for any model to find the best fit for your needs. Discover performance metrics for specialized AI systems handling speech, images, and video, plus benchmark data for different hardware configurations.
python.org API
Access comprehensive Python release information including downloads, versions, and supported operating systems, plus stay updated with the latest Python news and events. Search across Python.org's resources and browse release files, details, and the FTP index all in one place.
alienvault.com API
Search and analyze global threat intelligence data including indicators of compromise, threat pulses, and adversary profiles from the Open Threat Exchange community. Monitor recent security alerts and access detailed information about threats and adversaries to strengthen your cybersecurity defenses.
nvidia.com API
nvidia.com API
crt.sh API
Search for SSL/TLS certificates across public transparency logs by domain, fingerprint, serial number, or public key, and retrieve detailed certificate information including issuer, validity dates, and certificate chain details. Monitor certificate issuance for domains you care about to track security changes and detect unauthorized certificates.
lucide.dev API
Browse and download thousands of Lucide icons with instant search and category filtering to find exactly what you need. Get SVG files and metadata for each icon to integrate them seamlessly into your projects.
instantdomainsearch.com API
Check domain name availability instantly across over 800 TLD extensions and verify whether specific domains are registered. Search and monitor domain registration status to find your perfect web address or track competitor domains in real-time.
producthunt.com API
Access Product Hunt's daily leaderboards, detailed product pages, and search functionality. Retrieve ranked product launches for any date or date range, including upvote counts, descriptions, maker information, tags, and external links.