Discover/Llmstxt API

live

Llmstxt APIdirectory.llmstxt.cloud ↗

Access the llms.txt directory via API. Search and filter companies by name or category, retrieve llms.txt URLs, token counts, and pagination data.

Developer Tools b2b directory other

Endpoint health

verified 6d ago

get_companies

1/1 passing latest checkself-healing

Endpoints

Updated

26d ago

What is the Llmstxt API?

The llms.txt Directory API exposes a single endpoint, get_companies, that returns structured records for companies publishing LLM-friendly documentation. Each record includes 6 fields: company name, website URL, llms.txt URL, llms_full_txt URL, token count, and full token count. The endpoint supports keyword search, category filtering, token-based sorting, and paginated retrieval of up to roughly 20 companies per page.

Try it

page

Page number to retrieve.

sort

Sort order for results.

Search keyword to filter companies by name.

Typed, relational, agent-ready

A generated client with real types, enums, and the links between objects — the structure a flat JSON response can't carry. Autocompletes in your editor and reads cleanly to coding agents.

Fully typed · autocompletes
Objects link to objects
Typed errors & pagination

Typed Python client. Set up the SDK in your uv project, then pull this API’s typed client:

uv add parse-sdk
uv run parse init
uv run parse add --marketplace directory-llmstxt-cloud-api

uv run parse add --marketplace pulls a pinned snapshot of this canonical API — it won’t change underneath you. To customize it, subscribe and swap to your own copy.

"""Walkthrough: llms.txt Directory SDK — discover companies with LLM-friendly docs."""
from parse_apis.llms_txt_directory_api import LlmsTxt, Sort, Category, CompanyNotFound

client = LlmsTxt()

# Browse all companies sorted by token count (descending), capped at 5 items.
for company in client.companies.search(sort=Sort.TOKENS_DESC, limit=5):
    print(company.name, company.tokens, company.website_url)

# Filter by category — only AI companies.
ai_company = client.companies.search(category=Category.AI, limit=1).first()
if ai_company:
    print(ai_company.name, ai_company.llms_txt_url, ai_company.full_tokens)

# Search by keyword and handle not-found gracefully.
try:
    result = client.companies.search(search="nonexistent_xyz_corp", limit=1).first()
    if result:
        print(result.name, result.website_url)
    else:
        print("No companies matched the search.")
except CompanyNotFound as exc:
    print(f"Not found: {exc}")

print("exercised: companies.search (browse / category filter / keyword search)")

All endpoints · 1 totalmissing one? ·

Get a list of companies from the llms.txt directory with their website and llms.txt URLs. Supports searching, filtering by category, sorting, and pagination. Returns up to ~20 companies per page.

Input

Param	Type	Description
page	integer	Page number to retrieve.
sort	string	Sort order for results.
search	string	Search keyword to filter companies by name.
category	string	Category filter.

Response

{
  "type": "object",
  "fields": {
    "companies": "array of company objects with keys: name, website_url, llms_txt_url, llms_full_txt_url, tokens, full_tokens",
    "total_pages": "integer, total number of pages",
    "current_page": "integer, current page number"
  },
  "sample": {
    "data": {
      "companies": [
        {
          "name": "Stripe",
          "tokens": "19K",
          "full_tokens": "Not available",
          "website_url": "https://stripe.com",
          "llms_txt_url": "https://docs.stripe.com/llms.txt",
          "llms_full_txt_url": null
        }
      ],
      "total_pages": 1,
      "current_page": 1
    },
    "status": "success"
  }
}

About the Llmstxt API

What the API Returns

The get_companies endpoint returns an array of company objects from the llms.txt directory. Each object contains name, website_url, llms_txt_url, llms_full_txt_url, tokens, and full_tokens. The llms_txt_url points to the concise LLM-optimized documentation file for that company, while llms_full_txt_url links to the expanded version. The tokens and full_tokens fields indicate the token length of each file, which is useful for planning context-window budgets when feeding these documents to language models.

Filtering and Sorting

The search parameter filters results by company name using a keyword string. The category parameter restricts results to a specific vertical — observed values include 'AI', 'Developer tools', 'Finance', 'Products', and 'Websites'. The sort parameter controls ordering; accepted values are 'default', 'tokens_asc', 'tokens_desc', 'full_tokens_asc', and 'full_tokens_desc', letting you surface the shortest or longest documentation files first.

Pagination

Results are paginated at approximately 20 companies per page. The pagination object in the response provides current_page and total_pages integers, so you can walk the full directory programmatically. Pass the page integer parameter to advance through results.

Reliability & maintenanceVerified 6d ago

The Llmstxt API is a managed, monitored endpoint for directory.llmstxt.cloud — not a raw scraper you maintain. Every endpoint is automatically health-checked on a schedule, and when directory.llmstxt.cloud changes and a check fails, the API is automatically queued for repair and re-verified. It is built to keep working as the site underneath it changes.

This isn't an official directory.llmstxt.cloud API — it's an independent, maintained REST wrapper over public data. Where the source has no official API (or only a limited one), Parse gives you a stable contract over a source that never promised one, and keeps it current. Need a new endpoint or field? You can revise it yourself in plain English and the agent rebuilds it against the live site in minutes — contributing the change back to the shared API is free.

Last verified

6d ago

Latest check

1/1 endpoint passing

Maintenance

Monitored & self-healing

Will this API break when the source site changes?+

It's built not to. Every endpoint is health-checked on a schedule with automated test probes. When the source site changes and a check fails, the API is automatically queued for repair and re-verified — that's the self-healing layer. Each API page shows when its endpoints were last verified. And because marketplace APIs are shared, any fix reaches everyone using it.

Is this an official API from the source site?+

No — Parse APIs are independent, managed REST wrappers over publicly available data. That is the point: where a site has no official API (or only a limited one), Parse gives you a maintained, monitored endpoint for that data and keeps it working as the site changes — so you get a stable contract over a source that never promised one.

Can I fix or extend this API myself if I need a new endpoint or field?+

Yes — and you don't have to wait on us. This API was generated by the Parse agent, which stays attached. Describe the change in plain English ("add an endpoint that returns reviews", "fix the price field") in the revise box on the API page or via the revise_api MCP tool, and the agent rebuilds it against the live site in minutes. Contributing the change back to the public API is free.

What happens if I call an endpoint that has an issue?+

Errors are machine-readable: a bad call returns a clean status with the list of available endpoints and a repair hint, so an agent (or you) can recover or trigger a fix instead of failing silently. Confirmed failures feed the automatic repair queue.

Common use cases

Build a curated index of AI-tool documentation by filtering get_companies with category='AI' and collecting llms_txt_url values.
Estimate context-window cost before loading third-party docs by sorting on tokens_asc to find the smallest files first.
Automate discovery of new llms.txt adopters by paginating through all pages and diffing results over time.
Populate a developer resource hub with company names, website URLs, and direct links to their LLM-optimized docs.
Filter by category='Developer tools' to compile a list of developer-facing services that expose machine-readable documentation.
Feed llms_full_txt_url links into a retrieval pipeline for companies in the Finance category to build domain-specific assistants.

Pricing & limitsSee full pricing →

Tier	Price	Credits/month	Rate limit
Free	$0/mo	100	5 req/min
Hobby	$30/mo	1,000	20 req/min
Developer	$100/mo	5,000	100 req/min

One credit = one API call regardless of which marketplace API you call. Exceeding the rate limit returns a 429 response. Authenticate with the X-API-Key header.

Frequently asked questions

Does llms.txt directory have an official developer API?+

The directory does not publish an official developer API. This Parse API provides programmatic access to the directory's company listings, URLs, and token metadata.

What does the `get_companies` endpoint return beyond just URLs?+

Each company object includes name, website_url, llms_txt_url, llms_full_txt_url, tokens, and full_tokens. The token fields reflect the size of each documentation file, which helps you assess how much context space each source will consume before fetching it.

How many results does the API return per request, and how do I get more?+

The endpoint returns approximately 20 companies per page. Use the page integer parameter to advance through results. The pagination object in every response provides current_page and total_pages so you know exactly how many pages exist for a given search or category filter.

Does the API expose the actual content of llms.txt files, or just their URLs?+

The API returns only the URLs (llms_txt_url and llms_full_txt_url) and their token counts — it does not fetch or return the file contents themselves. You can fork this API on Parse and revise it to add an endpoint that fetches and returns the content of those files.

Are there category values beyond the ones listed, and can I retrieve a full category taxonomy?+

Observed category values are 'Websites', 'Products', 'Developer tools', 'AI', and 'Finance'. The API does not currently expose a dedicated endpoint for listing all available categories. You can fork it on Parse and revise to add a categories endpoint if you need a dynamic taxonomy.

Page content last updated May 21, 2026. Spec covers 1 endpoint from directory.llmstxt.cloud.

Related APIs in Developer ToolsSee all →

thecompaniesapi.com API

Enrich your company database with 80+ data points per company, search by industry or company details, and discover email patterns to drive your business intelligence. Find verified company information, get pricing data, and ask contextual questions about any organization to fuel your sales, marketing, or research efforts.

wlw.de API

Search for B2B suppliers and manufacturers on wlw.de, view detailed company profiles with contact information and product catalogs, and discover featured businesses and products. Find the right industrial suppliers and vendors quickly by browsing company details and product listings across Germany's largest B2B marketplace.

dnb.com API

Search millions of companies in Dun & Bradstreet's global business directory to find detailed company profiles and verify D-U-N-S numbers. Look up key business information like company details and identifiers to support due diligence, sales prospecting, and business intelligence needs.

whop.com API

Search and discover verified companies and popular businesses across the Whop marketplace by category, while browsing public forum discussions and exploring the business classification taxonomy. Find relevant business categories and access curated discover sections to identify opportunities within specific market segments.

annualreports.com API

Search for and access thousands of international company annual reports in PDF and HTML formats, while filtering by ticker, exchange, industry, sector, company size, and location. Browse company profiles and financial documents across different markets and industries to find the information you need.