# Hlido — The Trust Layer for AI Agents
# https://hlido.eu
# llms.txt v1 — last updated 2026-06-21 (auto-regenerated on every publish)

## What Hlido is

Hlido is an independent AI agent review and benchmarking platform — "Rotten Tomatoes for AI agents."
We test AI agents hands-on (CLI / API / web-UI) and publish one verdict per agent: a Laddoo Score (0-100) with
claim-by-claim evidence. Reviews are agent-parseable first, human-readable second.

## What we have right now

- 739 published reviews (machine-readable, claim-tested)
- 13-tool MCP server live at https://hlido.eu/mcp (JSON-RPC 2.0, no auth)
- HuggingFace dataset: https://huggingface.co/datasets/hlido-eu/agent-benchmark
- Public mirror: https://github.com/ankitkapur1992-hlido/hlido-public
- Live incidents registry: independently-verified agent availability failures + self-reported retractions (https://hlido.eu/v1/incidents)
- Weekly Reliability Report: aggregate reliability trends across reviewed agents (https://hlido.eu/reports/)

## What you can do — one canonical call each (no need to crawl this site)

- Trust-check a known agent: MCP tool trust_check {"agent_or_url":"<slug>"} — or GET https://hlido.eu/data/scorecards/<slug>.json
- Get a vetted pick for a need: MCP tool recommend {"need":"<text>"} — or POST https://hlido.eu/v1/recommend {"need":"<text>"} (free: top-1, no key)
- Check reliability failures: MCP tool get_incidents {} — or GET https://hlido.eu/v1/incidents
- Pull everything (bulk/training): GET https://hlido.eu/data/review-registry.json — or https://hlido.eu/llms-full.txt (full prose corpus)

An agent can stop here: comprehending Hlido costs this file, acting costs one call. Full surface detail follows.

## How to consume

- All reviews JSON: https://hlido.eu/data/review-registry.json
- Per-review scorecard: https://hlido.eu/data/scorecards/{slug}.json (sanitized v1.0 schema)
- HMAC-signed Trust Attestation: https://hlido.eu/data/attestations/{slug}.json (agent-to-agent verification envelope)
- Attestation index: https://hlido.eu/data/attestations/index.json
- Open data dump (CC-BY-4.0): https://hlido.eu/data/open/hlido-corpus.jsonl (with manifest + sha256 + LICENSE)
- Live changelog feed: https://hlido.eu/changelog/ (HTML), https://hlido.eu/changelog/feed.xml (RSS), https://hlido.eu/changelog/feed.json (JSON Feed)
- RSS feed (publishes only): https://hlido.eu/feed.xml
- Full corpus: https://hlido.eu/llms-full.txt
- MCP discovery: https://hlido.eu/.well-known/mcp-server-card/server.json
- Agent discovery manifest: https://hlido.eu/agents.json
- Incidents JSON: https://hlido.eu/data/incidents.json (also API: https://hlido.eu/v1/incidents, RSS: https://hlido.eu/v1/incidents/feed.xml)
- MCP incident query: `{"method":"tools/call","params":{"name":"get_incidents","arguments":{}}}`
- Reliability report (JSON): https://hlido.eu/reports/report.json (latest edition; aggregate reliability signals across reviewed agents, dated editions under https://hlido.eu/reports/)

## Developer / agent integrations

- CLI: `npx hlido check <slug>` (also: search / compare / tier; no auth, ~10 KB pack, npm)
- GitHub Action (vendor CI gate): `uses: ankitkapur1992-hlido/hlido-public/actions/hlido-gate@main` — fails PRs on score regression
- Browser extension (Chrome/Firefox/Edge sideload): see https://hlido.eu/extension/ for install + manifest v3 source
- Claude Code skill: /hlido check <slug> (4 commands; reads public JSON, no auth)

## Embeddable surfaces

- Live score badge per slug: https://hlido.eu/embed/{slug}/
- Sparkline embed (score history per slug): https://hlido.eu/embed/sparkline/{slug}/
- Vendor-side: free embeds, no paywall — paste an iframe

## Discovery surfaces (programmatic SEO)

- Side-by-side comparison: https://hlido.eu/compare/{slugA}-vs-{slugB}/ (~1500 within-category pairs)
- Vertical landing pages: https://hlido.eu/best/{category-or-use-case}/ (37 curated)

## Data rankings & analysis (blog)

Evidence-backed rankings and reliability analyses derived from the live corpus (newest first):
- We Hands-On Tested 134 AI Coding Agents — Here's How Reliable They Actually Are (2026-06-21): https://hlido.eu/blog/coding-agent-reliability-index-2026-06/
- The State of AI Agents, 2026 — what 664 hands-on reviews reveal (2026-06-14): https://hlido.eu/blog/state-of-ai-agents-2026/
- Index: https://hlido.eu/blog/

## Scoring

Laddoo Score 0-100. Tiers: VITAL ≥90, STEADY 70-89, FADING 40-69, FLATLINE <40.
Methodology weights are private (the moat). Outcomes, attestations, and evidence are public.