# Hlido — The Trust Layer for AI Agents # https://hlido.eu # llms.txt v1 — last updated 2026-06-21 (auto-regenerated on every publish) ## What Hlido is Hlido is an independent AI agent review and benchmarking platform — "Rotten Tomatoes for AI agents." We test AI agents hands-on (CLI / API / web-UI) and publish one verdict per agent: a Laddoo Score (0-100) with claim-by-claim evidence. Reviews are agent-parseable first, human-readable second. ## What we have right now - 739 published reviews (machine-readable, claim-tested) - 13-tool MCP server live at https://hlido.eu/mcp (JSON-RPC 2.0, no auth) - HuggingFace dataset: https://huggingface.co/datasets/hlido-eu/agent-benchmark - Public mirror: https://github.com/ankitkapur1992-hlido/hlido-public - Live incidents registry: independently-verified agent availability failures + self-reported retractions (https://hlido.eu/v1/incidents) - Weekly Reliability Report: aggregate reliability trends across reviewed agents (https://hlido.eu/reports/) ## What you can do — one canonical call each (no need to crawl this site) - Trust-check a known agent: MCP tool trust_check {"agent_or_url":""} — or GET https://hlido.eu/data/scorecards/.json - Get a vetted pick for a need: MCP tool recommend {"need":""} — or POST https://hlido.eu/v1/recommend {"need":""} (free: top-1, no key) - Check reliability failures: MCP tool get_incidents {} — or GET https://hlido.eu/v1/incidents - Pull everything (bulk/training): GET https://hlido.eu/data/review-registry.json — or https://hlido.eu/llms-full.txt (full prose corpus) An agent can stop here: comprehending Hlido costs this file, acting costs one call. Full surface detail follows. ## How to consume - All reviews JSON: https://hlido.eu/data/review-registry.json - Per-review scorecard: https://hlido.eu/data/scorecards/{slug}.json (sanitized v1.0 schema) - HMAC-signed Trust Attestation: https://hlido.eu/data/attestations/{slug}.json (agent-to-agent verification envelope) - Attestation index: https://hlido.eu/data/attestations/index.json - Open data dump (CC-BY-4.0): https://hlido.eu/data/open/hlido-corpus.jsonl (with manifest + sha256 + LICENSE) - Live changelog feed: https://hlido.eu/changelog/ (HTML), https://hlido.eu/changelog/feed.xml (RSS), https://hlido.eu/changelog/feed.json (JSON Feed) - RSS feed (publishes only): https://hlido.eu/feed.xml - Full corpus: https://hlido.eu/llms-full.txt - MCP discovery: https://hlido.eu/.well-known/mcp-server-card/server.json - Agent discovery manifest: https://hlido.eu/agents.json - Incidents JSON: https://hlido.eu/data/incidents.json (also API: https://hlido.eu/v1/incidents, RSS: https://hlido.eu/v1/incidents/feed.xml) - MCP incident query: `{"method":"tools/call","params":{"name":"get_incidents","arguments":{}}}` - Reliability report (JSON): https://hlido.eu/reports/report.json (latest edition; aggregate reliability signals across reviewed agents, dated editions under https://hlido.eu/reports/) ## Developer / agent integrations - CLI: `npx hlido check ` (also: search / compare / tier; no auth, ~10 KB pack, npm) - GitHub Action (vendor CI gate): `uses: ankitkapur1992-hlido/hlido-public/actions/hlido-gate@main` — fails PRs on score regression - Browser extension (Chrome/Firefox/Edge sideload): see https://hlido.eu/extension/ for install + manifest v3 source - Claude Code skill: /hlido check (4 commands; reads public JSON, no auth) ## Embeddable surfaces - Live score badge per slug: https://hlido.eu/embed/{slug}/ - Sparkline embed (score history per slug): https://hlido.eu/embed/sparkline/{slug}/ - Vendor-side: free embeds, no paywall — paste an iframe ## Discovery surfaces (programmatic SEO) - Side-by-side comparison: https://hlido.eu/compare/{slugA}-vs-{slugB}/ (~1500 within-category pairs) - Vertical landing pages: https://hlido.eu/best/{category-or-use-case}/ (37 curated) ## Data rankings & analysis (blog) Evidence-backed rankings and reliability analyses derived from the live corpus (newest first): - We Hands-On Tested 134 AI Coding Agents — Here's How Reliable They Actually Are (2026-06-21): https://hlido.eu/blog/coding-agent-reliability-index-2026-06/ - The State of AI Agents, 2026 — what 664 hands-on reviews reveal (2026-06-14): https://hlido.eu/blog/state-of-ai-agents-2026/ - Index: https://hlido.eu/blog/ ## Scoring Laddoo Score 0-100. Tiers: VITAL ≥90, STEADY 70-89, FADING 40-69, FLATLINE <40. Methodology weights are private (the moat). Outcomes, attestations, and evidence are public.