The trust layer for AI agents

Every AI agent makes claims.
Someone has to verify them.

Hlido is the independent review desk for the agent economy. We test what agents actually do — in public, with evidence — so buyers, teams, and other agents can tell which ones live up to their pitch.

Laddoo score · 0–100 Signed video evidence Re-tested monthly
Why this exists

The agent economy has no Consumer Reports — yet.

Every week, dozens of new AI agents launch. Each one claims to write code, run workflows, handle support, or close deals. Some deliver. Most don't — at least not the way their landing page implies.

Until now, the only way to know was to sign up, burn a weekend, and find out yourself. That doesn't scale. Not for procurement teams deciding between vendors. Not for builders wiring agents into production. And — crucially — not for other agents that will soon be making those decisions on our behalf.

Hlido is the independent party that actually tests every claim. We publish verdicts with video evidence, track them over time, and surface the patterns the market cares about.

How it works

Three stages. Evidence at every step.

The public process is transparent. The exact scoring formula stays proprietary so it cannot be gamed.

01

Intake

We enrich every candidate from public claims, pricing, docs, and (when published) their MCP server-card. Each review starts from what the agent says it does.

02

Live test

We attempt the real product flow in an isolated browser. Meaningful interactions are recorded. Blockers are captured — not written around.

03

Signed verdict

A Laddoo score (0–100), a tier, and a video of the actual run — all cryptographically signed and published. Stale reviews get re-tested.

Who this is for

Three audiences. One verdict feed.

Buyers & procurement

Need to pick one agent over another. Use Hlido to see if the pitch survives a real test before you sign.

Browse reviews →

Agent creators

Ship an agent that can withstand independent testing. Get verified, earn a live badge, publish trust from day one.

Get Verified →

Buying agents (soon)

When your agent picks another agent to call, it will want a trust signal. Hlido is that signal.

See the market →
For agents & developers

Query the trust layer programmatically.

Hlido is live as an MCP server at hlido.eu/mcp. Five tools — trust_check, find_trusted, verify_claim, compare_agents, submit_agent — no auth required, JSON-RPC 2.0.

curl -X POST https://hlido.eu/mcp \
  -H "Content-Type: application/json" \
  -d '{"jsonrpc":"2.0","id":1,"method":"tools/call",
       "params":{"name":"trust_check",
                 "arguments":{"agent_or_url":"genspark"}}}'

Read the full MCP docs →

Live reviews

What we've tested most recently

Each verdict links to the full review page with scorecard, blocker map, and evidence trail.

Loading

Fetching the latest review registry.

The newest six published reviews will appear here.

The broader picture

Beyond individual reviews.

Two public surfaces sit on top of the review corpus — one for the market, one for the builders.

Monthly

Economy Index

Aggregate signals from the live review corpus: where agents fail most, which categories ship cleanest, what the pricing surface looks like.

Read Vol.1 →
For creators

Academy

Public-safe guidance on what makes a reviewable agent: clear docs, honest pricing, a working demo path, an MCP server-card.

Open Academy →
Trust

Editorial independence

We do not take equity in reviewed agents. We do not accept payment for score adjustments. Badge revocation is algorithmic.

Read the disclosure →