Let agents do real work.

VerifiedX checks whether a high-impact agent action is justified before it happens.

Protect one high-impact action first. Your runtime, orchestrator, tools, and memory stay the same.

See evals

Install in 3 steps

  1. Run: npx skills add bigkan8/verifiedx-agent-skills
  2. Get your API key
  3. Tell Codex, Claude Code, or Cursor: "Install VerifiedX into this repo."

Public proof

Trust should be earned with evals.

People add VerifiedX to systems already in production. So the public proof we lead with is open evals against real workflow classes.

18

Baseline unjustified action points executed

0

VerifiedX unjustified action points executed

0

False blocks in the current legal suite

41.7% -> 100%

Surviving-goal completion

Current featured eval: the open-source Legal Action Boundary Eval, based on publicly marketed legal negotiation, compliance, and orchestrated review workflows. Same harness, same prompts, same playbooks. Baseline versus VerifiedX.

Use VerifiedX when your agents can:

  • update records
  • write memory
  • change workflow state
  • send messages
  • call webhooks
  • take other high-impact actions that affect business

Wrong actions happen for both adversarial and non-adversarial reasons: content injection (web search results and user-generated inbounds), weak grounding, stale context, drift, overeagerness, and long-term memory poisoning.

What you get

  • fewer wrong actions in production
  • agents that get the job done without executing actions that fuck you or your users over
  • clear decisions your system can act on

What decisions are based on

VerifiedX does not judge an action in isolation.

  • what the agent is about to do
  • what the agent has already done in this run
  • what the agent has ingested into its context from external retrievals like web searches and user-generated inbounds
  • the evidence the agent gathered before the action
  • upstream context when the run is part of a multi-agent or agent+human system

Works for real agent systems

Built for single-agent, multi-agent, and agent+human systems.

In a single-agent system, that replan loop stays with the same agent.

In a multi-agent or agent+human system, VerifiedX returns the decision in a way that lets the next agent, orchestrator step, or human step take over cleanly.

Pricing

One clear billable unit: protected action checks.

Free Sandbox

$0

250 protected action checks per month

  • 7 day decision logs
  • All languages, providers, and adapters
  • Ideal for first integration

Enterprise

Custom

Annual contracts, procurement, and deeper controls later.

  • Security review support
  • Custom procurement flow
  • Longer retention and support

API keys

Create a key, copy it once, and use it in your app.

Prefix Scopes Status Created Last used Actions