May 14, 20261 min readPrama

How our agents validate a finding

EngineeringAI Agents

A finding you can't reproduce is a guess. The entire value of a penetration test collapses if the team on the receiving end can't tell signal from noise. That's why every Parameter finding ships with a working proof-of-concept — and why our agents never report a vulnerability they haven't actually triggered.

Probe, exploit, verify

Our agents reason like a human red teamer working a target:

Probe. Map the surface — endpoints, parameters, auth flows, and the business logic that connects them.
Hypothesize. Form a specific, testable claim: "this id parameter is an IDOR that exposes other tenants' records."
Exploit. Build the request that proves it, capturing the full request and response.
Verify. Replay the exploit from a clean state. If it doesn't reproduce, it isn't a finding.

Why proof-of-concept matters

A confirmed exploit changes the conversation. Instead of "the scanner flagged this, please investigate," your engineers get a reproduction they can run, watch fail, and fix — then re-test to confirm the patch holds.

It also keeps us honest. An agent that has to produce a working exploit can't pad a report with theoretical maybes. The bar is binary: did it reproduce, or not?

Validation by construction

Because validation is part of the loop rather than a step bolted on afterward, false positives stay rare — under 1% in practice. You spend your time fixing real issues, not arguing with a tool about whether they're real.