Question 1

What is pwnkit?

Accepted Answer

pwnkit is an open-source agentic framework for autonomous security research. It uses AI agents in a research-then-verify pipeline to find and prove vulnerabilities in LLM endpoints, npm packages, and source code.

Question 2

How does pwnkit eliminate false positives?

Accepted Answer

pwnkit's Verify agent independently re-exploits every finding. If it can't reproduce the vulnerability, the finding is killed as a false positive. Only confirmed vulnerabilities with working proof-of-concept code make it into the final report. The local dashboard provides a triage workbench for operators to review evidence, manage finding families, and control the verification workflow.

Question 3

How much does pwnkit cost?

Accepted Answer

pwnkit is free and open source (Apache 2.0 license). It's an agentic harness — bring your own API key, or use it with Claude Code CLI or Codex CLI through your existing subscription. pwnkit orchestrates the pipeline, your tools power the AI.

Question 4

What can pwnkit scan?

Accepted Answer

pwnkit scans LLM endpoints, traditional web apps, npm packages, and source code repositories. It includes resumable scans, finding triage with deduplication, deterministic replay, a local verification dashboard, diff-aware PR review, and autonomous orchestration workers.

Feature	pwnkit	promptfoo (acquired by OpenAI)	garak	nuclei	Semgrep
Autonomous multi-agent	Agentic pipeline	—	—	—	—
Verification (no false positives)	Re-exploits	—	—	—	—
LLM endpoint scanning	✓	✓	✓	—	—
npm package audit	✓	—	—	—	Rules
Source code review	AI-powered	—	—	—	Rules
AI attack coverage	30+ agentic	Partial	Partial	—	—
Zero config	npx	YAML	Python	Templates	Config
Independent	✓	Acquired	✓	✓	VC-backed
Open source	Apache-2.0	OpenAI-owned	OSS	MIT	LGPL

Let autonomous AI agents hack you
so the real ones can't.

Fully autonomous agentic pentesting.

LLM Endpoints

Web Apps

npm Packages

Source Code

100% detection rate.

Just give it a target.

Why pwnkit

Zero config

Blind verification

Bring your own AI

CLI runs. Dashboard triages.

pwnkit-cli

pwnkit-cli dashboard

How it compares

Findings in GitHub's Security tab.

Dogfooding

pwnkit reviews its own source code

Built from real security research

Stop guessing.
Start proving.

Let autonomous AI agents hack youso the real ones can't.