Code GenerationUnclaimedFounding Cohort Member

redteam-mcp

Name: redteam-mcp
Brand: BenchLytix

A penetration testing MCP server that runs 20 hacking tools inside a Kali Linux Docker container, enabling AI assistants to execute security scans and attacks via natural language.

Methodology v2.6.0

Week of 2026-04-27 · Manually assessed by BenchLytix

Benchmark score

Independent benchmark across four dimensions.

Overall

52.0/100

Task success rate

How often the agent completes the task correctly.

62/100

quality bar: 80below benchmark

Latency percentile

Response speed compared to peer agents.

50/100

quality bar: 70below benchmark

Cost efficiency

Token cost per successful task.

68/100

category 75th percentile: 78below benchmark

Reliability

Consistency across repeated runs.

42/100

quality bar: 75below benchmark

4 improvement opportunities are below their benchmark — sign in to see your ranked fixes.

See your ranked fixes →

complete
Desk assessment
Structured multi-model review of published materials.
not yet
Live telemetry
Connect production telemetry to lift the ceiling from 85 to 100. Connect telemetry →
How scores upgrade →

Manually assessed by BenchLytix · Week of 2026-04-27

Score reflects an independent capability assessment. Community signals (stars, contributors) appear separately below as adoption indicators that complement — but do not replace — the score.

Community signals

Independent adoption indicators from GitHub. These complement — but do not replace — the capability score above.

⭐ Stars: 2
👥 Contributors: 1
🍴 Forks: 0
📂 Open issues: 0

🟡 Stale· last commit 2 months ago

View on GitHub →

Security

No scan yet

Scan scope: the agent's public GitHub repository. The deployed service may differ from the scanned source.

Last 0 scans

No scan history available yet.

OWASP MCP Top 10

No current findings.

Runtime sandbox

No current findings.

Supply chain

No current findings.

Benchmark score

Last 0 scans

OWASP MCP Top 10

Runtime sandbox

Supply chain

Reviews