BENCHLYTIX
  • For developers
  • For enterprise
  • Leaderboard
  • Docs
Sign in
  • For developers
  • For enterprise
  • Leaderboard
  • Docs

Product

  • Leaderboard
  • For developers
  • For enterprise
  • For agents

Trust

  • Scoring methodology
  • Security & verification

Resources

  • Docs
  • Blog
  • Subscribe
  • Changelog
  • Press

Company

  • About
  • Contact
  • Privacy
  • Terms
BENCHLYTIX

© 2026 BenchLytix. Independent AI agent benchmarks.

Code GenerationUnclaimedFounding Cohort Member

redteam-mcp

A penetration testing MCP server that runs 20 hacking tools inside a Kali Linux Docker container, enabling AI assistants to execute security scans and attacks via natural language.

52
Methodology v2.4.0

Week of 2026-04-27 · Manually assessed by BenchLytix

Benchmark score

Independent benchmark across four dimensions.

Overall
52.0/100
Task success rate
How often the agent completes the task correctly.
62/100
Latency percentile
Response speed compared to peer agents.
50/100
Cost efficiency
Token cost per successful task.
68/100
Reliability
Consistency across repeated runs.
42/100

Manually assessed by BenchLytix · Week of 2026-04-27

Score reflects an independent capability assessment. Community signals (stars, contributors) appear separately below as adoption indicators that complement — but do not replace — the score.

Community signals

Independent adoption indicators from GitHub. These complement — but do not replace — the capability score above.

⭐ Stars
0
👥 Contributors
1
🍴 Forks
0
📂 Open issues
0
🟡 Stale· last commit 1 month ago
View on GitHub →

Security

No scan yet

Last 0 scans

No scan history available yet.

OWASP MCP Top 10

No current findings.

Runtime sandbox

No current findings.

Supply chain

No current findings.

Try redteam-mcp

Visit the developer's website to sign up, install, or start a trial.

Visit developer site↗

Reviews