The eval + guardrail + red-team + audit platform built by + for security people. 249 attack plugins, 42 strategies, adaptive multi-turn engine — the deepest red-team coverage of any LLM-safety platform, period. Plus the audit trail your SOC + your customer's procurement security review need.
What ships today
Every checked item is in production today. In-progress items are flagged explicitly — no overclaiming, no vapor.
Built for buyer reality
Security team needs to test every customer-facing LLM app for jailbreak / PII-extraction / prompt-injection BEFORE prod release. Manual red-teaming doesn't scale; off-the-shelf tools test 30 attacks.
SOC team wants a continuous attack feed against the production LLM endpoint. Need synthetic-traffic that doesn't pollute real customer data + alerting that distinguishes real attacks from synthetic.
Customer's procurement team asks 'show me your LLM safety posture.' Need to hand them a control-mapping report, not a 30-page narrative.
Eng + product teams are shipping Claude / Custom GPTs / agent workflows internally. Security team needs to scan every config-bundle for risky tool grants + prompt-injection surfaces BEFORE they go live.
Wire it in 60 seconds
The evalguard CLI runs the full red-team locally — no API key required for the offline pack. Point it at your endpoint config + plug into your CI pipeline.
# Install the CLI
npm i -g @evalguard/cli
# Run the offline red-team against an endpoint config
# (OWASP LLM Top 10 + MITRE ATLAS plugins, JSON/YAML config)
evalguard scan-local ./red-team.config.yaml \
--output ./out/findings.json --verbose
# Scan model artifacts for trojans / backdoors before deploy
evalguard model-scan ./models/finetuned.safetensors \
--severity high --format json
# Scan a repo for risky agent tool grants + leaked secrets
evalguard repo-scan ./my-agent --format json > scan.jsonwrapOpenAI for wrapAnthropic.Stack
Eval, firewall, red-team, audit, BYOK, dashboard — every surface ships out of the box. No bolt-on vendors, no procurement cycle per capability.
Free trial includes all 249 plugins, the adaptive engine, OWASP + NIST control mapping, and the full audit trail. SOC2 + OWASP evidence bundles ready for your next customer security review.
Apache-2.0 source · SOC 2 Type II in progress · full trust center