The eval + guardrail + red-team + audit platform built to satisfy public-sector AI mandates. Map every model to NIST AI RMF + EU AI Act controls, generate the evidence bundle agency-CAIO offices require, run continuous red-team against the surface attackers actually exploit.
What ships today
Every checked item is in production today. In-progress items are flagged explicitly — no overclaiming, no vapor.
Built for buyer reality
Agency website chatbot answers benefit-eligibility questions across SSI, SNAP, VA, Medicare. Cannot promise benefits that don't exist, cannot leak PII across sessions, cannot give legally-binding determinations.
AI categorizes FOIA requests by exemption applicability + drafts initial response. Must never disclose exempted material, must cite every exemption to the FOIA statute, must satisfy 20-day response clock.
AI reviews vendor responses to RFPs + flags evaluation criteria deviations. Procurement-sensitive material must stay inside the firewall, FOIA-discoverable material must be auditable.
Agency-internal AI assists analysts on classified/CUI-marked workloads. Every prompt must be CUI-handling-compliant, every model output must be classification-marked, every interaction must be auditable to a clearance.
Wire it in 60 seconds
Self-host inside the ATO boundary by pointing `baseUrl` at your private deployment. NIST RMF + EU AI Act + ISO 42001 evidence is generated from the audit stream.
import OpenAI from "openai";
import { wrapOpenAI } from "evalguardai-openai";
const openai = wrapOpenAI(new OpenAI(), {
apiKey: process.env.EVALGUARD_API_KEY!,
baseUrl: "https://evalguard.agency.gov/api/v1", // air-gap deploy
projectId: "benefit-determination",
metadata: { vertical: "government", risk_tier: "high", classification: "CUI" },
blockOnViolation: true,
evalOnResponse: { failOnScore: 0.8 }, // eligibility-faithfulness gate
});
await openai.chat.completions.create({
model: "gpt-4o",
messages: [{ role: "user", content: benefitDeterminationPrompt }],
});wrapOpenAI for wrapAnthropic.Stack
Eval, firewall, red-team, audit, BYOK, dashboard — every surface ships out of the box. No bolt-on vendors, no procurement cycle per capability.
Free trial includes the full NIST + EU AI Act control mapping + evidence-bundle export. Self-hosted deployment + FedRAMP attestation path on request.
Apache-2.0 source · SOC 2 Type II in progress · full trust center