Concepts

Mental models for EvalGuard

The API reference catalogues every endpoint, but those pages assume you already know the difference between a scorer and a firewall, what the regeneration loop does, and when an evaluation mode is "basic" vs "deep". Read these seven pages first if you're new to the platform — they're the smallest set of concepts that make the rest of the docs parse.

Evaluation modes — basic vs deep
Cheap ML scorers vs LLM-as-judge rubrics. Cost/latency tradeoffs and when to pick each.
Scoring thresholds
0–1 score scale, the 0.8 default, MIN-of-dims gate semantics, calibrating thresholds for your domain.
The regeneration loop
Evaluate → if-failing-then-regenerate → re-evaluate. Stop conditions, cost-budget gate, audit row shape.
Policy engine
Declarative rules that map score thresholds and dim verdicts to actions (block, regenerate, redact, log).
Agent checkpoints
Input injection scan → tool-call gate → tool-result scan. Three places to insert safety in an agent loop.
Red teaming
Plugins (what to test) × strategies (how to obfuscate). 249 × 42 surface; choosing the right subset.
Firewall vs scorer
Sub-3ms inline gate vs LLM-judged eval. When each fires, how they compose, why you want both.

Evaluation modes — basic vs deep

Scoring thresholds

The regeneration loop

Policy engine

Agent checkpoints

Red teaming

Firewall vs scorer