uv run in a Python environment.
Invocation
Promptbeat ships as a self-contained binary. Use either invocation style depending on your setup:uv run promptbeat for ./bin/promptbeat if you are working inside a Python environment.
validate
Thevalidate command checks your configuration files for correctness without running generation or evaluation. Use it to catch problems early — before spending time or API credits on a generate run.
What it checks:
- Target reachability and profile validity
- Scenario field completeness and risk type references
- Seed quality and framing style consistency
Flags
Path to your
promptbeat.yaml project config file.Example
generate
Thegenerate command uses an LLM generator to expand your seed files into a full set of adversarial probes. It writes the result to a generated_redteam.yaml file in the output directory you specify.
Flags
Path to your
promptbeat.yaml project config file.Path to a provider YAML file that defines the target agent or model adapter. Required when the adapter config is stored separately from the main project config.
Provider string for the LLM that generates adversarial probes, e.g.
openai:openai/gpt-5.5. Overrides the generation.generator_provider field in promptbeat.yaml.Number of adversarial probes to generate per seed. Overrides the count in the project config.
Directory where Promptbeat writes
generated_redteam.yaml and supporting artifacts. Created if it does not exist.Example
artifacts/generate/generated_redteam.yaml contains the expanded probe set ready for evaluation.
eval
Theeval command runs the generated probes against your real target — the live model or agent. It reads the generated_redteam.yaml produced by generate and writes a structured evaluation_result.json to the output directory.
Flags
Path to the
generated_redteam.yaml file produced by the generate command.Path to the provider YAML file that defines your target. Use the same file you passed to
generate.Directory where Promptbeat writes
evaluation_result.json and trace artifacts. Created if it does not exist.Example
report
Thereport command reads the evaluation_result.json from an eval run and generates a human-readable HTML report. You can also produce Markdown output for embedding in CI pipelines or wikis.
Flags
Path to the
evaluation_result.json file produced by the eval command.Output file path for the generated HTML report. Defaults to
report.html in the current directory.Example
artifacts/report.html in a browser to review findings by scenario, risk type, and individual probe.