Parity Test Scenarios — Reference¶

The parity scenario files are retained as historical migration fixtures. The Python-based parity harness has been retired; native Rust tests and scripts/probe-no-python.sh now guard the shipped implementation.

This document describes every scenario tier file, the test cases each contains, and what behaviour each tier validates.

Contents¶

Running parity checks
Scenario file format
Case fields
Comparison targets
Environment variable expansion
Tier files
tier1.yaml — Mode detection
tier2-install.yaml — Install command
tier2-plugin.yaml — Plugin command
tier3-memory.yaml — Memory command
tier4-recipe-run.yaml — Recipe run
tier5-e2e.yaml — End-to-end launch
tier5-gap-tests.yaml — Known gaps
tier5-launcher.yaml — Launcher flags
tier5-live-recipe.yaml — Live recipe execution
tier5-malformed-yaml.yaml — Error handling
tier6-qa-bugfixes.yaml — QA regressions
tier7-launcher-parity.yaml — Launcher gaps
tier8-env-vars.yaml — Environment variable injection
Related

Running parity checks¶

# Run native tests that cover migrated behavior
cargo test --workspace --locked

# Verify the CLI works without Python on PATH
scripts/probe-no-python.sh

# Verify no Python implementation/package assets are tracked
scripts/check-no-python-assets.sh

Scenario file format¶

Each YAML file contains a top-level cases: list. Each entry is one test case.

cases:
  - name: example-case
    argv: ["launch"]
    timeout: 15
    env:
      PATH: "${SANDBOX_ROOT}/bin:${PATH}"
      AMPLIHACK_NONINTERACTIVE: "1"
    setup: |
      mkdir -p bin
      cat > bin/claude <<'SCRIPT'
      #!/usr/bin/env bash
      printf '%s\n' "$@" > "${SANDBOX_ROOT}/claude_args.txt"
      SCRIPT
      chmod +x bin/claude
    compare:
      - exit_code
      - stdout
      - fs:claude_args.txt

Case fields¶

Field	Required	Description
`name`	yes	Unique identifier for the test case
`argv`	yes	Argument list passed to the CLI (without the binary name)
`timeout`	no	Seconds before the case is killed (default: 30)
`env`	no	Extra environment variables for both Python and Rust runs
`setup`	no	Shell script run once per engine before execution, in `$SANDBOX_ROOT`
`compare`	yes	List of comparison targets (see below)
`cwd`	no	Working directory relative to `$SANDBOX_ROOT`
`stdin`	no	String piped to stdin of both engines

Comparison targets¶

Target	What is compared
`exit_code`	Process exit code
`stdout`	Captured stdout, newline-normalised
`stderr`	Captured stderr, newline-normalised
`fs:<path>`	Content of `$SANDBOX_ROOT/<path>` after execution

Environment variable expansion¶

${SANDBOX_ROOT} in env: values and setup: scripts is expanded to the absolute path of the per-engine temporary directory. Use double-quoted "${SANDBOX_ROOT}" in shell redirections to handle paths with spaces.

${PATH} expands to the inherited PATH at harness startup. ${HOME} expands to the user's home directory.

Tier files¶

tier1.yaml — Mode detection¶

Validates amplihack mode detect, mode to-plugin, and mode to-local commands. Tests both the dry-run and confirmation paths. Filesystem comparisons verify that .claude/ layout changes match between Python and Rust.