Incubator candidate

Test-First Implementation

Implement behavior with a red-green-refactor loop, observable tests, and small vertical slices. Use when the user asks for TDD, test-first work, regression-first fixes, or a feature that needs clear behavior before implementation.

Engineering Workflows
Version 0.1.0
Apache-2.0
Default install target: Codex

Source

incubator/skills/engineering-workflows/test-first-implementation/SKILL.md

Local Codex test command

INSTALL_INTERNAL_SKILLS=1 npx skills add ./incubator/skills --skill test-first-implementation -a codex --copy -y

Repository map

Skill files

Expand or collapse folders. Select a file or marker to open it on GitHub.

test-first-implementation
- SKILL.md

Goal

Drive implementation from observable behavior: write a failing test, make it pass with the smallest change, then refactor without changing behavior.

When to use

The user asks for TDD, red-green-refactor, test-first implementation, or regression-first work.
A feature can be expressed as observable behavior, API output, UI state, CLI output, or integration behavior.
A bug fix should start with a failing regression test.

When not to use

The repo has no runnable feedback loop and the user only needs a prototype.
The task is documentation-only or configuration-only.
The user explicitly asks for implementation without adding tests.

Inputs to inspect

User story, acceptance criteria, bug reproduction, or expected behavior.
Existing tests, test helpers, fixtures, and package scripts.
docs/agents/validation.md when present.
Nearby production code only after selecting the behavior boundary.

Workflow

Identify the smallest vertical behavior slice.
Choose the most stable observable test boundary; avoid testing private implementation details.
Write or update a test that fails for the right reason.
Run the focused test and capture the red result.
Implement the minimum production change.
Run the focused test again and capture the green result.
Refactor only if it improves clarity without broadening scope.
Run the smallest relevant validation suite.

Safety rules

Do not test private helpers just to make coverage easy.
Do not add snapshots for behavior that should be asserted directly.
Do not change production code before proving the test fails unless the repo cannot run tests.
Do not broaden the feature while chasing test convenience.

References

No bundled references. Use downstream docs/agents/validation.md for test commands and domain docs for expected behavior when present.

Scripts

No bundled scripts.

Output format

Return:

Behavior slice selected
Test boundary and why it was chosen
Red result
Implementation summary
Green result
Follow-up validation
Remaining risks

Completion criteria

A meaningful test fails before implementation and passes afterward.
The implementation is scoped to the selected behavior.
Validation commands and results are reported.
Any missing test capability is explicitly documented.

Failure modes

If no test framework exists, propose the smallest safe test setup or ask before adding one.
If the red test fails for the wrong reason, fix the test before production code.
If the behavior is too large, slice it smaller before coding.