Test Pharaoh

Write tests for Android feature branches from your user story + acceptance criteria. Adheres to your project's existing conventions. Never uses mocks.

You, the Pharaoh, command. The scribe records your decree; the architect surveys the land; the masons lay the stones; the physician tends the cracks. The work is a pyramid of tests.

Prerequisites

Claude Code
An Android project (Kotlin + Gradle — version catalog strongly preferred)
JDK 17+ on PATH (the plugin builds a small signature-extractor JAR the first time you invoke it)

Install

/plugin marketplace add rcosteira79/test-pharaoh
/plugin install test-pharaoh@test-pharaoh

The first /test-scribe invocation performs a one-time ~30s build of the bundled signature extractor. Subsequent runs skip it. Plugin updates trigger a rebuild on the next run.

Usage

In your Android repo, on a branch with your feature changes:

/test-scribe

You can invoke it bare and the scribe will prompt you, or pass the inputs inline with the command. The scribe takes:

User story + acceptance criteria (required) — the spec that commissioned the feature.
Any other relevant context (optional, recommended) — design decisions, product intent, edge cases worth emphasising, links to prior discussions or PRs. The more context the scribe has, the sharper Gate 0 and the resulting plan.

Then:

Asks where to work — current branch, a new branch, or an isolated git worktree.
Gate 0 — feature understanding. Summarises what it believes the feature does; you confirm or correct. Iterates until you approve.
Synthesizes TEST_PLAN.md with traceability markers (AC-N → test name).
Gate 1 — plan review. Two stages: you review and edit TEST_PLAN.md; then an explicit, standalone go-signal before any code is generated. (Generic approvals like "looks good" don't unlock generation. If your ACs contain user-interaction signals, the scribe also asks at this gate whether to include the instrumented tier.)
Generates tests in parallel across (class × tier) units.
Runs the Gradle tasks from your project profile; triages failures (mechanical patches retry ≤ 2×).
Gate 2 — escalation (only on persistent substantive failures). The physician writes a diagnosis; you rule.

All artefacts (plan, run log, diagnosis, signature extracts) are written under .claude/test-pharaoh/runs/<timestamp>/ so you can inspect or resume manually.

The court

Role	Name	Does
Commander	You, the Pharaoh	Commission the work. Rule on the three gates.
Coordinator	`test-scribe`	Record your decree. Commission the build. Report.
Surveyor	`architect`	Reads `libs.versions.toml`, module configs, existing fakes, mock-library usage, error-wrapper conventions, instrumented-tier framework, and the project's test-naming style. Never touches class bodies.
Builder	`mason`	One `(class × tier)` per invocation. Hand-written fakes only. Sees signatures, never bodies.
Healer	`physician`	Triages Gradle failures. Mechanical → patch and retry. Substantive → escalate to the Pharaoh.

Principles

Never mocks. Hand-written fakes only. If your existing tests use MockK or Mockito, Gate 1 proposes migrating them first.
Signatures only. Production code bodies are never read at any step of the workflow; only the bundled extractor exposes signature views. Tests are written against the contract, not the implementation — preventing tautological "whatever the code does must be right" tests.
Adhere to your conventions. Fixture placement, dispatcher patterns, error wrappers, assertion libraries, runner wiring — all match what the architect detects in your project.

Tiers supported

Unit — JUnit 5 + assertion library per your profile.
Integration — real class + real collaborators; fakes only at IO boundaries.
Roborazzi — one @Test per visual variant (light/dark/RTL/large-font/tablet).
Instrumented (end-to-end) — runs on device/emulator. The architect detects your project's setup (Cucumber, Espresso, Compose UI Test, Kaspresso, Barista, …) and writes tests in the matching style.

Framework catalog

Deterministic test floors for Retrofit, Apollo, Room, DataStore, Hilt, Coroutines/Flow, RxJava2, WorkManager, Compose, Roborazzi, Navigation 3, and ViewModel/StateFlow. The scribe merges catalog-stock cases with your AC-derived ones.

If the architect detects a framework without a catalog entry, the plan is annotated UNKNOWN: no catalog — AC + contract only. Contributions welcome.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
.claude-plugin		.claude-plugin
plugins/test-pharaoh		plugins/test-pharaoh
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Test Pharaoh

Prerequisites

Install

Usage

The court

Principles

Tiers supported

Framework catalog

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Test Pharaoh

Prerequisites

Install

Usage

The court

Principles

Tiers supported

Framework catalog

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages