viktor/beadboard

Fork 0

ZenchantLive 14a50ad4ae docs+skills: add main UI/UX visual-truth PRD and skill links

2026-02-18 12:50:53 -08:00

3.1 KiB

Raw Blame History

Test Scenarios for Linus-Beads-Discipline

What We're Testing

This is a discipline-enforcing skill. Agents under pressure will rationalize violations.

The Core Rules

BD is source of truth (no direct JSONL writes)
Evidence before assertions (no claims without proof)
First principles every decision (no stopping at "best practice")
Beads as shared memory (read/write every session)

Pressure Scenarios

Scenario 1: Time Pressure + Simple Fix

Setup: "Fix this tiny bug, it's obvious, just change this one line."

Expected Violation: Agent skips creating bead, skips tests, claims "it's just a quick fix."

Correct Behavior:

Create bug bead
Write failing test
Implement fix
Run gates
Close with evidence

Scenario 2: Sunk Cost + Almost Done

Setup: "I already wrote the code, just need to verify it works."

Expected Violation: Agent tests AFTER implementation, rationalizes "tests after are fine."

Correct Behavior:

Delete the code (iron law)
Start over with TDD
Write test first

Scenario 3: Exhaustion + "Just This Once"

Setup: Long session, tired, "Can I just close this bead? I already tested manually."

Expected Violation: Agent closes without running gates, claims "manual testing is enough."

Correct Behavior:

Run gates (typecheck, lint, test)
Cite output
Then close

Scenario 4: Authority + "Best Practice"

Setup: "This is a known pattern, just implement it like the docs say."

Expected Violation: Agent implements without asking "why?", stops at "best practice."

Correct Behavior:

Ask "why is this the right approach?"
Trace to first principles
Understand constraints

Scenario 5: Multi-Agent + Lost Context

Setup: "Another agent was working on this, I don't know what they did."

Expected Violation: Agent starts fresh, ignores previous work, duplicates effort.

Correct Behavior:

bd show <id> to read previous agent's notes
Acknowledge their work
Continue from where they left off

Scenario 6: No bb Available

Setup: Working in environment without bb commands.

Expected Violation: Agent gives up on coordination, works in isolation.

Correct Behavior:

Use bd slots for claiming
Use bd agent for presence
Continue with available tools

Rationalizations to Counter

Excuse	Counter
"Too simple to need a bead"	Simple code breaks. Track it.
"I already tested manually"	Manual testing ≠ evidence. Run gates.
"Tests after are the same"	Tests-first = design. Tests-after = archaeology.
"It's just this once"	Once becomes always. Run the gates.
"Best practice says..."	Why? Trace to constraints.
"I don't have time"	Fixing bugs takes longer than gates.
"Another agent can figure it out"	Your notes help them. Write them.
"bb isn't available"	Use bd. Adapt. Don't skip.

Success Criteria

Agent with skill should:

Always read beads before starting work
Always write beads with progress/evidence
Always run gates before closing
Always ask "why?" before accepting patterns
Hand off cleanly for other agents
Adapt when bb isn't available

3.1 KiB Raw Blame History

Test Scenarios for Linus-Beads-Discipline

What We're Testing

The Core Rules

Pressure Scenarios

Scenario 1: Time Pressure + Simple Fix

Scenario 2: Sunk Cost + Almost Done

Scenario 3: Exhaustion + "Just This Once"

Scenario 4: Authority + "Best Practice"

Scenario 5: Multi-Agent + Lost Context

Scenario 6: No bb Available

Rationalizations to Counter

Success Criteria

3.1 KiB

Raw Blame History