Adds app/afk/ — the "away-from-keyboard" control plane that watches the
issue tracker for ready-for-agent issues, dispatches each to a fresh
full-access T3 thread (with the issue-implementer preamble prepended,
because T3 does not honour ~/.claude/CLAUDE.md), and drives the resulting
run through its lifecycle: tests-red -> green -> pushed -> CI -> deployed,
escalating or fix-forwarding via a small pure state machine.
The loop is split into pure cores (no I/O, exhaustively unit-tested) and
thin injected adapters (the only edges that ever touch T3, the tracker,
CI, or Slack — faked in every test, so nothing here talks to a real
server, GitHub/Forgejo, or the cluster):
pure: types, dispatch_policy, run_state_machine, phase_checklist,
config, issue_implementer_prompt
adapters: t3_client (two-POST dispatch + snapshot), tracker, ci_watcher,
notifier
loops: poller — CronJob tick #1: list_ready -> select_dispatchable
-> dispatch + stamp the in-progress lock (label only
AFTER a successful dispatch, so a failed dispatch
never leaves a phantom lock). Per-repo lock derived
from the ready set, since the CronJob is stateless
between ticks.
watcher — CronJob tick #2: assemble RunState from snapshot +
CI -> next_action -> act (close on success; relabel
ready-for-human + ring the doorbell on the two
escalations; dispatch a corrective turn on
fix-forward; refresh the progress checklist).
SHIPS DISABLED, on purpose: Config defaults to kill_switch=True AND an
empty allowlist, so a freshly-loaded config dispatches nothing and does
zero I/O. The package is not imported by the running service and has no
auto-enable path. Arming it is a deliberate, later, manual step requiring
BOTH gates (clear the kill switch AND enrol the exact repos) so one
fat-fingered env var can't arm every repo.
Test-first throughout: 412 tests pass (poller + watcher add integration
tests wiring the real pure cores to in-memory fakes). mypy clean.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
84 lines
4 KiB
Python
84 lines
4 KiB
Python
"""Run state machine: assembled ``RunState`` -> next ``Action`` (ADR-0002).
|
|
|
|
This is the heart of the AFK loop's per-issue control: each tick the loop
|
|
assembles a :class:`~app.afk.types.RunState` (thread liveness from the
|
|
orchestration snapshot, CI verdict from the watcher, plus its own ``pushed`` /
|
|
``fix_forward_attempts`` / ``elapsed_seconds`` bookkeeping) and calls
|
|
:func:`next_action` to decide what to do next.
|
|
|
|
The function is **pure** — it reads only its two arguments, never the clock, the
|
|
network, or any global. That keeps the lifecycle policy a plain decision table
|
|
the test suite can exhaust combinatorially; the loop owns all the I/O (closing
|
|
issues, dispatching corrective turns, escalating) based on the Action returned.
|
|
|
|
The decision table (first match wins):
|
|
|
|
* pushed AND CI green -> CLOSE_SUCCESS
|
|
The run is healthy and verified; close the issue. The thread's own status
|
|
is irrelevant once a pushed commit is green.
|
|
* pushed AND CI red, budget remaining -> FIX_FORWARD
|
|
A pushed commit broke CI. Dispatch another corrective turn — but only
|
|
while BOTH budgets hold: ``fix_forward_attempts < fix_forward_max_attempts``
|
|
AND ``elapsed_seconds < fix_forward_max_seconds`` (strict; at/over either
|
|
bound is exhausted).
|
|
* pushed AND CI red, budget exhausted -> FREEZE_ESCALATE
|
|
Out of fix-forward attempts or wall-clock; stop churning and hand to a
|
|
human with the broken commit left in place.
|
|
* not pushed AND thread ERROR/IDLE -> ESCALATE_PREPUSH
|
|
The agent will never reach green: it errored, or its turn finished /
|
|
stalled with nothing pushed. There is no pushed commit to fix forward, so
|
|
escalate before-push (a different remediation path than FREEZE_ESCALATE).
|
|
* everything else -> WAIT
|
|
Still in flight: working toward a first push (thread running / unknown), or
|
|
pushed with CI not yet decided. Poll again next tick.
|
|
"""
|
|
from .types import Action, CIStatus, Config, RunState, ThreadStatus
|
|
|
|
# Thread states that mean the agent is finished with this turn — it will not push
|
|
# any further on its own. Reaching one of these with nothing pushed is terminal
|
|
# (escalate), whereas RUNNING / None (no snapshot entry yet) means keep waiting.
|
|
_TERMINAL_THREAD_STATES: frozenset[ThreadStatus] = frozenset(
|
|
{ThreadStatus.ERROR, ThreadStatus.IDLE}
|
|
)
|
|
|
|
|
|
def next_action(state: RunState, config: Config) -> Action:
|
|
"""Decide the next :class:`Action` for one issue's run.
|
|
|
|
Pure and total: every reachable ``(thread_status, ci_status, pushed,
|
|
attempts, elapsed)`` combination maps to exactly one Action via the table in
|
|
the module docstring. See that table for the rationale of each branch.
|
|
"""
|
|
if state.pushed:
|
|
# A commit is out; the CI verdict on it drives everything from here.
|
|
if state.ci_status is CIStatus.GREEN:
|
|
return Action.CLOSE_SUCCESS
|
|
if state.ci_status is CIStatus.RED:
|
|
return (
|
|
Action.FIX_FORWARD
|
|
if _fix_forward_budget_remaining(state, config)
|
|
else Action.FREEZE_ESCALATE
|
|
)
|
|
# CI pending / not yet reported -> wait for the verdict.
|
|
return Action.WAIT
|
|
|
|
# Nothing pushed yet. If the turn is over (errored or gone idle) the run can
|
|
# never reach green on its own -> escalate before-push; otherwise it is still
|
|
# working toward a first push -> wait.
|
|
if state.thread_status in _TERMINAL_THREAD_STATES:
|
|
return Action.ESCALATE_PREPUSH
|
|
return Action.WAIT
|
|
|
|
|
|
def _fix_forward_budget_remaining(state: RunState, config: Config) -> bool:
|
|
"""True while another fix-forward turn is allowed.
|
|
|
|
Both bounds must hold (strict ``<``): the run has spent fewer than
|
|
``fix_forward_max_attempts`` corrective turns AND fewer than
|
|
``fix_forward_max_seconds`` of wall-clock. Hitting either cap exhausts the
|
|
budget.
|
|
"""
|
|
return (
|
|
state.fix_forward_attempts < config.fix_forward_max_attempts
|
|
and state.elapsed_seconds < config.fix_forward_max_seconds
|
|
)
|