Migrated from monorepo during Forgejo registry consolidation 2026-05-07
Find a file
Viktor Barzin a29bffdda3
Some checks failed
Build and Push / lint-and-test (push) Has been cancelled
Build and Push / build (push) Has been cancelled
Build and Push / deploy (push) Has been cancelled
Build and Push / notify-failure (push) Has been cancelled
chat-completions: stream conversational turns (SSE token relay) for realtime voice
Adds stream=true support to POST /v1/chat/completions (it previously 400'd).
When streaming, it runs the no-tools `conversational` agent via
`claude -p --output-format stream-json --include-partial-messages --verbose`
and relays each content_block_delta as an OpenAI chat.completion.chunk SSE
event, ending with finish_reason=stop + [DONE]. Free CLI/subscription auth, no
tools, no API key.

Stateless by design: the full message history is flattened into the prompt
(prior assistant turns kept), so an OpenAI-style client that re-sends history
each turn — e.g. Pipecat's OpenAILLMService — can stream from us directly. The
non-streaming path (recruiter-triage workspace agent) is unchanged.

This is phase 1 of the Pipecat realtime full-duplex voice-agent rebuild for
portal-assistant (continuous audio, VAD endpointing, barge-in, ~seconds to
first words). New pure helpers (stream_argv/delta_text/openai_chunk/
synthesise_chat_prompt) are unit-tested; the SSE endpoint has a mocked-subprocess
integration test. 429 passing.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-17 22:22:38 +00:00
.github/workflows ci: move image build off-infra to GHA -> ghcr (ADR-0002) 2026-06-13 01:45:36 +00:00
.woodpecker ci: move image build off-infra to GHA -> ghcr (ADR-0002) 2026-06-13 01:45:36 +00:00
agents conversational: add no-tools multi-turn Brain endpoint for portal-assistant 2026-06-17 18:38:44 +00:00
app chat-completions: stream conversational turns (SSE token relay) for realtime voice 2026-06-17 22:22:38 +00:00
beads Initial extraction from monorepo 2026-05-07 17:07:12 +00:00
docs docs: capture AFK implementation pipeline design + ADRs 0002-0004 2026-06-14 19:09:12 +00:00
frontend breakglass UI v2: attachable sessions (tmux model) + mobile-first redesign 2026-06-14 19:19:03 +00:00
tests chat-completions: stream conversational turns (SSE token relay) for realtime voice 2026-06-17 22:22:38 +00:00
.dockerignore Initial extraction from monorepo 2026-05-07 17:07:12 +00:00
.gitignore breakglass: in-cluster emergency-recovery UI for the devvm 2026-06-12 21:36:05 +00:00
CONTEXT.md docs: capture breakglass design (CONTEXT glossary + ADR 0001) 2026-06-12 20:59:13 +00:00
docker-entrypoint-breakglass.sh breakglass: in-cluster emergency-recovery UI for the devvm 2026-06-12 21:36:05 +00:00
Dockerfile breakglass: in-cluster emergency-recovery UI for the devvm 2026-06-12 21:36:05 +00:00
LICENSE.txt Initial extraction from monorepo 2026-05-07 17:07:12 +00:00
requirements-dev.txt Initial extraction from monorepo 2026-05-07 17:07:12 +00:00
requirements.txt Initial extraction from monorepo 2026-05-07 17:07:12 +00:00