[ci skip] add CI build caching implementation plan

This commit is contained in:
Viktor Barzin 2026-02-28 17:46:44 +00:00
parent 5ef03cc0e0
commit 2adfa86401

View file

@ -0,0 +1,436 @@
# CI Build Caching Implementation Plan
> **For Claude:** REQUIRED SUB-SKILL: Use superpowers:executing-plans to implement this plan task-by-task.
**Goal:** Speed up Woodpecker CI Docker image builds by adding BuildKit layer caching via a local private registry, with dual-push to Docker Hub and local.
**Architecture:** Extend the existing Docker Compose registry stack on `10.0.20.10` with a new R/W `registry-private` service (port 5050). Configure Woodpecker `plugin-docker-buildx` pipelines with `cache_from`/`cache_to` pointing to `registry.viktorbarzin.lan:5050`. Push final images to both Docker Hub and local registry.
**Tech Stack:** Docker Registry v2, nginx, Docker Compose, Woodpecker CI, BuildKit, Technitium DNS
**Design doc:** `docs/plans/2026-02-28-ci-build-caching-design.md`
---
### Task 1: Create private registry config file
**Files:**
- Create: `modules/docker-registry/config-private.yml`
**Step 1: Create the config file**
This is a standard `registry:2` config WITHOUT the `proxy` section (which is what makes it R/W instead of read-only pull-through). Based on the existing `config.yaml` but with 100GiB storage and no proxy/auth.
```yaml
version: 0.1
log:
fields:
service: registry-private
storage:
cache:
blobdescriptor: inmemory
filesystem:
rootdirectory: /var/lib/registry
maxsize: 100GiB
delete:
enabled: true
maintenance:
uploadpurging:
enabled: true
age: 168h
interval: 4h
dryrun: false
http:
addr: :5000
headers:
X-Content-Type-Options: [nosniff]
health:
storagedriver:
enabled: true
interval: 10s
threshold: 3
```
Key differences from the proxy configs:
- No `proxy` section → allows pushes
- `maxsize: 100GiB` (user requested 100GB)
- `uploadpurging.age: 168h` (7 days, since build cache layers are re-pushed frequently)
**Step 2: Commit**
```bash
git add modules/docker-registry/config-private.yml
git commit -m "[ci skip] add private R/W registry config for CI build caching"
```
---
### Task 2: Add registry-private service to Docker Compose
**Files:**
- Modify: `modules/docker-registry/docker-compose.yml`
**Step 1: Add the registry-private service**
Add this service block after `registry-kyverno` (before `nginx`):
```yaml
registry-private:
image: registry:2
container_name: registry-private
restart: always
volumes:
- /opt/registry/data/private:/var/lib/registry
- /opt/registry/config-private.yml:/etc/docker/registry/config.yml:ro
networks:
- registry
healthcheck:
test: ["CMD", "sh", "-c", "wget -qO- http://localhost:5000/v2/ >/dev/null 2>&1"]
interval: 30s
timeout: 10s
retries: 3
start_period: 10s
```
**Step 2: Add port 5050 to the nginx service**
In the `nginx` service `ports` list, add:
```yaml
- "5050:5050"
```
**Step 3: Add registry-private to nginx depends_on**
```yaml
registry-private:
condition: service_healthy
```
**Step 4: Commit**
```bash
git add modules/docker-registry/docker-compose.yml
git commit -m "[ci skip] add registry-private service to Docker Compose stack"
```
---
### Task 3: Add nginx upstream and server block for private registry
**Files:**
- Modify: `modules/docker-registry/nginx_registry.conf`
**Step 1: Add upstream block**
After the existing `upstream kyverno` block, add:
```nginx
upstream private {
server registry-private:5000;
keepalive 32;
}
```
**Step 2: Add server block**
After the last server block (kyverno on port 5040), add:
```nginx
# --- Private R/W Registry (port 5050) ---
server {
listen 5050;
server_name _;
client_max_body_size 0;
proxy_request_buffering off;
proxy_buffering off;
chunked_transfer_encoding on;
location /v2/ {
proxy_pass http://private;
proxy_http_version 1.1;
proxy_set_header Host $host;
proxy_set_header Connection "";
proxy_set_header X-Real-IP $remote_addr;
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
proxy_read_timeout 900;
proxy_send_timeout 900;
}
location / {
return 200 'ok';
add_header Content-Type text/plain;
}
}
```
Key differences from the read-only proxy server blocks:
- **No `proxy_cache`** directives — caching uploads would corrupt pushes
- **`proxy_buffering off`** — important for large layer uploads
- **`chunked_transfer_encoding on`** — Docker push uses chunked uploads
- **`X-Real-IP` / `X-Forwarded-For`** headers — useful for debugging
**Step 3: Commit**
```bash
git add modules/docker-registry/nginx_registry.conf
git commit -m "[ci skip] add nginx upstream and server block for private registry on port 5050"
```
---
### Task 4: Deploy the private registry on 10.0.20.10
**Step 1: SSH to the registry VM and create the storage directory**
```bash
ssh root@10.0.20.10 "mkdir -p /opt/registry/data/private"
```
**Step 2: Copy updated files to the VM**
```bash
scp modules/docker-registry/docker-compose.yml root@10.0.20.10:/opt/registry/docker-compose.yml
scp modules/docker-registry/config-private.yml root@10.0.20.10:/opt/registry/config-private.yml
scp modules/docker-registry/nginx_registry.conf root@10.0.20.10:/opt/registry/nginx.conf
```
Note: The nginx config is stored as `/opt/registry/nginx.conf` on the VM (the docker-compose mounts it as `nginx.conf`).
**Step 3: Restart the Docker Compose stack**
```bash
ssh root@10.0.20.10 "cd /opt/registry && docker compose up -d"
```
This will create the new `registry-private` container and reload nginx with the new port.
**Step 4: Verify the private registry is accessible**
```bash
# From the local machine or any node on the 10.0.20.0/24 network:
curl -s http://10.0.20.10:5050/v2/ | head
# Expected: {} (empty JSON object = registry is up)
curl -s http://10.0.20.10:5050/v2/_catalog
# Expected: {"repositories":[]} (empty, no images pushed yet)
```
---
### Task 5: Add DNS record for registry.viktorbarzin.lan
**Step 1: Add A record via Technitium API**
```bash
# Technitium DNS API endpoint (web UI is on port 5380)
# First, get API token from tfvars (technitium_password)
curl -s "http://10.0.20.204:5380/api/zones/records/add?token=<TECHNITIUM_TOKEN>&domain=registry.viktorbarzin.lan&zone=viktorbarzin.lan&type=A&ipAddress=10.0.20.10&overwrite=true"
```
Alternatively, add via Technitium web UI at `https://technitium.viktorbarzin.me`:
- Zone: `viktorbarzin.lan`
- Record: `registry` → A → `10.0.20.10`
**Step 2: Verify DNS resolution from a K8s pod**
```bash
kubectl run -it --rm dns-test --image=alpine --restart=Never -- nslookup registry.viktorbarzin.lan
# Expected: Address: 10.0.20.10
```
**Step 3: Verify registry is accessible via DNS name**
```bash
curl -s http://registry.viktorbarzin.lan:5050/v2/
# Expected: {}
```
---
### Task 6: Update build-cli.yml pipeline with BuildKit caching
**Files:**
- Modify: `.woodpecker/build-cli.yml`
**Step 1: Update the pipeline**
Replace the entire file content with:
```yaml
when:
event: push
clone:
git:
image: woodpeckerci/plugin-git
settings:
attempts: 5
backoff: 10s
steps:
- name: build-image
image: woodpeckerci/plugin-docker-buildx
settings:
username: "viktorbarzin"
password:
from_secret: dockerhub-pat
repo:
- viktorbarzin/infra
- registry.viktorbarzin.lan:5050/infra
logins:
- registry: https://index.docker.io/v1/
username: viktorbarzin
password:
from_secret: dockerhub-pat
dockerfile: cli/Dockerfile
context: cli
auto_tag: true
cache_from: type=registry,ref=registry.viktorbarzin.lan:5050/infra:buildcache
cache_to: type=registry,ref=registry.viktorbarzin.lan:5050/infra:buildcache,mode=max
buildkit_config: |
[registry."registry.viktorbarzin.lan:5050"]
http = true
insecure = true
```
Key changes:
- `repo` is now a list — pushes to both Docker Hub and local registry
- `logins` provides Docker Hub credentials explicitly (needed when `repo` is a list)
- `cache_from`/`cache_to` use registry-based BuildKit cache on the local registry
- `buildkit_config` allows HTTP access to the insecure local registry
- `mode=max` caches ALL layers (including intermediate build stages)
**Step 2: Commit**
```bash
git add .woodpecker/build-cli.yml
git commit -m "[ci skip] add BuildKit layer caching and dual-push to build-cli pipeline"
```
---
### Task 7: Update f1-stream.yml pipeline with BuildKit caching
**Files:**
- Modify: `.woodpecker/f1-stream.yml`
**Step 1: Update the pipeline**
Replace the entire file content with:
```yaml
when:
event: push
path: "stacks/f1-stream/files/**"
clone:
git:
image: woodpeckerci/plugin-git
settings:
attempts: 5
backoff: 10s
steps:
- name: build-image
image: woodpeckerci/plugin-docker-buildx
settings:
username: "viktorbarzin"
password:
from_secret: dockerhub-pat
repo:
- viktorbarzin/f1-stream
- registry.viktorbarzin.lan:5050/f1-stream
logins:
- registry: https://index.docker.io/v1/
username: viktorbarzin
password:
from_secret: dockerhub-pat
dockerfile: stacks/f1-stream/files/Dockerfile
context: stacks/f1-stream/files
platforms: linux/amd64
provenance: false
tags: latest
cache_from: type=registry,ref=registry.viktorbarzin.lan:5050/f1-stream:buildcache
cache_to: type=registry,ref=registry.viktorbarzin.lan:5050/f1-stream:buildcache,mode=max
buildkit_config: |
[registry."registry.viktorbarzin.lan:5050"]
http = true
insecure = true
- name: deploy
image: bitnami/kubectl
commands:
- kubectl -n f1-stream rollout restart deployment f1-stream
- kubectl -n f1-stream rollout status deployment f1-stream --timeout=120s
```
Same pattern as build-cli: dual-push + BuildKit cache. The `deploy` step is unchanged.
**Step 2: Commit**
```bash
git add .woodpecker/f1-stream.yml
git commit -m "[ci skip] add BuildKit layer caching and dual-push to f1-stream pipeline"
```
---
### Task 8: Test end-to-end with a manual build trigger
**Step 1: Push changes to trigger the build-cli pipeline**
```bash
git push origin master
```
The `build-cli.yml` pipeline triggers on every push. Monitor it at `https://ci.viktorbarzin.me`.
**Step 2: Verify cache was populated**
After the first build completes, check the local registry has the cache:
```bash
curl -s http://registry.viktorbarzin.lan:5050/v2/_catalog
# Expected: {"repositories":["infra"]}
curl -s http://registry.viktorbarzin.lan:5050/v2/infra/tags/list
# Expected: tags include "buildcache" and the auto-tagged version
```
**Step 3: Trigger a second build to verify cache hit**
Make a trivial change (e.g., update a comment in `cli/`) and push again. The build logs should show "importing cache manifest from registry.viktorbarzin.lan:5050/infra:buildcache" and skip unchanged layers.
**Step 4: Verify Docker Hub also has the image**
```bash
# Check Docker Hub has the image too
curl -s https://hub.docker.com/v2/repositories/viktorbarzin/infra/tags/ | python3 -m json.tool | head -20
```
---
### Task 9: Extend cleanup script for private registry
**Files:**
- Modify: `modules/docker-registry/cleanup-tags.sh`
**Step 1: Verify the script already handles multiple registries**
The existing script walks ALL subdirectories under `BASE` (`/opt/registry/data`). Since the private registry stores data at `/opt/registry/data/private/docker/registry/v2/repositories/`, it will automatically be picked up by the existing script without changes.
Verify by reading the script logic — `os.listdir(BASE)` iterates `dockerhub`, `ghcr`, `quay`, `k8s`, `kyverno`, and now `private`.
**Step 2: Consider whether to adjust the keep count**
The default `KEEP=10` may be too aggressive for the private registry since buildcache tags are few (usually just one `buildcache` tag per repo). The script only deletes when there are MORE than `KEEP` tags, so with typically 2-3 tags per repo (e.g., `latest`, `buildcache`, maybe a version tag), no cleanup will happen. This is fine.
No code changes needed — the script already works with the new registry.
**Step 3: Commit (no-op — just verify)**
No changes needed. The cleanup script automatically covers the private registry.