Viktor asked to review Authentik and the web tier and make first-time signin to apps faster. Review found the slowness is screens and round trips, not server time. Changes: - values.yaml: the authentik.* Helm values (gunicorn workers, cache timeouts, conn_max_age) were silently INERT because existingSecret skips chart env rendering — pods ran defaults (2 workers, 300s caches, no persistent DB conns). Moved all tuning into server.env/worker.env, which actually reaches the pods. - authentik_provider.tf: adopt the identification stage and pin password_stage so username+password render on ONE screen (the separate order-20 password binding is deleted via API — authentik requires that when embedding). Outpost log_level trace->info and 1->2 replicas (it is on the hot path of every forward-auth request; PG-backed sessions make 2 replicas safe). - authentik module: /static ingress carve-out with immutable Cache-Control (assets are version-fingerprinted but served with no max-age — internal split-horizon users got zero caching). - traefik auth-proxy nginx: upstream keepalive 32 + HTTP/1.1 (was opening a fresh TCP connection to the outpost per subrequest) + config-checksum annotation so config changes roll the pods. - docs: authentication.md + authentik-state.md updated; fixed stale 'postgresql.dbaas has no endpoints' claim in CLAUDE.md/CONTEXT.md (it is a live CNPG primary-selector compatibility service). Done via API in the same change (UI-managed objects): 6 OIDC providers (Vault, Forgejo, Immich, Headscale, linkwarden, Cloudflare Access) switched from explicit to implicit consent — all first-party, the 4-weekly consent screen only slowed first-time signin. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
16 KiB
Authentication & Authorization
Overview
The homelab uses Authentik as a centralized identity provider (IdP) for all services, providing single sign-on (SSO) via OIDC and forward authentication for ingress protection. Authentik integrates with social login providers (Google, GitHub, Facebook), manages user groups and RBAC policies, and enforces authentication at the Traefik ingress layer. The system supports both human authentication (OIDC SSO) and service-to-service authentication (Kubernetes SA JWT for CI/CD).
Architecture Diagram
graph TB
User[User Browser]
Traefik[Traefik Ingress]
ForwardAuth[ForwardAuth Middleware]
Authentik[Authentik<br/>3 server + 3 worker<br/>+ embedded outpost]
Backend[Protected Backend Service]
Social[Social Providers<br/>Google/GitHub/Facebook]
K8s[Kubernetes API]
Vault[Vault]
User -->|1. HTTPS Request| Traefik
Traefik -->|2. Auth Check| ForwardAuth
ForwardAuth -->|3. Verify Session| Authentik
Authentik -->|4a. Not Authenticated| User
User -->|4b. Login Flow| Authentik
Authentik -->|5. Social Login| Social
Social -->|6. OAuth Callback| Authentik
Authentik -->|7. Session Cookie| User
User -->|8. Retry Request| Traefik
ForwardAuth -->|9. Authenticated| Backend
Traefik -->|10. Forward Request| Backend
K8s -->|OIDC Groups| Authentik
Vault -->|OIDC Auth| Authentik
Components
| Component | Version | Location | Purpose |
|---|---|---|---|
| Authentik Server | 2026.2.2 | stacks/authentik/ |
Core IdP application servers (3 replicas) |
| Authentik Worker | 2026.2.2 | stacks/authentik/ |
Background task processors (2 replicas) |
| PgBouncer | Latest | stacks/authentik/ |
PostgreSQL connection pooler (3 replicas) |
| Embedded Outpost | - | Standalone deployment, managed by Authentik | Forward auth endpoint for Traefik (2 replicas, PG-backed sessions) |
| Traefik ForwardAuth | - | modules/kubernetes/ingress_factory/ |
Middleware attached when auth = "required" or "public" |
| Vault OIDC Method | - | stacks/vault/ |
Human SSO authentication to Vault |
| Vault K8s Auth | - | stacks/vault/ |
Service account JWT authentication |
How It Works
Forward Authentication Flow
Services pick an auth tier via the auth enum on the ingress_factory module (default "required", fail-closed):
| Tier | Effect | When to use |
|---|---|---|
"required" |
Authentik forward-auth gates every request | Backend has no own user auth — Authentik is the only gate |
"app" |
No Authentik middleware; backend's own login is the gate | Backend handles its own user auth (NextAuth, Django, OAuth, bearer-token API) |
"public" |
Authentik anonymous binding via public outpost |
Audit trail without gating; only works for top-level browser navigation |
"none" |
No Authentik middleware at all | Anubis-fronted content, webhooks, OAuth callbacks, native-client APIs (CalDAV, WebDAV, Git) |
When auth = "required", an unauthenticated request flows:
- Request hits Traefik ingress
- ForwardAuth middleware calls the
auth-proxynginx (basicAuth fallback when Authentik is down), which proxies to the Authentik embedded outpost over a keepalive connection pool - Authentik checks for valid session cookie (domain-level
authentik_proxy_*cookie on.viktorbarzin.me, 4-week validity — one cookie covers all forward-auth apps) - If missing/invalid, redirects to Authentik login page (authentik.viktorbarzin.me)
- User authenticates on a single screen: username + password together (the identification stage embeds the password stage), or a social provider button (Google/GitHub/Facebook), then MFA validation
- Authentik creates session, sets cookie, redirects back to original URL
- Subsequent requests include session cookie, pass auth check, reach backend
Authentik adds authentication headers (user, email, groups) to forwarded requests. These headers are stripped before reaching the backend to prevent confusion.
First-time signin performance (2026-06-10)
Signin latency is dominated by screen count and round trips, not server time (DB avg 1.6ms). Standing decisions:
- Single-screen login: the identification stage carries
password_stage, so username+password is one round trip. The separate password-stage binding was removed fromdefault-authentication-flow(required by authentik when embedding). Pinned in TF:authentik_stage_identification.default_identification. - Implicit consent everywhere: all OIDC providers are first-party, so none use the explicit-consent flow (it re-prompted every 4 weeks per app).
- Live tuning via
server.env/worker.env(theauthentik.*Helm values are inert due toexistingSecret): 3 gunicorn workers, 30m flow-plan cache, 15m policy cache, 60s persistent DB connections. - Static assets cached immutable:
/staticingress carve-out addsCache-Control: public, max-age=31536000, immutable(assets are version-fingerprinted; authentik itself sends no max-age). - Outpost: 2 replicas,
log_level=info(was 1 replica attrace). - auth-proxy nginx: upstream
keepalive 32+ HTTP/1.1 — no per-request TCP setup on the forward-auth subrequest path.
Anti-exposure guard: every auth = "app" or auth = "none" line MUST have a preceding # auth = "<tier>": <reason> comment documenting what gates the backend (for "app") or why the endpoint is intentionally public (for "none"). The convention is enforced by scripts/check-ingress-auth-comments.py, which scripts/tg runs on every plan/apply/destroy/refresh and blocks the terragrunt invocation if violated. Stack-scoped — each stack documents itself.
Social Login & Invitation Flow
All new users must use an invitation link to register. The invitation-enrollment flow:
- invitation-validation - Validates invitation token
- enrollment-identification - Social login (Google/GitHub/Facebook) + passkey registration
- enrollment-prompt - Collect name/email
- enrollment-user-write - Create user account
- enrollment-login - Auto-login after creation
Group membership is auto-assigned from the invitation's fixed_data field. This prevents open registration while maintaining SSO convenience.
OIDC Applications
Authentik provides OIDC for 10 applications:
| Application | Type | Purpose |
|---|---|---|
| Cloudflare Access | OIDC | Cloudflare Zero Trust tunnels |
| Domain-wide catch-all | Proxy (Forward Auth) | Protect all *.viktorbarzin.me services |
| Forgejo | OIDC | Git repository SSO |
| Grafana | OIDC | Monitoring dashboard SSO |
| Headscale | OIDC | Tailscale control plane auth |
| Immich | OIDC | Photo management SSO |
| Kubernetes | OIDC (public client) | K8s API authentication (kubectl / kubelogin CLI) |
| Kubernetes Dashboard | OIDC (confidential) | Built for dashboard SSO — currently idle (apiserver OIDC blocked; dashboard uses forward-auth + token-paste) |
| Linkwarden | OIDC | Bookmark manager SSO |
| Wrongmove | OIDC | Real estate app SSO |
Kubernetes API authentication (OIDC) — CURRENTLY NON-FUNCTIONAL
⚠️ apiserver OIDC does not work in this cluster (as of 2026-06-04). The kube-apiserver rejects every valid Authentik OIDC token — with both the legacy
--oidc-*flags AND a structuredAuthenticationConfiguration, for both thekubernetesandk8s-dashboardissuers — despite verified signature, issuer, audience,email_verified=true, synced clock, and a reachable + publicly-trusted JWKS. Root cause is still open; seedocs/plans/2026-06-04-k8s-dashboard-sso-design.md§12. A kubeadm v1.34 upgrade had earlier silently wiped the apiserver--oidc-*flags, so OIDC CLI/dashboard login has effectively been off. Do not assumekubectlOIDC (kubelogin) works until this is resolved.
The intended model (binds by email, see stacks/rbac/modules/rbac/main.tf):
admin → cluster-admin; power-user → custom read-mostly ClusterRole;
namespace-owner → admin RoleBinding in their namespace(s) + cluster read-only.
Kubernetes Dashboard access (auto-injected SA token)
Because OIDC SSO is blocked, the web dashboard at k8s.viktorbarzin.me uses a
token-injector instead — users never see the dashboard's token prompt:
- Authentik forward-auth (
auth=required) gates access AND injectsX-authentik-username(the user's email). Theadmin-services-restrictionpolicy admitsHome Server Adminspluskubernetes-admins/kubernetes-power-users/kubernetes-namespace-ownersfor this host (stacks/authentik/admin-services-restriction.tf). - Token-injector (
stacks/k8s-dashboard/dashboard_injector.tf): an nginx that mapsX-authentik-username→ that user's ServiceAccount token and setsAuthorization: Bearerbefore proxying to kong-proxy, so the dashboard auto-authenticates. Namespace-owners →dashboard-<user>SA (admin on their namespace + read-only on the namespace list & nodes only (dashboard-nav-readonly, NOT cross-tenant resource reads);stacks/rbac/modules/rbac/dashboard-sa.tf), auto-derived fromk8s_users. Admins → the cluster-adminkubernetes-dashboardSA token (admin identities listed explicitly indashboard_injector.tf, since their Authentik login email ≠ theirk8s_usersemail). The injected token is the per-namespace security boundary; the map lives in a Secret (namespace-owners' cluster-read covers configmaps, not secrets).
Manual token (fallback / break-glass):
kubectl -n <ns> get secret dashboard-<user>-token -o jsonpath='{.data.token}' | base64 -d, orkubectl create token kubernetes-dashboard -n kubernetes-dashboardfor admin.
The oauth2-proxy + k8s-dashboard Authentik OIDC app (built for the
seamless-SSO design) remain deployed but idle/unwired pending the
apiserver-OIDC fix.
Authentik Groups
9 groups manage authorization:
- Allow Login Users - Base group, can authenticate to any OIDC app
- authentik Admins - Full Authentik admin UI access
- Headscale Users - Can access Headscale control plane
- Home Server Admins - Admin access to homelab services
- Wrongmove Users - Access to Wrongmove real estate app
- kubernetes-admins - K8s cluster-admin role
- kubernetes-power-users - K8s read-mostly access
- kubernetes-namespace-owners - K8s namespace-scoped admin
- Task Submitters - Can submit tasks to cluster task runner
Vault Authentication
For humans:
- OIDC method using Authentik as provider
- SSO login to Vault UI and CLI
- Group-based policy assignment
For services (CI/CD):
- Kubernetes SA JWT authentication
- Woodpecker CI uses service account token
- Vault K8s secrets engine roles:
dashboard-admin- K8s dashboard admin tokenci-deployer- Deploy workloads via CI/CDopenclaw- AI assistant cluster accesslocal-admin- Local development access
Configuration
Key Config Files
| Path | Purpose |
|---|---|
stacks/authentik/ |
Authentik deployment (servers, workers, PgBouncer) |
modules/kubernetes/ingress_factory/ |
Auth-tier enum + per-ingress middleware composition |
stacks/traefik/modules/traefik/middleware.tf |
ForwardAuth middleware definitions (required + public outposts) |
scripts/check-ingress-auth-comments.py |
Comment-convention guard wired into scripts/tg |
stacks/vault/auth.tf |
Vault OIDC and K8s auth methods |
Vault Paths
- OIDC config:
auth/oidc- Authentik integration settings - K8s auth:
auth/kubernetes- SA JWT validation - K8s secrets engine:
kubernetes/- Dynamic kubeconfig/SA token generation
Terraform Stacks
stacks/authentik/- Authentik infrastructurestacks/platform/- Traefik ingress with ForwardAuthstacks/vault/- Vault auth methods
Ingress Protection Examples
Authentik-gated admin UI (default):
module "myapp_ingress" {
source = "../../modules/kubernetes/ingress_factory"
name = "myapp"
namespace = "myapp"
tls_secret_name = var.tls_secret_name
# auth = "required" is the default — Authentik forward-auth is the gate.
}
Backend with its own user auth (no Authentik in the way):
module "myapp_ingress" {
source = "../../modules/kubernetes/ingress_factory"
name = "myapp"
namespace = "myapp"
tls_secret_name = var.tls_secret_name
# auth = "app": myapp uses NextAuth + Google OAuth; mobile clients can't follow Authentik 302.
auth = "app"
}
Intentionally public webhook receiver:
module "myapp_ingress" {
source = "../../modules/kubernetes/ingress_factory"
name = "webhook"
namespace = "webhooks"
tls_secret_name = var.tls_secret_name
# auth = "none": upstream signs payloads with HMAC; no user identity expected.
auth = "none"
}
Decisions & Rationale
Why Authentik over Keycloak?
- Lighter weight: Lower resource footprint (3+3+3 replicas vs Keycloak's heavier Java runtime)
- Better UX: Modern UI, simpler admin experience, better mobile support
- Python-based: Easier to extend, faster startup times, better developer experience
- Active development: More frequent releases, responsive community
Why Forward Auth over Sidecar?
- Simpler architecture: Single auth check at ingress, no sidecar per pod
- Works with any backend: Language/framework agnostic, no SDK required
- Centralized policy: All auth logic in Authentik, not distributed across sidecars
- Performance: Single auth check per session, not per request
Why OIDC for Kubernetes?
- SSO integration: Same login as all other services, no separate credentials
- No credential management: No kubeconfig secrets to rotate, tokens are short-lived
- Group-based RBAC: Centralized group management in Authentik, automatic K8s role mapping
- Public client flow: No client secret needed, works in kubectl plugins and dashboards
Why Invitation-Only Enrollment?
- Security: Prevents open internet access to homelab services
- Controlled onboarding: Explicit approval before granting access
- Social login convenience: No password management, leverages trusted providers
- Group auto-assignment: Invitation encodes initial group membership
Troubleshooting
Headers Not Stripped
Problem: Backend receives X-Authentik-Username, X-Authentik-Email, X-Authentik-Groups headers and breaks.
Fix: Traefik middleware should strip these headers before forwarding. Check ingress_factory module for header stripping config.
OIDC Token Expired
Problem: kubectl returns 401 Unauthorized.
Fix: Re-authenticate to refresh token:
kubectl oidc-login setup --oidc-issuer-url=https://authentik.viktorbarzin.me/application/o/kubernetes/
Social Login Redirect Loop
Problem: After social login, redirects to Authentik login page instead of destination.
Fix: Check Authentik application's redirect URIs. Must include https://authentik.viktorbarzin.me/source/oauth/callback/* for social providers.
User Not in Correct Group
Problem: User authenticated but lacks permissions.
Fix: Check group membership in Authentik admin UI. Verify invitation fixed_data specified correct group. Manually add to group if needed.
Vault OIDC Login Fails
Problem: Vault UI redirects to Authentik but returns error.
Fix:
- Verify Vault OIDC client credentials in Authentik
- Check Vault OIDC issuer URL matches Authentik
- Ensure Vault redirect URI (
https://vault.viktorbarzin.me/ui/vault/auth/oidc/oidc/callback) is registered in Authentik
K8s Auth Group Mapping Not Working
Problem: User authenticated but kubectl shows limited permissions despite being in kubernetes-admins.
Fix:
- Verify group claim is present in token:
kubectl oidc-login get-token | jq -R 'split(".") | .[1] | @base64d | fromjson' - Check ClusterRoleBinding maps group correctly:
kubectl get clusterrolebinding -o yaml | grep kubernetes-admins - Ensure Authentik OIDC app includes
groupsscope
Related
- Security & L7 Protection - CrowdSec, anti-AI scraping, rate limiting
- Networking - Ingress, DNS, load balancing
- Vault Runbook - Vault operations and troubleshooting
- Kubernetes Access Runbook - Setting up kubectl with OIDC