infra/modules/kubernetes/ollama/values.yaml

ollama:
  gpu:
    # -- Enable GPU integration
    enabled: true

    # -- GPU type: 'nvidia' or 'amd'
    type: "nvidia"

    # -- Specify the number of GPU to 1
    number: 1

  # -- List of models to pull at container startup
  models:
    pull:
      - llama3

persistentVolume:
  enabled: true
  existingClaim: "ollama-pvc"

nodeSelector:
  gpu: "true"

tolerations:
  - key: "nvidia.com/gpu"
    operator: "Equal"
    value: "true"
    effect: "NoSchedule"
add ollama deployment with a ui [ci skip] 2024-06-08 19:22:35 +00:00			`ollama:`
			`gpu:`
			`# -- Enable GPU integration`
add ollama [ci skip] 2025-12-14 09:49:25 +00:00			`enabled: true`
add ollama deployment with a ui [ci skip] 2024-06-08 19:22:35 +00:00
			`# -- GPU type: 'nvidia' or 'amd'`
			`type: "nvidia"`

			`# -- Specify the number of GPU to 1`
			`number: 1`

			`# -- List of models to pull at container startup`
			`models:`
update ollama to work again [ci skip] 2025-05-04 11:23:57 +00:00			`pull:`
			`- llama3`
add ollama deployment with a ui [ci skip] 2024-06-08 19:22:35 +00:00
			`persistentVolume:`
			`enabled: true`
			`existingClaim: "ollama-pvc"`
Add GPU node taint tolerations and enhance GPU memory exporter Add nvidia.com/gpu toleration to all GPU workloads (frigate, ollama) to support NoSchedule taint on GPU nodes. Update nvidia operator helm values with daemonset tolerations. Enhance GPU pod memory exporter with Kubernetes API integration to resolve container IDs to pod names/namespaces, adding RBAC resources for API access. 2026-02-06 20:19:26 +00:00
			`nodeSelector:`
			`gpu: "true"`

			`tolerations:`
			`- key: "nvidia.com/gpu"`
			`operator: "Equal"`
			`value: "true"`
			`effect: "NoSchedule"`