Supply Chain Security

CAIPE maintains a layered supply chain security posture that spans dependency management, static analysis, container hardening, and continuous vulnerability scanning. Every control described here is enforced automatically in CI — no manual steps are required to stay compliant.

Dependency Pinning

Python

Every Python dependency across all subpackages (pyproject.toml files) must be pinned to an exact version using ==. Range specifiers (>=, ~=, ^) are not allowed for direct dependencies.

# ✅ Correct — exact pin
"fastmcp==3.2.3"
"pydantic==2.12.5"

# ❌ Not allowed — range specifier
"fastmcp>=3.2.0"

Why exact pins? Range specifiers allow the resolver to silently upgrade a package the next time a lock file is regenerated. An exact pin means the version you test is the version that ships.

Enforcement: The check-pinned-deps CI gate runs scripts/check_pinned_deps.py on every PR and main push. The job fails if any pyproject.toml or package.json contains an unpinned dependency.

Node.js / JavaScript

The same exact-pin requirement applies to package.json files in the UI and bot packages. The check-pinned-deps workflow enforces this for Node.js alongside Python.

Reproducible Builds with `uv.lock`

Every Python subpackage with its own pyproject.toml ships a committed uv.lock file. uv.lock captures the fully-resolved dependency graph — including all transitive dependencies — at a specific commit.

What this means:

uv sync --locked reproduces the exact environment used in CI and container builds, regardless of when it runs
No transitive dependency drift between developer machines, CI, and production images
The lock file is the source of truth; pyproject.toml defines the constraints

Lock file locations:

Component	Lock file
Supervisor agent	`uv.lock` (repo root)
RAG server	`ai_platform_engineering/knowledge_bases/rag/server/uv.lock`
RAG ingestors	`ai_platform_engineering/knowledge_bases/rag/ingestors/uv.lock`
RAG common	`ai_platform_engineering/knowledge_bases/rag/common/uv.lock`
RAG ontology agent	`ai_platform_engineering/knowledge_bases/rag/agent_ontology/uv.lock`
Each A2A/MCP agent	`ai_platform_engineering/agents/<name>/{a2a,mcp}/uv.lock`

Enforcement: The uv-lock-check CI gate runs scripts/check_uv_lock_sync.sh on every PR and main push. It re-runs uv lock --check across all subpackages and fails if any lock file is out of sync with its pyproject.toml.

Vulnerability Scanning (Grype)

CAIPE uses Anchore Grype for vulnerability scanning, configured via the security-scan workflow. Scan results are uploaded as SARIF to GitHub Code Scanning.

Filesystem scan (PRs and main)

Runs on every pull request to main and every push to main. Scans the full repository filesystem against Grype's vulnerability database.

Setting	Value	Why
`severity-cutoff`	`critical`	Only critical-severity findings are surfaced
`fail-build`	`false`	Grype itself never hard-fails — the gate is GitHub Code Scanning PR diff mode
`output-format`	`sarif`	Results uploaded to GitHub Code Scanning for PR diff analysis

Why fail-build: false?

fail-build: true blocks a PR the moment Grype finds any critical CVE — including CVEs in upstream libraries that the PR author did not introduce and cannot fix. That forces contributors to take responsibility for vulnerabilities they have no control over, which creates noise without improving security.

Instead, CAIPE uses GitHub Code Scanning's PR diff mode as the blocking gate. When SARIF is uploaded for a PR, GitHub compares the results against the base branch scan. Only alerts that are new in the PR surface as a blocking check:

A PR that introduces a new vulnerable dependency → new alert → PR blocked, with a direct link to the CVE and the package name so the author knows exactly what to fix
An upstream CVE already present on main → not new → PR not blocked; the alert remains visible in the Security tab for maintainers to track and upgrade when a patch ships

The required check is Code scanning results / Grype under Settings → Branches → Branch protection rules for main.

Container image scan (tags and manual dispatch)

Runs on every version tag push (e.g., 0.2.3) and via workflow_dispatch. Scans all published container images from GHCR.

Setting	Value	Why
`severity-cutoff`	`high`	Wider net for published images (high + critical)
`fail-build`	`false`	Informational — results visible in Security tab, do not block tagging
`output-format`	`sarif`	Uploaded to Code Scanning per image for independent tracking

Container images scanned include all A2A sub-agents, MCP servers, the supervisor, UI, RAG components, and bots. Each image gets its own SARIF category (e.g., grype-agent-github, grype-caipe-ui) for independent tracking in GitHub Code Scanning.

Trigger matrix

Event	Filesystem scan	Container scan
Pull request → `main`	✅ (new alerts block via Code Scanning)	✗
Push to `main`	✅ informational	✗
Push tag (e.g. `0.2.3`)	✅ informational	✅ informational
`workflow_dispatch`	✅ informational	✅ informational

Static Analysis (CodeQL)

GitHub's CodeQL engine runs on every PR and main push, covering all four language categories present in the repository:

Language	Scope
`python`	Supervisor agent, sub-agents, MCP servers, RAG stack
`javascript-typescript`	CAIPE UI (Next.js / React), bot frontends
`go`	Go-based tooling and utilities
`actions`	GitHub Actions workflow files

CodeQL results are also uploaded as SARIF to GitHub Code Scanning. Any error-severity finding blocks the PR.

GitHub Actions Security

SHA-pinned actions

Every third-party GitHub Action in .github/workflows/ is pinned to a full commit SHA rather than a mutable tag:

# ✅ Immutable — SHA will never be reassigned
uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6

# ❌ Not used — tag can be force-pushed
uses: actions/checkout@v4

This prevents tag-hijacking supply chain attacks where a compromised upstream maintainer pushes malicious code to an existing tag.

Step Security Harden Runner

Every workflow job starts with the Step Security Harden Runner:

- name: Harden runner
  uses: step-security/harden-runner@fa2e9d605c4eeb9fcad4c99c224cee0c6c7f3594 # v2.16.0
  with:
    egress-policy: audit

The harden runner monitors network egress during the job and generates a report of all outbound connections. In audit mode it logs without blocking; this baseline is used to identify unexpected network calls that could indicate a compromised action.

Least-privilege permissions

Workflow files declare only the permissions they actually need. Most jobs run with contents: read only. The security scan workflow adds security-events: write solely to upload SARIF results:

permissions:
  contents: read
  security-events: write  # required to upload SARIF to GitHub Code Scanning

Commit and Contribution Controls

DCO (Developer Certificate of Origin)

Every commit must include a DCO sign-off line:

Signed-off-by: Your Name <your.email@example.com>

This is enforced by the Probot DCO app on every PR. Sign-off is added automatically with git commit -s. The DCO certifies that the contributor has the right to submit the code under the project's open-source license.

Conventional Commits

The conventional_commits workflow enforces the Conventional Commits specification on every PR title and commit message. This provides a structured audit trail and enables automated changelog generation.

No proprietary content

The check-proprietary-content workflow scans every changed file in a PR for patterns associated with proprietary infrastructure (internal domains, email suffixes, internal team names). Any match blocks the PR and posts a comment identifying the exact lines. This prevents accidental leakage of internal tooling references into the open-source repository.

Container Image Hardening

All runtime container images follow a two-stage build pattern to minimize the attack surface:

Multi-stage builds

# Stage 1: builder — full toolchain, installs dependencies
FROM python:3.13-slim AS builder
RUN uv sync --locked --no-dev

# Stage 2: runtime — only what's needed to run
FROM python:3.13-slim
COPY --from=builder /app /app

The builder stage installs compilers and build tools that are not copied to the final image. Only the compiled artifacts and application code ship.

Non-root user

All agent containers run as a dedicated non-root user with a fixed UID:

RUN groupadd -r appuser && useradd -r -g appuser -u 1001 -m appuser
USER appuser

Running as UID 1001 (non-root) limits the blast radius of a container breakout — a compromised process cannot write to system paths or install packages.

Locked dependency installs

Container builds install from the committed uv.lock file using uv sync --locked. This guarantees the image contains exactly the dependency versions that passed CI, with no resolver re-execution at build time.

Dependabot

GitHub Dependabot is configured for the repository and generates alerts when a dependency version is matched against a known vulnerability in the GitHub Advisory Database. Dependabot alerts feed into the same GitHub Code Scanning interface as Grype and CodeQL results, providing a unified view.

Alert Lifecycle

GitHub Code Scanning alerts are lifecycle-managed as follows:

Auto-close on fix — When a scan runs on main and a previously-reported finding is no longer present (e.g., because a dependency was upgraded), GitHub automatically closes the alert.
PR diff mode keeps the alert list clean — Because fail-build: false is paired with GitHub Code Scanning PR diff mode, upstream CVEs that have no fix yet don't accumulate as blocking noise. They remain in the Security tab as tracked items until a patch is available and the dependency is upgraded.
Ghost alert dismissal — Alerts created by scans of PR merge-refs (refs/pull/N/merge) or deleted branches can persist in the alert list even after the CVE is remediated, because GitHub has no new scan of that ref to close them against. These can be dismissed via the GitHub API with reason "false positive" or "won't fix" as appropriate.
SARIF categories — Each scan type uploads to a named category (grype-filesystem, grype-<image-name>, codeql-python, etc.). GitHub tracks alerts per category, so a filesystem alert and a container alert for the same CVE are tracked independently.

Dependency Pinning​

Python​

Node.js / JavaScript​

Reproducible Builds with uv.lock​

Vulnerability Scanning (Grype)​

Filesystem scan (PRs and main)​

Container image scan (tags and manual dispatch)​

Trigger matrix​

Static Analysis (CodeQL)​

GitHub Actions Security​

SHA-pinned actions​

Step Security Harden Runner​

Least-privilege permissions​

Commit and Contribution Controls​

DCO (Developer Certificate of Origin)​

Conventional Commits​

No proprietary content​

Container Image Hardening​

Multi-stage builds​

Non-root user​

Locked dependency installs​

Dependabot​

Alert Lifecycle​