Question 36 of 46

How do we intake and govern open-weight and self-hosted AI models?

By Cody Maxwell · AI Governance Institute · March 2026

A governance framework for organizations that download, fine-tune, or self-host open-weight models — covering intake review, deployment controls, and ongoing maintenance obligations that differ from API-based vendor relationships.

If you only do 3 things, do this:

1.Open-weight models shift the attack surface inward — you own the weights, so you own the prompt injection risk, the output guardrails, and the update cadence. There is no vendor to call when something goes wrong.
2.Treat model weight files as critical software artifacts: verify integrity on download, store them in access-controlled repositories, and document the provenance chain from the originating release.
3.The biggest governance gap in open-weight deployments is the absence of a model update process — teams download a model, deploy it, and never revisit it. Define an update cadence and a process for evaluating new releases before deploying them.

The Situation

Who this is for: AI Engineering, Security, and GRC teams at organizations considering or actively using Llama, Mistral, Falcon, or other open-weight models in production

When you need this: Before the first open-weight model is deployed to production, or when conducting a governance audit of existing self-hosted deployments

The Decision

What governance controls apply specifically to open-weight models, and how do they differ from API-based vendor controls?

The Steps

1Create an intake checklist for open-weight models: licensing review, security assessment, performance benchmarking, and risk classification
2Verify model weight integrity on download using published checksums or cryptographic signatures
3Store weights in an access-controlled artifact repository with audit logging
4Document the deployment configuration: quantization level, inference infrastructure, guardrail configuration, and any fine-tuning applied
5Run pre-deployment adversarial testing — no vendor has done this for you
6Define a model update policy: how often do you evaluate newer releases, and what criteria trigger an upgrade or replacement
7Assign a named owner responsible for monitoring security disclosures and community findings related to the deployed model

The Artifacts

—Open-weight model intake checklist (license review, security assessment, integrity verification, risk classification)
—Model deployment configuration template (weights version, quantization, guardrails, fine-tuning log)
—Model update and retirement policy for self-hosted deployments
—Weight integrity verification log (download date, checksum, verification status)

The Output

Every open-weight model in production has a documented intake record, a named owner, verified weight integrity, and a scheduled update review.

How open-weight governance differs from vendor API governance

When an organization uses a model through an API, most of the infrastructure-level governance burden sits with the vendor: they manage model updates, apply safety fine-tuning, monitor for misuse, and publish security advisories. When an organization downloads and self-hosts an open-weight model, that burden transfers entirely to the organization. This is a significant governance shift that is frequently underestimated.

The most important differences are: the organization owns the output guardrails (no vendor-side safety filters apply unless you build them); the organization is responsible for monitoring security disclosures and community findings about model vulnerabilities; and the organization must manage its own update cadence, including evaluating whether new model releases improve or change the risk profile.

Open-weight models also introduce weight integrity as a distinct control surface. Downloaded model weights can be tampered with in transit or at rest. Organizations that do not verify the integrity of downloaded weights against published checksums or cryptographic signatures cannot confirm they are running the model they believe they are.

The intake process

Every open-weight model entering production should go through a structured intake process before deployment. This should cover four areas. First, licensing: open-weight models carry widely varying license terms — some prohibit commercial use, some restrict specific industries, some impose attribution requirements, and some (including the Llama 2 license) have restrictions on use by large organizations. Legal must review the license before deployment.

Second, security assessment: run the model through your standard adversarial testing suite before deployment. For models that will be exposed to external user input, include prompt injection and jailbreak testing. Third, performance benchmarking: validate that the model performs acceptably on your specific use case — do not assume published benchmark scores translate to your domain.

Fourth, risk classification: apply your standard AI risk classification framework. A self-hosted LLM handling customer queries is still a Significant-risk system regardless of how it was deployed. The intake process should produce a signed record showing who reviewed what and when.

Ongoing maintenance obligations

The most common governance failure in open-weight deployments is the absence of a model maintenance process. Teams deploy a model and do not revisit it. Over time, the gap between the deployed model and current releases grows; new attack techniques emerge that the model has not been tested against; and fine-tuning applied at deployment becomes misaligned with the current production context.

Define a model update review cadence — quarterly for Critical-tier deployments, semi-annually for others. Assign a named owner who monitors the model's originating community (Hugging Face, GitHub, the model developer's security channels) for safety disclosures and significant new releases. When a new release appears, the owner evaluates whether it warrants an upgrade and documents the decision.

Also define a retirement process: what happens to the weights when the model is decommissioned? Weights should be deleted from all environments, and the deletion should be logged. Leaving unused model weights in accessible storage is a security risk.

Governance Controls

Operational controls that implement the guidance in this playbook.

AGT-001Agent Permission Boundaries AGT-015Agent OAuth Scope Drift Detection DGC-001Training Data Provenance DGC-006AI-Generated Code and Open-Source License Compliance PRC-005AI Procurement Risk Assessment SEC-005Adversarial Robustness Testing

Related frameworks

NIST AI RMF EU AI Act NIST AI 600-1 Generative AI Profile

Recent Coverage

News and developments relevant to this playbook topic.

DeepSeek Agent Conducts Autonomous Cyberattacks, Bypassing Human-in-the-Loop Controls Distillation Study Finds Censorship Does Not Transfer, But Supply Chain Risk Does CVE-2026-59726: CVSS 10.0 Flaw in Ruflo Lets Attackers Seize AI Agents, Steal Credentials, and Poison Agent Memory Through a Single HTTP Request Debian's Vote to Ban LLM-Assisted Contributions Signals Open-Source Supply Chain Risk Enterprises Cannot Ignore UK AISI and CAISI Find Kimi K3 Safeguards Failed to Block Offensive Cyber Attempts Ahead of Open-Weight Release

Not sure where to start? Answer 3 questions and get a tailored compliance action plan.

What applies to me? →

More guidance like this, every week

New playbook articles, governance controls, and the regulatory changes driving them. Every Thursday.

← How do we report AI risk to the board and audit committee?How do we map AI compliance obligations across multiple jurisdictions? →