AI Governance Institute logo
AI Governance Institute

Practical Governance for Enterprise AI

← AI Governance Playbook

Question 36 of 45

How do we intake and govern open-weight and self-hosted AI models?

Published by AI Governance Institute · Practical Governance for Enterprise AI

A governance framework for organizations that download, fine-tune, or self-host open-weight models — covering intake review, deployment controls, and ongoing maintenance obligations that differ from API-based vendor relationships.

If you only do 3 things, do this:

  1. 1.Open-weight models shift the attack surface inward — you own the weights, so you own the prompt injection risk, the output guardrails, and the update cadence. There is no vendor to call when something goes wrong.
  2. 2.Treat model weight files as critical software artifacts: verify integrity on download, store them in access-controlled repositories, and document the provenance chain from the originating release.
  3. 3.The biggest governance gap in open-weight deployments is the absence of a model update process — teams download a model, deploy it, and never revisit it. Define an update cadence and a process for evaluating new releases before deploying them.

The Situation

Who this is for: AI Engineering, Security, and GRC teams at organizations considering or actively using Llama, Mistral, Falcon, or other open-weight models in production

When you need this: Before the first open-weight model is deployed to production, or when conducting a governance audit of existing self-hosted deployments

The Decision

What governance controls apply specifically to open-weight models, and how do they differ from API-based vendor controls?

The Steps

  1. 1Create an intake checklist for open-weight models: licensing review, security assessment, performance benchmarking, and risk classification
  2. 2Verify model weight integrity on download using published checksums or cryptographic signatures
  3. 3Store weights in an access-controlled artifact repository with audit logging
  4. 4Document the deployment configuration: quantization level, inference infrastructure, guardrail configuration, and any fine-tuning applied
  5. 5Run pre-deployment adversarial testing — no vendor has done this for you
  6. 6Define a model update policy: how often do you evaluate newer releases, and what criteria trigger an upgrade or replacement
  7. 7Assign a named owner responsible for monitoring security disclosures and community findings related to the deployed model

The Artifacts

  • Open-weight model intake checklist (license review, security assessment, integrity verification, risk classification)
  • Model deployment configuration template (weights version, quantization, guardrails, fine-tuning log)
  • Model update and retirement policy for self-hosted deployments
  • Weight integrity verification log (download date, checksum, verification status)

The Output

Every open-weight model in production has a documented intake record, a named owner, verified weight integrity, and a scheduled update review.

How open-weight governance differs from vendor API governance

When an organization uses a model through an API, most of the infrastructure-level governance burden sits with the vendor: they manage model updates, apply safety fine-tuning, monitor for misuse, and publish security advisories. When an organization downloads and self-hosts an open-weight model, that burden transfers entirely to the organization. This is a significant governance shift that is frequently underestimated.

The most important differences are: the organization owns the output guardrails (no vendor-side safety filters apply unless you build them); the organization is responsible for monitoring security disclosures and community findings about model vulnerabilities; and the organization must manage its own update cadence, including evaluating whether new model releases improve or change the risk profile.

Open-weight models also introduce weight integrity as a distinct control surface. Downloaded model weights can be tampered with in transit or at rest. Organizations that do not verify the integrity of downloaded weights against published checksums or cryptographic signatures cannot confirm they are running the model they believe they are.

The intake process

Every open-weight model entering production should go through a structured intake process before deployment. This should cover four areas. First, licensing: open-weight models carry widely varying license terms — some prohibit commercial use, some restrict specific industries, some impose attribution requirements, and some (including the Llama 2 license) have restrictions on use by large organizations. Legal must review the license before deployment.

Second, security assessment: run the model through your standard adversarial testing suite before deployment. For models that will be exposed to external user input, include prompt injection and jailbreak testing. Third, performance benchmarking: validate that the model performs acceptably on your specific use case — do not assume published benchmark scores translate to your domain.

Fourth, risk classification: apply your standard AI risk classification framework. A self-hosted LLM handling customer queries is still a Significant-risk system regardless of how it was deployed. The intake process should produce a signed record showing who reviewed what and when.

Ongoing maintenance obligations

The most common governance failure in open-weight deployments is the absence of a model maintenance process. Teams deploy a model and do not revisit it. Over time, the gap between the deployed model and current releases grows; new attack techniques emerge that the model has not been tested against; and fine-tuning applied at deployment becomes misaligned with the current production context.

Define a model update review cadence — quarterly for Critical-tier deployments, semi-annually for others. Assign a named owner who monitors the model's originating community (Hugging Face, GitHub, the model developer's security channels) for safety disclosures and significant new releases. When a new release appears, the owner evaluates whether it warrants an upgrade and documents the decision.

Also define a retirement process: what happens to the weights when the model is decommissioned? Weights should be deleted from all environments, and the deletion should be logged. Leaving unused model weights in accessible storage is a security risk.

Not sure where to start? Answer 3 questions and get a tailored compliance action plan.

What applies to me? →