Which regulatory agencies enforce this type of clause?

{'agency': 'FTC', 'reason': 'FTC oversight of deceptive AI practices and data-related consumer harms is relevant where training data disclosure is absent or inaccurate in ways that affect consumer-facing AI products', 'complaint_url': 'https://reportfraud.ftc.gov/'}

What is the severity of this clause?

ConductAtlas classifies this Training Data Disclosure clause as medium severity. Severity reflects the magnitude of rights affected, the breadth of users impacted, and the degree of discretion the platform retains.

Training Data Disclosure — Hugging Face

Share 𝕏 Share in Share 🔒 PDF

Monitor governance changes for Hugging Face Create a free account to receive the weekly governance digest and monitor one platform for governance changes.

Create free account No credit card required.

Document Record

What it is

Model publishers are encouraged to disclose what datasets were used to train their model, which helps users assess potential biases, data provenance issues, and licensing implications of the training data.

ⓘ

This analysis describes what Hugging Face's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

Training data disclosure is directly relevant to intellectual property compliance, data provenance assessments, and bias risk evaluation, particularly as regulatory frameworks increasingly require transparency about AI training data sources.

⚠

Interpretive note: Training data disclosure is described as a recommendation rather than a mandatory field, so completeness and accuracy depend on individual model publisher behavior and cannot be assumed.

Consumer impact (what this means for users)

The training data section of a model card, when completed by the publisher, provides users with information about where the model's capabilities come from, including whether training data may have included copyrighted material, personal data, or datasets with known demographic biases.

Cross-platform context

See how other platforms handle Training Data Disclosure and similar clauses.

Compare across platforms →

Monitoring

Hugging Face has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 10 platforms.

Start Watcher free trial Or create a free account →

▸ View Original Clause Language DOCUMENT RECORD

"
Model cards should include information about the datasets used to train the model. This information helps users understand the potential biases and limitations of the model.

— Excerpt from Hugging Face's Hugging Face Model Card Guidelines

ConductAtlas Analysis

Institutional analysis (Compliance & governance intelligence)

(1) REGULATORY LANDSCAPE: Training data disclosure engages the EU AI Act's requirements for data governance and transparency for AI systems. The EU General Data Protection Regulation may apply where training data included personal data. Copyright law in multiple jurisdictions is increasingly relevant to AI training data, following litigation and regulatory attention in the US, EU, and UK. (2) GOVERNANCE EXPOSURE: Medium to High. Organizations deploying models without reviewing training data disclosures may unknowingly use models trained on data that creates copyright infringement exposure or violates data protection regulations applicable to the training data subjects. (3) JURISDICTION FLAGS: EU organizations face heightened exposure under GDPR where models were trained on personal data without adequate legal basis. US organizations should assess training data disclosures in light of ongoing copyright litigation involving AI training datasets. UK organizations should review training data against the UK's data protection framework. (4) CONTRACT AND VENDOR IMPLICATIONS: Procurement teams should treat training data disclosure as a material due diligence item and consider requesting contractual representations from model publishers regarding the lawfulness of training data collection and processing. (5) COMPLIANCE CONSIDERATIONS: Compliance teams should assess training data disclosures for intellectual property and data protection risk before commercial deployment, and maintain documentation of this assessment as part of AI governance records.

Full compliance analysis

Regulatory citations, enforcement risk, and due diligence action items.

Track 1 platform — free Try Watcher free for 14 days

Free: track 1 platform + weekly digest. Watcher: 10 platforms + same-day alerts. No credit card required.

Applicable agencies

FTC

FTC oversight of deceptive AI practices and data-related consumer harms is relevant where training data disclosure is absent or inaccurate in ways that affect consumer-facing AI products
File a complaint →

Provision details

Document information

Document

Hugging Face Model Card Guidelines

Entity

Hugging Face

Document last updated

May 12, 2026

Tracking information

First tracked

May 12, 2026

Last verified

May 12, 2026

Record ID

CA-P-012041

Document ID

CA-D-00842

Evidence Provenance

Source URL

https://huggingface.co/docs/hub/model-cards

Wayback Machine

View archived versions →

Content hash (SHA-256)

5ab2ffdb4775639318cbe1f59c37b7cc7ae22717418f27552c120ec31e09fc37

Analysis generated

May 12, 2026 17:16 UTC

Methodology

summarize_document-v8

Evidence

✓ Snapshot stored ✓ Hash verified

Citation Record

Entity: Hugging Face
Document: Hugging Face Model Card Guidelines
Record ID: CA-P-012041
Captured: 2026-05-12 17:16:37 UTC
SHA-256: 5ab2ffdb47756393…
URL: https://conductatlas.com/platform/hugging-face/hugging-face-model-card-guidelines/training-data-disclosure/
Accessed: May 13, 2026

Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.

Classification

Severity

Medium

Other risks in this policy

Model Card Mandatory Metadata Fields medium
Carbon Emissions Disclosure low
Bias and Limitations Disclosure medium
Intended Use and Out-of-Scope Use Declaration medium
License Declaration Requirement high
Evaluation Results Structured Reporting medium

Professional Governance Intelligence

Need to monitor specific governance provisions?

Professional includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies

Start Professional free trial

Or start with Watcher →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does Hugging Face's Training Data Disclosure clause do?

How does this clause affect you?

Is ConductAtlas affiliated with Hugging Face?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by Hugging Face.