Which regulatory agencies enforce this type of clause?

{'agency': 'FTC', 'reason': 'The FTC has authority over unfair data practices affecting US consumers, including the use of personal data scraped from public sources for AI training without notice to affected individuals.', 'complaint_url': 'https://reportfraud.ftc.gov/'}

What is the severity of this clause?

ConductAtlas classifies this Third-Party Training Datasets clause as medium severity. Severity reflects the magnitude of rights affected, the breadth of users impacted, and the degree of discretion the platform retains.

Third-Party Training Datasets — Mistral AI

Share 𝕏 Share in Share 🔒 PDF

Recent governance activity Mistral AI recorded 4 documented changes in the last 30 days.

Start monitoring updates

Monitor governance changes for Mistral AI Create a free account to receive the weekly governance digest and monitor one platform for governance changes.

Create free account No credit card required.

Document Record

What it is

Mistral AI trains its AI models using data collected from the public internet and from third-party datasets, both of which may contain personal data about individuals who never interacted with Mistral AI and did not consent to this use.

ⓘ

This analysis describes what Mistral AI's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

Your personal data may be included in Mistral AI's AI training even if you have never used any Mistral AI product, because the company sources training data from public internet content and third-party datasets that may contain your information.

Consumer impact (what this means for users)

Personal data from public internet sources and third-party datasets, potentially including data about individuals who are not Mistral AI users, may be used for model training; this provision affects a broader population than just registered users.

How other platforms handle this

Strava Medium

We use information to enhance the quality, reliability, and/or accuracy of our AI Features by creating, developing, training, testing, improving, and maintaining AI and ML models run by Strava or our service providers. We use aggregated, de-identified data for this purpose. We also use personal info...

Ledger Medium

At Ledger, earning and maintaining our users' trust is a top priority. That's why we are deeply committed not only to protecting your privacy and securing your personal data, but also to being fully transparent about how we handle it.

Garmin Medium

If you are located in the European Economic Area, Switzerland, or the United Kingdom, you have the right to access, correct, or erase your personal data; the right to restrict or object to our processing of your personal data; the right to data portability; and, where our processing is based on your...

See all platforms with this clause type →

Monitoring

Mistral AI has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 25 platforms.

Start Monitor free trial Or create a free account →

▸ View Original Clause Language DOCUMENT RECORD

"
Training Datasets. In some cases, we access datasets provided by third parties for our model training purposes. These datasets may include personal data (even if such third parties and Mistral AI use good practices to filter out such personal data), proprietary data, or public data. [...] Data publicly available on the Internet. Our artificial intelligence models are trained on data that is publicly available on the Internet by third parties, which may contain personal data, even if we use good practices to filter out such personal data.

— Excerpt from Mistral AI's Mistral AI Privacy Policy

ConductAtlas Analysis

Institutional analysis (Compliance & governance intelligence)

1. REGULATORY LANDSCAPE: This provision engages GDPR's requirements for lawful basis and purpose limitation when personal data is sourced from third parties or publicly available sources. The CNIL and the European Data Protection Board have issued guidance indicating that publicly available data is not automatically exempt from GDPR requirements when repurposed for AI training. The EU AI Act's provisions on training data transparency and documentation may also apply to Mistral AI as a general-purpose AI model provider. 2. GOVERNANCE EXPOSURE: Medium. The acknowledgment that training datasets 'may include personal data' despite filtering efforts is a transparency disclosure, but it does not specify the lawful basis for processing that personal data. Regulators may require Mistral AI to demonstrate that individuals whose data appears in training datasets have their rights respected, including the right to object and the right to erasure where technically feasible. 3. JURISDICTION FLAGS: EU and EEA individuals whose data appears in public internet scrapes or third-party datasets may have GDPR rights that Mistral AI must honor, regardless of whether those individuals are registered users. This creates a broad and difficult-to-scope population of potentially affected data subjects. Jurisdictions with active AI governance frameworks, including France, Germany, and Italy, may apply heightened scrutiny. 4. CONTRACT AND VENDOR IMPLICATIONS: Third-party dataset providers supplying training data to Mistral AI should be subject to due diligence on their own data sourcing practices and legal basis for sharing. Procurement teams should confirm that third-party data providers have documented lawful basis for transfer and have conducted appropriate filtering. 5. COMPLIANCE CONSIDERATIONS: Legal teams should evaluate whether Mistral AI's legitimate interest basis extends to personal data sourced from public internet scrapes and third-party datasets, and whether a formal privacy impact assessment has been conducted for training data sourcing. Data subject rights mechanisms should address how individuals who are not registered users can exercise GDPR rights such as erasure or objection with respect to data used in model training.

Full compliance analysis

Regulatory citations, enforcement risk, and due diligence action items.

Track 1 platform — free Try Monitor free for 14 days

Free: track 1 platform + weekly digest. Monitor: 25 platforms + same-day alerts. No credit card required.

Applicable agencies

FTC

The FTC has authority over unfair data practices affecting US consumers, including the use of personal data scraped from public sources for AI training without notice to affected individuals.
File a complaint →

Applicable regulations

Connecticut Data Privacy Act Amendments

US-CT

EU AI Act - High Risk Provisions

FTC Act Section 5

United States Federal

GDPR

European Union

Indiana Consumer Data Protection Act

US-IN

Kentucky Consumer Data Protection Act

US-KY

Universal Opt-Out Mechanism Expansion 2026

Provision details

Document information

Document

Mistral AI Privacy Policy

Entity

Mistral AI

Document last updated

May 5, 2026

Tracking information

First tracked

May 11, 2026

Last verified

May 11, 2026

Record ID

CA-P-010427

Document ID

CA-D-00443

Evidence Provenance

Source URL

https://legal.mistral.ai/terms/privacy-policy

Wayback Machine

View archived versions →

Content hash (SHA-256)

a3774c814d80737846c7ac8379ec7dcc1c55ee8e0300de40dccee951ff5d0230

Analysis generated

May 11, 2026 05:55 UTC

Methodology

summarize_document-v8

Evidence

✓ Snapshot stored ✓ Hash verified

Citation Record

Entity: Mistral AI
Document: Mistral AI Privacy Policy
Record ID: CA-P-010427
Captured: 2026-05-11 05:55:06 UTC
SHA-256: a3774c814d807378…
URL: https://conductatlas.com/platform/mistral-ai/mistral-ai-privacy-policy/third-party-training-datasets/
Accessed: June 28, 2026

Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.

Classification

Severity

Medium

Other risks in this policy

Related Analysis

OpenAI Changed Its Privacy Policy 4 Times in One Week. Here Is What Actually Changed.
Ad personalization controls removed. Contact scanning added. Advertiser data partnerships quietly dropped. A timeline of every change.

Compliance Governance Intelligence

Need to monitor specific governance provisions?

Compliance includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies

Start Compliance free trial

Or start with Monitor →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does Mistral AI's Third-Party Training Datasets clause do?

How does this clause affect you?

Is ConductAtlas affiliated with Mistral AI?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by Mistral AI.