Mistral AI · Mistral AI Privacy Policy · View original document ↗

Training Data from Publicly Available Internet Sources

Medium severity Unique · 0 of 325 platforms
Share 𝕏 Share in Share 🔒 PDF
Recent governance activity Mistral AI recorded 4 documented changes in the last 30 days.
Start monitoring updates
Monitor governance changes for Mistral AI Create a free account to receive the weekly governance digest and monitor one platform for governance changes.
Create free account No credit card required.

This analysis describes what Mistral AI's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

People whose personal information appears in publicly available internet data — such as in articles, social media posts, or forums — may have their data used to train Mistral AI's models without their knowledge or consent.

Consumer impact (what this means for users)

Mistral AI's privacy policy authorizes the use of your chat inputs and AI outputs for model training under a legitimate interest basis, meaning this occurs by default without requiring your affirmative consent unless you opt out. The Memory feature may also store sensitive personal details, such as health information you mention in prompts, and the policy states this is handled under explicit consent for sensitive data, though the mechanism for obtaining that consent in practice is not fully detailed in this document. You can opt out of having your conversations used for model training by adjusting your preferences in your Mistral AI account settings.

How other platforms handle this

Writer Medium

Writer does not use Customer Data to train its AI models without explicit customer permission. Customer Data means the data, content, and information that customers and their end users submit to or through the Services.

Ideogram Medium

We may use the content you provide to us, including prompts and generated images, to train and improve our AI models and services.

Roblox Medium

We are simplifying our Terms of Use, including clarifications around the use of AI tools, and their data use. We have moved the terms that describe AI Features, which were previously written for a Creator audience and located under the AI-Based Tools Supplemental Terms and Disclaimer, into the User ...

See all platforms with this clause type →

Monitoring

Mistral AI has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 10 platforms.

Start Watcher free trial Or create a free account →
▸ View Original Clause Language DOCUMENT RECORD
"
Data publicly available on the Internet. Our artificial intelligence models are trained on data that is publicly available on the Internet by third parties, which may contain personal data, even if we use good practices to filter out such personal data. [...] Training Datasets. In some cases, we access datasets provided by third parties for our model training purposes. These datasets may include personal data (even if such third parties and Mistral AI use good practices to filter out such personal data), proprietary data, or public data.

— Excerpt from Mistral AI's Mistral AI Privacy Policy

Applicable regulations

EU AI Act
European Union
California AB 2013 AI Training Data Transparency
US-CA
Colorado AI Act
US-CO
EU AI Act - High Risk Provisions
EU
GDPR
European Union
Texas AI Act
Texas, USA
Trump Executive Order on AI Policy Framework
US

Provision details

Document information
Document
Mistral AI Privacy Policy
Entity
Mistral AI
Document last updated
May 5, 2026
Tracking information
First tracked
May 11, 2026
Last verified
May 11, 2026
Record ID
CA-P-007014
Document ID
CA-D-00443
Evidence Provenance
Source URL
Wayback Machine
Content hash (SHA-256)
a3774c814d80737846c7ac8379ec7dcc1c55ee8e0300de40dccee951ff5d0230
Analysis generated
May 11, 2026 05:55 UTC
Methodology
Evidence
✓ Snapshot stored   ✓ Hash verified
Citation Record
Entity: Mistral AI
Document: Mistral AI Privacy Policy
Record ID: CA-P-007014
Captured: 2026-05-11 05:55:06 UTC
SHA-256: a3774c814d807378…
URL: https://conductatlas.com/platform/mistral-ai/mistral-ai-privacy-policy/training-data-from-publicly-available-internet-sources/
Accessed: May 13, 2026
Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.
Classification
Severity
Medium
Categories

Other risks in this policy

Related Analysis

Professional Governance Intelligence

Need to monitor specific governance provisions?

Professional includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies
Start Professional free trial

Or start with Watcher →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does Mistral AI's Training Data from Publicly Available Internet Sources clause do?

People whose personal information appears in publicly available internet data — such as in articles, social media posts, or forums — may have their data used to train Mistral AI's models without their knowledge or consent.

Is ConductAtlas affiliated with Mistral AI?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by Mistral AI.