Mistral AI trains its AI models using data collected from the public internet and from third-party datasets, both of which may contain personal data about individuals who never interacted with Mistral AI and did not consent to this use.
This analysis describes what Mistral AI's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology
Your personal data may be included in Mistral AI's AI training even if you have never used any Mistral AI product, because the company sources training data from public internet content and third-party datasets that may contain your information.
Personal data from public internet sources and third-party datasets, potentially including data about individuals who are not Mistral AI users, may be used for model training; this provision affects a broader population than just registered users.
How other platforms handle this
We use information to enhance the quality, reliability, and/or accuracy of our AI Features by creating, developing, training, testing, improving, and maintaining AI and ML models run by Strava or our service providers. We use aggregated, de-identified data for this purpose. We also use personal info...
At Ledger, earning and maintaining our users' trust is a top priority. That's why we are deeply committed not only to protecting your privacy and securing your personal data, but also to being fully transparent about how we handle it.
If you are located in the European Economic Area, Switzerland, or the United Kingdom, you have the right to access, correct, or erase your personal data; the right to restrict or object to our processing of your personal data; the right to data portability; and, where our processing is based on your...
Monitoring
Mistral AI has changed this document before.
Receive same-day alerts, structured change summaries, and monitoring for up to 25 platforms.
"Training Datasets. In some cases, we access datasets provided by third parties for our model training purposes. These datasets may include personal data (even if such third parties and Mistral AI use good practices to filter out such personal data), proprietary data, or public data. [...] Data publicly available on the Internet. Our artificial intelligence models are trained on data that is publicly available on the Internet by third parties, which may contain personal data, even if we use good practices to filter out such personal data.— Excerpt from Mistral AI's Mistral AI Privacy Policy
1. REGULATORY LANDSCAPE: This provision engages GDPR's requirements for lawful basis and purpose limitation when personal data is sourced from third parties or publicly available sources. The CNIL and the European Data Protection Board have issued guidance indicating that publicly available data is not automatically exempt from GDPR requirements when repurposed for AI training. The EU AI Act's provisions on training data transparency and documentation may also apply to Mistral AI as a general-purpose AI model provider. 2. GOVERNANCE EXPOSURE: Medium. The acknowledgment that training datasets 'may include personal data' despite filtering efforts is a transparency disclosure, but it does not specify the lawful basis for processing that personal data. Regulators may require Mistral AI to demonstrate that individuals whose data appears in training datasets have their rights respected, including the right to object and the right to erasure where technically feasible. 3. JURISDICTION FLAGS: EU and EEA individuals whose data appears in public internet scrapes or third-party datasets may have GDPR rights that Mistral AI must honor, regardless of whether those individuals are registered users. This creates a broad and difficult-to-scope population of potentially affected data subjects. Jurisdictions with active AI governance frameworks, including France, Germany, and Italy, may apply heightened scrutiny. 4. CONTRACT AND VENDOR IMPLICATIONS: Third-party dataset providers supplying training data to Mistral AI should be subject to due diligence on their own data sourcing practices and legal basis for sharing. Procurement teams should confirm that third-party data providers have documented lawful basis for transfer and have conducted appropriate filtering. 5. COMPLIANCE CONSIDERATIONS: Legal teams should evaluate whether Mistral AI's legitimate interest basis extends to personal data sourced from public internet scrapes and third-party datasets, and whether a formal privacy impact assessment has been conducted for training data sourcing. Data subject rights mechanisms should address how individuals who are not registered users can exercise GDPR rights such as erasure or objection with respect to data used in model training.
Full compliance analysis
Regulatory citations, enforcement risk, and due diligence action items.
Free: track 1 platform + weekly digest. Monitor: 25 platforms + same-day alerts. No credit card required.
Ad personalization controls removed. Contact scanning added. Advertiser data partnerships quietly dropped. A timeline of every change.
Compliance Governance Intelligence
Need to monitor specific governance provisions?
Compliance includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.
Built from archived source documents, structured governance mappings, and historical version tracking.
Your personal data may be included in Mistral AI's AI training even if you have never used any Mistral AI product, because the company sources training data from public internet content and third-party datasets that may contain your information.
Personal data from public internet sources and third-party datasets, potentially including data about individuals who are not Mistral AI users, may be used for model training; this provision affects a broader population than just registered users.
No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by Mistral AI.