Which regulatory agencies enforce this type of clause?

{'agency': 'FTC', 'reason': 'The FTC has authority over commercial data collection practices including the use of publicly available data for AI model training in ways that may constitute unfair or deceptive practices.', 'complaint_url': 'https://reportfraud.ftc.gov/'}

What is the severity of this clause?

ConductAtlas classifies this Publicly Available Data for Model Training clause as medium severity. Severity reflects the magnitude of rights affected, the breadth of users impacted, and the degree of discretion the platform retains.

Publicly Available Data for Model Training — Character.AI

Share 𝕏 Share in Share 🔒 PDF

Recent governance activity Character.AI recorded 15 documented changes in the last 30 days.

Start monitoring updates

Monitor governance changes for Character.AI Create a free account to receive the weekly governance digest and monitor one platform for governance changes.

Create free account No credit card required.

Document Record

What it is

Character.AI collects publicly available information from the internet to train its AI models, in addition to data collected directly from users.

ⓘ

This analysis describes what Character.AI's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

The use of publicly available internet data for commercial AI model training has become a subject of regulatory and legal scrutiny, including questions about intellectual property rights and whether publicly available data retains privacy protections under applicable law.

⚠

Interpretive note: The policy does not specify what types of publicly available data are collected or from which sources, creating uncertainty about the scope of this collection practice and the applicable compliance obligations.

Clause Stability Stable

Changes

Months Monitored

May 11, 2026

First Seen

May 22, 2026

Last Seen

This clause type exists across 1153 other provisions on other platforms.

Change history

added Jun 16, 2026

This new provision clarifies that Character.AI uses publicly available internet data for model training, expanding the scope of data sources beyond user-provided content.

View full change record →

Consumer impact (what this means for users)

Information about you that is publicly available online may be collected and used by Character.AI for AI model training purposes, beyond what you directly provide to the platform.

How other platforms handle this

MetaMask Medium

We may share your personal information with our affiliates, meaning entities that control, are controlled by, or are under common control with Consensys. We also share information with service providers who assist in operating our services, subject to confidentiality obligations.

Ledger Medium

At Ledger, earning and maintaining our users' trust is a top priority. That's why we are deeply committed not only to protecting your privacy and securing your personal data, but also to being fully transparent about how we handle it.

Target Medium

RedCard. We share information with our financial partners to operate the Target RedCard program.

See all platforms with this clause type →

Monitoring

Character.AI has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 25 platforms.

Start Monitor free trial Or create a free account →

▸ View Original Clause Language DOCUMENT RECORD

"
We also collect information that is available on the Internet or from other publicly available sources to evaluate and improve our Services, including for model training and development.

— Excerpt from Character.AI's Character.ai Privacy Policy

ConductAtlas Analysis

Institutional analysis (Compliance & governance intelligence)

REGULATORY LANDSCAPE: The collection of publicly available data for AI model training engages GDPR Article 6 lawful basis requirements and Article 14 transparency obligations for data not collected directly from data subjects, as well as emerging EU AI Act training data governance provisions. In the US, this practice interacts with FTC guidance on commercial data practices and state privacy law definitions of personal information. The European Data Protection Board has issued guidance relevant to whether publicly available data retains personal data status under GDPR. GOVERNANCE EXPOSURE: Medium. Scraping publicly available data for AI model training is a widespread industry practice but has attracted regulatory scrutiny in the EU regarding GDPR Article 14 notification obligations and in the UK from the ICO. The policy's brief disclosure does not specify what types of publicly available data are collected or from which sources, limiting the ability to assess compliance exposure without additional information. JURISDICTION FLAGS: EU and UK users whose information appears in publicly available sources may have Article 14 notification rights under GDPR that require the data controller to provide transparency disclosures within a reasonable time. California users may have CCPA rights over personal information collected from public sources depending on how the data is categorized. The breadth of the disclosure, referencing internet and other publicly available sources without limitation, creates uncertainty about scope. CONTRACT AND VENDOR IMPLICATIONS: If publicly available data collection is conducted by third-party data providers or web scraping services, those relationships should be reviewed for compliance with applicable terms of service and privacy laws. Data provenance documentation is increasingly expected by regulators reviewing AI training data practices. COMPLIANCE CONSIDERATIONS: Compliance teams should document the categories of publicly available data collected, the sources, and the legal basis under GDPR and applicable US law. GDPR Article 14 notification obligations should be assessed and, if applicable, a mechanism for providing those notifications should be developed. Intellectual property review of training data sources should also be considered given current litigation trends in this area.

Full compliance analysis

Regulatory citations, enforcement risk, and due diligence action items.

Track 1 platform — free Try Monitor free for 14 days

Free: track 1 platform + weekly digest. Monitor: 25 platforms + same-day alerts. No credit card required.

Applicable agencies

FTC

The FTC has authority over commercial data collection practices including the use of publicly available data for AI model training in ways that may constitute unfair or deceptive practices.
File a complaint →

Applicable regulations

CCPA/CPRA

California, USA

Connecticut Data Privacy Act Amendments

US-CT

CAN-SPAM

United States Federal

FTC Act Section 5

United States Federal

GDPR

European Union

Indiana Consumer Data Protection Act

US-IN

Kentucky Consumer Data Protection Act

US-KY

UK GDPR

United Kingdom

Universal Opt-Out Mechanism Expansion 2026

VPPA

United States Federal

Provision details

Document information

Document

Character.ai Privacy Policy

Entity

Character.AI

Document last updated

May 5, 2026

Tracking information

First tracked

May 8, 2026

Last verified

May 11, 2026

Record ID

CA-P-010335

Document ID

CA-D-00120

Evidence Provenance

Source URL

https://character.ai/privacy

Wayback Machine

View archived versions →

Content hash (SHA-256)

6ad8585d7de8834f45d45863325899d3602d6584f208eff63eb099fffa024748

Analysis generated

May 8, 2026 14:58 UTC

Methodology

summarize_document-v8

Evidence

✓ Snapshot stored ✓ Hash verified

Citation Record

Entity: Character.AI
Document: Character.ai Privacy Policy
Record ID: CA-P-010335
Captured: 2026-05-08 14:58:37 UTC
SHA-256: 6ad8585d7de8834f…
URL: https://conductatlas.com/platform/characterai/characterai-privacy-policy/publicly-available-data-for-model-training/
Accessed: June 30, 2026

Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.

Classification

Severity

Medium

Other risks in this policy

AI Model Training Use of Chat and Voice Data high
Popular Characters Post-Deletion Retention high
Sensitive Personal Information Handling high
Advertising and Analytics Data Sharing medium
Children's Privacy Age Restriction high
Business Transfer Disclosure medium

Related Analysis

Meta Removed 438 Sentences From Its Privacy Policy. Here Is What Disappeared.
ConductAtlas detected a major restructuring of Meta’s privacy policy that removed detailed consumer rights disclosures and relocated them to separate documents.
23andMe Is Bankrupt. What Happens to Your DNA Now?
Your genetic data may be transferred to a new owner as a business asset. Here is what the Terms of Service actually say and what you can do right now.

Compliance Governance Intelligence

Need to monitor specific governance provisions?

Compliance includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies

Start Compliance free trial

Or start with Monitor →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does Character.AI's Publicly Available Data for Model Training clause do?

How does this clause affect you?

Information about you that is publicly available online may be collected and used by Character.AI for AI model training purposes, beyond what you directly provide to the platform.

Is ConductAtlas affiliated with Character.AI?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by Character.AI.