GitHub · GitHub Privacy Statement · View original document ↗

AI/ML Model Training Data Use

High severity Unique · 0 of 325 platforms
Share 𝕏 Share in Share 🔒 PDF
Recent governance activity GitHub recorded 2 documented changes in the last 30 days.
Start monitoring updates
Monitor governance changes for GitHub Create a free account to receive the weekly governance digest and monitor one platform for governance changes.
Create free account No credit card required.

This analysis describes what GitHub's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

This is a default opt-in practice, meaning your data is used for AI training automatically unless you take action to opt out, which many users may not know to do.

Recent Activity

This document changed recently

High Apr 28, 2026

The updated terms now explicitly authorize GitHub to collect AI outputs generated within the platform alongside user-provided code and content, and to share personal data with Microsoft and other Git…

Consumer impact (what this means for users)

The policy states GitHub collects identifiers, device information, usage activity, payment data, and user-generated content, and authorizes sharing this data with Microsoft affiliates, service providers, and analytics and advertising partners. Public repository content is described as globally visible and potentially indexed by search engines, meaning code and associated metadata posted publicly is not treated as private personal data under the policy. You can submit data access, deletion, or correction requests through GitHub's privacy contact form at https://support.github.com/contact/privacy.

How other platforms handle this

Windsurf Medium

We may leverage OpenAI models independent of user selection for processing other tasks (e.g. for summarization). We may leverage Anthropic models independent of user selection for processing other tasks (e.g. for summarization). We may leverage these models independent of user selection for processi...

Writer Medium

Writer does not use Customer Data to train its AI models without explicit customer permission. Customer Data means the data, content, and information that customers and their end users submit to or through the Services.

Ideogram Medium

We may use the content you provide to us, including prompts and generated images, to train and improve our AI models and services.

See all platforms with this clause type →

Monitoring

GitHub has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 10 platforms.

Start Watcher free trial Or create a free account →
▸ View Original Clause Language DOCUMENT RECORD
"
We may use the personal data we collect to improve our Services, develop new Services, and conduct research. This includes using the data to train and improve AI and machine learning models for features like GitHub Copilot. You can opt out of your personal data being used to train these models by adjusting your settings.

— Excerpt from GitHub's GitHub Privacy Statement

Applicable regulations

EU AI Act
European Union
Colorado AI Act
US-CO
GDPR
European Union
Texas AI Act
Texas, USA
UK GDPR
United Kingdom

Provision details

Document information
Document
GitHub Privacy Statement
Entity
GitHub
Document last updated
May 5, 2026
Tracking information
First tracked
May 10, 2026
Last verified
May 12, 2026
Record ID
CA-P-005601
Document ID
CA-D-00254
Evidence Provenance
Source URL
Wayback Machine
Content hash (SHA-256)
d21b58443ca0b4402240dbd06996ada072c72ed842fcccc6b13acab2d7bc6c4d
Analysis generated
May 10, 2026 09:46 UTC
Methodology
Evidence
✓ Snapshot stored   ✓ Hash verified
Citation Record
Entity: GitHub
Document: GitHub Privacy Statement
Record ID: CA-P-005601
Captured: 2026-05-10 09:46:36 UTC
SHA-256: d21b58443ca0b440…
URL: https://conductatlas.com/platform/github/github-privacy-statement/aiml-model-training-data-use/
Accessed: May 14, 2026
Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.
Classification
Severity
High
Categories

Other risks in this policy

Related Analysis

Professional Governance Intelligence

Need to monitor specific governance provisions?

Professional includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies
Start Professional free trial

Or start with Watcher →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does GitHub's AI/ML Model Training Data Use clause do?

This is a default opt-in practice, meaning your data is used for AI training automatically unless you take action to opt out, which many users may not know to do.

Is ConductAtlas affiliated with GitHub?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by GitHub.