GitHub · GitHub Privacy Statement · View original document ↗

AI/ML Training Data Use

High severity Unique · 0 of 325 platforms
Share 𝕏 Share in Share 🔒 PDF
Recent governance activity GitHub recorded 3 documented changes in the last 30 days.
Start monitoring updates
Monitor governance changes for GitHub Create a free account to receive the weekly governance digest and monitor one platform for governance changes.
Create free account No credit card required.
Document Record

What it is

GitHub may use your personal data, activity, and content to train artificial intelligence and machine learning models, including features like GitHub Copilot, though certain opt-out options are available.

This analysis describes what GitHub's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

This provision establishes GitHub's operational authority to apply user-generated content to AI model training, which affects how the platform monetizes and develops its technical infrastructure. The clause defines the boundaries of permitted data use beyond the immediate service delivery.

Recent Activity

This document changed recently

High Apr 28, 2026

The updated terms now explicitly authorize GitHub to collect AI outputs generated within the platform alongside user-provided code and content, and to share personal data with Microsoft and other GitHub affiliates for purposes including training and improving artificial intelligence and machine learning technologies. The privacy statement indicates that aggregate and de-identified data will be used where feasible, but the updated language establishes broader authority for affiliate data sharing and AI model development than the previous version stated. The revised terms also remove specific disclosure of the conditions under which GitHub personnel may access private repositories, replacing that detail with a cross-reference to the Terms of Service, which means the scope of internal GitHub access to private repositories is now defined in a separate contract document rather than the privacy statement itself.

View change record →

Consumer impact (what this means for users)

Your code contributions, usage patterns, and other data could be used to improve GitHub's AI features without explicit consent in all cases. Opting out of applicable AI training uses is possible through privacy settings.

What you can do

⚠️ These actions may provide transparency or partial mitigation but may not fully address the underlying issue. Effectiveness varies by jurisdiction and individual circumstances.
  • Opt Out of Arbitration
    Log in to your GitHub account, navigate to Settings > Privacy, and locate the AI training data preferences to opt out of your data being used for AI/ML model training.

How other platforms handle this

HubSpot Medium

We may use the information we collect to help us improve our products and services, to develop new features, and to perform analytics. We may also use your information to personalize your experience and to allow us to deliver the type of content and product offerings in which you are most interested...

Waze Medium

We may use aggregated, anonymized, or de-identified information that cannot reasonably be used to identify you for any purpose, including sharing it with partners, advertisers, and other third parties. This information is not subject to the restrictions in this Privacy Policy.

Threads Medium

We use the information we collect to send you ads and other commercial and sponsored content. We use the information we have to deliver our products, including to personalize features and content and make suggestions for you on and off our products. We share information across the Meta Companies.

See all platforms with this clause type →

Monitoring

GitHub has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 10 platforms.

Start Watcher free trial Or create a free account →
ConductAtlas Analysis

Institutional analysis (Compliance & governance intelligence)

This provision implicates GDPR Article 6 legitimate interests as a lawful basis for AI training, requiring a balancing test; under CPRA, use of sensitive personal information for AI training may require additional disclosures and opt-out mechanisms. Enterprises should assess whether employee use of GitHub results in organizational data being incorporated into AI training pipelines.

Full compliance analysis

Regulatory citations, enforcement risk, and due diligence action items.

Track 1 platform — free Try Watcher free for 14 days

Free: track 1 platform + weekly digest. Watcher: 10 platforms + same-day alerts. No credit card required.

Applicable agencies

  • FTC
    The FTC has jurisdiction over unfair or deceptive data practices, including undisclosed use of consumer data to train commercial AI products.
    File a complaint →

Applicable regulations

EU AI Act
European Union
CCPA/CPRA
California, USA
Colorado AI Act
US-CO
CAN-SPAM
United States Federal
ePrivacy Directive
European Union
FTC Act Section 5
United States Federal
GDPR
European Union
UK GDPR
United Kingdom

Provision details

Document information
Document
GitHub Privacy Statement
Entity
GitHub
Document last updated
May 5, 2026
Tracking information
First tracked
March 20, 2026
Last verified
March 20, 2026
Record ID
CA-P-001343
Document ID
CA-D-00254
Evidence Provenance
Source URL
Wayback Machine
Content hash (SHA-256)
6ffd0bca7ee8ec2746c4351f0452b4941b4e2157175f395ff9607b70d0463c07
Analysis generated
March 20, 2026 12:20 UTC
Methodology
Evidence
✓ Snapshot stored   ✓ Hash verified
Citation Record
Entity: GitHub
Document: GitHub Privacy Statement
Record ID: CA-P-001343
Captured: 2026-03-20 12:20:26 UTC
SHA-256: 6ffd0bca7ee8ec27…
URL: https://conductatlas.com/platform/github/github-privacy-statement/aiml-training-data-use/
Accessed: May 20, 2026
Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.
Classification
Severity
High
Categories

Other risks in this policy

Related Analysis

Professional Governance Intelligence

Need to monitor specific governance provisions?

Professional includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies
Start Professional free trial

Or start with Watcher →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does GitHub's AI/ML Training Data Use clause do?

This provision establishes GitHub's operational authority to apply user-generated content to AI model training, which affects how the platform monetizes and develops its technical infrastructure. The clause defines the boundaries of permitted data use beyond the immediate service delivery.

How does this clause affect you?

Your code contributions, usage patterns, and other data could be used to improve GitHub's AI features without explicit consent in all cases. Opting out of applicable AI training uses is possible through privacy settings.

Is ConductAtlas affiliated with GitHub?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by GitHub.