GitHub · GitHub Privacy Statement · View original document ↗

AI/ML Training Data Use

High severity Unique · 0 of 343 platforms
Share 𝕏 Share in Share 🔒 PDF
Recent governance activity GitHub recorded 2 documented changes in the last 30 days.
Start monitoring updates
Monitor governance changes for GitHub Create a free account to receive the weekly governance digest and monitor one platform for governance changes.
Create free account No credit card required.
Document Record

What it is

GitHub may use your personal data, activity, and content to train artificial intelligence and machine learning models, including features like GitHub Copilot, though certain opt-out options are available.

This analysis describes what GitHub's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

This provision establishes GitHub's operational authority to apply user-generated content to AI model training, which affects how the platform monetizes and develops its technical infrastructure. The clause defines the boundaries of permitted data use beyond the immediate service delivery.

Recent Activity

This document changed recently

High Apr 28, 2026

The updated terms now explicitly authorize GitHub to collect AI outputs generated within the platform alongside user-provided code and content, and to share personal data with Microsoft and other GitHub affiliates for purposes including training and improving artificial intelligence and machine learning technologies. The privacy statement indicates that aggregate and de-identified data will be used where feasible, but the updated language establishes broader authority for affiliate data sharing and AI model development than the previous version stated. The revised terms also remove specific disclosure of the conditions under which GitHub personnel may access private repositories, replacing that detail with a cross-reference to the Terms of Service, which means the scope of internal GitHub access to private repositories is now defined in a separate contract document rather than the privacy statement itself.

View change record →

Clause Stability Stable

0
Changes
3
Months Monitored
Apr 3, 2026
First Seen
Apr 10, 2026
Last Seen
This clause type exists across 381 other provisions on other platforms.

Consumer impact (what this means for users)

Your code contributions, usage patterns, and other data could be used to improve GitHub's AI features without explicit consent in all cases. Opting out of applicable AI training uses is possible through privacy settings.

What you can do

⚠️ These actions may provide transparency or partial mitigation but may not fully address the underlying issue. Effectiveness varies by jurisdiction and individual circumstances.
  • Opt Out of Arbitration
    Log in to your GitHub account, navigate to Settings > Privacy, and locate the AI training data preferences to opt out of your data being used for AI/ML model training.

How other platforms handle this

HubSpot Medium

We may use the information we collect to help us improve our products and services, to develop new features, and to perform analytics. We may also use your information to personalize your experience and to allow us to deliver the type of content and product offerings in which you are most interested...

LinkedIn Medium

We target (and measure the performance of) ads to Members, Visitors and others both on and off our Services directly or through a variety of partners, using the following data, whether separately or combined: Data from advertising technologies on and off our Services, like web beacons, pixels, ad ta...

Microsoft Azure Medium

Microsoft uses data we collect to provide you with rich, interactive experiences. In particular, we may use data to show you advertising or serve Microsoft-selected content within Microsoft products and services. Microsoft does not use what you say in email, chat, video calls, or voice mail to targe...

See all platforms with this clause type →

Monitoring

GitHub has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 25 platforms.

Start Monitor free trial Or create a free account →
ConductAtlas Analysis

Institutional analysis (Compliance & governance intelligence)

This provision implicates GDPR Article 6 legitimate interests as a lawful basis for AI training, requiring a balancing test; under CPRA, use of sensitive personal information for AI training may require additional disclosures and opt-out mechanisms. Enterprises should assess whether employee use of GitHub results in organizational data being incorporated into AI training pipelines.

Full compliance analysis

Regulatory citations, enforcement risk, and due diligence action items.

Track 1 platform — free Try Monitor free for 14 days

Free: track 1 platform + weekly digest. Monitor: 25 platforms + same-day alerts. No credit card required.

Applicable agencies

  • FTC
    The FTC has jurisdiction over unfair or deceptive data practices, including undisclosed use of consumer data to train commercial AI products.
    File a complaint →

Applicable regulations

EU AI Act
European Union
CCPA/CPRA
California, USA
Colorado AI Act
US-CO
CAN-SPAM
United States Federal
ePrivacy Directive
European Union
FTC Act Section 5
United States Federal
GDPR
European Union
UK GDPR
United Kingdom

Provision details

Document information
Document
GitHub Privacy Statement
Entity
GitHub
Document last updated
May 5, 2026
Tracking information
First tracked
March 20, 2026
Last verified
March 20, 2026
Record ID
CA-P-001343
Document ID
CA-D-00254
Evidence Provenance
Source URL
Wayback Machine
Content hash (SHA-256)
6ffd0bca7ee8ec2746c4351f0452b4941b4e2157175f395ff9607b70d0463c07
Analysis generated
March 20, 2026 12:20 UTC
Methodology
Evidence
✓ Snapshot stored   ✓ Hash verified
Citation Record
Entity: GitHub
Document: GitHub Privacy Statement
Record ID: CA-P-001343
Captured: 2026-03-20 12:20:26 UTC
SHA-256: 6ffd0bca7ee8ec27…
URL: https://conductatlas.com/platform/github/github-privacy-statement/aiml-training-data-use/
Accessed: July 4, 2026
Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.
Classification
Severity
High
Categories

Other risks in this policy

Related Analysis

Compliance Governance Intelligence

Need to monitor specific governance provisions?

Compliance includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies
Start Compliance free trial

Or start with Monitor →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does GitHub's AI/ML Training Data Use clause do?

This provision establishes GitHub's operational authority to apply user-generated content to AI model training, which affects how the platform monetizes and develops its technical infrastructure. The clause defines the boundaries of permitted data use beyond the immediate service delivery.

How does this clause affect you?

Your code contributions, usage patterns, and other data could be used to improve GitHub's AI features without explicit consent in all cases. Opting out of applicable AI training uses is possible through privacy settings.

Is ConductAtlas affiliated with GitHub?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by GitHub.