GitHub · GitHub Privacy Statement

AI/ML Training Data Use

High severity
Share 𝕏 Share in Share

Why it matters

Developers storing code on GitHub — including potentially proprietary or sensitive code — should be aware their contributions and behavior may feed into commercial AI products.

Consumer impact

GitHub collects a broad range of personal data including your account information, browsing behavior, code contributions, and payment details, which may be shared with Microsoft affiliates, third-party service providers, and disclosed in response to legal requests. Your data may also be used to train AI and machine learning models, including features like GitHub Copilot, which has significant implications for developers who store proprietary or sensitive code on the platform. You can opt out of AI training data use and manage your privacy preferences through your GitHub account settings at https://github.com/settings/privacy.

What you can do

⚠️ These actions may provide transparency or partial mitigation but may not fully address the underlying issue. Effectiveness varies by jurisdiction and individual circumstances.
  • Opt Out of Arbitration
    Log in to your GitHub account, navigate to Settings > Privacy, and locate the AI training data preferences to opt out of your data being used for AI/ML model training.

Applicable agencies

  • FTC
    The FTC has jurisdiction over unfair or deceptive data practices, including undisclosed use of consumer data to train commercial AI products.
    File a complaint →

Provision details

Document information
Document
GitHub Privacy Statement
Entity
GitHub
Document last updated
March 24, 2026
Tracking information
First tracked
March 20, 2026
Last verified
March 20, 2026
Record ID
CA-P-001343
Document ID
CA-D-00254
Evidence Provenance
Source URL
Wayback Machine
SHA-256
6ffd0bca7ee8ec2746c4351f0452b4941b4e2157175f395ff9607b70d0463c07
Verified
✓ Snapshot stored   ✓ Change verified
How to Cite
ConductAtlas Policy Archive
Entity: GitHub | Document: GitHub Privacy Statement | Record: CA-P-001343
Captured: 2026-03-20 12:20:26 UTC | SHA-256: 6ffd0bca7ee8ec27…
URL: https://conductatlas.com/platform/github/github-privacy-statement/aiml-training-data-use/
Accessed: April 4, 2026
Classification
Severity
High
Categories

Other provisions in this document