GitHub · GitHub Privacy Statement

De-identified and Aggregate Data Use

Medium severity
Share 𝕏 Share in Share 🔒 PDF

What it is

GitHub can take your personal data, strip out identifying details, and use the resulting anonymized data for any purpose without being bound by the privacy protections in this policy.

Clause Stability Highly Volatile

1
Change
1
Month Monitored
Apr 27, 2026
First Seen
Apr 27, 2026
Last Seen
This clause has changed once in 1 month of monitoring.

Change history

added Apr 28, 2026

This new provision explicitly exempts de-identified and aggregate data from privacy protections, enabling unrestricted use and sharing for any purpose.

View full change record →

Consumer impact (what this means for users)

GitHub can derive de-identified or aggregate data from your personal information and repository activity and use or share it without restriction, including potentially for training AI models like GitHub Copilot, with no opt-out provided under this provision.

Cross-platform context

See how other platforms handle De-identified and Aggregate Data Use and similar clauses.

Compare across platforms →
Need full compliance memos? See Professional →

Why it matters (compliance & risk perspective)

The broad right to use de-identified data derived from user content without restriction — including for AI training and product improvement — means GitHub can extract commercial value from user behavior and code without the privacy policy's protections applying.

View original clause language
We may use de-identified or aggregate information derived from your personal data for research, analytics, and to improve our products and services. Such de-identified or aggregate data is not subject to this Privacy Statement and may be used and shared by GitHub without restriction.

Institutional analysis (Compliance & legal intelligence)

REGULATORY FRAMEWORK: GDPR Recital 26 and Art. 4(1) establish that truly anonymized data falls outside GDPR scope, but the standard for anonymization is high (FPF and EDPB guidance); CCPA/CPRA §1798.140(m) defines 'deidentified' data with specific technical and contractual requirements including public commitments not to re-identify; FTC has issued guidance on the re-identification risk of 'anonymized' datasets.

🔒

Compliance intelligence locked

Regulatory citations, enforcement risk, and due diligence action items.

Watcher $9.99/mo Professional $149/mo

Watcher: regulatory citations. Professional: full compliance memo.

Applicable agencies

  • FTC
    FTC has issued guidance and brought enforcement actions regarding deceptive claims about data anonymization and unrestricted use of consumer-derived data.
    File a complaint →

Provision details

Document information
Document
GitHub Privacy Statement
Entity
GitHub
Document last updated
April 29, 2026
Tracking information
First tracked
April 27, 2026
Last verified
April 27, 2026
Record ID
CA-P-003600
Document ID
CA-D-00254
Evidence Provenance
Source URL
Wayback Machine
SHA-256
6b5f0a9a524d3261cfe25f12abc65ee86bfcca11dcb979d0a2c6fa30d7aa36e8
Verified
✓ Snapshot stored   ✓ Change verified
How to Cite
ConductAtlas Policy Archive
Entity: GitHub | Document: GitHub Privacy Statement | Record: CA-P-003600
Captured: 2026-04-27 14:59:43 UTC | SHA-256: 6b5f0a9a524d3261…
URL: https://conductatlas.com/platform/github/github-privacy-statement/de-identified-and-aggregate-data-use/
Accessed: May 2, 2026
Classification
Severity
Medium
Categories

Other provisions in this document