GitHub · GitHub Privacy Statement · View original document ↗

De-identified and Aggregate Data Use

Medium severity Unique · 0 of 343 platforms
Share 𝕏 Share in Share 🔒 PDF
Monitor governance changes for GitHub Create a free account to receive the weekly governance digest and monitor one platform for governance changes.
Create free account No credit card required.
Document Record

What it is

GitHub can take your personal data, strip out identifying details, and use the resulting anonymized data for any purpose without being bound by the privacy protections in this policy.

This analysis describes what GitHub's agreement states, permits, or reserves. It does not constitute a legal determination about enforceability. Regulatory applicability and practical outcomes may vary by jurisdiction, enforcement context, and individual circumstances. Read our methodology

ConductAtlas Analysis

Why it matters (compliance & governance perspective)

This clause establishes a carve-out from privacy protections for data in de-identified or aggregated form, permitting internal research operations and product development to proceed without the notice and consent requirements that apply to personal data handling.

Recent Activity

This document changed recently

High Apr 28, 2026

The updated terms now explicitly authorize GitHub to collect AI outputs generated within the platform alongside user-provided code and content, and to share personal data with Microsoft and other GitHub affiliates for purposes including training and improving artificial intelligence and machine learning technologies. The privacy statement indicates that aggregate and de-identified data will be used where feasible, but the updated language establishes broader authority for affiliate data sharing and AI model development than the previous version stated. The revised terms also remove specific disclosure of the conditions under which GitHub personnel may access private repositories, replacing that detail with a cross-reference to the Terms of Service, which means the scope of internal GitHub access to private repositories is now defined in a separate contract document rather than the privacy statement itself.

View change record →

Consumer impact (what this means for users)

GitHub can derive de-identified or aggregate data from your personal information and repository activity and use or share it without restriction, including potentially for training AI models like GitHub Copilot, with no opt-out provided under this provision.

How other platforms handle this

Mixpanel Medium

Mixpanel may use aggregated or de-identified data derived from customer event data for its own purposes, including improving its services, developing new features, and generating analytics insights, provided that such data cannot reasonably be used to identify individual users.

LinkedIn Medium

We target (and measure the performance of) ads to Members, Visitors and others both on and off our Services directly or through a variety of partners, using the following data, whether separately or combined: Data from advertising technologies on and off our Services, like web beacons, pixels, ad ta...

Microsoft Azure Medium

Microsoft uses data we collect to provide you with rich, interactive experiences. In particular, we may use data to show you advertising or serve Microsoft-selected content within Microsoft products and services. Microsoft does not use what you say in email, chat, video calls, or voice mail to targe...

See all platforms with this clause type →

Monitoring

GitHub has changed this document before.

Receive same-day alerts, structured change summaries, and monitoring for up to 25 platforms.

Start Monitor free trial Or create a free account →
▸ View Original Clause Language DOCUMENT RECORD
"
We may use de-identified or aggregate information derived from your personal data for research, analytics, and to improve our products and services. Such de-identified or aggregate data is not subject to this Privacy Statement and may be used and shared by GitHub without restriction.

— Excerpt from GitHub's GitHub Privacy Statement

ConductAtlas Analysis

Institutional analysis (Compliance & governance intelligence)

REGULATORY FRAMEWORK: GDPR Recital 26 and Art. 4(1) establish that truly anonymized data falls outside GDPR scope, but the standard for anonymization is high (FPF and EDPB guidance); CCPA/CPRA §1798.140(m) defines 'deidentified' data with specific technical and contractual requirements including public commitments not to re-identify; FTC has issued guidance on the re-identification risk of 'anonymized' datasets.

Full compliance analysis

Regulatory citations, enforcement risk, and due diligence action items.

Track 1 platform — free Try Monitor free for 14 days

Free: track 1 platform + weekly digest. Monitor: 25 platforms + same-day alerts. No credit card required.

Applicable agencies

  • FTC
    FTC has issued guidance and brought enforcement actions regarding deceptive claims about data anonymization and unrestricted use of consumer-derived data.
    File a complaint →

Applicable regulations

EU AI Act
European Union
CCPA/CPRA
California, USA
Colorado AI Act
US-CO
CAN-SPAM
United States Federal
ePrivacy Directive
European Union
FTC Act Section 5
United States Federal
GDPR
European Union
UK GDPR
United Kingdom

Provision details

Document information
Document
GitHub Privacy Statement
Entity
GitHub
Document last updated
May 5, 2026
Tracking information
First tracked
April 27, 2026
Last verified
April 27, 2026
Record ID
CA-P-003600
Document ID
CA-D-00254
Evidence Provenance
Source URL
Wayback Machine
Content hash (SHA-256)
6b5f0a9a524d3261cfe25f12abc65ee86bfcca11dcb979d0a2c6fa30d7aa36e8
Analysis generated
April 27, 2026 14:59 UTC
Methodology
Evidence
✓ Snapshot stored   ✓ Hash verified
Citation Record
Entity: GitHub
Document: GitHub Privacy Statement
Record ID: CA-P-003600
Captured: 2026-04-27 14:59:43 UTC
SHA-256: 6b5f0a9a524d3261…
URL: https://conductatlas.com/platform/github/github-privacy-statement/de-identified-and-aggregate-data-use/
Accessed: June 17, 2026
Permanent archival reference. Stable identifier suitable for legal filings, compliance documentation, and research citation.
Classification
Severity
Medium
Categories

Other risks in this policy

Related Analysis

Compliance Governance Intelligence

Need to monitor specific governance provisions?

Compliance includes provision-level monitoring, governance timelines, regulatory mapping, and audit-ready analysis.

Arbitration clauses AI governance Data rights Indemnification Retention policies
Start Compliance free trial

Or start with Monitor →

Built from archived source documents, structured governance mappings, and historical version tracking.

Frequently Asked Questions

What does GitHub's De-identified and Aggregate Data Use clause do?

This clause establishes a carve-out from privacy protections for data in de-identified or aggregated form, permitting internal research operations and product development to proceed without the notice and consent requirements that apply to personal data handling.

How does this clause affect you?

GitHub can derive de-identified or aggregate data from your personal information and repository activity and use or share it without restriction, including potentially for training AI models like GitHub Copilot, with no opt-out provided under this provision.

Is ConductAtlas affiliated with GitHub?

No. ConductAtlas is an independent monitoring service. We are not affiliated with, endorsed by, or sponsored by GitHub.