GitHub's Innovation Graph marks its second anniversary with expanded academic research, policy impact, and new bar chart race visualizations tracking global software development trends.
Today marks the second anniversary of the GitHub Innovation Graph, a comprehensive dataset tracking aggregated statistics on public software development activity worldwide. Since its launch, the Innovation Graph has become an essential resource for researchers, policymakers, and organizations seeking to understand the dynamics of open source development and its broader economic and social implications.

Updated Visualizations and Data Releases
The latest data release brings refreshed bar chart race videos to the git pushes, repositories, developers, and organizations global metrics pages. These visualizations provide dynamic insights into how software development activity has evolved across different regions and communities over time.
Academic Research Leveraging Innovation Graph Data
One of the most significant developments over the past year has been the growing body of academic research utilizing Innovation Graph data. Researchers have applied various methodologies to explore questions ranging from global collaboration networks to the institutional foundations of digital capabilities.
Historical Institutions and Modern Digital Capabilities
Research by an economist at the Federal Reserve Board examines how the density of Protestant mission stations correlates with present-day participation in digital production across African countries. The study, titled "Historical Institutions and Modern Digital Capabilities: New Evidence from GitHub in Africa", demonstrates how historical factors continue to influence contemporary digital engagement patterns.
Cross-National Collaboration Patterns
A collaborative effort between researchers from MIT, Carnegie Mellon, and the University of Chicago analyzes international collaboration patterns in the Innovation Graph's economy collaborators dataset. Their paper, "The Structure of Cross-National Collaboration in Open-Source Software Development", explores how common colonial histories influence modern software development collaboration activities. The replication package is available for further exploration.
Small-World Phenomenon in Global OSS Collaboration
Researchers at Midwestern State University and Tarleton State University conducted a social network analysis revealing the tightly connected, small-world structure of global open source software collaboration. Their findings, published in the Journal of Global Information Management, highlight the interconnected nature of the global developer community.
Software Complexity of Nations
An innovative study extends countries' software economic complexity into the digital economy by leveraging the geographic distribution of programming languages in open source software. The research, "The Software Complexity of Nations", demonstrates that software economic complexity predicts GDP, income inequality, and emissions, with significant policy implications.
Policy and Industry Impact
Beyond academic research, the Innovation Graph has influenced policy discussions and industry analysis at major conferences and in prominent publications.
Conference Presentations
The dataset has been featured at numerous venues including:
- ATLC25: The 10th Atlanta Conference on Science and Innovation Policy
- OpenForum Academy Symposium 2025
- 2nd CEU Vienna Data Analytics Jamboree
- Wharton Human-AI Research: 3rd Annual Business & Generative AI Conference
Media Coverage
Major international publications have referenced Innovation Graph data in their analysis. The Economist published two pieces in 2025 drawing on GitHub data: one examining China's approach to open technology (June 17, 2025) and another exploring India's potential role as an AI superpower (September 18, 2025). This coverage demonstrates how open source activity data can illuminate geopolitical and economic shifts.
Flagship Reports
The Innovation Graph has contributed to several influential reports:
- The 2025 Stanford AI Index Report
- The 2025 WIPO Global Innovation Index
- The Rise of FOSS in India report from the National Law School of India University
Future Directions
As we move through 2026, the GitHub team is focused on deepening collaboration with the research community, welcoming new perspectives, and creating clearer pathways for people to apply Innovation Graph data in their own contexts. The goal is to support applications ranging from strategy and research to product development and policy-making.
"We're grateful for the community that has formed around the Innovation Graph," says Kevin Xu, Staff Software Engineer at GitHub. "Our focus will be on deepening collaboration, welcoming new perspectives, and creating clearer pathways for people to apply the Innovation Graph data in their own contexts, from strategy and research to product development and policy."

The Innovation Graph continues to evolve as a vital resource for understanding the global software development ecosystem, providing evidence-based insights that inform decisions across academia, industry, and government. With its second year complete, the dataset is poised to play an even more significant role in shaping our understanding of digital innovation and its impact on the global economy.
For those interested in exploring the data themselves, the Innovation Graph repository provides access to the datasets and documentation needed to conduct independent analysis.

Comments
Please log in or register to join the discussion