Home Data for Artificial Intelligence The role of governments as a provider of data for artificial intelligence

Data for Artificial Intelligence

The role of governments as a provider of data for artificial intelligence

Is this project of interest to you ? Get involved today.



Governments are uniquely placed not only to create the policy environment in which others share data for the development of AI, but also to be a provider of data themselves.

The data assets stewarded by governments – both at a national and multinational level – are unique and authoritative. Data assets in public hands include administrative data, health records in countries with national health systems, census information, statute and case law, statutory registers, and data collected by regulators. This project aims to support governments to make decisions about whether and how to share data they steward with AI developers.

Supporting governments in enabling AI innovation through responsible data stewardship

This project will survey the different ways in which governments are, could, and should, use data that they steward to support the development of AI. It will adopt or produce a suitable framework for understanding the types of data held by governments – such as administrative data, statutory registers, or routine monitoring data – and their potential to support the development of AI systems.

The first phase of the project will be to conduct a survey of data sharing by governments for the development of AI around the world, generating three illustrative case studies of current practices.

The second phase will examine how governments are making this data available to AI developers in well-governed ways – including how they are navigating issues around technology, culture, public attitudes, and fair financial models. This will create recommendations for governments who are trying to prioritise which data to make public, inform the discussion of how to navigate data governance challenges, and provide insight on the types of resources required to successfully govern the hosting of public data resources, particularly when these are sensitive in nature. The third phase will analyse state-of-the-art data sharing technologies and facilitators and who has access to them, and produce general guiding principles for the formulation of domestic rules in the interest of more transnational coherence.à

The intended impact is to increase the availability of publicly held data for AI grounded in the principles of human rights, inclusion, diversity, innovation, and economic growth by helping governments to prioritise their efforts and to reduce their concerns about the risks of sharing public data for AI by providing clear guidance, use cases and examples that demonstrate how it can be done safely and responsibly.


Stephanie King


Director of AI Initiatives

Ching-Yi Liu

National Taiwan University

GPAI expert and Project co-lead

Jhalak Kakkar

Centre for Communication Governance

GPAI expert and Project co-lead

Project Advisory Group

Kim McGrail

Andrea A. Jacobs

Christian Reimsbach

Bertrand Monthubert

Maja Bogataj

Collaborative opportunities with CEIMIA

Collaborating with CEIMIA means contributing to the development of responsible AI solutions.

We welcome all GPAI member governments who are navigating these challenges who would be willing to participate in this project via offering applied case studies, by sharing their experiences and approaches as stewards of value data for AI.

Additionally, we welcome expert representatives from external organisations in this domain to join the team as members of the Project Advisory Group.