Data for Artificial Intelligence
The role of governments as a provider of data for artificial intelligence
Objectives
The data assets stewarded by governments – both at a national and multinational level – are unique and authoritative. Data assets in public hands include administrative data, health records in countries with national health systems, census information, statute and case law, statutory registers, and data collected by regulators. This project aims to support governments to make decisions about whether and how to share data they steward with AI developers.
Supporting governments in enabling AI innovation through responsible data stewardship
This project will survey the different ways in which governments are, could, and should, use data that they steward to support the development of AI. It will adopt or produce a suitable framework for understanding the types of data held by governments – such as administrative data, statutory registers, or routine monitoring data – and their potential to support the development of AI systems.
The first phase of the project will be to conduct a survey of data sharing by governments for the development of AI around the world, generating three illustrative case studies of current practices.
The second phase will examine how governments are making this data available to AI developers in well-governed ways – including how they are navigating issues around technology, culture, public attitudes, and fair financial models. This will create recommendations for governments who are trying to prioritise which data to make public, inform the discussion of how to navigate data governance challenges, and provide insight on the types of resources required to successfully govern the hosting of public data resources, particularly when these are sensitive in nature. The third phase will analyse state-of-the-art data sharing technologies and facilitators and who has access to them, and produce general guiding principles for the formulation of domestic rules in the interest of more transnational coherence.à
The intended impact is to increase the availability of publicly held data for AI grounded in the principles of human rights, inclusion, diversity, innovation, and economic growth by helping governments to prioritise their efforts and to reduce their concerns about the risks of sharing public data for AI by providing clear guidance, use cases and examples that demonstrate how it can be done safely and responsibly.
Team
Stephanie King
Director of AI Initiatives
Ching-Yi Liu
GPAI expert and Project co-lead
Jhalak Kakkar
GPAI expert and Project co-lead
Project Advisory Group
Kim McGrail
Andrea A. Jacobs
Christian Reimsbach
Bertrand Monthubert
Maja Bogataj
Collaborative opportunities with CEIMIA
Collaborating with CEIMIA means contributing to the development of responsible AI solutions.
We welcome all GPAI member governments who are navigating these challenges who would be willing to participate in this project via offering applied case studies, by sharing their experiences and approaches as stewards of value data for AI.
Additionally, we welcome expert representatives from external organisations in this domain to join the team as members of the Project Advisory Group.