Better Data for AI – A Possible Task


UN-CCSA/ UNCTAD/ITU/ Colombia, Norway, UK

Session 239

Thursday, 9 July 2026 16:00–16:45 (UTC+02:00) Physical (on-site) and Virtual (remote) participation Room K, Palexpo Interactive Session
Register »

Physical (on-site) and Virtual (remote) participation


Large language models and generative AI increasingly shape how knowledge is used and shared, but their reliability depends on the quality and provenance of training data. As AI expands into governance, public services and markets, identifying trustworthy, well-documented sources has become a global priority.

Official statistics provide a key public good: they are produced under professional standards, transparent methodologies and public oversight, drawing on administrative records, surveys and privately held data. They offer essential ground truth for validating, calibrating and benchmarking AI outputs. In a context of growing data volumes and uneven quality, authoritative statistical datasets help verify results and ensure consistency with established evidence.

Combining diverse data for AI raises challenges around quality assurance, metadata, representativeness, bias, intellectual property and privacy. The WSIS+20 and GDC processes emphasize shared approaches for safe, inclusive and interoperable digital ecosystems.

This session will examine how statistical and geospatial communities, private companies, NGOs and initiatives such as the Financing for Development work on the Future of Data and the Trusted Data Observatory can develop joint action on “better data for AI.” It will explore principles for identifying, curating and sharing high-quality datasets and metadata. Experts from national statistical offices, international organizations, academia and the private sector will discuss roles of official and private data, practices for documenting AI-ready datasets, innovations in AI-for-data, and opportunities for cooperation to ensure the next generation of AI is built on trusted, accountable and globally beneficial data foundations.

Topics
Digital Economy Emerging Technologies Global Digital Compact (GDC)
WSIS Action Lines
  • AL C1 logo C1. The role of governments and all stakeholders in the promotion of ICTs for development
  • AL C2 logo C2. Information and communication infrastructure
  • AL C6 logo C6. Enabling environment
  • AL C11 logo C11. International and regional cooperation
Sustainable Development Goals
  • Goal 17 logo Goal 17: Revitalize the global partnership for sustainable development
GDC Objectives
  • Objective 1: Close all digital divides and accelerate progress across the Sustainable Development Goals
  • Objective 2: Expand inclusion in and benefits from the digital economy for all