Page 40 - The Annual AI Governance Report 2025 Steering the Future of AI

P. 40

The Annual AI Governance Report 2025: Steering the Future of AI

training datasets or code. A wide range of actors—including major AI labs, open science
172
consortia, and civil society organisations—are involved in shaping norms and practices around
openness.

Defining Practices and Actors in Open(ish) AI. Current open source practices vary widely across Theme 6: AI Safety
labs and projects. Meta's LLaMA and Mistral models, while referred to as “open,” are released
173
with restrictive licences. ‘Truly’ open models, such as those by EleutherAI, Hugging Face’s
174
BigScience project, or StabilityAI, publish code, weights, and sometimes training data. In
175
China, Alibaba Cloud’s Qwen family , DeepSeek-LM , and Baidu’s forthcoming open-weight
177
176
ERNIE models all release or pledge to release weights (and, in Qwen’s case, code) on public
178
repositories. India’s AI4Bharat programme at IIT Madras follows a similar ethos, publishing
multilingual Indic models such as IndicTrans 2 under permissive licences to support low-
resource languages and local innovation. In contrast, some models from Google DeepMind
179
or OpenAI are fully closed. Initiatives such as the AI Alliance (led by IBM and Meta) promote
“open innovation,” but with few enforceable standards. This fragmented landscape reflects
ongoing negotiations between openness, safety, and commercial interest. The common thread
is that outside researchers can, in some capacity, inspect, fine-tune, and deploy the models
locally rather than calling a remote API, a shift advocates say restores scientific reproducibility
and lowers entry costs. 180

Benefits and Challenges—Accountability vs. Proliferation Risk. Proponents argue that open
source models enhance auditability, reproducibility, and global participation, allowing
actors—including those in Global Majority countries—to experiment with and build on frontier
capabilities. Critics, including some government agencies and labs, argue that open weights
increase proliferation risks for misuse, including disinformation or bioterrorism. Tensions
181
between openness and control have led to calls for differentiated governance: e.g., restricting
capabilities above certain thresholds, while keeping smaller models fully open.

Adoption and Advocacy. Open source AI is increasingly used for capacity building and
technological sovereignty. For instance, South Africa’s Masakhane project and Latin America’s
182
Cohere For AI have developed multilingual language models based on open source
183

172 Seger, E., Dreksler, N., Moulange, R., Dardaman, E., Schuett, J., Wei, K., Winter, C., Arnold, M., Héigeartaigh,
S. Ó., Korinek, A., Anderljung, M., Bucknall, B., Chan, A., Stafford, E., Koessler, L., Ovadya, A., Garfinkel, B.,
Bluemke, E., Aird, M., Gupta, A. (2023, September 29). Open-Sourcing Highly Capable Foundation Models:
An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives. arXiv.org.
173 Meta’s LLaMa license is still not Open Source. (2025, February 18). Open Source Initiative.
174 Wiggers, K. (2025, March 19). ‘Open’ AI model licenses often carry concerning restrictions. TechCrunch.
175 Wiggers, K. (2025b, June 6). EleutherAI releases massive AI training dataset of licensed and open domain
text. TechCrunch.
176 QwenLM. (n.d.). GitHub - QwenLM/Qwen: The official repo of Qwen (通义千问) chat & pretrained large
language model proposed by Alibaba Cloud. GitHub.
177 DeepSeek’s release of an open-weight frontier AI model. (April 2025). IISS.
178 Williams, K. (2025, June 30). China’s biggest public AI drop since DeepSeek, Baidu’s open source Ernie, is
about to hit the market. CNBC.
179 Gala, J., Chitale, P. A., Ak, R., Gumma, V., Doddapaneni, S., Kumar, A., Nawale, J., Sujatha, A., Puduppully, R.,
Raghavan, V., Kumar, P., Khapra, M. M., Dabre, R., & Kunchukuttan, A. (2023, May 25). IndicTrans2: Towards
High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages. arXiv.org.
180 White, M., Haddad, I., Osborne, C., Liu, X. Y., Abdelmonsef, A., Varghese, S., & Hors, A. L. (2024, March 20).
The Model Openness Framework: Promoting completeness and openness for reproducibility, transparency,
and usability in artificial intelligence. arXiv.org.
181 Harris, D. E. (2023, December 6). How to regulate unsecured “Open-Source” AI: No exemptions. Tech Policy
Press.
182 Masakhane. Masakhane: A grassroots NLP community for Africa, by Africans. Retrieved June 19, 2025.
183 Cohere Labs. Aya. Retrieved June 19, 2025.

35 36 37 38 39 40 41 42 43 44 45