


AI Agent Meetup Paris #2 – Open Data for Open Models and Agents
This AI Alliance event is sponsored by Ekimetrics & IBM.
The Paris AI Agent Meetup is a community for AI Agent Developers, Engineers, UX, Ops, and Applied Researchers exploring and leading the evolution of AI agents. Whether you're building autonomous systems, experimenting with LLM-powered assistants, or integrating AI agents into real-world applications, this meetup is the place to discover, share insights, and collaborate with your peers. Join us for talks, demos, discussions and networking with like-minded innovators shaping the next generation of AI.
📆 Agenda
6:00 pm – Check-in & Networking (arrive early, grab a drink, and connect with fellow participants)
6:30 pm – Welcome & Introductions - Launching AI Alliance SYNTH Initiative, Agata Ferretti (IBM / AI Alliance)
6:40 pm – The Next Gem Catalog: Increasing Transparency in Open Data to Trust Models and Agents, Joe Olson (IBM / AI Alliance)
7:00 pm – SYNTH: open synthetic data for the new generation of open frontier reasoning models, Anastasia Stasenko & Pierre-Carl Langlais (Pleias)
7:20 pm – Advancing Generative Models for Scientific Discovery with Open Synthetic Data, Nelson Fernandez-Pinto (AirLiquide)
7:40 pm – Clair.bot : open access multi-agent responsible AI Q&A, Annabelle Blangero & Jean Lelong (Ekimetrics)
8:00 pm – Open Networking (continue the conversation, exchange ideas, and meet potential collaborators)
9:00 pm – Close
Talk descriptions:
The Next Gem Catalog: Increasing Transparency in Open Data to Trust Models and Agents
In this talk we will present NextGem, an AI-powered tool designed to make open data more transparent and usable, thereby increasing trust in everything built on top of it, from models to agents. By enriching dataset documentation with GenAI, NextGem enables precise searches not only by topic but also by transparency and trust specifications, such as the AI Alliance’s OTDI framework. For data producers, it identifies gaps in documentation, suggests improvements, and makes datasets more attractive to enterprise consumers. For data consumers, it reveals lineage, governance, and compliance - empowering innovation with confidence. Rather than replacing existing data catalogs, NextGem supercharges them, helping to build a more open, trusted, and sustainable data ecosystem.SYNTH: open synthetic data for the new generation of open frontier reasoning models
This talk introduces SYNTH, an initiative addressing a critical bottleneck in open source AI development: the lack of specialized datasets needed for advanced reasoning capabilities. While the open-source community has successfully created competitive base models through large-scale pretraining, developing frontier-level reasoning capabilities requires access to specialized datasets for advanced training techniques, such as midtraining and reinforcement learning. These critical training phases currently rely heavily on proprietary datasets or synthetic data generated from closed frontier models, creating a dependency that undermines the open-source ecosystem's ability to develop advanced reasoning capabilities independently.Advancing Generative Models for Scientific Discovery with Open Synthetic Data
Generative models are rapidly transforming scientific discovery. But their progress is often held back by the lack of high-quality training data. In this talk, I’ll show how open synthetic datasets can help bridge this gap by providing scalable, reproducible, and domain-relevant supervision. As a case study, I’ll highlight MEGA, our recently released large-scale open dataset for molecular editing, and share reinforcement learning training recipes that push the data efficiency of LLMs to unprecedented levels.Clair.bot : open access multi-agent responsible AI Q&A
In today's world, conversations about AI are everywhere. Every corner of society is buzzing with excitement, fear, and curiosity. What's often missing, however, is a voice of reason—a balanced perspective that bridges between extreme enthusiasm and paralyzing fear. CLAIR aims to provide the user with free general information on best practices in responsible AI. CLAIR. is based on a multi-agent technology. User's questions will be simultaneously answered by our three expert agents: Ada, Norma and Sophia, and CLAIR. will summarize their points of view in a condensed response.
Speaker Bios:
Dr. Agata Ferretti is the European Lead of the AI Alliance at IBM. She fosters cross-sector collaborations and leads projects that build, adopt, and advocate for open-source AI solutions. Agata draws on her expertise in AI ethics and data governance to help ensure that AI is developed transparently, inclusively, and for the public good.
Dr. Anastasia Stasenko is a co-founder and CEO of pleias, French startup training fully open foundation SLMs. In addition to her role at pleias, she holds the position of associate lecturer at Sorbonne-Nouvelle, where she co-directs the master's program in data analysis and digital communications.
Dr. Pierre-Carl Langlais is a French LLM researcher and the co-founder of Pleias. He coordinated the « Common Corpus», an international initiative that released the largest collection of open data for training LLMs. A long time advocate of the digital commons, he is an admin of Wikipedia since 2012 and has authored several influential policy reports on open science such as the Open Diamond Study (2020).
Joe Olson is an open source GenAI system architect for IBM, and is currently working to develop tools and application by consensus with the AI Alliance.
Nelson Fernandez-Pinto is a Senior Machine-Learning Engineer leading generative-AI projects at Air Liquide. Since 2022 he has been deploying large-language-model and diffusion-model solutions for R&D and industrial operations. Previously, he developed autonomous-driving perception systems at Renault-Nissan-Mitsubishi and led computer-vision R&D at Axionable. He is particularly interested in the intersection of generative modeling, scientific discovery, and industrial applications.
We look forward to bringing Paris's AI Agent community together! 🚀
Food and drink, engaging conversation, and incredible company will all be provided!
This AI Alliance event is sponsored by Ekimetrics.
Why join?
This is your chance to dive deep into the world of AI agents, connect with a new and emerging developer community. This event is designed for experienced developers, and those who are curious about agents.
Please be advised: Unfortunately, space is very limited at these community events and we can not always accept everyone we would like to. We appreciate your application and are looking forward to seeing you at a future even!
Code of Conduct: We expect our members to treat each other like family, so please click here to read our Code of Conduct. If you see someone violating it, please speak with your event organizer or reach out to [email protected].
The AI Alliance is a non-profit, grassroots community of 190+ companies, startups, researchers, and individuals. Through world-class events, online events, and workshops, members leverage their diverse perspectives and innovative minds to foster meaningful relationships, solve challenging problems, and define the future of AI.
