Cover Image for How to Reduce LLM Hallucinations with Wikidata: Hands-On Fact-Checking Using MCP
Cover Image for How to Reduce LLM Hallucinations with Wikidata: Hands-On Fact-Checking Using MCP
Avatar for DataTalks.Club events
DataTalks.Club is a global online community of people who love data.
599 Went

How to Reduce LLM Hallucinations with Wikidata: Hands-On Fact-Checking Using MCP

YouTube
Registration
Past Event
Welcome! To join the event, please register below.
About Event

LLMs are powerful, but they still hallucinate facts, especially when asked about entities, relationships, or claims that require up-to-date or structured knowledge.

In this hands-on workshop, we'll explore how to use Wikidata as a grounding and fact-checking layer for LLMs to reduce hallucinations and make AI systems more reliable.

We'll start with a short introduction to Wikidata and then set up the Wikidata MCP so an LLM can retrieve and verify facts rather than relying solely on its internal memory. This already provides a practical way to ground LLM outputs in verifiable data.

From there, we’ll go beyond LLM-only approaches and build a small experimental fact-checking pipeline. The system combines semantic retrieval, LLM-based reranking, and natural language inference (NLI) to validate claims against evidence in a more controlled and interpretable way.

This workshop focuses on evidence-driven verification pipelines that make LLM's reasoning steps explicit and easier to inspect, debug, and improve.

What we'll cover

  • Wikidata as a structured source for factual verification

  • Setting up and querying Wikidata using MCP

  • Verifying claims with MCP + an LLM

  • Moving beyond pure GenAI to evidence-based fact-checking

  • Finding relevant Wikidata statements with semantic search

  • Ranking candidate evidence with an LLM

  • Verifying claims using an NLI model

What you'll leave with

By the end of the workshop, you'll be able to:

  • Ground LLM outputs in structured data to reduce hallucinations

  • Understand when LLM-only fact-checking is not enough

  • Build a small, transparent fact-checking pipeline you can adapt to real projects


About the speaker:

Philippe Saadé is the AI/ML project manager at Wikimedia Deutschland. His current work focuses on making Wikidata accessible to AI applications with projects like the Wikidata vector database and the Wikidata Model Context Protocol.

DataTalks.Club is the place to talk about data. Join our Slack community!

This event is sponsored by Wikimedia

Avatar for DataTalks.Club events
DataTalks.Club is a global online community of people who love data.
599 Went