Cover Image for LLMs and Alignment. Research Talks: Francesco Croce
Cover Image for LLMs and Alignment. Research Talks: Francesco Croce
Hosted By
12 Going

LLMs and Alignment. Research Talks: Francesco Croce

Hosted by AaltoAI
Registration
Welcome! To join the event, please register below.
About Event

At the second event in the “Research Talks” series by LLMs and Alignment, Professor Francesco Croce will present his research on
Multimodal Chain-Of-Thought Reasoning Generalization.

Description of the talk:
Integrating reasoning in large language models and large vision-language models has recently led to significant improvement of their capabilities. However, the generalization of reasoning models is still vaguely defined and poorly understood. In this work, we present an evaluation framework to rigorously examine how well chain-of-thought (CoT) approaches generalize on a simple planning task. The versatility of the task and its data allows us to fine-tune model variants using different input representations (visual and textual) and CoT reasoning strategies, and systematically evaluate them under both in-distribution (ID) and out-of-distribution (OOD) test conditions. Our experiments show that, while CoT reasoning improves in-distribution generalization across all representations, out-of-distribution generalization remains very limited in most cases when controlling for trivial matches with the ID data. Surprisingly, we find that reasoning traces which combine multiple text formats yield the best (and non-trivial) OOD generalization.

Location
Maarintie 8
02150 Espoo, Finland
Hosted By
12 Going