Building Mini Minds: Our Adventures in Teaching Small LLMs to Reason

Building Mini Minds: Our Adventures in Teaching Small LLMs to Reason

Friday, February 28, 2025 11:00 AM to 11:30 AM · 30 min. (Europe/Amsterdam)
Main Stage
Session
Future of AI

Information

Join us for a candid exploration of our attempts to train small language models to solve puzzles through reinforcement learning. We'll share both the technical hurdles and melted GPUs as we try to replicate reasoning capabilities of models like OpenAI's o1 and Deepseek R1 at a smaller scale - all in pursuit of giving these tiny models their very own 'aha!' moments.

Log in

See all the content and easy-to-use features by logging in or registering!