Building Mini Minds: Our Adventures in Teaching Small LLMs to Reason
Friday, February 28, 2025 11:00 AM to 11:30 AM · 30 min. (Europe/Amsterdam)
Main Stage
Session
Future of AI
Information
Join us for a candid exploration of our attempts to train small language models to solve puzzles through reinforcement learning. We'll share both the technical hurdles and melted GPUs as we try to replicate reasoning capabilities of models like OpenAI's o1 and Deepseek R1 at a smaller scale - all in pursuit of giving these tiny models their very own 'aha!' moments.


