2025 | Nouha Dziri

Sep 25, 2025	RL Grokking Recipe -- How Can We Enable LLMs to Solve Previously Unsolvable Tasks with RL?
Jun 24, 2025	Can LLMs Reason Outside the Box in Math?
Jan 30, 2025	DeepSeek R1: Innovative Research and Engineering Can Rival Brute-Force Scaling