RL Grokking Recipe -- How Can We Enable LLMs to Solve Previously Unsolvable Tasks with RL?

September 25, 2025

2025

OpenAI set the AI world abuzz with the release of their o1 models. As the dust settles on this news, I can’t help but feel this is the perfect moment to share my thoughts on LLMs reasoning as someone who’s spent a good chunk of my research on understanding the capabilities of LLMs on compositional reasoning tasks…

Enjoy Reading This Article?

Here are some more articles you might like to read next:

Can LLMs Reason Outside the Box in Math?

DeepSeek R1: Innovative Research and Engineering Can Rival Brute-Force Scaling

Current Paradigms of LLMs Safety Alignment are superficial

Have o1 Models Cracked Human Reasoning?