RL Grokking Recipe -- How Can We Enable LLMs to Solve Previously Unsolvable Tasks with RL?
OpenAI set the AI world abuzz with the release of their o1 models. As the dust settles on this news, I can’t help but feel this is the perfect moment to share my thoughts on LLMs reasoning as someone who’s spent a good chunk of my research on understanding the capabilities of LLMs on compositional reasoning tasks…
Enjoy Reading This Article?
Here are some more articles you might like to read next: