| Nov 2025 | Invited talk at IVADO/Mila Workshop: Deploying Autonomous Agents: Lessons, Risks, and Real-World Impact   | 
  | Oct 2025 |  Attending COLM with 2 papers     | 
  | Sep 2025 |  1 oral and 2 posters accepted at NeurIPS. See you in San Diego   | 
  | Sep 2025 |    New work “RL Grokking Recipe: How Does RL Unlock New Algorithms in LLMs?” [Code & Data] | 
  | Sep 2025 | Invited talk at D.E.Shaw about LLM Reasoning in NYC   | 
  | Jul 2025 | Invited lecture about LLM reasoning at the Armenian LLM Summer School 2025   | 
  | Jul 2025 | Invited talk at the Apple Reasoning and Planning Workshop in Cupertino. | 
  | Jul 2025 | Invited talk & panel at the Data in Generative Models Workshop at ICML 2025   | 
  | Jul 2025 | Invited talk & panel at the Computer Use Agents Workshop at ICML 2025   | 
  | Jul 2025 | Invited talk & panel at the Cross Future AI & Technology Summit in Vancouver   | 
  | Jul 2025 |  1 poster SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behaviorat ICML 2025   | 
  | Jun 2025 |  Check out our new work “OMEGA: Can LLMs Reason Outside the Box in Math?” | 
  | May 2025 | Invited talk and panel at the International Symposium on Trustworthy Foundation Models (MBZUAI) | 
  | May 2025 |  Our paper Rel-A.I received the Best Paper Runner Up award at NAACL 2025   | 
  | Apr 2025 |  1 oral and 2 posters in ICLR 2025 Singapore   | 
  | Feb 2025 |  Guest lecture “Red-Teaming and Safeguarding Language Models: Current Practices, Challenges, and Future Directions” at Carnegie Mellon University (CMU) | 
  | Feb 2025 | Honored to have been part of the Paris AI Action Summit     | 
  | Jan 2025 |  Check out the new blog post: DeepSeek R1: Innovative Research and Engineering Can Rival Brute-Force Scaling | 
  | Dec 2024 | System 2 Reasoning at Scale workshop (NeurIPS 2024) was a success  including keynote talks, panel, posters and lightening talks. Check out details! | 
  | Dec 2024 | Invited talk “In-Context Learning in LLMs: Potential and Limits” at the Language Gamification Workshop @ NeurIPS 2024   | 
  | Dec 2024 | Invited as a panelist at the Meta-Generation Algorithms for Large Language Models Tutorial at NeurIPS 2024   | 
  | Oct 2024 |  WildTeaming and WildGuard got accepted at NeurIPS 2024. See you in Vancouver   | 
  | Sep 2024 |    New blogpost about AI safety “Current Paradigms of LLMs Safety Alignment are superficial” | 
  | Sep 2024 |    New blogpost about o1 models and LLMs reasoning “Have o1 Models Cracked Human Reasoning?” | 
  | Aug 2024 |  Super excited that our workshop “System 2 Reasoning At Scale” was accepted to NeurIPS24, Vancouver! Mark your calendar for Dec 15, 2024! | 
  | Jul 2024 | Check out our  new safety moderation tool  WildGuard: a state-of-the-art open tool for assessing safety risks, jailbreaks, and refusals in LLMs. | 
  | Jul 2024 | New red-teaming method  WildTeaming: an automatic red-teaming framework that discovers novel jailbreaks based on in-the-wild user-LLMs interactions. | 
  | Jul 2024 | Check out my interview with Science News Magazine about “LLMs reasoning skills” featuring “Faith and Fate” and “Generative AI Paradox”. | 
  | Jul 2024 | I will serve as a Demo Chair for NAACL 2025. | 
  | Jun 2024 | I will serve as a Senior Area Chair for ACL 2025 in the area of Ethics, Bias, and Fairness. | 
  | Jun 2024 | Check out my interview with LeMonde (equivalent of NYT in France) about hallucinations in LLMs. | 
  | May 2024 | Invited Talk “What it can create, it may not understand: Studying the Limits of Transformers” at the University of Cambridge. | 
  | May 2024 | I served as an Area Chair for COLM 2024 in the area of Safety in LLMs. | 
  | Feb 2024 | Featured in TechCrunch talking about why LMs perform better when we “motivate” them or ask them “nicely”? | 
  | Jan 2024 | 3 papers accepted at ICLR 2024. 1 Oral and 2 posters. See you in Vienna       | 
  | Dec 2023 | Guest Lecture: “Limits of Generative AI Models and their Societal Implications” for the “Generative AI” course taught by Prof. Adji Bousso at the Princeton University. | 
  | Nov 2023 | Invited Talk: Presented “Faith and Fate” & “Generative AI Paradox” at LLM evaluation workshop at The Alan Turing Institute. | 
  | Nov 2023 | Invited Talk: Presented “Faith and Fate” in ILCC CDT/NLP seminar, University of Edinburgh. | 
  | Nov 2023 | Invited Talk: Presented “Faith and Fate” at SAIL workshop on fundamental limits of LLMs. | 
  | Nov 2023 | New paper  “The Generative AI Paradox: What It Can Create, It May Not Understand” is out. [Paper] | 
  | Oct 2023 | New paper  “Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement” is out. (ICLR Oral 2024) [Paper][Code] | 
  | Oct 2023 | Invited Talk: Presented “Faith and Fate” at the University of Pittisburgh | 
  | Oct 2023 | 2 papers accepted at EMNLP. | 
  | Sep 2023 | Invited Talk: Presented “Faith and Fate” at the Formal Languages and Neural Networks Seminar [Video] | 
  | Sep 2023 | 3 papers accepted at NeurIPS. See you in New Orleans   | 
  | Jun 2023 | New paper  “Fine-Grained Human Feedback Gives Better Rewards for Language Model Training” is out. [Paper] [Code/Data] (NeurIPS Spotlight 2023) | 
  | May 2023 | New paper    “Faith and Fate: Limits of Transformers in Compositionality”  is out. [Paper][Code][Blog] (NeurIPS Spotlight 2023) | 
  | Mar 2023 | New paper  “Self-Refine: Iterative Refinement with Self-Feedback”  is out. [Paper][Website][Demo] (NeurIPS 2023) | 
  | Dec 2022 | Defended my PhD successfully  and was nominated for the best thesis award   | 
  | Nov 2022 | Joined AI2 (Mosaic) as a postdoc. | 
  | Jun 2022 | Invited Talk: Stanford NLP Seminar. | 
  | May 2022 | Our work “Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark” got accepted at TACL 2022 [Paper] . | 
  | Apr 2022 | New Benchmark FaithDial for building faithful information-seeking dialogue systems got accepted at TACL 2022. [Preprint] [Data] [Code] [Project Page] | 
  | Apr 2022 | Our work “On the Origin of Hallucination in Conversational Models: Is it the Datasets or the Models?” got accepted at NAACL 2022. [Paper] [Data] | 
  | Jan 2022 | Joined Google Research as a student researcher. | 
  | Aug 2021 | Our work “Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding” got accepted at EMNLP 2021. [Paper] [Code] | 
  | May 2021 | Joined Mila as a visiting researcher to work with Siva Reddy. | 
  | May 2021 | Our work “Decomposed Mutual Information Estimation for Contrastive Representation Learning” got accepted at ICML 2021 [Paper]. | 
  | Apr 2021 | Passed my PhD candidacy successfully! | 
  | Apr 2021 | New Benchmark BEGIN about evaluating groundedness in dialogue systems. [Preprint] [Data] | 
  | Mar 2021 | Invited Talk: DSC Women in Tech Conference about inspiring females to pursue a career in STEM. [Video] | 
  | Sep 2020 | Invited Talk: Montreal NLP Meetups about conversational AI. | 
  | Jun 2020 | Interned at Google Research Language team NYC under Tal Linzen and David Reitter. | 
  | Dec 2019 | Invited Talk: DeepMind Montreal about evaluating consistency in dialogue systems. | 
  | Sep 2019 | Interned at Microsoft Research Montreal. | 
  | Sep 2019 | Invited talk at Rasa Developer Summit. [Slides] [Video] | 
  | May 2019 | Interned at Google Research Language team NYC. | 
  | May 2019 | Our work “Augmenting Neural Response Generation with Context-Aware Topical Attention” got accepted at ACL workshop NLP4ConvAI. [Paper] [Code] | 
  | Mar 2019 | Invited to the Amazon Graduate Research Symposium among top student researchers across North America to present my work in Seattle. | 
  | Feb 2019 | Our paper “Evaluating Coherence in Dialogue Systems using Entailment” got accepted at NAACL 2019. [Paper][Code] | 
  | Dec 2018 | Presented a poster “Response Generation For Open-Ended Conversational Agent” at NeurIPS Workshop on Women in Machine Learning (WiML), Montreal, Canada. | 
  | Oct 2018 | Check out my interview with l’Express about my studies at the University of Alberta. | 
  | Sep 2018 | Attended the Grace Hopper Celebration 2018 at Houston, Texas. | 
  | Jan 2018 | Our work “Automatic Dialogue Generation with Expressed Emotions” got accepted at NAACL 2018. [Paper] [Code] |