news | Nouha Dziri

Jan 2026	Invited talk at IVADO/Mila “Cognitive Basis of Reasoning (in Minds and AI)” Workshop
Dec 2025	Super proud of receiving the Best Paper Award at NeurIPS 2025
Dec 2025	Invited panel at NeurIPS’25 tutorial: Science of Trustworthy Generative Foundation Models
Nov 2025	Invited talk at IVADO/Mila Workshop: Deploying Autonomous Agents: Lessons, Risks, and Real-World Impact
Oct 2025	Attending COLM with 2 papers
Sep 2025	1 oral and 2 posters accepted at NeurIPS. See you in San Diego
Sep 2025	New work “RL Grokking Recipe: How Does RL Unlock New Algorithms in LLMs?” [Code & Data]
Sep 2025	Invited talk at D.E.Shaw about LLM Reasoning in NYC
Jul 2025	Invited lecture about LLM reasoning at the Armenian LLM Summer School 2025
Jul 2025	Invited talk at the Apple Reasoning and Planning Workshop in Cupertino.
Jul 2025	Invited talk & panel at the Data in Generative Models Workshop at ICML 2025
Jul 2025	Invited talk & panel at the Computer Use Agents Workshop at ICML 2025
Jul 2025	Invited talk & panel at the Cross Future AI & Technology Summit in Vancouver
Jul 2025	1 poster SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behaviorat ICML 2025
Jun 2025	Check out our new work “OMEGA: Can LLMs Reason Outside the Box in Math?”
May 2025	Invited talk and panel at the International Symposium on Trustworthy Foundation Models (MBZUAI)
May 2025	Our paper Rel-A.I received the Best Paper Runner Up award at NAACL 2025
Apr 2025	1 oral and 2 posters in ICLR 2025 Singapore
Feb 2025	Guest lecture “Red-Teaming and Safeguarding Language Models: Current Practices, Challenges, and Future Directions” at Carnegie Mellon University (CMU)
Feb 2025	Honored to have been part of the Paris AI Action Summit
Jan 2025	Check out the new blog post: DeepSeek R1: Innovative Research and Engineering Can Rival Brute-Force Scaling
Dec 2024	System 2 Reasoning at Scale workshop (NeurIPS 2024) was a success including keynote talks, panel, posters and lightening talks. Check out details!
Dec 2024	Invited talk “In-Context Learning in LLMs: Potential and Limits” at the Language Gamification Workshop @ NeurIPS 2024
Dec 2024	Invited as a panelist at the Meta-Generation Algorithms for Large Language Models Tutorial at NeurIPS 2024
Oct 2024	WildTeaming and WildGuard got accepted at NeurIPS 2024. See you in Vancouver
Sep 2024	New blogpost about AI safety “Current Paradigms of LLMs Safety Alignment are superficial”
Sep 2024	New blogpost about o1 models and LLMs reasoning “Have o1 Models Cracked Human Reasoning?”
Aug 2024	Super excited that our workshop “System 2 Reasoning At Scale” was accepted to NeurIPS24, Vancouver! Mark your calendar for Dec 15, 2024!
Jul 2024	Check out our new safety moderation tool WildGuard: a state-of-the-art open tool for assessing safety risks, jailbreaks, and refusals in LLMs.
Jul 2024	New red-teaming method WildTeaming: an automatic red-teaming framework that discovers novel jailbreaks based on in-the-wild user-LLMs interactions.
Jul 2024	Check out my interview with Science News Magazine about “LLMs reasoning skills” featuring “Faith and Fate” and “Generative AI Paradox”.
Jul 2024	I will serve as a Demo Chair for NAACL 2025.
Jun 2024	I will serve as a Senior Area Chair for ACL 2025 in the area of Ethics, Bias, and Fairness.
Jun 2024	Check out my interview with LeMonde (equivalent of NYT in France) about hallucinations in LLMs.
May 2024	Invited Talk “What it can create, it may not understand: Studying the Limits of Transformers” at the University of Cambridge.
May 2024	I served as an Area Chair for COLM 2024 in the area of Safety in LLMs.
Feb 2024	Featured in TechCrunch talking about why LMs perform better when we “motivate” them or ask them “nicely”?
Jan 2024	3 papers accepted at ICLR 2024. 1 Oral and 2 posters. See you in Vienna
Dec 2023	Guest Lecture: “Limits of Generative AI Models and their Societal Implications” for the “Generative AI” course taught by Prof. Adji Bousso at the Princeton University.
Nov 2023	Invited Talk: Presented “Faith and Fate” & “Generative AI Paradox” at LLM evaluation workshop at The Alan Turing Institute.
Nov 2023	Invited Talk: Presented “Faith and Fate” in ILCC CDT/NLP seminar, University of Edinburgh.
Nov 2023	Invited Talk: Presented “Faith and Fate” at SAIL workshop on fundamental limits of LLMs.
Nov 2023	New paper “The Generative AI Paradox: What It Can Create, It May Not Understand” is out. [Paper]
Oct 2023	New paper “Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement” is out. (ICLR Oral 2024) [Paper][Code]
Oct 2023	Invited Talk: Presented “Faith and Fate” at the University of Pittisburgh
Oct 2023	2 papers accepted at EMNLP.
Sep 2023	Invited Talk: Presented “Faith and Fate” at the Formal Languages and Neural Networks Seminar [Video]
Sep 2023	3 papers accepted at NeurIPS. See you in New Orleans
Jun 2023	New paper “Fine-Grained Human Feedback Gives Better Rewards for Language Model Training” is out. [Paper] [Code/Data] (NeurIPS Spotlight 2023)
May 2023	New paper “Faith and Fate: Limits of Transformers in Compositionality” is out. [Paper][Code][Blog] (NeurIPS Spotlight 2023)
Mar 2023	New paper “Self-Refine: Iterative Refinement with Self-Feedback” is out. [Paper][Website][Demo] (NeurIPS 2023)
Dec 2022	Defended my PhD successfully and was nominated for the best thesis award
Nov 2022	Joined AI2 (Mosaic) as a postdoc.
Jun 2022	Invited Talk: Stanford NLP Seminar.
May 2022	Our work “Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark” got accepted at TACL 2022 [Paper] .
Apr 2022	New Benchmark FaithDial for building faithful information-seeking dialogue systems got accepted at TACL 2022. [Preprint] [Data] [Code] [Project Page]
Apr 2022	Our work “On the Origin of Hallucination in Conversational Models: Is it the Datasets or the Models?” got accepted at NAACL 2022. [Paper] [Data]
Jan 2022	Joined Google Research as a student researcher.
Aug 2021	Our work “Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding” got accepted at EMNLP 2021. [Paper] [Code]
May 2021	Joined Mila as a visiting researcher to work with Siva Reddy.
May 2021	Our work “Decomposed Mutual Information Estimation for Contrastive Representation Learning” got accepted at ICML 2021 [Paper].
Apr 2021	Passed my PhD candidacy successfully!
Apr 2021	New Benchmark BEGIN about evaluating groundedness in dialogue systems. [Preprint] [Data]
Mar 2021	Invited Talk: DSC Women in Tech Conference about inspiring females to pursue a career in STEM. [Video]
Sep 2020	Invited Talk: Montreal NLP Meetups about conversational AI.
Jun 2020	Interned at Google Research Language team NYC under Tal Linzen and David Reitter.
Dec 2019	Invited Talk: DeepMind Montreal about evaluating consistency in dialogue systems.
Sep 2019	Interned at Microsoft Research Montreal.
Sep 2019	Invited talk at Rasa Developer Summit. [Slides] [Video]
May 2019	Interned at Google Research Language team NYC.
May 2019	Our work “Augmenting Neural Response Generation with Context-Aware Topical Attention” got accepted at ACL workshop NLP4ConvAI. [Paper] [Code]
Mar 2019	Invited to the Amazon Graduate Research Symposium among top student researchers across North America to present my work in Seattle.
Feb 2019	Our paper “Evaluating Coherence in Dialogue Systems using Entailment” got accepted at NAACL 2019. [Paper][Code]
Dec 2018	Presented a poster “Response Generation For Open-Ended Conversational Agent” at NeurIPS Workshop on Women in Machine Learning (WiML), Montreal, Canada.
Oct 2018	Check out my interview with l’Express about my studies at the University of Alberta.
Sep 2018	Attended the Grace Hopper Celebration 2018 at Houston, Texas.
Jan 2018	Our work “Automatic Dialogue Generation with Expressed Emotions” got accepted at NAACL 2018. [Paper] [Code]