Jul 2025 |
Invited lecture about LLM reasoning at the Armenian LLM Summer School 2025
|
Jul 2025 |
Invited talk at the Apple Reasoning and Planning Workshop in Cupertino.
|
Jul 2025 |
Invited talk & panel at the Data in Generative Models Workshop at ICML 2025
|
Jul 2025 |
Invited talk & panel at the Computer Use Agents Workshop at ICML 2025
|
Jul 2025 |
Invited talk & panel at the Cross Future AI & Technology Summit in Vancouver
|
Jul 2025 |
1 poster SafetyAnalyst: Interpretable, Transparent, and Steerable Safety Moderation for AI Behaviorat ICML 2025
|
Jun 2025 |
Check out our new work “OMEGA: Can LLMs Reason Outside the Box in Math?”
|
May 2025 |
Invited talk and panel at the International Symposium on Trustworthy Foundation Models (MBZUAI)
|
May 2025 |
Our paper Rel-A.I received the Best Paper Runner Up award at NAACL 2025
|
Apr 2025 |
1 oral and 2 posters in ICLR 2025 Singapore
|
Feb 2025 |
Guest lecture “Red-Teaming and Safeguarding Language Models: Current Practices, Challenges, and Future Directions” at Carnegie Mellon University (CMU)
|
Feb 2025 |
Honored to have been part of the Paris AI Action Summit
|
Jan 2025 |
Check out the new blog post: DeepSeek R1: Innovative Research and Engineering Can Rival Brute-Force Scaling
|
Dec 2024 |
System 2 Reasoning at Scale workshop (NeurIPS 2024) was a success including keynote talks, panel, posters and lightening talks. Check out details!
|
Dec 2024 |
Invited talk “In-Context Learning in LLMs: Potential and Limits” at the Language Gamification Workshop @ NeurIPS 2024
|
Dec 2024 |
Invited as a panelist at the Meta-Generation Algorithms for Large Language Models Tutorial at NeurIPS 2024
|
Oct 2024 |
WildTeaming and WildGuard got accepted at NeurIPS 2024. See you in Vancouver
|
Sep 2024 |
New blogpost about AI safety “Current Paradigms of LLMs Safety Alignment are superficial”
|
Sep 2024 |
New blogpost about o1 models and LLMs reasoning “Have o1 Models Cracked Human Reasoning?”
|
Aug 2024 |
Super excited that our workshop “System 2 Reasoning At Scale” was accepted to NeurIPS24, Vancouver! Mark your calendar for Dec 15, 2024!
|
Jul 2024 |
Check out our new safety moderation tool WildGuard: a state-of-the-art open tool for assessing safety risks, jailbreaks, and refusals in LLMs.
|
Jul 2024 |
New red-teaming method WildTeaming: an automatic red-teaming framework that discovers novel jailbreaks based on in-the-wild user-LLMs interactions.
|
Jul 2024 |
Check out my interview with Science News Magazine about “LLMs reasoning skills” featuring “Faith and Fate” and “Generative AI Paradox”.
|
Jul 2024 |
I will serve as a Demo Chair for NAACL 2025.
|
Jun 2024 |
I will serve as a Senior Area Chair for ACL 2025 in the area of Ethics, Bias, and Fairness.
|
Jun 2024 |
Check out my interview with LeMonde (equivalent of NYT in France) about hallucinations in LLMs.
|
May 2024 |
Invited Talk “What it can create, it may not understand: Studying the Limits of Transformers” at the University of Cambridge.
|
May 2024 |
I served as an Area Chair for COLM 2024 in the area of Safety in LLMs.
|
Feb 2024 |
Featured in TechCrunch talking about why LMs perform better when we “motivate” them or ask them “nicely”?
|
Jan 2024 |
3 papers accepted at ICLR 2024. 1 Oral and 2 posters. See you in Vienna
|
Dec 2023 |
Guest Lecture: “Limits of Generative AI Models and their Societal Implications” for the “Generative AI” course taught by Prof. Adji Bousso at the Princeton University.
|
Nov 2023 |
Invited Talk: Presented “Faith and Fate” & “Generative AI Paradox” at LLM evaluation workshop at The Alan Turing Institute.
|
Nov 2023 |
Invited Talk: Presented “Faith and Fate” in ILCC CDT/NLP seminar, University of Edinburgh.
|
Nov 2023 |
Invited Talk: Presented “Faith and Fate” at SAIL workshop on fundamental limits of LLMs.
|
Nov 2023 |
New paper “The Generative AI Paradox: What It Can Create, It May Not Understand” is out. [Paper]
|
Oct 2023 |
New paper “Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement” is out. (ICLR Oral 2024) [Paper][Code]
|
Oct 2023 |
Invited Talk: Presented “Faith and Fate” at the University of Pittisburgh
|
Oct 2023 |
2 papers accepted at EMNLP.
|
Sep 2023 |
Invited Talk: Presented “Faith and Fate” at the Formal Languages and Neural Networks Seminar [Video]
|
Sep 2023 |
3 papers accepted at NeurIPS. See you in New Orleans
|
Jun 2023 |
New paper “Fine-Grained Human Feedback Gives Better Rewards for Language Model Training” is out. [Paper] [Code/Data]
(NeurIPS Spotlight 2023)
|
May 2023 |
New paper “Faith and Fate: Limits of Transformers in Compositionality” is out. [Paper][Code][Blog]
(NeurIPS Spotlight 2023)
|
Mar 2023 |
New paper “Self-Refine: Iterative Refinement with Self-Feedback” is out. [Paper][Website][Demo]
(NeurIPS 2023)
|
Dec 2022 |
Defended my PhD successfully and was nominated for the best thesis award
|
Nov 2022 |
Joined AI2 (Mosaic) as a postdoc.
|
Jun 2022 |
Invited Talk: Stanford NLP Seminar.
|
May 2022 |
Our work “Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark” got accepted at TACL 2022 [Paper] .
|
Apr 2022 |
New Benchmark FaithDial for building faithful information-seeking dialogue systems got accepted at TACL 2022. [Preprint] [Data] [Code] [Project Page]
|
Apr 2022 |
Our work “On the Origin of Hallucination in Conversational Models: Is it the Datasets or the Models?” got accepted at NAACL 2022. [Paper] [Data]
|
Jan 2022 |
Joined Google Research as a student researcher.
|
Aug 2021 |
Our work “Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding” got accepted at EMNLP 2021. [Paper] [Code]
|
May 2021 |
Joined Mila as a visiting researcher to work with Siva Reddy.
|
May 2021 |
Our work “Decomposed Mutual Information Estimation for Contrastive Representation Learning” got accepted at ICML 2021 [Paper].
|
Apr 2021 |
Passed my PhD candidacy successfully!
|
Apr 2021 |
New Benchmark BEGIN about evaluating groundedness in dialogue systems. [Preprint] [Data]
|
Mar 2021 |
Invited Talk: DSC Women in Tech Conference about inspiring females to pursue a career in STEM. [Video]
|
Sep 2020 |
Invited Talk: Montreal NLP Meetups about conversational AI.
|
Jun 2020 |
Interned at Google Research Language team NYC under Tal Linzen and David Reitter.
|
Dec 2019 |
Invited Talk: DeepMind Montreal about evaluating consistency in dialogue systems.
|
Sep 2019 |
Interned at Microsoft Research Montreal.
|
Sep 2019 |
Invited talk at Rasa Developer Summit. [Slides] [Video]
|
May 2019 |
Interned at Google Research Language team NYC.
|
May 2019 |
Our work “Augmenting Neural Response Generation with Context-Aware Topical Attention” got accepted at ACL workshop NLP4ConvAI. [Paper] [Code]
|
Mar 2019 |
Invited to the Amazon Graduate Research Symposium among top student researchers across North America to present my work in Seattle.
|
Feb 2019 |
Our paper “Evaluating Coherence in Dialogue Systems using Entailment” got accepted at NAACL 2019. [Paper][Code]
|
Dec 2018 |
Presented a poster “Response Generation For Open-Ended Conversational Agent” at NeurIPS Workshop on Women in Machine Learning (WiML), Montreal, Canada.
|
Oct 2018 |
Check out my interview with l’Express about my studies at the University of Alberta.
|
Sep 2018 |
Attended the Grace Hopper Celebration 2018 at Houston, Texas.
|
Jan 2018 |
Our work “Automatic Dialogue Generation with Expressed Emotions” got accepted at NAACL 2018. [Paper] [Code]
|