I am a PhD candidate in Computer Science at the University of Illinois Urbana-Champaign, working at the Conversational AI Lab with Prof. Dilek Hakkani-Tür. My research focuses on AI safety and LLM alignment, especially in dialogue settings. My work draws on both NLP and human-computer interaction to find what we should be measuring, and what we should be aligning to. Previously, I completed my B.S. in Computer Science at the University of Tehran.
Research
See Google Scholar for a full list.
Introduced a framework for measuring dialogic deference: how LLM judges shift from verifying statements to deferring to speakers. Benchmarked 4 LLMs across 9 domains (3k+ instances) and 280 Reddit conversations, finding an average deference shift of +18pp (2-4x higher in naturalistic settings). Developed fine-tuning mitigations that achieved +22pp accuracy and -24pp deference.
Investigated how task framing affects LLM conviction in dialogue systems. Showed that reframing evaluation from statement verification to speaker judgment significantly reduced model conviction, revealing high sensitivity to social framing across multiple LLMs.
Surveyed multimodal approaches (speech, vision, physiological signals) to human behavior understanding and examined their potential for building socially-aware language models.
Identified design opportunities and goals for AI-assisted documentation in speech-language intervention through interviews and co-design sessions with clinical practitioners.
Developed a human-AI collaborative system for documenting child behavior in therapy videos using GPT-4V and Whisper, in collaboration with the AI Institute for Exceptional Education. Designed with 17 speech-language pathologists through an expert-in-the-loop approach with automatic prompt engineering from clinician preferences.
Coursework
CS 598 · Spring 2025
Evaluated GPT-4o and Gemini on 180 CrowS-Pairs sentences across 4 languages using embedding-based bias metrics. Found bias shifts in up to 90% of gender-related translations (Russian) and 90% of religion-related translations (Korean).
CS 598 · Fall 2024
Studied the effect of AI integration on platform trust by analyzing 68k Reddit comments using SBERT classification and Poisson regression. Found a 26% decrease in competence distrust post-Meta AI launch, with tech subreddits exhibiting higher competence concerns vs. non-tech communities focusing on sincerity.
CS 467 · Spring 2024
Visualized CS faculty diversity using ML-generated facial composites from 3,900+ images across 48 U.S. universities. Found 67% masculine representation overall; top-ranked institutions predominantly reflect white male features, while minority-serving institutions mirror their student body demographics.
Talks
Invited Poster · 2026
DialDefer: Detecting Dialogic Deference in LLM Judges
Amazon Trusted AI Symposium
Oral · 2024
Designing for Human Behavior Analysis using Multi-Modal AI
AI Institute for Exceptional Education Workshop
Teaching
UIUC · 2024 - 2025
CS 465 User Interface Design
CS 416 Data Visualization
CS 101 Intro to Programming
Community
Reviewing
ACL Rolling Review January 2026
ACM CHI 2025
NeurIPS MTI-LLM Workshop, 2025
Spring 2025
Girls Who Code Spotlight Speaker
HCI research presentation to middle school students.
Spring 2024, 2025
CS Visit Days Graduate Ambassador
Diversity @ Illinois sessions and Q&As for prospective international PhD students.
2021 - 2022
Outreach Content Creation
Educational content on math and computer logic for K-12 students.
Personal
A few snapshots from my life, past and present.
Blu, my research assistant
Our Conversational AI Lab
First day on the UIUC campus!
I like Photography
University of Tehran
Playing with fire in high school!
The first PCI (Parisa-Computer Interaction)