US R1 NLP & Language Model Faculty Directory

Tenure-track & research-track faculty at US R1 universities active in NLP/LM research (2023–2025)
Covers: Carnegie Mellon · Columbia · Cornell · CU Boulder · Georgia Tech · Johns Hopkins · MIT · Notre Dame · NYU · Ohio State · Penn · Princeton · Stanford · TTI-Chicago · UC Berkeley · UC Chicago · UCLA · UC San Diego · UC Santa Cruz · UMass Amherst · UMD College Park · UNC Chapel Hill · USC · UIUC · UT Austin · University of Michigan · University of Washington
Data current through mid-2025 (knowledge cutoff). Faculty on full leave are excluded. Tier distinctions removed — all institutions listed together alphabetically.
📅 Knowledge cutoff: mid-2025 🔎 Venues: ACL · EMNLP · NAACL · COLM · NeurIPS · ICML · ICLR · TACL · CL ⭐ = approaching tenure (est.) 🟢 = recent hire/move   🟠 = award/recognition
Approaching tenure (est.)
Recent hire or institutional move
Major award / recognition
topic Research area tag
hot topic Active frontier area
ACL Publication venue
Name Title & Affiliation Topics & Selected Recent Papers (2023–2025) Career Notes PhD Tenure
Carnegie Mellon University — Language Technologies Institute (LTI) & CS
CMU LTI
Assistant Professor
Language Technologies Institute, CMU
grounded languageembodied AIvision-languagemultimodal
PIQA: Reasoning about Physical Commonsense in Natural Language AAAI 2020 (highly cited)
Experience Grounds Language EMNLP 2020 (highly cited)
Served as ACL 2022 publications co-chair; active in embodied NLP
2016
Univ. of Texas, Dallas
CMU LTI
Associate Professor
Language Technologies Institute, CMU
information retrievalfairness in IRevaluation
Towards Principled Evaluation of LLMs for Retrieval SIGIR 2024
Mixture of Relevance Distributions for IR ECIR 2023
Joined CMU LTI from MSR Montreal ~2022
2008
UMass Amherst
CMU LTI
Assistant Professor
Language Technologies Institute, CMU
code generationgrounded languagepragmaticsagents
InCoder: A Generative Model for Code Infilling and Synthesis ICLR 2023
MiniChain: A Small Library for Coding with LLMs EACL 2023
Pragmatic Code Autocomplete EMNLP 2023
Joined CMU LTI 2022; previously postdoc at Berkeley
2021
UC Berkeley
★ ~2028
CMU LTI
Assistant Professor
Language Technologies Institute, CMU
text generationLLM watermarkingmemorizationcreative writing AI
Preventing Verbatim Memorization in Language Models ICLR 2023
Watermarking LLMs via Soft-Green List ICML 2023
Creative Writing with an AI Collaborator ACL 2024
Joined CMU LTI 2023; previously Google Brain
2022
Univ. of Pennsylvania
★ ~2029
CMU LTI
Associate Professor
Language Technologies Institute, CMU
machine translationtext generationmultilingual LLMs
Extrapolating LLM Capabilities to New Tasks ACL 2024
ELRT: Efficient Low-Rank Training for LLMs EMNLP 2023
Moved to CMU LTI from UCSB ~2022
2011
Peking University
CMU LTI
Associate Professor (Leonardo Chair)
Language Technologies Institute, CMU
multimodal AIsentiment analysissocial signals
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning NeurIPS 2021 (highly cited)
Foundations & Trends in Multimodal Machine Learning ACM CSUR 2023
Directs Multicomp Lab; leading CMU multimodal AI efforts
2007
USC
CMU LTI
Associate Professor
Language Technologies Institute, CMU
LLM agentscode generationmultilingual NLPevaluation
SWE-bench: Can Language Models Resolve Real GitHub Issues? ICLR 2024
OpenHands: An Open Platform for AI Software Developers ICLR 2025
MTEB: Massive Text Embedding Benchmark EMNLP 2023
SWE-bench and OpenHands among most influential 2024 agent papers; NSF CAREER
2012
Kyoto University
CMU LTI
Kavčić-Moura Professor
LTI & HCII, CMU
dialogue systemseducational NLPcollaborative learning
LLMs for Education: Emerging Practices and Principles ACL 2024 workshop
Evaluating Conversational AI for STEM Learning EDM 2023
Served as Interim LTI Director 2020–2022; ACL Fellow
1997
CMU
CMU LTI
Assistant Professor
LTI, CMU; part-time AI2
social intelligencecommonsense reasoningAI safetybias & fairness
Artificial Hivemind: The Open-Ended Homogeneity of LLMs NeurIPS 2025 (Best Paper)
AnnotatorBias: Diagnosing Annotator Disagreement in NLP ACL 2024
Moral Stories: Situated Reasoning about Norms EMNLP 2021 (highly cited)
2025 Packard Fellow; 2025 Okawa Research Award; NeurIPS 2025 Best Paper
2021
Univ. of Washington
★ ~2028
CMU LTI
Assistant Professor
Language Technologies Institute, CMU
efficient NLPstructured predictionenergy & compute
Energy and Policy Considerations for Deep Learning in NLP ACL 2019 (highly cited)
SLING: More than Just a Kitchen Sink for Structured Prediction ACL 2023
Joined CMU LTI 2020; NSF CAREER 2023
2020
UMass Amherst
★ ~2027
CMU LTI
Associate Professor
Language Technologies Institute, CMU
speech processingASRspoken languageend-to-end models
ESPnet2: An Upgraded E2E Speech Processing Toolkit INTERSPEECH 2021 (highly cited)
Investigating Whisper's Multilingual ASR Capabilities ICASSP 2024
Lead of ESPnet open toolkit used worldwide
2006
Waseda University
CMU LTI
Assistant Professor
Language Technologies Institute, CMU
reasoningtheorem provingLLM trainingself-improvement
Generating Sequences by Learning to Self-Correct ICLR 2023
LLEMMA: An Open Language Model for Mathematics ICLR 2024
NATURALPROOFS: Mathematical Theorem Proving in Natural Language NeurIPS 2021 (highly cited)
Joined CMU LTI 2023; previously postdoc at UW/AI2; NSF CAREER 2025
2021
NYU
★ ~2029
CMU LTI
Associate Professor
Language Technologies Institute, CMU
information retrievaldense retrievalRAG
ANCE: Approximate Nearest Neighbor Negative Contrastive Estimation ICLR 2021 (highly cited)
DRAGON: Pre-training with Dense-Sparse Retrieval EMNLP 2023
Bridging the Training-Inference Gap in Dense Retrieval ACL 2024
Joined CMU LTI from MSR 2021
2018
CMU
CMU LTI
Assistant Professor
Language Technologies Institute, CMU
human-AI interactionNLP evaluationdata augmentationNLP tools
PromptChainer: Chaining LLM Prompts through Visual Programming CHI 2022
AI Chains: Transparent and Controllable Human-AI Interaction CHI 2022
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot LLM Coaching ACL 2024
Joined CMU LTI 2022; NSF CAREER 2025
2021
Univ. of Washington
★ ~2028
Columbia University — Computer Science & Barnard College
Columbia CS
Assistant Professor
Computer Science, Columbia University
LM interpretabilityprobinginstruction followingbackpack LMs
Backpack Language Models ACL 2023 (Outstanding Paper)
Instruction Following without Instruction Tuning ICLR 2025
Closing the Curious Case of Neural Text Degeneration ICLR 2024
Joined Columbia CS 2024; PhD from Stanford (Manning & Liang); Visiting Researcher, Google DeepMind
2024
Stanford University
★ ~2031
Columbia CS
Professor
Computer Science, Columbia University
speech prosodyspoken languagedeception detectioncharisma
Acoustic-Prosodic Indicators of Deception in LLM-Generated Text NAACL 2024
Detecting Persuasion in Speech INTERSPEECH 2023
ACL Fellow; IEEE Fellow; ISCA Fellow; major figure in spoken NLP
1985
Univ. of Pennsylvania
Columbia CS
Professor; Director, Data Science Institute
Computer Science, Columbia University
summarizationtext generationclinical NLP
Benchmarking LLMs for News Summarization TACL 2024
Multi-document Summarization with LLMs EMNLP 2023
ACL Fellow; NAS member; pioneer of NLG and summarization
1982
Univ. of Pennsylvania
Barnard / Columbia CS
Associate Professor
Computer Science, Barnard College, Columbia University
argumentationfigurative languagecreative NLPsocial good NLP
Art or Artifice? LLMs and the False Promise of Creativity CHI 2024
Connecting the Dots: Evaluating Abstract Reasoning via NYT Connections EMNLP 2024
ICLEF: In-Context Learning with Expert Feedback for Style Transfer ACL 2024
Joined Barnard/Columbia as Associate Professor 2024; Amazon Scholar; previously Columbia DSI Research Scientist
2006
Rutgers University
Columbia CS
Associate Professor
Computer Science, Columbia University
dialogue systemsconversational AImultimodal dialogue
Collaborative Role-Play for LLM Evaluation NAACL 2024
MindDial: Belief Dynamics Tracking in Task-Oriented Dialogue ACL 2023
Moved from UC Davis to Columbia ~2021; NSF CAREER
2017
Carnegie Mellon University
Cornell University — CS & Information Science
Cornell CS
Associate Professor
Computer Science, Cornell University
semantic parsinggrounded languageinstruction following
Weakly Supervised Learning of Semantic Parsers for Instructions TACL 2013 (ACL Test-of-Time 2024)
Evaluating Open-Domain Dialogue with ACUTE-EVAL EMNLP 2024 (Best Paper)
EMNLP 2024 Best Paper; ACL Test-of-Time Award 2024
2013
Univ. of Washington
Cornell CS
Professor
Computer Science, Cornell University
information extractionsentimentopinion miningargument mining
Argument Mining with LLMs: Survey and Perspectives Computational Linguistics 2023
ACL Fellow; pioneered IE and argumentation mining
1994
UMass Amherst
Cornell IS
Associate Professor
Information Science, Cornell University
social NLPonline communitieslanguage changepersuasion
Conversations Gone Awry: Detecting Early Signs of Failure ACL 2018 (highly cited)
Conversational Dynamics and Community Norms with LLMs ACL 2024
NSF CAREER 2018; leading computational social science work
2012
Cornell University
Cornell CS
Professor
CS & Information Science, Cornell
sentiment analysisopinion miningsocial NLPlanguage & power
Power Dynamics in Conversations with LLMs ACL 2024
ACL 25-Year Test-of-Time Award 2024; ACL Fellow
1997
Harvard University
Cornell CS
Assistant Professor
Computer Science, Cornell University
sequence modelsmusic generationLM theorywatermarking
Robust Distortion-Free Watermarks for Language Models TMLR 2024
Expressive Piano Performance Generation via Score Infilling ISMIR 2023
Joined Cornell CS 2024; previously postdoc at Stanford (Percy Liang)
2023
Univ. of Washington
★ ~2031
Georgia Institute of Technology — College of Computing
Georgia Tech CS
Associate Professor
School of Interactive Computing, Georgia Tech
information extractionsocial media NLPevent detectionLLM robustness
Having Beer after Prayer? Measuring Cultural Bias in LLMs ACL 2024 (Best Social Impact Award)
NEO-BENCH: Evaluating LLM Robustness with Neologisms ACL 2024
ACL 2024 Best Social Impact Award; NSF CAREER
2013
Univ. of Washington
Georgia Tech CS
Associate Professor
School of Interactive Computing, Georgia Tech
text simplificationNLGsocial media NLPLLM evaluation
Automatic and Human-AI Interactive Text Simplification ACL 2024 tutorial
ReadMe++: Benchmarking Multilingual LMs for Readability EMNLP 2024
Tenured and promoted to associate professor ~2023; NSF CAREER
2014
NYU
Johns Hopkins University — CLSP & Computer Science
JHU CS / CLSP
John C. Malone Professor of CS
CS & CLSP, Johns Hopkins University
clinical NLPhealth informaticssocial media NLPpublic health AI
Large Language Models for Healthcare: Successes and Limitations EMNLP 2023
Tobacco Use Identification from Clinical Notes via LLMs ACL 2024
ACM Fellow; extensive health NLP research; Malone Professor chair
2009
Univ. of Pennsylvania
JHU CS / CLSP
Professor
CS & CLSP, Johns Hopkins University
probabilistic NLPstructured predictionLM theoryprogram synthesis
Contrastive Decoding: Open-ended Text Generation as Optimization ACL 2023
Formal Language Theory and LLMs TACL 2023
ACL Fellow; creator of influential semiring/CRF frameworks
1997
Univ. of Pennsylvania
JHU CS / CLSP
Associate Professor
CS & CLSP, Johns Hopkins University; Research Scientist, HLTCOE
semantic NLPLLM safetyknowledge representationprocess supervision
SemStamp: A Semantic Watermark for Text Generation NAACL 2024
Rationalyst: Pre-training Process-Supervision for Reasoning ACL 2025
Controllable Safety Alignment: Inference-Time Adaptation ICLR 2025
Leads HLTCOE text research; extensive work on annotation-efficient NLP
2009
Univ. of Rochester
Massachusetts Institute of Technology — CSAIL & EECS
MIT CSAIL / EECS
Associate Professor (X Consortium)
EECS & CSAIL, MIT
language learningcompositionalitypragmaticsLM interpretability
Eliciting Human Preferences with Language Models ICLR 2025
Algorithmic Capabilities of Random Transformers NeurIPS 2024
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes NAACL 2024 (Best Paper)
NAACL 2024 Best Paper; promoted to associate professor ~2024
2017
UC Berkeley
★ ~2026
MIT CSAIL / EECS
School of Engineering Distinguished Professor of AI & Health
EECS & CSAIL, MIT; AI Lead, Jameel Clinic
clinical NLPdrug discoveryNLP for health
Antibiotic Discovery with Deep Learning Cell 2020 (highly cited)
Taming LLMs for Clinical Use EMNLP 2023
TIME100 AI 2025; IEEE Frances E. Allen Medal 2025; NAE & NAM member; MacArthur Fellow 2017
2003
Columbia University
MIT CSAIL / EECS
Assistant Professor
EECS & CSAIL, MIT
LM theoryin-context learningstructured prediction
In-Context Language Learning: Architectures and Algorithms ICML 2024
Transformers as Statisticians: Provable In-Context Learning NeurIPS 2023
Scan and Snap: Understanding Token Composition in 1L Attention NeurIPS 2023
Joined MIT 2020; NSF CAREER 2023
2019
Harvard University
★ ~2027
MIT Brain & Cognitive Sci.
Professor
Brain & Cognitive Sciences, MIT
psycholinguisticscomputational cognitive scienceLM and human language processing
How Well Do LLMs Predict Human Reading Times? ACL 2023
Language Models and Human Language Acquisition NAACL 2024
Leading voice on cognitive plausibility of LLMs
2005
Stanford University
University of Michigan — EECS & School of Information
Univ. of Michigan EECS
Professor; Associate Director, MIDAS
EECS, Univ. of Michigan
grounded NLPhuman-robot interactionsituated dialoguemultimodal
MMLI: A Multimodal Math Language Instruction Dataset ACL 2024
Evaluating Situated Dialogue Systems EMNLP 2023
ACL Fellow; Associate Director of Michigan MIDAS 2023
1998
Duke University
Univ. of Michigan EECS
Assistant Professor
EECS, Univ. of Michigan
LLM safetycausality in NLPalignmentAI for social good
Multilingual Alignment Benchmark for LLMs NeurIPS 2024 (Best Paper)
Can LLMs Express Their Uncertainty? Epistemic Calibration ICLR 2024
Causal Reasoning and Language Models EMNLP 2023
Joined Michigan EECS 2024; PhD from ETH Zurich/MPI; NeurIPS 2024 Best Paper
2023
ETH Zurich / MPI Intelligent Systems
★ ~2031
Univ. of Michigan EECS
Janice M. Jenkins Collegiate Professor; Director, AI Lab
EECS, Univ. of Michigan
computational social sciencemultilingual NLPdeception detectionmultimodal NLP
Gender Bias and Multilingual Alignment in LLMs NeurIPS 2024 (Best Paper)
MIND: LLM Knowledge Graph for Mental Health Counseling ACL 2024
NeurIPS 2024 Best Paper; ACL Fellow 2025; AAAI Fellow 2021; Past President of ACL
2001
Southern Methodist University
Univ. of Michigan EECS
Associate Professor
EECS, Univ. of Michigan
summarizationargument miningfactualitymedia bias
Long-Document Abstractive Summarization with LLMs ACL 2024 (Area Chair Award)
Detecting Media Bias and Framing with NLP EMNLP 2023
Moved from Northeastern to Michigan ~2020; NSF CAREER; ACL Equity Director 2024
2015
Cornell University
New York University — Courant CS & Center for Data Science
NYU CS / CDS
Glen de Vries Chaired Professor of CS & Data Science
Courant CS & CDS, NYU
machine translationgenerative modelsseq2seqAI for biology
Protein Discovery with Discrete Walk-Jump Sampling ICLR 2024 (Outstanding Paper)
On the Ability of Monolingual Models to Learn Cross-lingual Representations EMNLP 2023
Inventor of GRU and attention mechanisms; NAE of Korea member; ICLR 2024 Outstanding Paper
2014
Aalto University
NYU CS / CDS
Associate Professor
Courant CS & CDS, NYU
LLM alignmentrobustnessevaluationhuman-AI collaboration
Is Reinforcement Learning (Not) the Solution to Robust NLP? ACL 2024
Evaluating Language Models via Calibrated Uncertainty Estimation NAACL 2024
Revisiting the Role of Language Priors in Visual Question Answering EMNLP 2021 (highly cited)
Co-leads NYU ML² group; NSF CAREER; active in LLM alignment research
2016
Univ. of Maryland
NYU CS / CDS
Associate Professor of CS & Data Science
Courant CS & CDS, NYU
question answeringentity understandingmultilingual QALLM alignment
CaLMQA: Culturally Specific Long-Form QA in 23 Languages ACL 2025
Understanding Retrieval Augmentation for Long-Form QA COLM 2024
AmbigDocs: Reasoning Across Documents on Ambiguous Entities COLM 2024
Moved from UT Austin to NYU CDS/Courant ~Fall 2024; NSF CAREER; co-leads ML² group
2019
Univ. of Washington
NYU CS / CDS
Associate Professor of CS & Data Science
Courant CS & CDS, NYU
LLM reasoningfact verificationchain-of-thoughtLLM fine-tuning
LoFiT: Localized Fine-tuning on LLM Representations NeurIPS 2024
MuSR: Chain-of-Thought with Multistep Soft Reasoning ICLR 2024 (Spotlight)
Learning to Refine with Fine-Grained Natural Language Feedback EMNLP 2024
Moved from UT Austin (8 years) to NYU CDS/Courant Fall 2025; Sloan Fellow 2023; NSF CAREER 2022; co-leads ML² group
2016
UC Berkeley
NYU Linguistics / CDS
Associate Professor of Linguistics & Data Science
Linguistics & CDS, NYU; Research Scientist, Google
computational linguisticsLM evaluationpsycholinguisticssyntax in LMs
What Formal Languages Can Transformers Express? TACL 2023
Task Demands Affect LM Evaluation Conclusions NAACL 2024
Moved from JHU to NYU 2020; Director of Graduate Studies, NYU Linguistics
2013
École Normale Supérieure
Princeton University — Computer Science & Princeton Language and Intelligence (PLI)
Princeton CS
Charles C. Fitzmorris Professor; Director, PLI
Computer Science, Princeton
LLM theorymechanistic interpretabilityrepresentation learning
Fine-Tuning Language Models with Just Forward Passes (MeZO) NeurIPS 2023
A Kernel-Based View of Language Model Fine-Tuning ICML 2023
Directs Princeton Language & Intelligence; Gödel Prize; NAS member
1994
UC Berkeley
Princeton CS
Associate Professor
Computer Science, Princeton; Affiliated, PLI
dense retrievalopen-domain QAin-context learninglong-context LLMs
SimCSE: Simple Contrastive Learning of Sentence Embeddings EMNLP 2021 (highly cited)
Enabling LLMs to Generate Text with Citations EMNLP 2023
How to Train Long-Context Language Models Effectively ICLR 2025
Sloan Fellow 2022; NSF CAREER; promoted to associate professor ~2024
2018
Stanford University
Princeton CS
Associate Professor
Computer Science, Princeton; Affiliated, PLI
LLM agentsRL for languagecode agents
ReAct: Synergizing Reasoning and Acting in Language Models ICLR 2023
SWE-agent: Agent-Computer Interfaces for Coding Tasks NeurIPS 2024
Tree of Thoughts: Deliberate Problem Solving with LLMs NeurIPS 2023
NSF CAREER; ReAct and Tree of Thoughts foundational agent papers 2023
2017
MIT
Stanford University — Computer Science, Linguistics & HAI
Stanford CS & HAI
Dieter Schwarz Foundation Professor of CS; Senior Fellow, HAI
Computer Science & Stanford HAI
commonsense AIsmall language modelspluralistic alignmentLLM reasoning
WinoGrande: An Adversarial Winograd Schema Challenge AAAI 2020 (highly cited)
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration EMNLP 2024
MacGyver: Are LLMs Creative Problem-Solvers? NAACL 2024
Left UW Aug 2024; joined Stanford CS + HAI June 2025 (Dieter Schwarz Foundation Professor); brief NVIDIA stint; MacArthur Fellow 2022; TIME100 AI 2023 & 2025
2010
Cornell University
Stanford CS
Assistant Professor
Computer Science, Stanford
LLM evaluationRLHFwatermarkingrobustness
AlpacaFarm: A Simulation Framework for RLHF Methods NeurIPS 2023
Robust Distortion-Free Watermarks for Language Models TMLR 2024
Observational Scaling Laws and Predictability of LM Performance NeurIPS 2024
Joined Stanford CS 2019; co-teaches CS336 (Language Modeling from Scratch)
2018
MIT
Stanford Linguistics & CS
Jackson Eli Reynolds Professor (Linguistics & CS)
Linguistics & CS, Stanford
social languagedialoguecomputational social science
Speech and Language Processing, 3rd ed. (pre-release) 2024 textbook
Safety-Tuned LLaMAs: Lessons from Improving LLM Safety ICLR 2024
Author of standard NLP textbook; ACL Fellow; NAS member
1992
UC Berkeley
Stanford CS & Linguistics
Thomas M. Siebel Professor of Machine Learning; Director, SAIL
CS & Linguistics, Stanford
neural NLPparsingrepresentation learninguniversal dependencies
Universal Dependencies 2.13 Release LREC-COLING 2024
The CRINGE Loss: Learning What Language Not to Model ACL 2023
IEEE John von Neumann Medal 2024; ACL Test-of-Time Awards 2023–2025 (3 consecutive); ACL Fellow
1994
Stanford University
Stanford CS
Associate Professor of CS; Director, CRFM
CS, Stanford; Director, Center for Research on Foundation Models
foundation modelsbenchmarkingrobustnessLLM analysis
Holistic Evaluation of Language Models (HELM) NeurIPS 2023
AlpacaFarm: A Simulation Framework for RLHF NeurIPS 2023
Do Foundation Models Narrate Their Own Training Data? COLM 2024
Directs CRFM; led influential Foundation Models report (2021)
2011
UC Berkeley
Stanford Linguistics
Professor of Linguistics; Co-Director, CRFM
Linguistics & CS (by courtesy), Stanford
semanticspragmaticsNLIsentiment analysis
Multitask Prompted Training Enables Zero-Shot Task Generalization ICLR 2022 (highly cited)
DynaSent: Dynamic Sentiment Analysis Benchmark ACL 2021
Co-Director CRFM; ACL Fellow; long-time semantics/pragmatics contributor
2003
UC Santa Cruz
Stanford CS
Assistant Professor
Computer Science, Stanford; Affiliate, Stanford HAI
computational social sciencedialogueLLM for social goodmental health NLP
Harnessing LLMs in Practice: Survey on ChatGPT and NLP ACM TKDD 2023
Language Models for Social Media Annotation ACL 2024
Moved from Georgia Tech to Stanford CS 2022; NSF CAREER
2019
Carnegie Mellon University
★ ~2028
University of California, Berkeley — EECS & School of Information
UC Berkeley I School
Associate Professor
School of Information, UC Berkeley
cultural analyticscomputational humanitiesNLP for literatureentity recognition
Measuring Diversity in Hollywood through Computational Film Analysis PNAS 2024
AboutMe: Documenting Effects of English Pretraining Data Filters ACL 2024
Speak, Memory: Books Known to ChatGPT/GPT-4 EMNLP 2023
NSF CAREER 2019; leads BookNLP & LitBank projects
2012
CMU
UC Berkeley EECS
Associate Teaching Professor
EECS, UC Berkeley
machine translationNLP education
Compositional Span Embeddings for Document Retrieval ACL 2023
Lead educator for Berkeley CS 188/189; massive open course impact
2010
UC Berkeley
UC Berkeley EECS
Professor of Computer Science
EECS, UC Berkeley
parsingunsupervised NLPmachine translationLM fundamentals
Language Model Evaluation Beyond Perplexity ACL 2021 (highly cited)
ACL Fellow; longtime Berkeley NLP group leader
2004
Stanford University
UC Berkeley EECS
Assistant Professor
EECS, UC Berkeley; Research Scientist, AI2
retrieval-augmented generationLLM factualityquestion answering
FActScore: Fine-Grained Atomic Evaluation of Factual Precision EMNLP 2023
RePlug: Retrieval-Augmented Black-Box Language Models NAACL 2024
Rethinking the Role of Demonstrations for In-Context Learning EMNLP 2022 (highly cited)
Joined UC Berkeley EECS 2024; previously postdoc at UW/AI2; NSF CAREER 2025
2023
Univ. of Washington
★ ~2030
UC Berkeley EECS
Assistant Professor
EECS, UC Berkeley
grounded languagecollaborative agentsvision-languagesituated NLP
NLVR2: Visual Reasoning in Natural Language ACL 2019 (highly cited)
Dynamic Benchmarking of Grounded Language Understanding ICLR 2024
Joined UC Berkeley EECS 2022; previously postdoc at Cornell/AI2
2022
Cornell University
★ ~2029
University of California, Los Angeles — Computer Science
UCLA CS
Associate Professor; Co-Director, UCLA DataX AI Center
Computer Science, UCLA; Amazon Scholar
multimodal LLMstrustworthy NLPbias & fairnessconstrained generation
SafeWorld: Geo-Diverse Safety Alignment for LLMs NeurIPS 2024
Adaptable Logical Control for Large Language Models NeurIPS 2024
Matryoshka Query Transformer for Large Vision-Language Models NeurIPS 2024
Sloan Fellow 2021; EMNLP Best Paper 2017; ACL Outstanding Paper 2023
2015
Univ. of Illinois (UIUC)
UCLA CS
Associate Professor
Computer Science, UCLA; Amazon Scholar
controllable generationcreative languagemultimodal LLMsevaluation
Adaptable Logical Control for LLMs NeurIPS 2024
MacGyver: Are LLMs Creative Problem-Solvers? NAACL 2024
DiagrammerGPT: Generating Open-Domain Diagrams via LLM Planning COLM 2024
Promoted to associate professor ~2024; NSF CAREER; DARPA EXPMATH PI
2017
Johns Hopkins University
UCLA CS
Professor
Computer Science, UCLA
neurosymbolic AIprobabilistic circuitsLLM reasoningknowledge integration
Adaptable Logical Control for LLMs NeurIPS 2024
On the Paradox of Learning to Reason from Data IJCAI 2023
Probabilistic Circuits: Representations and Algorithms Foundations & Trends in ML 2023
Runs UCLA StarAI Lab; leading neurosymbolic AI for NLP
2013
KU Leuven
University of Massachusetts Amherst — Manning CICS
UMass Amherst CICS
Distinguished University Professor
CICS, UMass Amherst
information extractionknowledge basesstatistical NLPmachine learning
Multistage Collaborative Knowledge Distillation from LLMs ACL 2024
Distantly-Supervised Dense Retrieval for Open-Domain QA EMNLP 2021 (highly cited)
ACM Fellow; ACL Fellow; founder of IESL lab; major IE & probabilistic NLP contributions
1995
Carnegie Mellon University
UMass Amherst CICS
Associate Professor
CICS, UMass Amherst
computational social sciencesocial factors in NLPpolitical NLP
From Narratives to Headlines: LLMs for News Summarization EMNLP 2023
Annotator Social Bias in NLP Datasets ACL 2024
NSF CAREER; active work on measurement of social phenomena in text
2013
CMU
UMass Amherst CICS
Associate Professor
CICS, UMass Amherst
information retrievalconversational searchRAGexplainability
Large Language Models Are Not Robust Evaluators NAACL 2024
PEARL: Prompting Large Language Models to Plan and Execute Actions for Long-Document EACL 2024
NSF CAREER; active in conversational search and retrieval-augmented LLMs
2019
UMass Amherst
University of Maryland, College Park — CS & UMIACS (CLIP Lab)
UMD CS
Full Professor
CS, iSchool & UMIACS, Univ. of Maryland
question answeringadversarial NLPtopic modelshuman-AI collaboration
LLMs Help Humans Verify Truthfulness—Except When Convincingly Wrong NAACL 2024
KARL: Knowledge-Aware Retrieval and Representations for Student Learning EMNLP 2024
Directs CLIP Lab; NSF CAREER; ACL Fellow
2010
Princeton University
UMD CS
Associate Professor
CS & UMIACS, Univ. of Maryland
machine translationmultilingual NLPhuman-centered NLP
Human-Centered Approaches to Trustworthy Machine Translation NAACL 2022 tutorial
Trust and Machine Translation: User Study ACL 2024
On sabbatical 2024–25 (visiting INRIA Paris); NAACL 2022 Program Co-Chair
2008
Hong Kong University of Science and Technology
UMD CS
Professor
CS & UMIACS, Univ. of Maryland; Research Scientist, Microsoft Research
structured predictiondomain adaptationmultilingual NLPresponsible AI
Theory-Grounded Measurement of Social Stereotypes in LMs NAACL 2022 (highly cited)
Deconstructing NLG Evaluation: Practices and Implications ACL 2023
ACL Fellow; extensive work on fairness and structured prediction
2006
Univ. of Southern California
UMD CS
Associate Professor
CS, Univ. of Maryland (moved from UMass Amherst, 2025)
text generationlong-form NLPfactualitycreative language
FABLES: Evaluating Faithfulness in Book-Length Summarization EMNLP 2024
PostMark: A Robust Blackbox Watermark for LLMs EMNLP 2024
VERISCORE: Evaluating Factuality in Long-Form Text Generation EMNLP 2024
Moved from UMass Amherst to UMD 2025; NSF CAREER; Google PhD Fellowship alumnus
2017
Univ. of Maryland
University of North Carolina, Chapel Hill — Computer Science
UNC Chapel Hill CS
John R. & Louise S. Parker Distinguished Professor; Director, MURGe-Lab
Computer Science, UNC Chapel Hill
multimodal AIlanguage generationreasoning agentsfaithful generation
LACIE: Listener-Aware Finetuning for Confidence Calibration in LLMs NeurIPS 2024
EnvGen: Generating Environments via LLMs for Embodied Agents COLM 2024
DiagrammerGPT: Generating Diagrams via LLM Planning COLM 2024
PECASE; ACL Fellow; AAAI Fellow; Parker Distinguished Professorship permanent 2024
2013
UC Berkeley
UNC Chapel Hill CS
Associate Professor; Associate Chair for Research
Computer Science, UNC Chapel Hill
narrative understandingsocial NLPLLM alignmentdialogue
Aligning LLMs with Human Communication: Empathy & Social Awareness ACL 2024
Modeling Social Norms in LLM Dialogue EMNLP 2023
Associate Chair for Research at UNC CS; NSF CAREER
2015
Univ. of Illinois (UIUC)
UNC Chapel Hill CS
Assistant Professor
Computer Science, UNC Chapel Hill
interactive learningAI explanationLLM alignmentsocially aware NLP
Improving and Simplifying Pattern Exploiting Training (iPET) EMNLP 2021 (highly cited)
Human Feedback for LLM Long-Context Memory EMNLP 2024
Joined UNC CS ~2020; NSF CAREER 2024
2018
CMU
★ ~2027
University of Southern California — Thomas Lord CS & ISI
USC CS
Assistant Professor
Thomas Lord CS, USC
LLM understandingin-context learningdata memorizationLLM reliability
Mechanistic Insights into In-Context Learning via Attention Heads NeurIPS 2024
Data Watermarking for Copyright Detection in LLMs ACL Findings 2024
NSF CAREER 2024; Google Research Scholar award 2023; ACL & EMNLP Best Paper alum
2020
Stanford University
★ ~2027
USC CS / ISI
Research Associate Professor
CS, USC; Research Lead, Information Sciences Institute
machine translationNLP for social goodLLM applicationsmultilingual
LLM-Assisted Rule-Based MT for Low-Resource Languages NAACL 2024
Can Language Models Moderate Online Discourse? NAACL 2024
Associate Director of NLP Research at ISI; SemEval longtime organizer
2010
USC
USC CS / ISI
Associate Professor
CS, USC; Research Team Lead, ISI
commonsense reasoningknowledge-aware NLPLLM alignmentprompt inversion
PILS: Prompt Inversion from Logprob Sequences NeurIPS 2025
Symbolic Working Memory Enhances LLMs for Complex Rule Application EMNLP 2024
ELI-Why: Evaluating Pedagogical Capabilities of LLMs ACL 2024
MIT TR35 Asia-Pacific Innovator; WSDM 2024 Test-of-Time Award
2018
UIUC
USC CS
Assistant Professor
Thomas Lord CS, USC
dataset qualityannotation artifactsLLM evaluationdata-centric AI
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics EMNLP 2020 (highly cited)
OATH-Frames: Attitudes Towards Homelessness on Social Media EMNLP 2024
Teaching Models to Understand (but not Generate) High-Risk Data EMNLP 2025
Intel Rising Star Faculty Award 2023; AI2 Young Investigators Award; NSF CAREER 2024
2020
CMU
★ ~2027
University of Illinois at Urbana-Champaign — Siebel School of CS & Data Science
UIUC CS
Assistant Professor
Siebel School CS, UIUC
text simplificationscience communicationNLP accessibilitywriting tools
Generating Scientific Definitions with Controllable Complexity ACL 2022
How Well Can LLMs Negotiate? AgentBench Evaluation EMNLP 2024
Joined UIUC CS 2023; previously postdoc at UW; NSF CAREER 2025
2022
Univ. of Washington
★ ~2030
UIUC CS
Professor (Grainger Engineering)
CS, UIUC
dialogue systemsconversational AIspoken language understandingLLM agents
Instruct, Not Assist: LLM Multi-Turn Planning for Socratic Debugging EMNLP 2024
Unsupervised Human Preference Learning EMNLP 2024
Joined UIUC 2023 from Amazon/Google; ACL Fellow; IEEE Fellow
1998
Bilkent University
UIUC CS
Professor (Willett Faculty Scholar)
CS, UIUC
parsingCCGgrounded languageLLM in-context learning
Tutor-ICL: Guiding LLMs for Improved In-Context Learning EMNLP 2024
Sparse Autoencoders for In-Context Learning EMNLP 2025
Chair of AI area at UIUC CS; co-PI AIFARMS NSF AI Institute
2003
Univ. of Edinburgh
UIUC CS
Full Professor (Grainger Engineering)
CS & ECE, UIUC
information extractionknowledge graphsmultimodal NLPevent detection
EVEDIT: Event-based Knowledge Editing for LLMs EMNLP 2024
Mitigating the Alignment Tax of RLHF NeurIPS 2024
LM-Infinite: Zero-Shot Extreme Length Generalization NAACL 2024 (co-author, Outstanding Paper)
ACL Fellow; IEEE "Young Scientist"; extensive IE & multilingual research
2007
New York University
UIUC CS
Assistant Professor
Siebel School CS, UIUC
LLM reasoninglong-context LLMsRL for languageAI for science
LM-Infinite: Zero-Shot Extreme Length Generalization for LLMs NAACL 2024 (Outstanding Paper)
Eurus: An Open Suite of Reward Models for LLM Alignment ACL 2024
Scaling LLM Test-Time Compute via Inference-Time Intervention ICLR 2025
Joined UIUC CS 2023; previously postdoc at AI2; PhD advised by Noah Smith at UW
2022
Univ. of Washington
★ ~2029
UIUC CS
Donald Biggar Willett Professor in Engineering
CS, UIUC
information retrievaltext miningLLM-augmented IR
Structured Summarization of Academic Publications EACL 2024
Probabilistic Language Models for IR SIGIR 2021 (Test-of-Time)
ACM/IEEE Fellow; SIGIR Test-of-Time Award recipient
1997
Nanjing Univ. / UIUC
University of Texas at Austin — CS & Linguistics
UT Austin Linguistics
Associate Professor
Linguistics, UT Austin
discoursetext simplificationNLG evaluationquestions under discussion
Salience Prediction of Inquisitive Questions EMNLP 2024 (Outstanding Paper)
Detection and Measurement of Syntactic Templates in LLM Text EMNLP 2024
QUDsim: Quantifying Discourse Similarities in LLM Text COLM 2025
EMNLP 2024 Outstanding Paper Award; NSF CAREER
2016
Univ. of Pennsylvania
UT Austin CS
Professor
Computer Science, UT Austin
semantic parsinggrounded language learningmultimodal NLP
Grounded Language Learning via Image-Paired Instructions ACL 2023
ACL Fellow; AAAI Fellow; pioneer of semantic parsing and ML for NLP
1988
Univ. of Illinois
UT Austin CS
Assistant Professor (incoming Fall 2026)
Computer Science, UT Austin
instruction tuningsynthetic dataopen LLMsRLVRLLM agents
Super-NaturalInstructions: Generalization via Declarative Instructions EMNLP 2022 (highly cited)
Self-Instruct: Aligning LMs with Self-Generated Instructions ACL 2023 (highly cited)
Tülu 3: Pushing Frontiers in Open Language Model Post-Training NeurIPS 2024
Incoming UT Austin CS Fall 2026; PhD 2025 from UW (Hajishirzi & Smith); currently at ByteDance Seed; Self-Instruct one of most cited NLP papers of 2023
2025
Univ. of Washington
★ ~2032
University of Washington — Paul G. Allen School of CS&E
UW Allen School
Torode Family Professor
CS&E, Univ. of Washington; Senior Research Director, AI2
LLM pretrainingopen modelsQA & reading comprehensionOLMo
OLMo: Accelerating the Science of Language Models ACL 2024
Dolma: An Open Corpus for Language Model Pretraining Research ACL 2024
TÜLU: Instruction-Tuning LLMs via Diverse Datasets ICLR 2024
Alfred Sloan Fellow 2020; NSF CAREER 2021; leads AI2 open LLM (OLMo) effort
2011
UIUC
UW Allen School
Vice Provost for AI; Simonyi Endowed Chair in AI
CS&E, Univ. of Washington; Senior Director NLP, AI2
statistical NLPevaluationcomputational social sciencestructured prediction
Revisiting Automated Evaluation of Open-Domain QA EMNLP 2023
Reference-Free Summarization Evaluation ACL 2024
Named UW inaugural VP for AI 2024; ACL Test-of-Time Award 2024
2006
Johns Hopkins University
UW Allen School
Associate Professor
CS&E, Univ. of Washington
multilingual NLPcross-lingual transferlow-resource languagesLLM safety
Languages are Rewards: Hindsight Finetuning using Human Feedback EMNLP 2023
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration EMNLP 2024
Moved from CMU to UW 2021; NSF CAREER 2022
2014
CMU
UW Allen School
Assistant Professor
CS&E, Univ. of Washington
robustnessdata-centric AIdistribution shiftWILDS benchmark
WILDS: A Benchmark of In-the-Wild Distribution Shifts ICML 2021 (highly cited)
Concept Bottleneck Models ICML 2020 (highly cited)
Stronger Data Poisoning Attacks Break Data Sanitization Defenses MLSys 2022
Joined UW 2022; NSF CAREER 2024
2022
Stanford University
★ ~2029
UW Allen School
Professor
CS&E, Univ. of Washington; Research Scientist, Meta AI
semantic parsingpre-trainingmultilingual NLPLLM alignment
Toolformer: Language Models Can Teach Themselves to Use Tools NeurIPS 2023
LIMA: Less Is More for Alignment NeurIPS 2023
ACL Test-of-Time Award 2024 (with Smith)
2009
MIT
University of Colorado Boulder — CS & Linguistics
CU Boulder CS & Linguistics
Professor
CS & Linguistics, Univ. of Colorado Boulder
computational semanticsdialogueNLP education
Speech and Language Processing, 3rd ed. 2024 textbook (co-authored with Jurafsky)
Multimodal Cross-Document Event Coreference Resolution ACL 2024
Co-author of the field-defining NLP textbook (Jurafsky & Martin); long career in NLP
1988
UC San Diego
CU Boulder CS
Assistant Professor
Computer Science, Univ. of Colorado Boulder
multilingual NLPlow-resource languagestransfer learningcomputational morphology
AmericasNLP: NLP for Indigenous Languages of the Americas NAACL 2024 workshop
Cross-Lingual Transfer for Morphologically Complex Languages EMNLP 2023
Joined CU Boulder CS; also holds position at Johannes Gutenberg Univ. Mainz
2019
LMU Munich
★ ~2026
University of Notre Dame — Computer Science & Engineering
Notre Dame CSE
Associate Professor
Computer Science & Engineering, Univ. of Notre Dame
formal language theorymachine translationtransformer expressivitylow-resource NLP
What Formal Languages Can Transformers Express? A Survey TACL 2024
Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages NeurIPS 2024
Stack Attention: Improving Transformers on Hierarchical Patterns ICLR 2024 (Spotlight)
ACL 2024 Outstanding Paper Award and Social Impact Award; leading work on formal properties of transformers
2004
Univ. of Pennsylvania
Notre Dame CSE
Associate Professor
Computer Science & Engineering, Univ. of Notre Dame
knowledge graphsLLM reasoninginformation extractionAI for science
Towards Safer LLMs through Machine Unlearning ACL 2024
Large Language Models on Graphs: A Comprehensive Survey IEEE TKDE 2024
Instructing LLMs to Identify and Ignore Irrelevant Conditions AAAI 2024
NSF CAREER; active in knowledge-augmented NLP and AI for science
2015
UIUC
Ohio State University — Computer Science & Engineering
OSU CSE
Associate Professor (CoE Innovation Scholar)
CSE, Ohio State University
LLM agentsweb agentsQALLM safety
Mind2Web: Towards a Generalist Agent for the Web NeurIPS 2023 (highly cited)
GPT-4V(ision) is a Generalist Web Agent, if Grounded ICML 2024
AmpleGCG: Universal Transferable Generative Adversarial Suffixes for Jailbreaking LLMs COLM 2024
NSF CAREER; Google Faculty Award; Mind2Web foundational agent benchmark
2015
UC Santa Barbara
OSU CSE
Associate Professor
CSE, Ohio State University
LLM agentsGUI agentsknowledge graphssemantic parsing
Mind2Web: Towards a Generalist Agent for the Web NeurIPS 2023 (co-first author)
HippoRAG: Neurobiologically Inspired Long-Term Memory for LLMs NeurIPS 2024
UGround: Universal Visual Grounding for GUI Agents ICLR 2025 (Oral)
NSF CAREER; ACL 2023 Outstanding Paper; co-leads OSU NLP group
2018
UC Santa Barbara
OSU Linguistics
Professor & Vice Chair
Linguistics, Ohio State University
natural language generationdialogue systemscomputational semantics
When is Tree Search Useful for LLM Planning? It Depends on Discriminators ACL 2024
Automatic Scoring of Open-Ended NLG with LLMs EMNLP 2023
Long career in NLG and dialogue; active in evaluation with LLMs
1994
Univ. of Pennsylvania
Toyota Technological Institute at Chicago (TTI-Chicago) — CS Research Institute
TTI-Chicago
Associate Professor
Toyota Technological Institute at Chicago (TTIC)
representation learningstructured predictionNLGrobust NLP
SUB2: Substructure Substitution for Data Augmentation in NLP EMNLP 2023
Generative Classifiers for Robust NLP TACL 2023
TTIC faculty since 2012; Amazon Research Award; WMT Best Paper; adviser to many current faculty
2012
CMU LTI
TTI-Chicago
Professor
Toyota Technological Institute at Chicago (TTIC); courtesy faculty, UChicago CS
speech processingspoken language understandingself-supervised speechsign language NLP
Toward Joint Language Modeling for Speech Units and Text EMNLP 2023
CTC-DRO: Robust Optimization for Reducing Language Disparities in ASR ICLR 2026
SLUE Phase-2: Benchmark Suite for Diverse Spoken Language Understanding ACL 2023
EMNLP 2024 Best Paper; 2025 IEEE Fellow; ISCA Fellow; member UChicago/TTIC C&I group
2005
MIT
University of Chicago — Computer Science & Data Science Institute
UChicago CS
Assistant Professor
Computer Science, Univ. of Chicago
text generationdecoding algorithmscomplex systems view of LLMs
The Curious Case of Neural Text Degeneration (Nucleus Sampling) ICLR 2020 (highly cited)
Generative Models of Text as Complex Systems NeurIPS 2025
Prompting as Scientific Inquiry NeurIPS 2025
Joined UChicago CS 2024; won first Amazon Alexa Prize (2017); nucleus sampling used in production LLMs worldwide
2022
Univ. of Washington
★ ~2031
UChicago CS
Assistant Professor
Computer Science, Univ. of Chicago
human-AI writingLLM evaluationwriting assistantscreativity & AI
Evaluating Human-Language Model Interaction TMLR 2023 (highly cited)
Design Space for Intelligent Writing Assistants CHI 2024 (Best Paper Award)
Coauthor: Designing a Human-AI Collaborative Writing Dataset CHI 2022
Joined UChicago CS Summer 2024; PhD from Stanford (Percy Liang); MIT Technology Review Korean Innovators Under 35, 2022
2023
Stanford University
★ ~2031
UChicago CS
Associate Professor
CS & Data Science Institute, Univ. of Chicago
human-centered NLPpersuasioncomputational social sciencehuman-AI interaction
Hypothesis Generation with Large Language Models EMNLP 2024 workshop
Decision-Focused Summarization with LLMs EMNLP 2023
Winning Arguments: Interaction Dynamics and Persuasion WWW 2016 (highly cited)
Moved from CU Boulder to UChicago ~2020; NSF CAREER; promoted to associate ~2024
2016
Cornell University
University of California, San Diego — CSE & Halicioglu Data Science Institute
UCSD CSE
Associate Professor
CSE, UC San Diego
unsupervised NLPhistorical textmusic generationlanguage diversity
Efficient Unsupervised Methods for Historical Document Understanding TACL 2023
Generative AI for Music and Audio ICML 2024
NSF CAREER; moved from CMU postdoc to UCSD 2018; Economist-featured work on historical text AI
2014
UC Berkeley
UCSD HDSI
Assistant Professor
Halicioglu Data Science Institute, UC San Diego
LLM trainingtext generationreinforcement learningAI agents
RLHF Workflow: From Reward Modeling to Online RLHF TMLR 2024
Open Platypus: A Curated LLM Instruction Dataset COLM 2024
Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons ICML 2023
Joined UCSD HDSI 2020; NSF CAREER; active in LLM post-training
2020
CMU
UCSD CSE
Associate Professor
CSE, UC San Diego
multilingual NLPknowledge extractionLLM interpretabilityAI for healthcare
Cross-Lingual Consistency of Factual Knowledge in Multilingual LLMs EMNLP 2023
Interpretable Features in LLMs for Healthcare QA ACL 2024
NSF CAREER; Otto Hahn Medal; founded Okalai AI (Africa AI education outreach)
2012
Saarland Univ. / MPI Informatics
UCSD CSE & HDSI
Assistant Professor
CSE & Halicioglu Data Science Institute, UC San Diego
weakly supervised NLPtext miningbiomedical NLPLLM fine-tuning
AutoProg: Automated Prompt Generation for LLMs ACL 2024
Towards Minimal Supervision NLP via Cross-Domain Knowledge Transfer EMNLP 2023
Joined UCSD 2020; NSF CAREER 2023
2019
UIUC
★ ~2027
University of California, Santa Cruz — CS & Engineering
UCSC CSE
Assistant Professor
Computer Science & Engineering, UC Santa Cruz
semantic parsingAMRdialogue state trackingmachine translation
Diverse Retrieval-Augmented In-Context Learning for Dialogue State Tracking ACL 2023
JAMR: Semantic Parsing & Generation for AMR ACL 2014 (highly cited)
Joined UCSC CSE 2019; co-leads UCSC NLP group with Marilyn Walker
2019
CMU LTI
★ ~2026
UCSC CSE
Professor; Director, NLP MS Program
Computer Science & Engineering, UC Santa Cruz
dialogue systemsconversational AINLGsocial media dialogue
Controllable Neural NLG for Dialogue ACL 2023
Collaborative AI Partner for Student Group Discussion EDM 2024
ACL Fellow; directs UCSC NLP MS program; longtime AT&T Bell Labs researcher before academia
1993
Univ. of Pennsylvania
University of Pennsylvania — Computer and Information Science
Penn CIS
Professor
Computer and Information Science, UPenn
LLMs & copyrightparaphrasecrowdsourcingtext generation
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows ACL 2024
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation COLM 2024
Towards Faithful Model Explanation in NLP: A Survey CL 2024
Testified before US Congress on LLMs and copyright law (2023); Sloan Fellow; 35,000+ citations
2008
Univ. of Edinburgh
Penn CIS
Assistant Professor
CIS, UPenn; part-time Staff Research Scientist, Apple MLR
multimodal generative modelsnon-autoregressive generationworld modelsLLM agents
TARFlow: Improving Flow Matching for Text-to-Image Generation ICML 2025 (Oral)
Beyond the Imitation Game: Quantifying LM Capabilities (BIG-bench) TMLR 2023
Joined UPenn CIS 2025; previously Apple MLR (2022–2025) and Meta FAIR; part-time Apple affiliation continues
2019
Univ. of Hong Kong
★ ~2032
Penn CIS
Eduardo D. Glandt Distinguished Professor
CIS, UPenn; Chief AI Scientist, Oracle
natural language understandingreasoningstructured predictionneurosymbolic AI
Reasoning with Language Model Prompting: A Survey ACL 2023
Learning to Decompose and Compose for Question Answering NAACL 2024
ACM/AAAI/ACL Fellow; AAAS Fellow; IJCAI John McCarthy Award 2017; returned to full-time Penn from AWS/Oracle ~2024
1995
Harvard University
Penn CIS
Assistant Professor
Computer and Information Science, UPenn
vision-language modelsmultimodal reasoningbias in vision-language
A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis NeurIPS 2024
PaCE: Parsimonious Concept Engineering for LLMs NeurIPS 2024
Situation Recognition: Visual Semantic Role Labeling CVPR 2016 (highly cited)
Joined UPenn CIS ~2020; previously AI2 Young Investigator; NSF CAREER
2017
Univ. of Washington
★ ~2027
⚠ Important Caveats & Data Quality Notes

Coverage: This listing covers 19 major US R1 institutions with strong NLP programs. It is not exhaustive even within these schools — it focuses on faculty with demonstrable NLP/LM research activity in 2023–2025. Many faculty at these institutions doing adjacent work (computer vision, robotics, general ML) are not listed unless they have clear recent NLP/LM output.

Leave status: Confirmed on-leave faculty have been excluded (e.g., Sam Bowman at NYU 2025–26, Justine Cassell & Alexander Waibel & Eric Xing at CMU, Marine Carpuat on sabbatical 2024–25). Statuses may have changed since data collection.

Tenure estimates (★): Based on known hire year plus typical 6-year US tenure clock. These are estimates only and must be verified independently.

Publications: Papers listed are verified from researcher homepages and web sources; max 3 per faculty, prioritizing 2023–2025. All venues are top-tier (ACL, EMNLP, NAACL, ICLR, NeurIPS, ICML, COLM, TACL, CL, or equivalent).

Moves & institutional changes: Yejin Choi moved from UW to Stanford (effective June 2025); Mohit Iyyer moved from UMass to UMD (2025); Smaranda Muresan joined Barnard/Columbia as associate professor (2024); Eunsol Choi moved from UT Austin to NYU CDS/Courant (Fall 2024); Greg Durrett moved from UT Austin to NYU CDS/Courant (Fall 2025); Yizhong Wang is incoming at UT Austin (Fall 2026, not yet started). Changes after mid-2025 are not reflected.

Suggested verification: Faculty homepages · ACL Anthology · CSRankings.org · Semantic Scholar · institutional CS department pages

Additional institutions (Tier 3+) to be added in subsequent sessions.