Research overview

My research is in Natural Language Processing and Machine Learning, with an emphasis on applications in health.

Working in the domain of health naturally motivates the methodological problems that I have worked on. For example, these include: model interpretability; learning with limited supervision from diverse sources; human-in-the-loop/hybrid systems; and trustworthiness of model outputs. For more details, see recent publications here.

On the applications side, one thread of my research concerns developing language technologies to automate (or semi-automate) biomedical evidence synthesis. Here is an episode of the NLP highlights podcast in which I discuss this work, here is a (brief) talk I gave at SciNLP 2020, and here is an article written for a lay audience about the effort. Elsewhere, I have worked on models for processing notes in Electronic Health Records.

A random sample of recentish publications

Sebastian Antony Joseph, Lily Chen, Jan Trienes, Hannah Louisa Göke, Monika Coers, Wei Xu, Byron C Wallace, and Junyi Jessy Li. FACTPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence ACL; 2024.

Jinhan Yang, Sarthak Jain and Byron C. Wallace. How Many and Which Training Points Would Need to be Removed to Flip this Prediction? EACL; 2023.

Sheridan Feucht, David Atkinson, Byron C. Wallace and David Bau. Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs EMNLP; 2024.

News

09/19/2025 Open Philanthropy grant

Open Philanthropy has awarded me a grant to work on "Mechanistic Interpretability for Healthcare"

09/18/2025 NeurIPs spotlight

Our paper, "Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in Language Models", received a Spotlight designation at NeurIPs 2025 (top ~4% of 21575)

01/16/2024 ICLR Spotlight

Our paper, Evaluating the Zero-shot Robustness of Instruction-tuned Language Models was accepted as Spotlight (top 5%) at ICLR 2024

10/01/2022 Helping radiologists navigate EHR

We have received a new R01 from the NIH/NLM to work on neural summarization methods to aid diagnosis (collaboration with Dr. Geoffrey Young at Brigham and Women's Hospital).

tweets

Hello.

Research overview

A random sample of recentish publications

News

Support