Byron Wallace
(he/him)
Sy and Laurie Sternberg Interdisciplinary Associate Professor, Associate Dean of Graduate Programs, Director of the BS in Data Science Program
Research interests
- Data mining
- Machine learning
- Natural language processing
Education
- PhD in Computer Science, Tufts University
- BS in Computer Science, University of Massachusetts at Amherst
Biography
Byron Wallace is an associate dean of graduate programs, director for the undergraduate data science program, and the Sy and Laurie Sternberg Interdisciplinary Associate Professor in the Khoury College of Computer Sciences at Northeastern University, based in Boston.
Wallace’s research areas include artificial intelligence, data science, machine learning, natural language processing, and information retrieval, with an emphasis on applications in health informatics. He is a member of the applied machine learning group and the Data Science and Analytics Lab at Northeastern. Before joining Northeastern, Wallace taught at Brown University and the University of Texas at Austin.
Wallace develops machine learning and natural language processing methods that make synthesizing the vast biomedical evidence base more efficient. He also works on core machine learning and natural language processing methods, with his recent work delving into convolutional neural network architectures for text. Wallace has also recently been developing hybrid, interactive human–machine learning systems that aim to combine human and machine intelligence.
Wallace's work has been supported by grants from the Army Research Office, the NIH, and the NSF. He won the Tufts University 2012 Outstanding Graduate Researcher award and his thesis work was recognized as The Runner Up for the 2013 ACM Special Interest Group on Knowledge Discovery and Data Mining Dissertation Award. Wallace also co-authored the winning submission for the Health Care Data Analytics Challenge at the 2015 IEEE International Conference on Healthcare Informatics.
Projects
Recent publications
-
Learning from Natural Language Explanations for Generalizable Entity Matching
Citation: Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Luyang Kong. (2024). Learning from Natural Language Explanations for Generalizable Entity Matching EMNLP, 6114-6129. https://aclanthology.org/2024.emnlp-main.352 -
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
Citation: Sheridan Feucht, David Atkinson, Byron C. Wallace, David Bau. (2024). Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs EMNLP, 9727-9739. https://aclanthology.org/2024.emnlp-main.543 -
On-the-fly Definition Augmentation of LLMs for Biomedical NER
Citation: Monica Munnangi, Sergey Feldman, Byron C. Wallace, Silvio Amir, Tom Hope, Aakanksha Naik. (2024). On-the-fly Definition Augmentation of LLMs for Biomedical NER CoRR, abs/2404.00152. https://doi.org/10.48550/arXiv.2404.00152 -
Function Vectors in Large Language Models
Citation: Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau. (2024). Function Vectors in Large Language Models ICLR. https://openreview.net/forum?id=AwyxtyMwaG -
Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews
Citation: Hye Sun Yun, Iain James Marshall, Thomas A. Trikalinos, Byron C. Wallace. (2023). Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews EMNLP, 10122-10139. https://aclanthology.org/2023.emnlp-main.626 -
Future Lens: Anticipating Subsequent Tokens from a Single Hidden State
Citation: Pal, K., Sun, J., Yuan, A., Wallace, B.C., & Bau, D. (2023). Future Lens: Anticipating Subsequent Tokens from a Single Hidden State. ArXiv, abs/2311.04897. -
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations
Citation: Lucy Lu Wang, Yulia Otmakhova , Jay DeYoung, Thinh Hung Truong, Bailey Kuehl, Erin Bransom, Byron C. Wallace. (2023). Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations ACL (1), 9871-9889. https://aclanthology.org/2023.acl-long.549 -
Summarizing, Simplifying, and Synthesizing Medical Evidence using GPT-3 (with Varying Success)
Citation: Chantal Shaib, Millicent L. Li, Sebastian Joseph, Iain James Marshall, Junyi Jessy Li, Byron C. Wallace. (2023). Summarizing, Simplifying, and Synthesizing Medical Evidence using GPT-3 (with Varying Success) ACL (2), 1387-1407. https://aclanthology.org/2023.acl-short.119 -
Revisiting Relation Extraction in the era of Large Language Models
Citation: Somin Wadhwa, Silvio Amir, Byron C. Wallace. (2023). Revisiting Relation Extraction in the era of Large Language Models CoRR, abs/2305.05003. https://doi.org/10.48550/arXiv.2305.05003 -
Evaluating Factuality in Text Simplification
Citation: Ashwin Devaraj, William Sheffield, Byron C. Wallace, Junyi Jessy Li. (2022). Evaluating Factuality in Text Simplification ACL (1), 7331-7345. https://doi.org/10.18653/v1/2022.acl-long.506 -
That’s the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data
Citation: Jered Jered McInerney, Geoffrey Young, Jan-Willem van de Meent, Byron C. Wallace. (2022). That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data EMNLP, 3626-3648. https://aclanthology.org/2022.emnlp-main.238 -
That’s the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data
Citation: Jered Jered McInerney, Geoffrey Young, Jan-Willem van de Meent, Byron C. Wallace. (2022). That's the Wrong Lung! Evaluating and Improving the Interpretability of Unsupervised Multimodal Encoders for Medical Data EMNLP, 3626-3648. https://aclanthology.org/2022.emnlp-main.238 -
PHEE: A Dataset for Pharmacovigilance Event Extraction from Text
Citation: Zhaoyue Sun, Jiazheng Li , Gabriele Pergola, Byron C. Wallace, Bino John, Nigel Greene, Joseph Kim, Yulan He . (2022). PHEE: A Dataset for Pharmacovigilance Event Extraction from Text EMNLP, 5571-5587. https://aclanthology.org/2022.emnlp-main.376 -
Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization
Citation: Ye Zhang, Matthew Lease, Byron C. Wallace