Hello! I am a second-year Ph.D. student in the Department of Computer Science at Purdue University, advised by Prof. Dan Goldwasser. My work centers on using natural language processing to understand and draw conclusions from large amounts of unstructured data. In particular, I build scalable and interpretable NLP systems that reason about human behavior, beliefs, and values as expressed in noisy real-world corpora—especially social media.
Recently, I developed ConceptCarve, a framework for identifying how abstract social concepts are expressed across communities by combining language model reasoning with scalable retrieval. I also introduced the Splits! dataset, a large Reddit-based dataset with demographic and topical annotations which allows for investigation of how different demographic groups communicate about shared topics. From a technical standpoint, my work has extensively involved large-scale data processing, retrieval, reranking, text clustering, and dataset design, collection, and validation.
Before coming to Purdue, I earned my B.Sc. in Computer Science and Mathematics from the University of Nebraska–Lincoln. There, I worked with Prof. Stephen Scott on continuous-layered neural architectures guided by integral equations, and with Prof. M. R. Hasan on improving classification performance for rare classes.
News