Photo of Me. I am a research associate at the Johns Hopkins University Human Language Technology Center of Excellence (HLTCOE), where I focus on enabling human annotation of natural language data and improving natural language processing software infrastructure. I earned my PhD in computer science from Johns Hopkins University, writing my dissertation on the relationship between theory and practice in topic modeling. I have a bachelor’s in mathematics from Harvey Mudd College, and between college and grad school I performed research and software engineering at Pacific Northwest National Laboratory. My pronouns are she/her.

Software

Selected software I’ve contributed to:

Concrete-python
A Python interface to the Concrete communication protocol for annotated text.

Concrete-js
A JavaScript/Node.js interface to Concrete.

Apache Thrift
A protocol for cross-language, cross-platform services. Used by Concrete.

Turkle
A Python (Django) web interface for crowd work. Largely compatible with Mechanical Turk formats.

Commented word2vec
The word2vec word representation learning C code, commented.

Publications

Selected publications I’ve contributed to:

Topic Modeling in Theory and Practice
Chandler May
Doctoral dissertation, Johns Hopkins University, 2022

You Keep Using That Word: Ways of Thinking about Gender in Computing Research
Os Keyes, Chandler May, and Annabelle Carrell
Proceedings of the ACM on Human-Computer Interaction, 2021

On Measuring Social Biases in Sentence Encoders
Chandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman, and Rachel Rudinger
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 2019
code

Streaming Word Embeddings with the Space-Saving Algorithm
Chandler May, Kevin Duh, Benjamin Van Durme, and Ashwin Lall
Preprint, 2017
code

Social Bias in Elicited Natural Language Inferences
Rachel Rudinger, Chandler May, and Benjamin Van Durme
Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, 2017
code

An Analysis of Lemmatization on Topic Models of Morphologically Rich Language
Chandler May, Ryan Cotterell, and Benjamin Van Durme
Preprint, 2016
code

Topic Identification and Discovery on Text and Speech
Chandler May, Francis Ferraro, Alan McCree, Jonathan Wintrode, Daniel Garcia-Romero, and Benjamin Van Durme
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Particle Filter Rejuvenation and Latent Dirichlet Allocation
Chandler May, Alex Clemmer, and Benjamin Van Durme
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
code

Miscellaneous

Pseudo-scientific boat physics sim