Photo of Me. I am a postdoctoral researcher at the Johns Hopkins University Human Language Technology Center of Excellence, where I focus on enabling human annotation of natural language data and improving natural language processing software infrastructure. I have a PhD in computer science from Johns Hopkins University and wrote my dissertation on the relationship between theory and practice in topic modeling. I have a BS in mathematics from Harvey Mudd College, and between college and grad school I performed research and software engineering at Pacific Northwest National Laboratory. My pronouns are she/her.

Software

Some software I’ve contributed to:

Concrete-python
A Python interface to the Concrete communication protocol for annotated text.

Apache Thrift
A protocol for cross-language, cross-platform services. Used by Concrete.

Turkle
A Python (Django) web interface for crowd work. Largely compatible with Mechanical Turk formats.

docker-http-api-example
Example code for an HTTP service that runs a machine learning prediction task.

Commented word2vec
The word2vec word representation learning C code, commented.

Publications

Some publications I’ve contributed to:

Topic Modeling in Theory and Practice
Chandler May
Doctoral dissertation, Johns Hopkins University, 2022

You Keep Using That Word: Ways of Thinking about Gender in Computing Research
Os Keyes, Chandler May, and Annabelle Carrell
Proceedings of the ACM on Human-Computer Interaction, 2021
bib

On Measuring Social Biases in Sentence Encoders
Chandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman, and Rachel Rudinger
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 2019
code bib

Streaming Word Embeddings with the Space-Saving Algorithm
Chandler May, Kevin Duh, Benjamin Van Durme, and Ashwin Lall
Preprint, 2017
code bib

Social Bias in Elicited Natural Language Inferences
Rachel Rudinger, Chandler May, and Benjamin Van Durme
Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, 2017
code bib

An Analysis of Lemmatization on Topic Models of Morphologically Rich Language
Chandler May, Ryan Cotterell, and Benjamin Van Durme
Preprint, 2016
bib

Topic Identification and Discovery on Text and Speech
Chandler May, Francis Ferraro, Alan McCree, Jonathan Wintrode, Daniel Garcia-Romero, and Benjamin Van Durme
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
bib

Particle Filter Rejuvenation and Latent Dirichlet Allocation
Chandler May, Alex Clemmer, and Benjamin Van Durme
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
bib