I am a sixth year Ph.D. student in the Johns Hopkins Computer Science department affiliated with the Center for Language and Speech Processing, where I am co-advised by a pair of ACL fellows, Jason Eisner (@adveisner) and David Yarowsky, which means my academic lineage forms a DAG, rather than a tree (see here). I specialize in Natural Language Processing, Computational Linguistics and Machine Learning, focusing on deep learning and statistical approaches to phonology, morphology, linguistic typology and low-resource languages. I have won best paper awards at ACL 2017 and EACL 2017 after having twice runnered-up (EMNLP 2015, NAACL 2016). Previously, I was a visiting Ph.D. student at the Center for Information and Language Processing at Ludwig-Maximilians-Universität München supported by a Fulbright Fellowship and a DAAD Research Grant under the supervision of Hinrich Schütze, with whom I still actively collaborate. These days, I also collaborate a lot with labmate Tim Vieira on interesting questions in speed-accuracy trade-offs in NLP and with Kat Vylomova on tasteful probability models for derivational morphology, the wild west of linguistics! I am supported by a Facebook Fellowship and an NDSEG graduate fellowship. On a lighter note, I'm a big fan of the passive voice and I insist forewent is a valid inflection of forego. Outside of the university, I spend a lot of time reading modern German-language literature; Thomas Mann, Hermann Hesse and Max Frisch are favorites.
Teaching: Jason Eisner and I are developing and co-instructing a brand new course on advanced machine learning for (linguistic) sequences. We called it seq2class.
Shared Tasking:I am co-organizing the CoNLL 2018 shared task on morphological reinflection, which will feature over 100 unique languages! Checkout a previous version of the shared task here, which was hosted in conjunction with SIGMORPHON.
Thoughts on Publishing: I've started continually updating my papers to include new insights, update the typography and fix small errors. I believe more and more that a paper should be the start of a conversation, rather than the end of one. Thus, it makes sense to see the document as living, rather than an ossified snapshot. The last date, on which I updated the work will appear in gray text in the upper-right-hand corner of the first page.
Thoughts on Reviewing: I will start posting the actual reviews my papers have received on my website, including my author response and, perhaps, additional commentary. I hope other authors follow suit in an effort to create a more open academic discourse about our work. If you have reviewed one of my papers and feel I did not adequately address your criticism, please drop me a line and I will happily add an addendum.Details: Full CV Google Scholar Semantic Scholar Twitter
- 2018/11: Giving a talk at The Friends School of Baltimore (USA)
- 2018/11: Giving a talk at University of Pennsylvania (USA)
- 2018/11: Giving a talk at Stony Brook (USA)
- 2018/04: Giving a talk at Yale University (USA)
- 2018/04: Giving a talk at Carnegie Mellon University (USA)
- 2018/03: Giving a talk at the University College London (UK)
- 2018/03: Giving a talk at the University of Cambridge (UK)
- 2018/02: Giving a talk at the University of Copenhagen (Denmark)
- 2018/01: Giving a talk at the University of Hamburg (Germany)
- 2017/11: Giving a talk at MIT (USA)
- 2017/10: Giving a talk at Yale University (USA)
- 2017/08: Received Best Paper Award at ACL 2017 (1/751)
- 2017/06: Starting an internship at Google Research, NYC
- 2017/05: Giving a talk at Universität Heidelberg hosted by Stefan Riezler
- 2017/03: Received Outstanding Paper Award at EACL 2017
- 2017/01: Attending the workshop "From Characters to Understanding Natural Language" at Schloss Dagstuhl in Germany
Alexander M. Hoyle, Lawrence Wolf-Sonkin, Hanna Wallach, Isabelle Augenstein and Ryan Cotterell. Unsupervised Discovery of Gendered Language through Latent-Variable Modeling. ACL. 2019.
Ran Zmigrod, Sebastian Mielke, Hanna Wallach and Ryan Cotterell. Counterfactual Data Augmentation for Mitigating Gender Bias in Languages with Rich Morphology. ACL. 2019.
Shijie Wu, Ryan Cotterell and Timothy J. O'Donnell. Measuring Morphological Irregularity. ACL. 2019.
Tiago Pimentel, Arya McCarthy, Damian Blasi, Brian Roark and Ryan Cotterell. Meaning to Form: Measuring Systematicity as Information. ACL. 2019.
Shijie Wu and Ryan Cotterell. Exact Hard Monotonic Attention for Character-Level Transduction. ACL. 2019.
Damian Blasi, Ryan Cotterell, Lawrence Wolf-Sonkin, Sabine Stoll, Balthasar Bickel and Marco Baroni. On the distribution of deep clausal embeddings: A large cross-linguistic study. ACL. 2019.
Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell and Isabelle Augenstein. Uncovering Typological Implications with Belief Nets. ACL. 2019.
Sebastian Mielke, Ryan Cotterell, Kyle Gorman, Brian Roark and Jason Eisner. What Kind of Language Is Hard to Language-Model?. ACL. 2019.
Ekaterina Vylomova, Ryan Cotterell, Trevor Cohn, Timothy Baldwin and Jason Eisner. Contextualization of Morphological Inflection. NAACL. 2019.
Jieyu Zhao, Tianlu Wang, Mark Yatskar, Ryan Cotterell, Vicente Ordóñez and Kai-Wei Chang. Gender Bias in Contextualized Word Embeddings. NAACL. 2019.
Alexander Hoyle, Lawrence Wolf-Sonkin, Hanna Wallach, Ryan Cotterell and Isabelle Augenstein. Combining Disparate Sentiment Lexica with a Multi-View Variational Autoencoder. NAACL. 2019.
Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell and Isabelle Augenstein. A Probabilistic Generative Model of Linguistic Typology. NAACL. 2019.
Shijia Liu, Hongyuan Mei, Adina Williams and Ryan Cotterell. On the Idiosyncrasies of the Mandarin Chinese Classifier System. NAACL. 2019.
Chaitanya Malaviya*, Shijie Wu* and Ryan Cotterell. A Simple Joint Model for Improved Contextual Neural Lemmatization. NAACL. 2019.
Christo Kirov and Ryan Cotterell. Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate. TACL. 2019.
Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Arya D. McCarthy, Katharina Kann, Sebastian Mielke, Garrett Nicolai, Miikka Silfverberg, David Yarowsky, Jason Eisner and Mans Hulden. The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection. CoNLL. 2018.
Ryan Cotterell and Julia Kreutzer. Explaining and Generalizing Back-Translation through Wake-Sleep. arXiv. 2018. arXiv
Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sebastian Mielke, Arya McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner and Mans Hulden. UniMorph 2.0: Universal Morphology. LREC. 2018.
Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sandra Kübler, David Yarowsky, Jason Eisner, and Mans Hulden. CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages. CoNLL. 2017. pdf arXiv
Francis Ferraro, Adam Poliak, Ryan Cotterell and Benjamin Van Durme. Frame-Based Continuous Lexical Semantics through Exponential Family Tensor Factorization and Semantic Proto-Roles. *SEM. 2017. pdf arXiv
Christo Kirov, John Sylak-Glassman, Rebecca Knowles, Ryan Cotterell and Matt Post. A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between Morphology and Syntax. EACL. 2017. pdf slides
Chandler May, Ryan Cotterell and Benjamin Van Durme. Analysis of Morphology in Topic Modeling. arXiv preprint. 2016. arXiv
Gaurav Kumar, Yuan Cao, Ryan Cotterell, Chris Callison-Burch, Daniel Povey and Sanjeev Khudanpur. Translation of the CALLHOME Egyptian Arabic Corpus For Conversational Speech Translation. IWSLT. 2014. pdf
Ryan Cotterell, Adithya Renduchintala, Naomi Saphra, and Chris Callison-Burch. An Algerian Arabic-French Code-Switched Corpus. LREC Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools. 2014. pdf data
David Etter, Francis Ferraro, Ryan Cotterell, Olivia Buzek, and Benjamin Van Durme. Nerit: Named Entity Recognition for Informal Text. Technical Report 11. HLTCOE, Johns Hopkins University. July, 2013. pdf
- Machine Learning: Linguistic and Sequence Modeling (601.765) - Co-Instructor - Spring 2018
- Machine Learning (600.475) - Teaching Assistant - Fall 2016
- Automata and Computation Theory (600.271) - Teaching Assistant - Spring 2014
- Natural Language Processing (600.465) - Teaching Assistant - Fall 2013