publications
Preprints
-
Interpreting Transformer’s Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT. [Arxiv]
2023
-
[ACL]
Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) 2023
-
[ACL]
Debiasing NLP Models Without Demographic Information. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) 2023 [Arxiv]
-
[ACL]
Parallel Context Windows Improve In-Context Learning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) 2023
-
[ACL]
What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) 2023 [Arxiv]
- [ICLR]
- [ICLR]
- [AAAI]
2022
- [EMNLP]
- [NEJLT]
- [NeurIPS]
- [NeurIPS]
- [*SEM]
- [NAACL]
- [ICLR]
- [CL]
- [AAAI]
-
Large-Scale Electronic Corpora and the Study of Middle and Mixed Arabic. In Middle and Mixed Arabic over Time and across Written and Oral Genres: From Legal Documents to Television and Internet through Literature. Proceedings of the IVth AIMA International Conference (Emory University, Atlanta, GA, USA, 12–15 October 2013) 2022 [PDF]
2021
- [NeurIPS]
- [EMNLP]
-
[ACL]
Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models. In Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP) 2021 [Abstract] [PDF] [Code] [Arxiv]
- [ICASSP]
- [ICLR]
- [ICLR]
- [EACL]
2020
- [NeurIPS]
- [EMNLP]
- [EMNLP]
- [WMT]
- [ACL]
- [ACL]
- [ACL]
-
[ICLR]
A Constructive Prediction of the Generalization Error Across Scales. In International Conference on Learning Representations (ICLR) 2020 [Abstract] [PDF] [Arxiv] [Media: MIT CSAIL News, The Batch]
- [CL]
2019
- [Interspeech]
- [Blackbox]
- [WMT]
- [CogSci]
- [ACL]
-
[ACL]
Don’t Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL) 2019 [Abstract] [PDF] [Slides] [Code] [Arxiv] [Talk] [Media: Havard News, TechXplore]
- [NAACL]
- [NAACL]
-
[*SEM]
On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference. In Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM, Oral presentation) 2019 [Abstract] [PDF] [Slides] [Code] [Arxiv] [Media: Havard News, TechXplore]
- [ICLR]
- [LRE]
- [SCiL]
- [AAAI]
- [AAAI]
- [TACL]
2018
-
[NAACL]
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference. In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT) 2018 [Abstract] [PDF] [Code] [Arxiv]
-
On Internal Language Representations in Deep Learning: An Analysis of Machine Translation and Speech Recognition. PhD Thesis, Massachusetts Institute of Technology 2018 [PDF]
-
[ICLR]
Synthetic and Natural Noise Both Break Neural Machine Translation. In International Conference on Learning Representations (ICLR, Oral presentation) 2018 [Abstract] [PDF] [Code] [Arxiv] [Media: Taiwanese Tech news, The Gradient]
2017
- [IWSLT]
- [NeurIPS]
- [IJCNLP]
- [IJCNLP]
- [Interspeech]
- [IPM]
- [ACL]
- [ACL]
- [ICLR]
2016
- [Coling]
-
Improving Sequence to Sequence Learning for Morphological Inflection Generation: The BIU-MIT Systems for the SIGMORPHON 2016 Shared Task for Morphological Reinflection. In Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology (SIGMORPHON at ACL) 2016 [Abstract] [PDF]
- [SemEval]
2015
- [EMNLP]
- [SemEval]
2014
- [TACL]
-
The Arabic Dialect of Ǧisir izZarga: Linguistic description and a preliminary classification, with sample texts. Master's Thesis, Tel Aviv University 2014 [PDF]
-
Neural Network Architectures for Prepositional Phrase Attachment Disambiguation. Master's Thesis, Massachusetts Institute of Technology 2014 [PDF]
2013
- [ACL]
-
arTenTen: a new, vast corpus for Arabic. In Proceedings of the Second Workshop on Arabic Corpus Linguistics (WACL) 2013 [PDF]