This page lists most of our publications and preprints since 2015. (It includes a few non-paper citations, like invited talks or edited volumes.) Papers are also available from the personal websites of lab members.
2023
- ACLMulti-CLS BERT: An Efficient Alternative to Traditional EnsemblingIn Association of Computational Linguistics, 2023
- ACLEvaluating Zero-Shot Event Structures: Recommendations for Automatic Content Extraction (ACE) AnnotationsIn Association of Computational Linguistics, 2023
- ACLA Critical Evaluation of Evaluations for Long-form Question AnsweringIn Association of Computational Linguistics, 2023
- ACL FindingsRevisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and BeyondIn Findings of Association of Computational Linguistics, 2023
- ACL FindingsCausal Matching with Text Embeddings: A Case Study in Estimating the Causal Effects of Peer Review PoliciesIn Findings of Association of Computational Linguistics, 2023
- EACLLongEval: Guidelines for Human Evaluation of Faithfulness in Long-form SummarizationIn European Chapter of the Association for Computational Linguistics, 2023
- EACL FindingsezCoref: Towards Unifying Annotation Guidelines for Coreference ResolutionIn Findings of European Chapter of the Association for Computational Linguistics, 2023
2022
- AAAISublinear Time Approximation of Text Similarity MatricesIn Proceedings of the AAAI Conference on Artificial Intelligence, 2022
- AAAIAn Evaluative Measure of Clustering Methods Incorporating Hyperparameter SensitivityProceedings of the AAAI Conference on Artificial Intelligence, 2022
- ACLSoftmax Bottleneck Makes Language Models Unable to Represent Multi-mode Word DistributionsIn Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
- ACLRELiC: Retrieving Evidence for Literary ClaimsIn Association of Computational Linguistics, 2022
- ACLWord2Box: Capturing Set-Theoretic Semantics of Words using Box EmbeddingsIn Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
- ACLEvent-Event Relation Extraction using Probabilistic Box EmbeddingIn Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022
- EMNLPSLING: Sino Linguistic Evaluation of Large Language ModelsIn Empirical Methods in Natural Language Processing, 2022
- EMNLPEfficient Nearest Neighbor Search for Cross-Encoder Models using Matrix FactorizationIn Empirical Methods in Natural Language Processing, 2022
- EMNLPOvercoming Catastrophic Forgetting in Zero-Shot Cross-Lingual GenerationIn Empirical Methods in Natural Language Processing, 2022
- EMNLPRankGen: Improving Text Generation with Large Ranking ModelsIn Empirical Methods in Natural Language Processing, 2022
- EMNLPExploring Document-Level Literary Machine Translation with Parallel Paragraphs from World LiteratureIn Empirical Methods in Natural Language Processing, 2022
- EMNLPDEMETR: Diagnosing Evaluation Metrics for TranslationIn Empirical Methods in Natural Language Processing, 2022
- EMNLP-FindingsYou can’t pick your neighbors, or can you? When and How to Rely on Retrieval in the KNN-LMIn Empirical Methods in Natural Language Processing, 2022
- Field MattersCorpus-Guided Contrast Sets for Morphosyntactic Feature Detection in Low-Resource English VarietiesIn Proceedings of the 1st Field Matters Workshop on NLP Applications to Field Linguistics, 2022
- ICMLKnowledge base question answering by case-based reasoning over subgraphsIn International Conference on Machine Learning, 2022
- ICMLInteractive Correlation Clustering with Existential Cluster ConstraintsIn International Conference on Machine Learning, 2022
- NAACLEntity Linking via Explicit Mention-Mention Coreference ModelingIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
- NAACLChapterBreak: A Challenge Dataset for Long-Range Language ModelsIn North American Association for Computational Linguistics, 2022
- NAACLDISAPERE: A Dataset for Discourse Structure in Peer Review DiscussionsIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
- NAACLModeling Exemplification in Long-form Question Answering via RetrievalIn North American Association for Computational Linguistics, 2022
- NLP+CSSExamining Political Rhetoric with Epistemic Stance DetectionIn Proceedings of the Fifth Workshop on Natural Language Processing and Computational Social Science, 2022
- Negative ResultsHow Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge?In Workshop on Insights from Negative Results in NLP @ ACL 2022, 2022
- TIISClioQuery: Interactive Query-Oriented Text Analytics for Comprehensive Investigation of Historical News ArchivesACM Trans. Interact. Intell. Syst., 2022
- W-NUTCross-Dialect Social Media Dependency Parsing for Social Scientific Entity Attribute AnalysisIn Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022), 2022
2021
- ACL FindingsCorpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat ViolenceIn Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021
- Causal NLPText as Causal Mediators: Research Design for Causal Estimates of Differential Treatment of Social Groups via Language AspectsIn Proceedings of the First Workshop on Causal Inference and NLP, 2021
- EMNLPOpen Aspect Target Sentiment Classification with Natural Language PromptsIn forthcoming EMNLP, 2021
- EMNLPDiverse Distributions of Self-Supervised Tasks for Meta-Learning in NLPIn forthcoming EMNLP, 2021
- EMNLPMS-Mentions: Consistently Annotating Entity Mentions in Materials Science Procedural TextIn forthcoming EMNLP, 2021
- EMNLPMath Word Problem Generation with Mathematical Consistency and Problem Context ConstraintsIn forthcoming EMNLP, 2021
- EMNLPPhrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus ExplorationIn forthcoming EMNLP, 2021
- EMNLPIGA: An Intent-Guided Authoring AssistantIn forthcoming EMNLP, 2021
- EMNLPImproved Latent Tree Induction with Distant Supervision via Span ConstraintsIn forthcoming EMNLP, 2021
- EMNLPDiverse Distributions of Self-Supervised Tasks for Meta-Learning in NLPIn Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
- EMNLPThe Perils of Using Mechanical Turk to Evaluate Open-Ended Text GenerationIn forthcoming EMNLP, 2021
- EMNLPDo Long-Range Language Models Actually Use Long-Range Context?In forthcoming EMNLP, 2021
- EMNLPMaking Better Use of Unlabeled Data with Task Augmentation and Self-trainingIn forthcoming EMNLP, 2021
- EMNLPCase-based Reasoning for Natural Language Questions over Knowledge BasesIn forthcoming EMNLP, 2021
- EMNLP demoBox Embeddings: An open-source library for representation learning using geometric structuresIn forthcoming EMNLP demo, 2021
- NeurIPSCapacity and Bias of Learned Geometric Embeddings for Directed GraphsAdvances in Neural Information Processing Systems, 2021
- UAIMin/max stability and box distributionsIn Uncertainty in Artificial Intelligence, 2021
- UAIExact and approximate hierarchical clustering using AIn Uncertainty in Artificial Intelligence, 2021
2020
- AAAISimultaneously linking entities and extracting relations from biomedical text without mention-level supervisionIn Proceedings of the AAAI Conference on Artificial Intelligence, 2020
- COLINGLearning to Few-Shot Learn Across Diverse Natural Language Classification TasksIn Proceedings of the 28th International Conference on Computational Linguistics, 2020
- EMNLPSelf-Supervised Meta-Learning for Few-Shot Natural Language Classification TasksIn Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
2019
- ACLOptimal Transport-based Alignment of Learned Character Representations for String SimilarityIn Association of Computational Linguistics (ACL), 2019
- KDDGradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic SpaceIn International Conference on Knowledge Discovery and Data Mining (KDD), 2019
- KDDScalable Hierarchical Clustering with Tree GraftingIn International Conference on Knowledge Discovery and Data Mining (KDD), 2019
2018
- COLING
- NAACLTraining Structured Prediction Energy Networks with Indirect SupervisionIn Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT/NAACL) (Oral), 2018
2017
- DISCMLGradient-based Hierarchical ClusteringIn NIPS Workshop on Discrete Structures in Machine Learning (DISCML) (Oral), 2017
2016
- NAACLMultilingual Relation Extraction using Compositional Universal SchemaIn NAACL HLT 2016, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT/NAACL), San Diego California, USA, June 12-17, 2016 (Oral), 2016
2015
- ACLCompositional Vector Space Models for Knowledge Base CompletionIn Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL), July 26-31, 2015, Beijing, China, Volume 1: Long Papers, 2015
- ACLLearning Dynamic Feature Selection for Fast Sequential PredictionIn Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL), July 26-31, 2015, Beijing, China, Volume 1: Long Papers (Outstanding Paper Award), 2015
- MPSAA Little Bit of NLP Goes A Long Way: Finding Meaning in Legislative Texts with Phrase ExtractionMidwest Political Science Association (MPSA) 73rd Annual Conference, Chicago (IL), 2015