This page lists some of our publications and preprints since 2015. More complete lists of publications are available from the personal websites of lab members.
2024
- ACLIn Association for Computational Linguistics, 2024
- ACLIn Association for Computational Linguistics, 2024
- ACLMultistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence GenerationIn Association for Computational Linguistics, 2024
- ACLIn Association for Computational Linguistics, 2024
- ACL FindingsIn Findings of the Association for Computational Linguistics, 2024
- ACL FindingsIn Findings of the Association for Computational Linguistics: ACL, 2024
- BEAIn 9th Workshop on Innovative Use of NLP for Building Educational Applications (BEA), 2024
- BEAIn 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA), 2024
- COLMIteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated ImagesIn Conference on Language Modeling, 2024
- COLMFABLES: Evaluating faithfulness and content selection in book-length summarizationIn Conference on Language Modeling, 2024
- EACLPEARL: Prompting Large Language Models to Plan and Execute Actions Over Long DocumentsIn European Chapter of the Association for Computational Linguistics, 2024
- EACL FindingsIn Findings of the Association for Computational Linguistics: EACL, 2024
- ICLRBooookScore: A systematic exploration of book-length summarization in the era of LLMsIn International Conference on Learning Representations, 2024
- J. PoliticsForthcoming, Journal of Politics, 2024
- JLCForthcoming, Journal of Law and Courts, 2024
- ML4ALIn 1st Workshop on Machine Learning for Ancient Languages (ML4AL), 2024
- NAACLTopicGPT: A Prompt-based Topic Modeling FrameworkIn North American Chapter of the Association for Computational Linguistics, 2024
- NAACL FindingsIn Findings of the Association for Computational Linguistics: NAACL, 2024
- NAACL FindingsIn Findings of the Association for Computational Linguistics: NAACL, 2024
- NAACL FindingsGEE! Grammar Error Explanation with Large Language ModelsIn Findings of the Association for Computational Linguistics: NAACL 2024, 2024
- NLP+CSSIn Sixth Workshop on Natural Language Processing and Computational Social Science (NLP+CSS), 2024
- NVSQIn Nonprofit and Voluntary Sector Quarterly, 2024
- PRQ
- PersonalizationIn 1st Workshop on Personalization of Generative AI Systems (PERSONALIZE), 2024
- WSDMIn The 17th ACM International Conference on Web Search and Data Mining (WSDM) , 2024
- WWWIn Proceedings of ACM The Web Conference (Web4Good Track), 2024
2023
- ACLA Critical Evaluation of Evaluations for Long-form Question AnsweringIn Association of Computational Linguistics, 2023
- ACLIn Association for Computational Linguistics, 2023
- ACLMulti-CLS BERT: An Efficient Alternative to Traditional EnsemblingIn Association of Computational Linguistics, 2023
- ACLEvaluating Zero-Shot Event Structures: Recommendations for Automatic Content Extraction (ACE) AnnotationsIn Association of Computational Linguistics, 2023
- ACLIn Association for Computational Linguistics, 2023
- ACLIn Association for Computational Linguistics, 2023
- ACL FindingsRevisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and BeyondIn Findings of Association of Computational Linguistics, 2023
- ACL FindingsIn Findings of the Association for Computational Linguistics: ACL, 2023
- ACL FindingsFreshLLMs: Refreshing Large Language Models with Search Engine AugmentationIn Findings of the Association for Computational Linguistics, 2023
- ACL FindingsCausal Matching with Text Embeddings: A Case Study in Estimating the Causal Effects of Peer Review PoliciesIn Findings of Association of Computational Linguistics, 2023
- BEAImproving Reading Comprehension Question Generation with Data Augmentation and Overgenerate-and-rankIn 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA), 2023
- EACLLongEval: Guidelines for Human Evaluation of Faithfulness in Long-form SummarizationIn European Chapter of the Association for Computational Linguistics, 2023
- EACL FindingsezCoref: Towards Unifying Annotation Guidelines for Coreference ResolutionIn Findings of European Chapter of the Association for Computational Linguistics, 2023
- EMNLPKNN-LM Does Not Improve Open-ended Text GenerationIn EMNLP, 2023
- EMNLP
- EMNLPFActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text GenerationIn EMNLP, 2023
- EMNLP FindingsIn Findings of the Association for Computational Linguistics: EMNLP, 2023
- EMNLP FindingsIn Findings of the Association for Computational Linguistics: EMNLP, 2023
- EMNLP FindingsPaRaDe: Passage Ranking using Demonstrations with LLMsIn Findings of EMNLP, 2023
- EMNLP FindingsDisco Elysium: Exploring Player Perceptions of LLM-Generated Dialogue within a Commercial Video GameIn Findings of EMNLP, 2023
- EMNLP FindingsIn Findings of the Association for Computational Linguistics: EMNLP, 2023
- NeurIPSParaphrasing evades detectors of AI-generated text, but retrieval is an effective defenseIn Conference on Neural Information Processing Systems, 2023
- WMTLarge language models effectively leverage document-level context for literary translation, but critical errors persistIn WMT, 2023
2022
- AAAISublinear Time Approximation of Text Similarity MatricesIn Proceedings of the AAAI Conference on Artificial Intelligence, 2022
- AAAIProceedings of the AAAI Conference on Artificial Intelligence, 2022
- ACLIn Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
- ACLRELiC: Retrieving Evidence for Literary ClaimsIn Association of Computational Linguistics, 2022
- ACLIn Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022
- ACLIn Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
- EMNLPEfficient Nearest Neighbor Search for Cross-Encoder Models using Matrix FactorizationIn Empirical Methods in Natural Language Processing, 2022
- EMNLPRankGen: Improving Text Generation with Large Ranking ModelsIn Empirical Methods in Natural Language Processing, 2022
- EMNLPOvercoming Catastrophic Forgetting in Zero-Shot Cross-Lingual GenerationIn Empirical Methods in Natural Language Processing, 2022
- EMNLPExploring Document-Level Literary Machine Translation with Parallel Paragraphs from World LiteratureIn Empirical Methods in Natural Language Processing, 2022
- EMNLPSLING: Sino Linguistic Evaluation of Large Language ModelsIn Empirical Methods in Natural Language Processing, 2022
- EMNLPDEMETR: Diagnosing Evaluation Metrics for TranslationIn Empirical Methods in Natural Language Processing, 2022
- EMNLP-FindingsYou can’t pick your neighbors, or can you? When and How to Rely on Retrieval in the KNN-LMIn Empirical Methods in Natural Language Processing, 2022
- Field MattersCorpus-Guided Contrast Sets for Morphosyntactic Feature Detection in Low-Resource English VarietiesIn Proceedings of the 1st Field Matters Workshop on NLP Applications to Field Linguistics, 2022
- ICMLKnowledge base question answering by case-based reasoning over subgraphsIn International Conference on Machine Learning, 2022
- ICMLInteractive Correlation Clustering with Existential Cluster ConstraintsIn International Conference on Machine Learning, 2022
- LRECIn Language Resources and Evaluation, 2022
- NAACLIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
- NAACLIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
- NAACLModeling Exemplification in Long-form Question Answering via RetrievalIn North American Association for Computational Linguistics, 2022
- NAACLChapterBreak: A Challenge Dataset for Long-Range Language ModelsIn North American Association for Computational Linguistics, 2022
- NLP+CSSExamining Political Rhetoric with Epistemic Stance DetectionIn Proceedings of the Fifth Workshop on Natural Language Processing and Computational Social Science, 2022
- Negative ResultsHow Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge?In Workshop on Insights from Negative Results in NLP @ ACL 2022, 2022
- TIISACM Trans. Interact. Intell. Syst., 2022
- W-NUTCross-Dialect Social Media Dependency Parsing for Social Scientific Entity Attribute AnalysisIn Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022), 2022
2021
- AAAIExtending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic ApplicationsIn Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021
- ACLIn Association for Computational Linguistics, 2021
- ACL FindingsIn Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021
- Causal NLPIn Proceedings of the First Workshop on Causal Inference and NLP, 2021
- EMNLPOpen Aspect Target Sentiment Classification with Natural Language PromptsIn forthcoming EMNLP, 2021
- EMNLPDiverse Distributions of Self-Supervised Tasks for Meta-Learning in NLPIn forthcoming EMNLP, 2021
- EMNLPMS-Mentions: Consistently Annotating Entity Mentions in Materials Science Procedural TextIn forthcoming EMNLP, 2021
- EMNLPMath Word Problem Generation with Mathematical Consistency and Problem Context ConstraintsIn forthcoming EMNLP, 2021
- EMNLPPhrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus ExplorationIn forthcoming EMNLP, 2021
- EMNLPIGA: An Intent-Guided Authoring AssistantIn forthcoming EMNLP, 2021
- EMNLPImproved Latent Tree Induction with Distant Supervision via Span ConstraintsIn forthcoming EMNLP, 2021
- EMNLPIn Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
- EMNLPThe Perils of Using Mechanical Turk to Evaluate Open-Ended Text GenerationIn forthcoming EMNLP, 2021
- EMNLPDo Long-Range Language Models Actually Use Long-Range Context?In forthcoming EMNLP, 2021
- EMNLPMaking Better Use of Unlabeled Data with Task Augmentation and Self-trainingIn forthcoming EMNLP, 2021
- EMNLPCase-based Reasoning for Natural Language Questions over Knowledge BasesIn forthcoming EMNLP, 2021
- EMNLP demoBox Embeddings: An open-source library for representation learning using geometric structuresIn forthcoming EMNLP demo, 2021
- Find. of ACLIn Findings of the Association for Computational Linguistics, 2021
- Find. of ACLIn Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021
- NAACLIn North American Association for Computational Linguistics, 2021
- NAACLIn North American Association for Computational Linguistics, 2021
- NAACLIn North American Association for Computational Linguistics, 2021
- NeurIPSCapacity and Bias of Learned Geometric Embeddings for Directed GraphsAdvances in Neural Information Processing Systems, 2021
- UAIMin/max stability and box distributionsIn Uncertainty in Artificial Intelligence, 2021
- UAIExact and approximate hierarchical clustering using AIn Uncertainty in Artificial Intelligence, 2021
- UnImplicitUnpublished abstract presented at UnImplicit: The First Workshop on Understanding Implicit and Underspecified Language at ACL-IJCNLP, 2021
2020
- AAAISimultaneously linking entities and extracting relations from biomedical text without mention-level supervisionIn Proceedings of the AAAI Conference on Artificial Intelligence, 2020
- ACLIn Association for Computational Linguistics, 2020
- ACLIn Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
- CLEF
- COLINGLearning to Few-Shot Learn Across Diverse Natural Language Classification TasksIn Proceedings of the 28th International Conference on Computational Linguistics, 2020
- ECIRIn European Conference on Information Retrieval, 2020
- EMNLPSelf-Supervised Meta-Learning for Few-Shot Natural Language Classification TasksIn Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
- EMNLPIn Empirical Methods in Natural Language Processing, 2020
- EMNLPIn Empirical Methods in Natural Language Processing, 2020
- EMNLPIn Empirical Methods in Natural Language Processing, 2020
- EMNLPIn Empirical Methods in Natural Language Processing, 2020
- ICLRIn International Conference on Learning Representations, 2020
- LRECIn Language Resources and Evaluation Conference, 2020
- NLP+CSSIn Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science, 2020
- NLP+CSSIn Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science, 2020
- SIGIRIn 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2020
- Sci. Adv.Science Advances, 2020
- arXivarXiv preprint arXiv:2010.12626, 2020
2019
- ACLOptimal Transport-based Alignment of Learned Character Representations for String SimilarityIn Association of Computational Linguistics (ACL), 2019
- ACLIn Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019
- ACLIn Association for Computational Linguistics, 2019
- ACLIn Association for Computational Linguistics, 2019
- ACLIn Association for Computational Linguistics, 2019
- AKBCIn Automated Knowledge Base Construction (AKBC), 2019
- CIKMIn Conference on Information and Knowledge Management, 2019
- EMNLP
- EMNLPIn Empirical Methods in Natural Language Processing, 2019
- EMNLPIn Empirical Methods in Natural Language Processing, 2019
- ICLRIn International Conference on Learning Representations (ICLR), 2019
- ICLRIn International Conference on Learning Representations (ICLR), 2019
- ICLRIn International Conference on Learning Representations (ICLR) (Oral), 2019
- ICMLIn International Conference on Machine Learning (ICML), 2019
- JLC
- KDDGradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic SpaceIn International Conference on Knowledge Discovery and Data Mining (KDD), 2019
- KDDIn International Conference on Knowledge Discovery and Data Mining (KDD), 2019
- KDDScalable Hierarchical Clustering with Tree GraftingIn International Conference on Knowledge Discovery and Data Mining (KDD), 2019
- LA+ACLIn Proceedings of the 13th Linguistic Annotation Workshop at ACL, 2019
- NAACLIn North American Association for Computational Linguistics, 2019
- NAACLIn North American Association for Computational Linguistics, 2019
- NAACLIn Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), 2019
- SCiLProceedings of the Society for Computation in Linguistics, 2019
- SIGIRIn 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019
- arXivarXiv preprint arXiv:1902.00489, 2019
2018
- ACLIn Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), 2018
- ACLIn Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2018
- ACLIn Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL) (Oral), 2018
- BlackboxNLPIn Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2018
- COLING
- CoNLLIn Proceedings of the 22nd Conference on Computational Natural Language Learning (CoNLL), 2018
- EMNLPIn Empirical Methods in Natural Language Processing, 2018
- EMNLPIn Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018
- EMNLPIn Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Best paper award), 2018
- EMNLPIn Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Oral), 2018
- EMNLPIn Empirical Methods in Natural Language Processing, 2018
- EMNLPIn Empirical Methods in Natural Language Processing, 2018
- ICLRIn International Conference on Learning Representations (ICLR), 2018
- NAACLIn North American Association for Computational Linguistics, 2018
- NAACLIn Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT/NAACL), 2018
- NAACLTraining Structured Prediction Energy Networks with Indirect SupervisionIn Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT/NAACL) (Oral), 2018
- NAACLIn Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT/NAACL), 2018
- NAACLIn North American Association for Computational Linguistics, 2018
- NAACLIn North American Association for Computational Linguistics, 2018
- NAACLIn Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2018
- NAACLIn Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2018
- SCiLProceedings of the Society for Computation in Linguistics, 2018
- SIGIRIn Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018
- SIGIRIn Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018
- TextGraphsIn TextGraphs-12: the Workshop on Graph-based Methods for Natural Language Processing (NAACL WS), 2018
2017
- ACLIn Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), Vancouver, Canada, July 30 - August 4, Volume 2: Short Papers, 2017
- AKBCIn 6th Workshop on Automated Knowledge Base Construction (AKBC) 2017 at NIPS, 2017
- AKBCIn 6th Workshop on Automated Knowledge Base Construction (AKBC) 2017 at NIPS, 2017
- AKBCIn 6th Workshop on Automated Knowledge Base Construction (AKBC) 2017 at NIPS, 2017
- DISCMLGradient-based Hierarchical ClusteringIn NIPS Workshop on Discrete Structures in Machine Learning (DISCML) (Oral), 2017
- DS+JData Science + Journalism Workshop (DS+J) at KDD, 2017
- EACLIn Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, April 3-7, 2017, Volume 1: Long Papers (Oral), 2017
- EACLIn Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Valencia, Spain, April 3-7, 2017, Volume 1: Long Papers, 2017
- EMNLPIn Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), Copenhagen, Denmark, September 9-11, 2017, 2017
- EMNLPIn Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
- EMNLPIn Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
- FAT/MLarXiv preprint arXiv:1707.00061. Presented at Fairness, Accountability, and Transparency in Machine Learning workshop at KDD, 2017
- ICLRIn International Conference on Learning Representations (ICLR), 2017
- ICMLIn Proceedings of the 34th International Conference on Machine Learning (ICML), Sydney, NSW, Australia, 6-11 August 2017, 2017
- ICML WSIn International Conference on Machine Learning Workshop on Deep Structured Prediction (ICML WS), 2017
- ICML WSIn International Conference on Machine Learning Workshop on Deep Structured Prediction (ICML WS), 2017
- NIPSIn Advances in Neural Information Processing Systems (NIPS), 2017
- NIPS WSIn Workshop on Machine Learning for Molecules and Materials at NIPS, 2017
- SIGKDDIn Proceedings of the 23rd ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Halifax, NS, Canada, August 13 - 17, 2017 (Oral), 2017
- SPNLPIn Proceedings of the 2nd Workshop on Structured Prediction for Natural Language Processing (SPNLP at EMNLP), Copenhagen, Denmark, September 2017, 2017
- SemEvalIn Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval at ACL), Vancouver, Canada, August 3-4, 2017, 2017
- WNUTIn Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017
- WWWIn Proceedings of the 26th International Conference on World Wide Web, 2017
2016
- AKBCIn Proceedings of the 5th Workshop on Automated Knowledge Base Construction (AKBC at NAACL-HLT), San Diego, CA, USA, June 17, 2016 (Oral), 2016
- AKBCIn Proceedings of the 5th Workshop on Automated Knowledge Base Construction (AKBC at NAACL-HLT), San Diego, CA, USA, June 17, 2016, 2016
- AKBCIn Proceedings of the 5th Workshop on Automated Knowledge Base Construction (AKBC at NAACL-HLT), San Diego, CA, USA, June 17, 2016, 2016
- CIKM
- EMNLPProceedings of EMNLP, 2016
- ICMLIn Proceedings of the 33rd International Conference on Machine Learning (ICML), New York City, NY, USA, June 19-24, 2016, 2016
- NAACLIn NAACL HLT 2016, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT/NAACL), San Diego California, USA, June 12-17, 2016 (Oral), 2016
- NLP+CSSIn Proceedings of EMNLP: NLP+CSS: Workshop in Natural Language Processing and Computational Social Science, 2016
- RecSysIn Proceedings of the 10th ACM Conference on Recommender Systems (RecSys), Boston, MA, USA, September 15-19, 2016, 2016
- TAC/KBPIn Text Analysis Conference, Knowledge Base Population (TAC/KBP), 2016
- WHIarXiv:1606.06352 at Workshop on Human Interpretability in Machine Learning, 2016
2015
- AAAI-SSIn AAAI Spring Symposium Series (AAAI-SS), 2015
- AAAI-SSIn AAAI Spring Symposium Series (AAAI-SS), 2015
- ACLIn Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL), July 26-31, 2015, Beijing, China, Volume 1: Long Papers, 2015
- ACLIn Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL), July 26-31, 2015, Beijing, China, Volume 1: Long Papers (Outstanding Paper Award), 2015
- EMNLPIn Proceedings of EMNLP, 2015
- ICLRIn International Conference on Learning Representations (ICLR) (Oral), 2015
- ICTIRIn Proceedings of the 2015 International Conference on The Theory of Information Retrieval (ICTIR), Northampton, Massachusetts, USA, September 27-30, 2015, 2015
- MPSAA Little Bit of NLP Goes A Long Way: Finding Meaning in Legislative Texts with Phrase ExtractionMidwest Political Science Association (MPSA) 73rd Annual Conference, Chicago (IL), 2015
- TAC/KBPIn Text Analysis Conference, Knowledge Base Population (TAC/KBP), 2015