Multivers: Improving Scientific Claim Verification With Weak Supervision And Full-document Context | Awesome LLM Papers Add your paper to Awesome LLM Papers

Multivers: Improving Scientific Claim Verification With Weak Supervision And Full-document Context

David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi . Findings of the Association for Computational Linguistics: NAACL 2022 2022 – 45 citations

[Code] [Paper]   Search on Google Scholar   Search on Semantic Scholar
ACL Datasets Few Shot Fine Tuning Has Code Image Text Integration Interdisciplinary Approaches Multimodal Semantic Representation NAACL Training Techniques Visual Contextualization

The scientific claim verification task requires an NLP system to label scientific documents which Support or Refute an input claim, and to select evidentiary sentences (or rationales) justifying each predicted label. In this work, we present MultiVerS, which predicts a fact-checking label and identifies rationales in a multitask fashion based on a shared encoding of the claim and full document context. This approach accomplishes two key modeling goals. First, it ensures that all relevant contextual information is incorporated into each labeling decision. Second, it enables the model to learn from instances annotated with a document-level fact-checking label, but lacking sentence-level rationales. This allows MultiVerS to perform weakly-supervised domain adaptation by training on scientific documents labeled using high-precision heuristics. Our approach outperforms two competitive baselines on three scientific claim verification datasets, with particularly strong performance in zero / few-shot domain adaptation experiments. Our code and data are available at https://github.com/dwadden/multivers.

Similar Work