Bridging Textual And Tabular Data For Cross-domain Text-to-sql Semantic Parsing | Awesome LLM Papers Add your paper to Awesome LLM Papers

Bridging Textual And Tabular Data For Cross-domain Text-to-sql Semantic Parsing

Xi Victoria Lin, Richard Socher, Caiming Xiong . Findings of the Association for Computational Linguistics: EMNLP 2020 2020 – 145 citations

[Code] [Paper]   Search on Google Scholar   Search on Semantic Scholar
EMNLP Efficiency Model Architecture Question Answering Scalability

We present BRIDGE, a powerful sequential architecture for modeling dependencies between natural language questions and relational databases in cross-DB semantic parsing. BRIDGE represents the question and DB schema in a tagged sequence where a subset of the fields are augmented with cell values mentioned in the question. The hybrid sequence is encoded by BERT with minimal subsequent layers and the text-DB contextualization is realized via the fine-tuned deep attention in BERT. Combined with a pointer-generator decoder with schema-consistency driven search space pruning, BRIDGE attained state-of-the-art performance on popular cross-DB text-to-SQL benchmarks, Spider (71.1% dev, 67.5% test with ensemble model) and WikiSQL (92.6% dev, 91.9% test). Our analysis shows that BRIDGE effectively captures the desired cross-modal dependencies and has the potential to generalize to more text-DB related tasks. Our implementation is available at https://github.com/salesforce/TabularSemanticParsing.

Similar Work