-
Reptext: Rendering Visual Text Via Replicating
(2025)
• No Venue
Wang et al.
-
Exploring The Latent Capacity Of Llms For One-step Text Generation
(2025)
• No Venue
Gleb Mezentsev, Ivan Oseledets
-
Surfer-h Meets Holo1: Cost-efficient Web Agent Powered By Open Weights
(2025)
• No Venue
Andreux et al.
-
Rethinking Reflection In Pre-training
(2025)
• No Venue
Ai et al.
-
Deepseek-r1: Incentivizing Reasoning Capability In Llms Via Reinforcement Learning
(2025)
• No Venue
Deepseek-Ai et al.
-
Competitive Programming With Large Reasoning Models
(2025)
• No Venue
Openai et al.
-
Skywork R1V2: Multimodal Hybrid Reinforcement Learning For Reasoning
(2025)
• No Venue
Chris et al.
-
Magistral
(2025)
• No Venue
Mistral-Ai et al.
-
Minimax-01: Scaling Foundation Models With Lightning Attention
(2025)
• No Venue
Minimax et al.
-
Wan: Open And Advanced Large-scale Video Generative Models
(2025)
• No Venue
Wanteam et al.
-
Grokking In The Wild: Data Augmentation For Real-world Multi-hop Reasoning With Transformers
(2025)
• No Venue
Roman Abramov, Felix Steinbauer, Gjergji Kasneci
-
Phi-4-reasoning Technical Report
(2025)
• No Venue
Abdin et al.
-
Phi-4-mini Technical Report: Compact Yet Powerful Multimodal Language Models Via Mixture-of-loras
(2025)
• No Venue
Abouelenin et al.
-
Language Models' Factuality Depends On The Language Of Inquiry
(2025)
• No Venue
Aggarwal et al.
-
Atla Selene Mini: A General Purpose Evaluation Model
(2025)
• No Venue
Alexandru et al.
-
Sadeed: Advancing Arabic Diacritization Through Small Language Model
(2025)
• No Venue
Aldallal et al.
-
When Less Is Enough: Adaptive Token Reduction For Efficient Image Representation
(2025)
• No Venue
Eduard Allakhverdov, Elizaveta Goncharova, Andrey Kuznetsov
-
I-con: A Unifying Framework For Representation Learning
(2025)
• No Venue
Alshammari et al.
-
Open Deep Search: Democratizing Search With Open-source Reasoning Agents
(2025)
• No Venue
Alzubi et al.
-
Tabstar: A Foundation Tabular Model With Semantically Target-aware Representations
(2025)
• No Venue
Alan Arazi, Eilam Shapira, Roi Reichart
-
Sketch-of-thought: Efficient LLM Reasoning With Adaptive Cognitive-inspired Sketching
(2025)
• No Venue
Simon A. Aytes, Jinheon Baek, Sung Ju Hwang
-
Towards Best Practices For Open Datasets For LLM Training
(2025)
• No Venue
Baack et al.
-
Swe-rebench: An Automated Pipeline For Task Collection And Decontaminated Evaluation Of Software Engineering Agents
(2025)
• No Venue
Badertdinov et al.
-
Perception Encoder: The Best Visual Embeddings Are Not At The Output Of The Network
(2025)
• No Venue
Bolya et al.
-
Singlora: Low Rank Adaptation Using A Single Matrix
(2025)
• No Venue
Bensaïd et al.
-
Qwen2.5-vl Technical Report
(2025)
• No Venue
Bai et al.
-
Impossible Videos
(2025)
• No Venue
Zechen Bai, Hai Ci, Mike Zheng Shou
-
Univg-r1: Reasoning Guided Universal Visual Grounding With Reinforcement Learning
(2025)
• No Venue
Bai et al.
-
Reflect, Retry, Reward: Self-improving Llms Via Reinforcement Learning
(2025)
• No Venue
Bensal et al.
-
Eurobert: Scaling Multilingual Encoders For European Languages
(2025)
• No Venue
Boizard et al.
-
Llama-nemotron: Efficient Reasoning Models
(2025)
• No Venue
Bercovich et al.
-
Reasoning Language Models: A Blueprint
(2025)
• No Venue
Besta et al.
-
All Is Not Lost: LLM Recovery Without Checkpoints
(2025)
• No Venue
Nikolay Blagoev, Oğuzhan Ersoy, Lydia Yiyu Chen
-
Riemannlora: A Unified Riemannian Framework For Ambiguity-free Lora Optimization
(2025)
• No Venue
Bogachev et al.
-
A Data-centric Framework For Addressing Phonetic And Prosodic Challenges In Russian Speech Generative Models
(2025)
• No Venue
Borodin et al.
-
Enhancing Vision-language Model Training With Reinforcement Learning In Synthetic Worlds For Real-world Success
(2025)
• No Venue
Bredis et al.
-
Neobert: A Next-generation BERT
(2025)
• No Venue
Breton et al.
-
Video Action Differencing
(2025)
• No Venue
Burgess et al.
-
Distillation Scaling Laws
(2025)
• No Venue
Busbridge et al.
-
Crowdsource, Crawl, Or Generate? Creating SEA-VL, A Multicultural Vision-language Dataset For Southeast Asia
(2025)
• No Venue
Cahyawijaya et al.
-
Reconstructing 4D Spatial Intelligence: A Survey
(2025)
• No Venue
Cao et al.
-
Why Do Multi-agent LLM Systems Fail?
(2025)
• No Venue
Cemri et al.
-
Quartet: Native FP4 Training Can Be Optimal For Large Language Models
(2025)
• No Venue
Castro et al.
-
Web-shepherd: Advancing Prms For Reinforcing Web Agents
(2025)
• No Venue
Chae et al.
-
Worldvla: Towards Autoregressive Action World Model
(2025)
• No Venue
Cen et al.
-
Oneig-bench: Omni-dimensional Nuanced Evaluation For Image Generation
(2025)
• No Venue
Chang et al.
-
GR-3 Technical Report
(2025)
• No Venue
Cheang et al.
-
The Geometry Of LLM Quantization: GPTQ As Babai's Nearest Plane Algorithm
(2025)
• No Venue
Jiale Chen, Torsten Hoefler, Dan Alistarh
-
An Empirical Study Of Gpt-4o Image Generation Capabilities
(2025)
• No Venue
Chen et al.
-
Eagle 2.5: Boosting Long-context Post-training For Frontier Vision-language Models
(2025)
• No Venue
Chen et al.
-
Advancing Multimodal Reasoning: From Optimized Cold Start To Staged Reinforcement Learning
(2025)
• No Venue
Chen et al.
-
Acereason-nemotron: Advancing Math And Code Reasoning Through Reinforcement Learning
(2025)
• No Venue
Chen et al.
-
Comp: Continual Multimodal Pre-training For Vision Foundation Models
(2025)
• No Venue
Chen et al.
-
Blip3-o: A Family Of Fully Open Unified Multimodal Models-architecture, Training And Dataset
(2025)
• No Venue
Chen et al.
-
Browsecomp-plus: A More Fair And Transparent Evaluation Benchmark Of Deep-research Agent
(2025)
• No Venue
Chen et al.
-
Xverify: Efficient Answer Verifier For Reasoning Model Evaluations
(2025)
• No Venue
Chen et al.
-
Parallel Scaling Law For Language Models
(2025)
• No Venue
Chen et al.
-
Minmo: A Multimodal Large Language Model For Seamless Voice Interaction
(2025)
• No Venue
Chen et al.
-
Livecc: Learning Video LLM With Streaming Speech Transcription At Scale
(2025)
• No Venue
Chen et al.
-
MIG: Automatic Data Selection For Instruction Tuning By Maximizing Information Gain In Semantic Space
(2025)
• No Venue
Chen et al.
-
Moca: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
(2025)
• No Venue
Chen et al.
-
Scaling Law For Quantization-aware Training
(2025)
• No Venue
Chen et al.
-
RM-R1: Reward Modeling As Reasoning
(2025)
• No Venue
Chen et al.
-
Sana-sprint: One-step Diffusion With Continuous-time Consistency Distillation
(2025)
• No Venue
Chen et al.
-
Revisiting Reinforcement Learning For LLM Reasoning From A Cross-domain Perspective
(2025)
• No Venue
Cheng et al.
-
Reasoning With Exploration: An Entropy Perspective
(2025)
• No Venue
Cheng et al.
-
Gold-medalist Performance In Solving Olympiad Geometry With Alphageometry2
(2025)
• No Venue
Chervonyi et al.
-
Selfcite: Self-supervised Alignment For Context Attribution In Large Language Models
(2025)
• No Venue
Chuang et al.
-
System Prompt Optimization With Meta-learning
(2025)
• No Venue
Yumin Choi, Jinheon Baek, Sung Ju Hwang
-
SFT Memorizes, RL Generalizes: A Comparative Study Of Foundation Model Post-training
(2025)
• No Venue
Chu et al.
-
This Time Is Different: An Observability Perspective On Time Series Foundation Models
(2025)
• No Venue
Cohen et al.
-
Gemini 2.5: Pushing The Frontier With Advanced Reasoning, Multimodality, Long Context, And Next Generation Agentic Capabilities
(2025)
• No Venue
Comanici et al.
-
Reinforcement Learning For Reasoning In Small Llms: What Works And What Doesn't
(2025)
• No Venue
Quy-Anh Dang, Chris Ngo
-
The Danger Of Overthinking: Examining The Reasoning-action Dilemma In Agentic Tasks
(2025)
• No Venue
Cuadron et al.
-
The Entropy Mechanism Of Reinforcement Learning For Reasoning Language Models
(2025)
• No Venue
Cui et al.
-
Process Reinforcement Through Implicit Rewards
(2025)
• No Venue
Cui et al.
-
Emerging Properties In Unified Multimodal Pretraining
(2025)
• No Venue
Deng et al.
-
CLIMB: Clustering-based Iterative Data Mixture Bootstrapping For Language Model Pre-training
(2025)
• No Venue
Diao et al.
-
Textcrafter: Accurately Rendering Multiple Texts In Complex Visual Scenes
(2025)
• No Venue
Du et al.
-
Sherlock: Self-correcting Reasoning In Vision-language Models
(2025)
• No Venue
Yi Ding, Ruqi Zhang
-
Mm-ifengine: Towards Multimodal Instruction Following
(2025)
• No Venue
Ding et al.
-
Story2board: A Training-free Approach For Expressive Storyboard Generation
(2025)
• No Venue
Dinkevich et al.
-
Mom: Linear Sequence Modeling With Mixture-of-memories
(2025)
• No Venue
Du et al.
-
Tool-star: Empowering Llm-brained Multi-tool Reasoner Via Reinforcement Learning
(2025)
• No Venue
Dong et al.
-
Mmdocir: Benchmarking Multi-modal Retrieval For Long Documents
(2025)
• No Venue
Dong et al.
-
Reinforcement Pre-training
(2025)
• No Venue
Dong et al.
-
Streaming Diloco With Overlapping Communication: Towards A Distributed Free Lunch
(2025)
• No Venue
Douillard et al.
-
Pre-trained Policy Discriminators Are General Reward Models
(2025)
• No Venue
Dou et al.
-
SONAR-LLM: Autoregressive Transformer That Thinks In Sentence Embeddings And Speaks In Tokens
(2025)
• No Venue
Dragunov et al.
-
Deepresearch Bench: A Comprehensive Benchmark For Deep Research Agents
(2025)
• No Venue
Du et al.
-
MMTEB: Massive Multilingual Text Embedding Benchmark
(2025)
• No Venue
Enevoldsen et al.
-
Virgo: A Preliminary Exploration On Reproducing O1-like MLLM
(2025)
• No Venue
Du et al.
-
Megascience: Pushing The Frontiers Of Post-training Datasets For Science Reasoning
(2025)
• No Venue
Run-Ze Fan, Zengzhi Wang, Pengfei Liu
-
Make Lora Great Again: Boosting Lora With Adaptive Singular Values And Mixture-of-experts Optimization Alignment
(2025)
• No Venue
Fan et al.
-
Skyreels-a2: Compose Anything In Video Diffusion Transformers
(2025)
• No Venue
Fei et al.
-
Got: Unleashing Reasoning Capability Of Multimodal Large Language Model For Visual Generation And Editing
(2025)
• No Venue
Fang et al.
-
Missing Premise Exacerbates Overthinking: Are Reasoning Models Losing Critical Thinking Skill?
(2025)
• No Venue
Fan et al.
-
A Comprehensive Survey Of Self-evolving AI Agents: A New Paradigm Bridging Foundation Models And Lifelong Agentic Systems
(2025)
• No Venue
Fang et al.
-
On Path To Multimodal Generalist: General-level And General-bench
(2025)
• No Venue
Fei et al.
-
Thinkless: LLM Learns When To Think
(2025)
• No Venue
Gongfan Fang, Xinyin Ma, Xinchao Wang
-
VITA-1.5: Towards Gpt-4o Level Real-time Vision And Speech Interaction
(2025)
• No Venue
Fu et al.
-
Multiple Choice Questions: Reasoning Makes Large Language Models (llms) More Self-confident Even When They Are Wrong
(2025)
• No Venue
Fu et al.
-
Scaling Reasoning, Losing Control: Evaluating Instruction Following In Large Reasoning Models
(2025)
• No Venue
Fu et al.
-
Towards General-purpose Model-free Reinforcement Learning
(2025)
• No Venue
Fujimoto et al.
-
I Have Covered All The Bases Here: Interpreting Reasoning Features In Large Language Models Via Sparse Autoencoders
(2025)
• No Venue
Galichin et al.
-
Cognitive Behaviors That Enable Self-improving Reasoners, Or, Four Habits Of Highly Effective Stars
(2025)
• No Venue
Gandhi et al.
-
Exploring Hallucination Of Large Multimodal Models In Video Understanding: Benchmark, Analysis And Mitigation
(2025)
• No Venue
Gao et al.
-
D-AR: Diffusion Via Autoregressive Models
(2025)
• No Venue
Ziteng Gao, Mike Zheng Shou
-
Beyond Ten Turns: Unlocking Long-horizon Agentic Search With Large-scale Asynchronous RL
(2025)
• No Venue
Gao et al.
-
Seedream 3.0 Technical Report
(2025)
• No Venue
Gao et al.
-
Pixels, Patterns, But No Poetry: To See The World Like Humans
(2025)
• No Venue
Gao et al.
-
Seedance 1.0: Exploring The Boundaries Of Video Generation Models
(2025)
• No Venue
Gao et al.
-
Tokenverse: Versatile Multi-concept Personalization In Token Modulation Space
(2025)
• No Venue
Garibi et al.
-
Inside-out: Hidden Factual Knowledge In Llms
(2025)
• No Venue
Gekhman et al.
-
You Do Not Fully Utilize Transformer's Representation Capacity
(2025)
• No Venue
Gerasimov et al.
-
Webwatcher: Breaking New Frontier Of Vision-language Deep Research Agent
(2025)
• No Venue
Geng et al.
-
Guided By Gut: Efficient Test-time Scaling With Reinforced Intrinsic Confidence
(2025)
• No Venue
Ghasemabadi et al.
-
Energy-based Transformers Are Scalable Learners And Thinkers
(2025)
• No Venue
Gladstone et al.
-
Multi-token Attention
(2025)
• No Venue
Golovneva et al.
-
RADLADS: Rapid Attention Distillation To Linear Attention Decoders At Scale
(2025)
• No Venue
Goldstein et al.
-
Training Long-context, Multi-turn Software Engineering Agents With Reinforcement Learning
(2025)
• No Venue
Golubev et al.
-
The Differences Between Direct Alignment Algorithms Are A Blur
(2025)
• No Venue
Gorbatovski et al.
-
Mind2web 2: Evaluating Agentic Search With Agent-as-a-judge
(2025)
• No Venue
Gou et al.
-
Breaking The Modality Barrier: Universal Embedding Learning With Multimodal Llms
(2025)
• No Venue
Gu et al.
-
Long-context Autoregressive Video Modeling With Next-frame Prediction
(2025)
• No Venue
Yuchao Gu, Weijia Mao, Mike Zheng Shou
-
Openthoughts: Data Recipes For Reasoning Models
(2025)
• No Venue
Guha et al.
-
Costaast: Cost-sensitive Toolpath Agent For Multi-turn Image Editing
(2025)
• No Venue
Gupta et al.
-
Seed1.5-vl Technical Report
(2025)
• No Venue
Guo et al.
-
Mineworld: A Real-time And Open-source Interactive World Model On Minecraft
(2025)
• No Venue
Guo et al.
-
Can We Generate Images With Cot? Let's Verify And Reinforce Image Generation Step By Step
(2025)
• No Venue
Guo et al.
-
Reward Reasoning Model
(2025)
• No Venue
Guo et al.
-
Swe-factory: Your Automated Factory For Issue Resolution Training Data And Evaluation Benchmarks
(2025)
• No Venue
Guo et al.
-
Vision As A Dialect: Unifying Visual Understanding And Generation Via Text-aligned Representations
(2025)
• No Venue
Han et al.
-
Trillion 7B Technical Report
(2025)
• No Venue
Han et al.
-
Learnings From Scaling Visual Tokenizers For Reconstruction And Generation
(2025)
• No Venue
Hansen-Estruch et al.
-
CASS: Nvidia To AMD Transpilation With Data, Models, And Benchmark
(2025)
• No Venue
Heakl et al.
-
Skywork Open Reasoner 1 Technical Report
(2025)
• No Venue
He et al.
-
Protoreasoning: Prototypes As The Foundation For Generalizable Reasoning In Llms
(2025)
• No Venue
He et al.
-
Hardtests: Synthesizing High-quality Test Cases For LLM Coding
(2025)
• No Venue
He et al.
-
Pasa: An LLM Agent For Comprehensive Academic Paper Search
(2025)
• No Venue
He et al.
-
Conceptattention: Diffusion Transformers Learn Highly Interpretable Features
(2025)
• No Venue
Helbling et al.
-
Omni-rgpt: Unifying Image And Video Region-level Understanding Via Token Marks
(2025)
• No Venue
Heo et al.
-
Kuwain 1.5B: An Arabic SLM Via Language Injection
(2025)
• No Venue
Hennara et al.
-
Open-reasoner-zero: An Open Source Approach To Scaling Up Reinforcement Learning On The Base Model
(2025)
• No Venue
Hu et al.
-
Glm-4.1v-thinking: Towards Versatile Multimodal Reasoning With Scalable Reinforcement Learning
(2025)
• No Venue
Hong et al.
-
Dita: Scaling Diffusion Transformer For Generalist Vision-language-action Policy
(2025)
• No Venue
Hou et al.
-
Charting And Navigating Hugging Face's Model Atlas
(2025)
• No Venue
Horwitz et al.
-
Xolver: Multi-agent Reasoning With Holistic Experience Learning Just Like An Olympiad Team
(2025)
• No Venue
Hosain et al.
-
Hunyuancustom: A Multimodal-driven Architecture For Customized Video Generation
(2025)
• No Venue
Hu et al.
-
Beyond 'aha!': Toward Systematic Meta-abilities Alignment In Large Reasoning Models
(2025)
• No Venue
Hu et al.
-
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability Of LLM Reasoning
(2025)
• No Venue
Huan et al.
-
O1 Replication Journey -- Part 3: Inference-time Scaling For Medical Reasoning
(2025)
• No Venue
Huang et al.
-
Benchmax: A Comprehensive Multilingual Evaluation Suite For Large Language Models
(2025)
• No Venue
Huang et al.
-
On The Trustworthiness Of Generative Foundation Models: Guideline, Assessment, And Perspective
(2025)
• No Venue
Huang et al.
-
Over-tokenized Transformer: Vocabulary Is Generally Worth Scaling
(2025)
• No Venue
Huang et al.
-
Thinkact: Vision-language-action Reasoning Via Reinforced Visual Latent Planning
(2025)
• No Venue
Huang et al.
-
Multi-granular Spatio-temporal Token Merging For Training-free Acceleration Of Video Llms
(2025)
• No Venue
Hyun et al.
-
Upsample What Matters: Region-adaptive Latent Sampling For Accelerated Diffusion Transformers
(2025)
• No Venue
Jeong et al.
-
Ambik: Dataset Of Ambiguous Tasks In Kitchen Environment
(2025)
• No Venue
Ivanova et al.
-
Reangle-a-video: 4D Video Generation As Video-to-video Translation
(2025)
• No Venue
Hyeonho Jeong, Suhyeon Lee, Jong Chul Ye
-
Multi-turn Code Generation Through Single-step Rewards
(2025)
• No Venue
Jain et al.
-
Silent Branding Attack: Trigger-free Data Poisoning Attack On Text-to-image Diffusion Models
(2025)
• No Venue
Jang et al.
-
Omnispatial: Towards Comprehensive Spatial Reasoning Benchmark For Vision Language Models
(2025)
• No Venue
Jia et al.
-
CSVQA: A Chinese Multimodal Benchmark For Evaluating STEM Reasoning Capabilities Of Vlms
(2025)
• No Venue
Jian et al.
-
Infiniteyou: Flexible Photo Recrafting While Preserving Your Identity
(2025)
• No Venue
Jiang et al.
-
Feedback Friction: Llms Struggle To Fully Incorporate External Feedback
(2025)
• No Venue
Jiang et al.
-
Token-efficient Long Video Understanding For Multimodal Llms
(2025)
• No Venue
Jiang et al.
-
S2s-arena, Evaluating Speech2speech Protocols On Instruction Following With Paralinguistic Information
(2025)
• No Venue
Jiang et al.
-
T2I-R1: Reinforcing Image Generation With Collaborative Semantic-level And Token-level Cot
(2025)
• No Venue
Jiang et al.
-
VACE: All-in-one Video Creation And Editing
(2025)
• No Venue
Jiang et al.
-
Continuous Diffusion Model For Language Modeling
(2025)
• No Venue
Jaehyeong Jo, Sung Ju Hwang
-
Can This Model Also Recognize Dogs? Zero-shot Model Search From Weights
(2025)
• No Venue
Kahana et al.
-
Expect The Unexpected: Failsafe Long Context QA For Finance
(2025)
• No Venue
Kamble et al.
-
The Common Pile V0.1: An 8TB Dataset Of Public Domain And Openly Licensed Text
(2025)
• No Venue
Kandpal et al.
-
T1: Tool-integrated Self-verification For Test-time Compute Scaling In Small Language Models
(2025)
• No Venue
Minki Kang, Jongwon Jeong, Jaewoong Cho
-
Distilling LLM Agent Into Small Models With Retrieval And Code Tools
(2025)
• No Venue
Kang et al.
-
LM2: Large Memory Models
(2025)
• No Venue
Kang et al.
-
VIKI-R: Coordinating Embodied Multi-agent Cooperation Via Reinforcement Learning
(2025)
• No Venue
Kang et al.
-
Inference-time Scaling For Flow Models Via Stochastic Generation And Rollover Budget Forcing
(2025)
• No Venue
Kim et al.
-
Chain-of-zoom: Extreme Super-resolution Via Scale Autoregression And Preference Alignment
(2025)
• No Venue
Bryan Sangwoo Kim, Jeongsol Kim, Jong Chul Ye
-
Temporal In-context Fine-tuning For Versatile Control Of Video Diffusion Models
(2025)
• No Venue
Kinam Kim, Junha Hyung, Jaegul Choo
-
Model Already Knows The Best Noise: Bayesian Active Noise Selection Via Attention In Video Diffusion Model
(2025)
• No Venue
Kwanyoung Kim, Sanghyun Kim
-
Mol-llama: Towards General Understanding Of Molecules In Large Molecular Language Model
(2025)
• No Venue
Dongki Kim, Wonbin Lee, Sung Ju Hwang
-
Heeding The Inner Voice: Aligning Controlnet Training Via Intermediate Features Feedback
(2025)
• No Venue
Konovalova et al.
-
Neurosymbolic Diffusion Models
(2025)
• No Venue
Krieken et al.
-
Theoremexplainagent: Towards Multimodal Explanations For LLM Theorem Understanding
(2025)
• No Venue
Ku et al.
-
Nohumansrequired: Autonomous High-quality Image Editing Triplet Mining
(2025)
• No Venue
Kuprashevich et al.
-
Zclip: Adaptive Spike Mitigation For LLM Pre-training
(2025)
• No Venue
Kumar et al.
-
Cramming 1568 Tokens Into A Single Vector And Back Again: Exploring The Limits Of Embedding Space Capacity
(2025)
• No Venue
Kuratov et al.
-
Infinitehip: Extending Language Model Context Up To 3 Million Tokens On A Single GPU
(2025)
• No Venue
Lee et al.
-
Evolving Deeper LLM Thinking
(2025)
• No Venue
Lee et al.
-
Genrecal: Generation After Recalibration From Large To Small Vision-language Models
(2025)
• No Venue
Lee et al.
-
Molmoact: Action Reasoning Models That Can Reason In Space
(2025)
• No Venue
Lee et al.
-
FUSION: Fully Integration Of Vision-language Representations For Deep Cross-modal Understanding
(2025)
• No Venue
Liu et al.
-
SEAP: Training-free Sparse Expert Activation Pruning Unlock The Brainpower Of Large Language Models
(2025)
• No Venue
Liang et al.
-
Llms Can Easily Learn To Reason From Demonstrations Structure, Not Content, Is What Matters!
(2025)
• No Venue
Li et al.
-
Langsplatv2: High-dimensional 3D Language Gaussian Splatting With 450+ FPS
(2025)
• No Venue
Li et al.
-
4D Langsplat: 4D Language Gaussian Splatting Via Multimodal Large Language Models
(2025)
• No Venue
Li et al.
-
JARVIS-VLA: Post-training Large-scale Vision Language Models To Play Visual Games With Keyboards And Mouse
(2025)
• No Venue
Li et al.
-
Confidence Is All You Need: Few-shot RL Fine-tuning Of Language Models
(2025)
• No Venue
Li et al.
-
Can One Domain Help Others? A Data-centric Study On Multi-domain Reasoning Via Reinforcement Learning
(2025)
• No Venue
Li et al.
-
Baichuan-omni-1.5 Technical Report
(2025)
• No Venue
Li et al.
-
C3PO: Critical-layer, Core-expert, Collaborative Pathway Optimization For Test-time Expert Re-mixing
(2025)
• No Venue
Zhongyang Li, Ziyue Li, Tianyi Zhou
-
Codei/o: Condensing Reasoning Patterns Via Code Input-output Prediction
(2025)
• No Venue
Li et al.
-
How Instruction And Reasoning Data Shape Post-training: Data Quality Through The Lens Of Layer-wise Gradients
(2025)
• No Venue
Li et al.
-
Deepsolution: Boosting Complex Engineering Solution Design Via Tree-based Exploration And Bi-point Thinking
(2025)
• No Venue
Li et al.
-
Have We Unified Image Generation And Understanding Yet? An Empirical Study Of Gpt-4o's Image Generation Ability
(2025)
• No Venue
Ning Li, Jingran Zhang, Justin Cui
-
Saferag: Benchmarking Security In Retrieval-augmented Generation Of Large Language Model
(2025)
• No Venue
Liang et al.
-
Small Models Struggle To Learn From Strong Reasoners
(2025)
• No Venue
Li et al.
-
Preference Leakage: A Contamination Problem In Llm-as-a-judge
(2025)
• No Venue
Li et al.
-
Model Merging In Pre-training Of Large Language Models
(2025)
• No Venue
Li et al.
-
Migician: Revealing The Magic Of Free-form Multi-image Grounding In Multimodal Large Language Models
(2025)
• No Venue
Li et al.
-
Memos: A Memory OS For AI System
(2025)
• No Venue
Li et al.
-
Mergevq: A Unified Framework For Visual Generation And Representation With Disentangled Token Merging And Quantization
(2025)
• No Venue
Li et al.
-
Miromind-m1: An Open-source Advancement In Mathematical Reasoning Via Context-aware Multi-stage Policy Optimization
(2025)
• No Venue
Li et al.
-
Perception, Reason, Think, And Plan: A Survey On Large Multimodal Reasoning Models
(2025)
• No Venue
Li et al.
-
Mol-r1: Towards Explicit Long-cot Reasoning In Molecule Discovery
(2025)
• No Venue
Li et al.
-
Ovo-bench: How Far Is Your Video-llms From Real-world Online Video Understanding?
(2025)
• No Venue
Li et al.
-
Skip A Layer Or Loop It? Test-time Depth Adaptation Of Pretrained Llms
(2025)
• No Venue
Ziyue Li, Yang Li, Tianyi Zhou
-
Radial Attention: O(nlog N) Sparse Attention With Energy Decay For Long Video Generation
(2025)
• No Venue
Li et al.
-
PRIMA.CPP: Speeding Up 70b-scale LLM Inference On Low-resource Everyday Home Clusters
(2025)
• No Venue
Li et al.
-
R2-T2: Re-routing In Test-time For Multimodal Mixture-of-experts
(2025)
• No Venue
Zhongyang Li, Ziyue Li, Tianyi Zhou
-
S*: Test Time Scaling For Code Generation
(2025)
• No Venue
Li et al.
-
Describe Anything: Detailed Localized Image And Video Captioning
(2025)
• No Venue
Lian et al.
-
Websailor: Navigating Super-human Reasoning For Web Agent
(2025)
• No Venue
Li et al.
-
Truth In The Few: High-value Data Selection For Efficient Multi-modal Reasoning
(2025)
• No Venue
Li et al.
-
START: Self-taught Reasoner With Tools
(2025)
• No Venue
Li et al.
-
Test-time Preference Optimization: On-the-fly Alignment Via Iterative Textual Feedback
(2025)
• No Venue
Li et al.
-
Veripo: Cultivating Long Reasoning In Video-llms Via Verifier-gudied Iterative Policy Optimization
(2025)
• No Venue
Li et al.
-
Webthinker: Empowering Large Reasoning Models With Deep Research Capability
(2025)
• No Venue
Li et al.
-
Zebra-cot: A Dataset For Interleaved Vision Language Reasoning
(2025)
• No Venue
Li et al.
-
Drag-and-drop Llms: Zero-shot Prompt-to-weights
(2025)
• No Venue
Liang et al.
-
Multimodal Mamba: Decoder-only Multimodal State Space Model Via Quadratic To Linear Distillation
(2025)
• No Venue
Liao et al.
-
Surveyx: Academic Survey Automation Via Large Language Models
(2025)
• No Venue
Liang et al.
-
Improved Visual-spatial Reasoning Via R1-zero-like Training
(2025)
• No Venue
Liao et al.
-
Reward-guided Speculative Decoding For Efficient LLM Reasoning
(2025)
• No Venue
Liao et al.
-
Sigma: Differential Rescaling Of Query, Key And Value For Efficient Language Models
(2025)
• No Venue
Lin et al.
-
Jarvisart: Liberating Human Artistic Creativity Via An Intelligent Photo Retouching Agent
(2025)
• No Venue
Lin et al.
-
Autoregressive Adversarial Post-training For Real-time Interactive Video Generation
(2025)
• No Venue
Lin et al.
-
Forgetting Transformer: Softmax Attention With A Forget Gate
(2025)
• No Venue
Lin et al.
-
Partcrafter: Structured 3D Mesh Generation Via Compositional Latent Diffusion Transformers
(2025)
• No Venue
Lin et al.
-
Omnihuman-1: Rethinking The Scaling-up Of One-stage Conditioned Human Animation Models
(2025)
• No Venue
Lin et al.
-
Ost-bench: Evaluating The Capabilities Of Mllms In Online Spatio-temporal Scene Understanding
(2025)
• No Venue
Lin et al.
-
Uniworld: High-resolution Semantic Encoders For Unified Visual Understanding And Generation
(2025)
• No Venue
Lin et al.
-
Efficient Medical VIE Via Reinforcement Learning
(2025)
• No Venue
Liu et al.
-
Advances And Challenges In Foundation Agents: From Brain-inspired Intelligence To Evolutionary, Collaborative, And Safe Systems
(2025)
• No Venue
Liu et al.
-
Synlogic: Synthesizing Verifiable Reasoning Data At Scale For Learning Logical Reasoning And Beyond
(2025)
• No Venue
Liu et al.
-
Region-adaptive Sampling For Diffusion Transformers
(2025)
• No Venue
Liu et al.
-
Olmotrace: Tracing Language Model Outputs Back To Trillions Of Training Tokens
(2025)
• No Venue
Liu et al.
-
Langscene-x: Reconstruct Generalizable 3D Language-embedded Scenes With Trimap Video Diffusion
(2025)
• No Venue
Liu et al.
-
Inference-time Scaling For Generalist Reward Modeling
(2025)
• No Venue
Liu et al.
-
Javisdit: Joint Audio-video Diffusion Transformer With Hierarchical Spatio-temporal Prior Synchronization
(2025)
• No Venue
Liu et al.
-
Learn To Reason Efficiently With Adaptive Length-based Reward Shaping
(2025)
• No Venue
Liu et al.
-
Reasonrank: Empowering Passage Ranking With Strong Reasoning Ability
(2025)
• No Venue
Liu et al.
-
Part I: Tricks Or Traps? A Deep Dive Into RL For LLM Reasoning
(2025)
• No Venue
Liu et al.
-
Phantom: Subject-consistent Video Generation Via Cross-modal Alignment
(2025)
• No Venue
Liu et al.
-
Step1x-edit: A Practical Framework For General Image Editing
(2025)
• No Venue
Liu et al.
-
Songgen: A Single Stage Auto-regressive Transformer For Text-to-song Generation
(2025)
• No Venue
Liu et al.
-
Rstar-coder: Scaling Competitive Code Reasoning With A Large-scale Verified Dataset
(2025)
• No Venue
Liu et al.
-
Skywork-reward-v2: Scaling Preference Data Curation Via Human-ai Synergy
(2025)
• No Venue
Liu et al.
-
SPIRAL: Self-play On Zero-sum Games Incentivizes Reasoning Via Multi-agent Multi-turn Reinforcement Learning
(2025)
• No Venue
Liu et al.
-
Understanding R1-zero-like Training: A Critical Perspective
(2025)
• No Venue
Liu et al.
-
Taking Notes Brings Focus? Towards Multi-turn Multimodal Dialogue Learning
(2025)
• No Venue
Liu et al.
-
Thus Spake Long-context Large Language Model
(2025)
• No Venue
Liu et al.
-
Visual Agentic Reinforcement Fine-tuning
(2025)
• No Venue
Liu et al.
-
BIOMEDICA: An Open Biomedical Image-caption Archive, Dataset, And Vision-language Models Derived From Scientific Literature
(2025)
• No Venue
Lozano et al.
-
Seeing, Listening, Remembering, And Reasoning: A Multimodal Agent With Long-term Memory
(2025)
• No Venue
Long et al.
-
Adacot: Pareto-optimal Adaptive Chain-of-thought Triggering Via Reinforcement Learning
(2025)
• No Venue
Lou et al.
-
UI-R1: Enhancing Action Prediction Of GUI Agents By Reinforcement Learning
(2025)
• No Venue
Lu et al.
-
Exploring The Limit Of Outcome Reward For Learning Mathematical Reasoning
(2025)
• No Venue
Lyu et al.
-
Beyond Context Limits: Subconscious Threads For Long-horizon Reasoning
(2025)
• No Venue
Luo et al.
-
Being-h0: Vision-language-action Pretraining From Large-scale Human Videos
(2025)
• No Venue
Luo et al.
-
Finmme: Benchmark Dataset For Financial Multi-modal Reasoning Evaluation
(2025)
• No Venue
Luo et al.
-
Rethinking Diverse Human Preference Learning Through Principal Component Analysis
(2025)
• No Venue
Luo et al.
-
Autonomy-of-experts Models
(2025)
• No Venue
Lv et al.
-
SQL-R1: Training Natural Language To SQL Reasoning Model By Reinforcement Learning
(2025)
• No Venue
Ma et al.
-
Rethinking RL Scaling For Vision Language Models: A Transparent, From-scratch Framework And Comprehensive Evaluation Scheme
(2025)
• No Venue
Ma et al.
-
Inference-time Scaling For Diffusion Models Beyond Scaling Denoising Steps
(2025)
• No Venue
Ma et al.
-
Calligrapher: Freestyle Text Image Customization
(2025)
• No Venue
Ma et al.
-
One RL To See Them All: Visual Triple Unified Reinforcement Learning
(2025)
• No Venue
Ma et al.
-
Step-video-t2v Technical Report: The Practice, Challenges, And Future Of Video Foundation Model
(2025)
• No Venue
Ma et al.
-
Unitok: A Unified Tokenizer For Visual Generation And Understanding
(2025)
• No Venue
Ma et al.
-
Scaling Analysis Of Interleaved Speech-text Language Models
(2025)
• No Venue
Maimon et al.
-
Slamming: Training A Speech Language Model On One GPU In A Day
(2025)
• No Venue
Gallil Maimon, Avishai Elmakies, Yossi Adi
-
Deepseek-r1 Thoughtology: Let's About LLM Reasoning
(2025)
• No Venue
Marjanović et al.
-
Smolvlm: Redefining Small And Efficient Multimodal Models
(2025)
• No Venue
Marafioti et al.
-
Spatiallm: Training Large Language Models For Structured Indoor Modeling
(2025)
• No Venue
Mao et al.
-
Wikivideo: Article Generation From Multiple Videos
(2025)
• No Venue
Martin et al.
-
Alignvlm: Bridging Vision And Language Latent Spaces For Multimodal Understanding
(2025)
• No Venue
Masry et al.
-
A Survey On Inference Engines For Large Language Models: Perspectives On Optimization And Efficiency
(2025)
• No Venue
Park et al.
-
A Survey Of Context Engineering For Large Language Models
(2025)
• No Venue
Mei et al.
-
Transmla: Multi-head Latent Attention Is All You Need
(2025)
• No Venue
Fanxu Meng, Zengwei Yao, Muhan Zhang
-
Mm-eureka: Exploring Visual Aha Moment With Rule-based Large-scale Reinforcement Learning
(2025)
• No Venue
Meng et al.
-
Exploring The Latent Capacity Of Llms For One-step Text Generation
(2025)
• No Venue
Gleb Mezentsev, Ivan Oseledets
-
I Think, Therefore I Diffuse: Enabling Multimodal In-context Reasoning In Diffusion Models
(2025)
• No Venue
Mi et al.
-
Easy Dataset: A Unified And Extensible Framework For Synthesizing LLM Fine-tuning Data From Unstructured Documents
(2025)
• No Venue
Miao et al.
-
Nablanabla: Neighborhood Adaptive Block-level Attention
(2025)
• No Venue
Mikhailov et al.
-
Text-aware Image Restoration With Diffusion Models
(2025)
• No Venue
Min et al.
-
Swe-lancer: Can Frontier Llms Earn $1 Million From Real-world Freelance Software Engineering?
(2025)
• No Venue
Miserendino et al.
-
Synthdetoxm: Modern Llms Are Few-shot Parallel Detoxification Data Annotators
(2025)
• No Venue
Moskovskiy et al.
-
Ruccod: Towards Automated ICD Coding In Russian
(2025)
• No Venue
Nesterov et al.
-
Adaptivocab: Enhancing LLM Efficiency In Focused Domains Through Lightweight Vocabulary Adaptation
(2025)
• No Venue
Nakash et al.
-
Do Generative Video Models Learn Physical Principles From Watching Videos?
(2025)
• No Venue
Motamed et al.
-
Discrete Audio Tokens: More Than A Survey!
(2025)
• No Venue
Mousavi et al.
-
S1: Simple Test-time Scaling
(2025)
• No Venue
Muennighoff et al.
-
Matryoshka Quantization
(2025)
• No Venue
Nair et al.
-
Effective Red-teaming Of Policy-adherent Agents
(2025)
• No Venue
Nakash et al.
-
Mlgym: A New Framework And Benchmark For Advancing AI Research Agents
(2025)
• No Venue
Nathani et al.
-
Fedrand: Enhancing Privacy In Federated Learning With Randomized Lora Subparameter Updates
(2025)
• No Venue
Park et al.
-
Hot: Highlighted Chain Of Thought For Referencing Supporting Facts From Inputs
(2025)
• No Venue
Nguyen et al.
-
Large Language Diffusion Models
(2025)
• No Venue
Nie et al.
-
Bielik 11B V2 Technical Report
(2025)
• No Venue
Ociepa et al.
-
Quest: Stable Training Of Llms With 1-bit Weights And Activations
(2025)
• No Venue
Panferov et al.
-
Tokenhsi: Unified Synthesis Of Physical Human-scene Interactions Through Task Tokenization
(2025)
• No Venue
Pan et al.
-
Omnimanip: Towards General Robotic Manipulation Via Object-centric Interaction Primitives As Spatial Constraints
(2025)
• No Venue
Pan et al.
-
Learning Adaptive Parallel Reasoning With Language Models
(2025)
• No Venue
Pan et al.
-
Medvlm-r1: Incentivizing Medical Reasoning Capability Of Vision-language Models (vlms) Via Reinforcement Learning
(2025)
• No Venue
Pan et al.
-
Sweeval: Do Llms Really Swear? A Safety Benchmark For Testing Limits For Enterprise Use
(2025)
• No Venue
Patel et al.
-
Fineweb2: One Pipeline To Scale Them All -- Adapting Pre-training Data Processing To Every Language
(2025)
• No Venue
Penedo et al.
-
Plutus: Benchmarking Large Language Models In Low-resource Greek Finance
(2025)
• No Venue
Peng et al.
-
Skywork R1V: Pioneering Multimodal Reasoning With Chain-of-thought
(2025)
• No Venue
Peng et al.
-
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification To Improve Trustworthy QA
(2025)
• No Venue
Pletenev et al.
-
Humanity's Last Exam
(2025)
• No Venue
Phan et al.
-
An Open Recipe: Adapting Language-specific Llms To A Reasoning Model In One Day Via Model Merging
(2025)
• No Venue
Pipatanakul et al.
-
How Much Knowledge Can You Pack Into A Lora Adapter Without Harming LLM?
(2025)
• No Venue
Pletenev et al.
-
Fino1: On The Transferability Of Reasoning Enhanced Llms To Finance
(2025)
• No Venue
Qian et al.
-
ART: Anonymous Region Transformer For Variable Multi-layer Transparent Image Generation
(2025)
• No Venue
Pu et al.
-
Dispider: Enabling Video Llms With Active Real-time Interaction Via Disentangled Perception, Decision, And Reaction
(2025)
• No Venue
Qian et al.
-
Vcr-bench: A Comprehensive Evaluation Framework For Video Chain-of-thought Reasoning
(2025)
• No Venue
Qi et al.
-
Sofar: Language-grounded Orientation Bridges Spatial Reasoning And Object Manipulation
(2025)
• No Venue
Qi et al.
-
We-math 2.0: A Versatile Mathbook System For Incentivizing Visual Mathematical Reasoning
(2025)
• No Venue
Qiao et al.
-
Toolrl: Reward Is All Tool Learning Needs
(2025)
• No Venue
Qian et al.
-
Phybench: Holistic Evaluation Of Physical Perception And Reasoning In Large Language Models
(2025)
• No Venue
Qiu et al.
-
Demons In The Detail: On Implementing Load Balancing Loss For Training Specialized Mixture-of-expert Models
(2025)
• No Venue
Qiu et al.
-
LHM: Large Animatable Human Reconstruction Model From A Single Image In Seconds
(2025)
• No Venue
Qiu et al.
-
Saffron-1: Towards An Inference Scaling Paradigm For LLM Safety Assurance
(2025)
• No Venue
Qiu et al.
-
A Survey Of Efficient Reasoning For Large Reasoning Models: Language, Multimodality, And Beyond
(2025)
• No Venue
Qu et al.
-
Codeelo: Benchmarking Competition-level Code Generation Of Llms With Human-comparable Elo Ratings
(2025)
• No Venue
Quan et al.
-
X-teaming: Multi-turn Jailbreaks And Defenses With Adaptive Multi-agents
(2025)
• No Venue
Rahman et al.
-
How Well Does Gpt-4o Understand Vision? Evaluating Multimodal Foundation Models On Standard Computer Vision Tasks
(2025)
• No Venue
Ramachandran et al.
-
Anycap Project: A Unified Framework, Dataset, And Benchmark For Controllable Omni-modal Captioning
(2025)
• No Venue
Ren et al.
-
Llm-microscope: Uncovering The Hidden Role Of Punctuation In Context Memory Of Transformers
(2025)
• No Venue
Razzhigaev et al.
-
Hogwild! Inference: Parallel LLM Generation Via Concurrent Attention
(2025)
• No Venue
Rodionov et al.
-
Zerobench: An Impossible Visual Benchmark For Contemporary Large Multimodal Models
(2025)
• No Venue
Roberts et al.
-
Dota-rag: Dynamic Of Thought Aggregation RAG
(2025)
• No Venue
Ruangtanusak et al.
-
SRMT: Shared Memory For Multi-agent Lifelong Pathfinding
(2025)
• No Venue
Alsu Sagirova, Yuri Kuratov, Mikhail Burtsev
-
The Diffusion Duality
(2025)
• No Venue
Sahoo et al.
-
Geopolitical Biases In Llms: What Are The "good" And The "bad" Countries According To Contemporary Language Models
(2025)
• No Venue
Salnikov et al.
-
Training Language Models For Social Deduction With Multi-agent Reinforcement Learning
(2025)
• No Venue
Sarkar et al.
-
Antidistillation Sampling
(2025)
• No Venue
Savani et al.
-
Quickvideo: Real-time Long Video Understanding With System Algorithm Co-design
(2025)
• No Venue
Schneider et al.
-
Seaweed-7b: Cost-effective Training Of Video Generation Foundation Model
(2025)
• No Venue
Seawead et al.
-
Skrr: Skip And Re-use Text Encoder Layers For Memory Efficient Text-to-image Generation
(2025)
• No Venue
Seo et al.
-
Paper2code: Automating Code Generation From Scientific Papers In Machine Learning
(2025)
• No Venue
Seo et al.
-
Longrope2: Near-lossless LLM Context Window Scaling
(2025)
• No Venue
Shang et al.
-
Reasonir: Training Retrievers For Reasoning Tasks
(2025)
• No Venue
Shao et al.
-
Core^2: Collect, Reflect And Refine To Generate Better And Faster
(2025)
• No Venue
Shao et al.
-
Phyx: Does Your Model Have The "wits" For Physical Reasoning?
(2025)
• No Venue
Shen et al.
-
Skywork-r1v3 Technical Report
(2025)
• No Venue
Shen et al.
-
VLM-R1: A Stable And Generalizable R1-style Large Vision-language Model
(2025)
• No Venue
Shen et al.
-
Longwriter-zero: Mastering Ultra-long Text Generation Via Reinforcement Learning
(2025)
• No Venue
Wu et al.
-
GLM-4.5: Agentic, Reasoning, And Coding (ARC) Foundation Models
(2025)
• No Venue
Team et al.
-
Voila: Voice-language Foundation Models For Real-time Autonomous Interaction And Voice Role-play
(2025)
• No Venue
Shi et al.
-
Mavors: Multi-granularity Video Representation For Multimodal Large Language Model
(2025)
• No Venue
Shi et al.
-
Heimdall: Test-time Scaling On The Generative Verification
(2025)
• No Venue
Wenlei Shi, Xing Jin
-
Taskcraft: Automated Generation Of Agentic Tasks
(2025)
• No Venue
Shi et al.
-
Mme-videoocr: Evaluating Ocr-based Capabilities Of Multimodal Llms In Video Scenarios
(2025)
• No Venue
Shi et al.
-
Scaling Vision Pre-training To 4K Resolution
(2025)
• No Venue
Shi et al.
-
Llmvox: Autoregressive Streaming Text-to-speech Model For Any LLM
(2025)
• No Venue
Shikhar et al.
-
Predictive Data Selection: The Data That Predicts Is The Data That Teaches
(2025)
• No Venue
Shum et al.
-
Smolvla: A Vision-language-action Model For Affordable And Efficient Robotics
(2025)
• No Venue
Shukor et al.
-
Agentic Reasoning And Tool Integration For Llms Via Reinforcement Learning
(2025)
• No Venue
Singh et al.
-
Diagonal Batching Unlocks Parallelism In Recurrent Memory Transformers For Long Contexts
(2025)
• No Venue
Sivtsov et al.
-
Refvnli: Towards Scalable Evaluation Of Subject-driven Text-to-image Generation
(2025)
• No Venue
Slobodkin et al.
-
T-lora: Single Image Diffusion Model Customization Without Overfitting
(2025)
• No Venue
Soboleva et al.
-
Omniconsistency: Learning Style-agnostic Consistency From Paired Stylization Data
(2025)
• No Venue
Yiren Song, Cheng Liu, Mike Zheng Shou
-
Vf-eval: Evaluating Multimodal Llms For Generating Feedback On AIGC Videos
(2025)
• No Venue
Song et al.
-
Alchemist: Turning Public Text-to-image Data Into Generative Gold
(2025)
• No Venue
Startsev et al.
-
Paperbench: Evaluating Ai's Ability To Replicate AI Research
(2025)
• No Venue
Starace et al.
-
Scale-wise Distillation Of Diffusion Models
(2025)
• No Venue
Starodubcev et al.
-
Stop Overthinking: A Survey On Efficient Reasoning For Large Language Models
(2025)
• No Venue
Sui et al.
-
Thinking With Images For Multimodal Reasoning: Foundations, Methods, And Future Frontiers
(2025)
• No Venue
Su et al.
-
Klear-reasoner: Advancing Reasoning Capability Via Gradient-preserving Clipping Policy Optimization
(2025)
• No Venue
Su et al.
-
Pixel Reasoner: Incentivizing Pixel-space Reasoning With Curiosity-driven Reinforcement Learning
(2025)
• No Venue
Su et al.
-
Gemma 3 Technical Report
(2025)
• No Venue
Team et al.
-
Zerosearch: Incentivize The Search Capability Of Llms Without Searching
(2025)
• No Venue
Sun et al.
-
Transformer^2: Self-adaptive Llms
(2025)
• No Venue
Qi Sun, Edoardo Cetin, Yujin Tang
-
Scienceboard: Evaluating Multimodal Autonomous Agents In Realistic Scientific Workflows
(2025)
• No Venue
Sun et al.
-
The Curse Of Depth In Large Language Models
(2025)
• No Venue
Sun et al.
-
Reasonmed: A 370K Multi-agent Generated Dataset For Advancing Medical Reasoning
(2025)
• No Venue
Sun et al.
-
DINGO: Constrained Inference For Diffusion Llms
(2025)
• No Venue
Suresh et al.
-
Auto-regressive Vs Flow-matching: A Comparative Study Of Modeling Paradigms For Text-to-music Generation
(2025)
• No Venue
Or Tal, Felix Kreuk, Yossi Adi
-
Cube: A Roblox View Of 3D Intelligence
(2025)
• No Venue
Team et al.
-
Lego-puzzles: How Good Are Mllms At Multi-step Spatial Reasoning?
(2025)
• No Venue
Tang et al.
-
Agent KB: Leveraging Cross-domain Experience For Agentic Problem Solving
(2025)
• No Venue
Tang et al.
-
Webshaper: Agentically Data Synthesizing Via Information-seeking Formalization
(2025)
• No Venue
Tao et al.
-
Realcritic: Towards Effectiveness-driven Evaluation Of Language Model Critiques
(2025)
• No Venue
Tang et al.
-
COIG-P: A High-quality And Large-scale Chinese Preference Dataset For Alignment With Human Values
(2025)
• No Venue
Team et al.
-
Supergpqa: Scaling LLM Evaluation Across 285 Graduate Disciplines
(2025)
• No Venue
Team et al.
-
Mimo: Unlocking The Reasoning Potential Of Language Model -- From Pretraining To Posttraining
(2025)
• No Venue
Team et al.
-
Kwai Keye-vl Technical Report
(2025)
• No Venue
Team et al.
-
Kanana: Compute-efficient Bilingual Language Models
(2025)
• No Venue
Team et al.
-
Kimi K1.5: Scaling Reinforcement Learning With Llms
(2025)
• No Venue
Team et al.
-
Lingshu: A Generalist Foundation Model For Unified Multimodal Medical Understanding And Reasoning
(2025)
• No Venue
Team et al.
-
Robobrain 2.0 Technical Report
(2025)
• No Venue
Team et al.
-
Minicpm4: Ultra-efficient Llms On End Devices
(2025)
• No Venue
Team et al.
-
Nextstep-1: Toward Autoregressive Image Generation With Continuous Tokens At Scale
(2025)
• No Venue
Team et al.
-
MMMR: Benchmarking Massive Multi-modal Reasoning Tasks
(2025)
• No Venue
Tie et al.
-
Ego-r1: Chain-of-tool-thought For Ultra-long Egocentric Video Reasoning
(2025)
• No Venue
Tian et al.
-
Padding Tone: A Mechanistic Analysis Of Padding Tokens In T2I Models
(2025)
• No Venue
Toker et al.
-
Vision-guided Chunking Is All You Need: Enhancing RAG With Multimodal Document Understanding
(2025)
• No Venue
Tripathi et al.
-
Franca: Nested Matryoshka Clustering For Scalable Visual Representation Learning
(2025)
• No Venue
Venkataramanan et al.
-
Siglip 2: Multilingual Vision-language Encoders With Improved Semantic Understanding, Localization, And Dense Features
(2025)
• No Venue
Tschannen et al.
-
How To Train Your LLM Web Agent: A Statistical Diagnosis
(2025)
• No Venue
Vattikonda et al.
-
Qwenlong-l1: Towards Long-context Large Reasoning Models With Reinforcement Learning
(2025)
• No Venue
Wan et al.
-
Less-to-more Generalization: Unlocking More Controllability By In-context Generation
(2025)
• No Venue
Wu et al.
-
Scholarcopilot: Training Large Language Models For Academic Writing With Accurate Citations
(2025)
• No Venue
Wang et al.
-
Octothinker: Mid-training Incentivizes Reinforcement Learning Scaling
(2025)
• No Venue
Wang et al.
-
Critique Fine-tuning: Learning To Critique Is More Effective Than Learning To Imitate
(2025)
• No Venue
Yubo Wang, Xiang Yue, Wenhu Chen
-
Bitnet V2: Native 4-bit Activations With Hadamard Transformation For 1-bit Llms
(2025)
• No Venue
Hongyu Wang, Shuming Ma, Furu Wei
-
Coser: Coordinating Llm-based Persona Simulation Of Established Roles
(2025)
• No Venue
Wang et al.
-
Cinemaster: A 3d-aware And Controllable Framework For Cinematic Text-to-video Generation
(2025)
• No Venue
Wang et al.
-
Clift: Compressive Light-field Tokens For Compute-efficient And Adaptive Neural Rendering
(2025)
• No Venue
Wang et al.
-
MIRIX: Multi-agent Memory System For Llm-based Agents
(2025)
• No Venue
Yu Wang, Xi Chen
-
Fantasytalking: Realistic Talking Portrait Generation Via Coherent Motion Synthesis
(2025)
• No Venue
Wang et al.
-
DDT: Decoupled Diffusion Transformer
(2025)
• No Venue
Wang et al.
-
Declip: Decoupled Learning For Open-vocabulary Dense Perception
(2025)
• No Venue
Wang et al.
-
Mathcoder-vl: Bridging Vision And Code For Enhanced Multimodal Mathematical Reasoning
(2025)
• No Venue
Wang et al.
-
RLVER: Reinforcement Learning With Verifiable Emotion Rewards For Empathetic Agents
(2025)
• No Venue
Wang et al.
-
Perception-aware Policy Optimization For Multimodal Reasoning
(2025)
• No Venue
Wang et al.
-
OTC: Optimal Tool Calls Via Reinforcement Learning
(2025)
• No Venue
Wang et al.
-
Open-qwen2vl: Compute-efficient Pre-training Of Fully-open Multimodal Llms On Academic Resources
(2025)
• No Venue
Wang et al.
-
Optimizing Large Language Model Training Using FP4 Quantization
(2025)
• No Venue
Wang et al.
-
Ovis-u1 Technical Report
(2025)
• No Venue
Wang et al.
-
Reptext: Rendering Visual Text Via Replicating
(2025)
• No Venue
Wang et al.
-
Pixnerd: Pixel Neural Field Diffusion
(2025)
• No Venue
Wang et al.
-
Rep-mtl: Unleashing The Power Of Representation-level Task Saliency For Multi-task Learning
(2025)
• No Venue
Zedong Wang, Siyuan Li, Dan Xu
-
Worldpm: Scaling Human Preference Modeling
(2025)
• No Venue
Wang et al.
-
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation And Methodology
(2025)
• No Venue
Wang et al.
-
Thoughts Are All Over The Place: On The Underthinking Of O1-like Llms
(2025)
• No Venue
Wang et al.
-
Test-time Scaling With Reflective Generative Model
(2025)
• No Venue
Wang et al.
-
Textatlas5m: A Large-scale Dataset For Dense Text Image Generation
(2025)
• No Venue
Wang et al.
-
Time Is A Feature: Exploiting Temporal Dynamics In Diffusion Language Models
(2025)
• No Venue
Wang et al.
-
Wait, We Don't Need To "wait"! Removing Thinking Tokens Improves Reasoning Efficiency
(2025)
• No Venue
Wang et al.
-
Visualprm: An Effective Process Reward Model For Multimodal Reasoning
(2025)
• No Venue
Wang et al.
-
Vl-rethinker: Incentivizing Self-reflection Of Vision-language Models With Reinforcement Learning
(2025)
• No Venue
Wang et al.
-
Mocha: Towards Movie-grade Talking Character Synthesis
(2025)
• No Venue
Wei et al.
-
Advancing Multimodal Reasoning Via Reinforcement Learning With Cold Start
(2025)
• No Venue
Wei et al.
-
Videorope: What Makes For Good Video Rotary Position Embedding?
(2025)
• No Venue
Wei et al.
-
SWE-RL: Advancing LLM Reasoning Via Reinforcement Learning On Open Software Evolution
(2025)
• No Venue
Wei et al.
-
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior For Visual Reasoning
(2025)
• No Venue
Wei et al.
-
Streamvln: Streaming Vision-and-language Navigation Via Slowfast Context Modeling
(2025)
• No Venue
Wei et al.
-
Unsupervised Post-training For Multi-modal LLM Reasoning Via GRPO
(2025)
• No Venue
Wei et al.
-
LAPO: Internalizing Reasoning Efficiency Via Length-adaptive Policy Optimization
(2025)
• No Venue
Wu et al.
-
Reinforcement Learning With Verifiable Rewards Implicitly Incentivizes Correct Reasoning In Base Llms
(2025)
• No Venue
Wen et al.
-
Delta Attention: Fast And Accurate Sparse Attention Inference By Delta Correction
(2025)
• No Venue
Jeffrey Willette, Heejun Lee, Sung Ju Hwang
-
Widesearch: Benchmarking Agentic Broad Info-seeking
(2025)
• No Venue
Wong et al.
-
Kris-bench: Benchmarking Next-level Intelligent Image Editing Models
(2025)
• No Venue
Wu et al.
-
The Invisible Leash: Why RLVR May Not Escape Its Origin
(2025)
• No Venue
Wu et al.
-
Automated Movie Generation Via Multi-agent Cot Planning
(2025)
• No Venue
Weijia Wu, Zeyu Zhu, Mike Zheng Shou
-
From Hours To Minutes: Lossless Acceleration Of Ultra Long Sequence Generation Up To 100K Tokens
(2025)
• No Venue
Wu et al.
-
Generate, But Verify: Reducing Hallucination In Vision-language Models With Retrospective Resampling
(2025)
• No Venue
Wu et al.
-
The Bitter Lesson Learned From 2,000+ Multilingual Benchmarks
(2025)
• No Venue
Wu et al.
-
Difix3d+: Improving 3D Reconstructions With Single-step Diffusion Models
(2025)
• No Venue
Wu et al.
-
Gui-actor: Coordinate-free Visual Grounding For GUI Agents
(2025)
• No Venue
Wu et al.
-
Vmoba: Mixture-of-block Attention For Video Diffusion Models
(2025)
• No Venue
Wu et al.
-
Superwriter: Reflection-driven Long-form Generation With Large Language Models
(2025)
• No Venue
Wu et al.
-
Spatial-mllm: Boosting MLLM Capabilities In Visual-based Spatial Intelligence
(2025)
• No Venue
Wu et al.
-
Mmsearch-r1: Incentivizing Lmms To Search
(2025)
• No Venue
Wu et al.
-
Omnigen2: Exploration To Advanced Multimodal Generation
(2025)
• No Venue
Wu et al.
-
Step-audio 2 Technical Report
(2025)
• No Venue
Wu et al.
-
Visualquality-r1: Reasoning-induced Image Quality Assessment Via Reinforcement Learning To Rank
(2025)
• No Venue
Wu et al.
-
Synthrl: Scaling Visual Reasoning With Verifiable Data Synthesis
(2025)
• No Venue
Wu et al.
-
Video World Models With Long-term Spatial Memory
(2025)
• No Venue
Wu et al.
-
Omnithink: Expanding Knowledge Boundaries In Machine Writing Through Thinking
(2025)
• No Venue
Xi et al.
-
Retrieval-augmented Large Language Models For Financial Time Series Forecasting
(2025)
• No Venue
Xiao et al.
-
Captain Cinema: Towards Short Movie Generation
(2025)
• No Venue
Xiao et al.
-
Videoauteur: Towards Long Narrative Video Generation
(2025)
• No Venue
Xiao et al.
-
Logic-rl: Unleashing LLM Reasoning With Rule-based Reinforcement Learning
(2025)
• No Venue
Xie et al.
-
STAR: Spatial-temporal Augmentation With Text-to-video Models For Real-world Video Super-resolution
(2025)
• No Venue
Xie et al.
-
Surrogate Signals From Format And Length: Reinforcement Learning For Solving Mathematical Problems Without Ground Truth Answers
(2025)
• No Venue
Xin et al.
-
Self-rewarding Correction For Mathematical Reasoning
(2025)
• No Venue
Xiong et al.
-
Flag-trader: Fusion Llm-agent With Gradient-based Reinforcement Learning For Financial Trading
(2025)
• No Venue
Xiong et al.
-
Gigatok: Scaling Visual Tokenizers To 3 Billion Parameters For Autoregressive Image Generation
(2025)
• No Venue
Xiong et al.
-
Kodcode: A Diverse, Challenging, And Verifiable Synthetic Dataset For Coding
(2025)
• No Venue
Xu et al.
-
Genius: A Generalizable And Purely Unsupervised Self-training Framework For Advanced Reasoning
(2025)
• No Venue
Xu et al.
-
Comfyui-copilot: An Intelligent Assistant For Automated Workflow Development
(2025)
• No Venue
Xu et al.
-
Chain Of Draft: Thinking Faster By Writing Less
(2025)
• No Venue
Xu et al.
-
Filmagent: A Multi-agent Framework For End-to-end Film Automation In Virtual 3D Spaces
(2025)
• No Venue
Xu et al.
-
Φ-decoding: Adaptive Foresight Sampling For Balanced Inference-time Exploration And Exploitation
(2025)
• No Venue
Xu et al.
-
Towards Large Reasoning Models: A Survey Of Reinforced Reasoning With Large Language Models
(2025)
• No Venue
Xu et al.
-
Qwen2.5-omni Technical Report
(2025)
• No Venue
Xu et al.
-
Noderag: Structuring Graph-based RAG With Heterogeneous Nodes
(2025)
• No Venue
Xu et al.
-
Phi-4-mini-reasoning: Exploring The Limits Of Small Reasoning Language Models In Math
(2025)
• No Venue
Xu et al.
-
Relearn: Unlearning Via Learning For Large Language Models
(2025)
• No Venue
Xu et al.
-
Vs-bench: Evaluating Vlms For Strategic Reasoning And Decision-making In Multi-agent Environments
(2025)
• No Venue
Xu et al.
-
Visual Planning: Let's Think Only With Images
(2025)
• No Venue
Xu et al.
-
Visulogic: A Benchmark For Evaluating Visual Reasoning In Multi-modal Large Language Models
(2025)
• No Venue
Xu et al.
-
Dancegrpo: Unleashing GRPO On Visual Generation
(2025)
• No Venue
Xue et al.
-
Audio-flan: A Preliminary Release
(2025)
• No Venue
Xue et al.
-
Learning To Reason Under Off-policy Guidance
(2025)
• No Venue
Yan et al.
-
Gpt-imgeval: A Comprehensive Benchmark For Diagnosing Gpt4o In Image Generation
(2025)
• No Venue
Yan et al.
-
MUR: Momentum Uncertainty Guided Reasoning For Large Language Models
(2025)
• No Venue
Yan et al.
-
Sparse Videogen2: Accelerate Video Generation With Sparse Attention Via Semantic-aware Permutation
(2025)
• No Venue
Yang et al.
-
Magma: A Foundation Model For Multimodal AI Agents
(2025)
• No Venue
Yang et al.
-
A Controllable Examination For Long-context Language Models
(2025)
• No Venue
Yang et al.
-
Lohovla: A Unified Vision-language-action Model For Long-horizon Embodied Tasks
(2025)
• No Venue
Yang et al.
-
Deepcritic: Deliberate Critique With Large Language Models
(2025)
• No Venue
Yang et al.
-
Egolife: Towards Egocentric Life Assistant
(2025)
• No Venue
Yang et al.
-
Qwen3 Technical Report
(2025)
• No Venue
Yang et al.
-
Omnisvg: A Unified Scalable Vector Graphics Generation Model
(2025)
• No Venue
Yang et al.
-
Mmada: Multimodal Large Diffusion Language Models
(2025)
• No Venue
Yang et al.
-
Multiverse: Your Language Models Secretly Decide How To Parallelize And Merge Generation
(2025)
• No Venue
Yang et al.
-
Qwen2.5-1m Technical Report
(2025)
• No Venue
Yang et al.
-
Videograin: Modulating Space-time Attention For Multi-grained Video Editing
(2025)
• No Venue
Yang et al.
-
Evolving Deeper LLM Thinking
(2025)
• No Venue
Lee et al.
-
Feedback Friction: Llms Struggle To Fully Incorporate External Feedback
(2025)
• No Venue
Jiang et al.
-
Complex Logical Instruction Generation
(2025)
• No Venue
Zhang et al.
-
BANG: Dividing 3D Assets Via Generative Exploded Dynamics
(2025)
• No Venue
Zhang et al.
-
Adaptthink: Reasoning Models Can Learn When To Think
(2025)
• No Venue
Zhang et al.
-
70% Size, 100% Accuracy: Lossless LLM Compression For Efficient GPU Inference Via Dynamic-length Float
(2025)
• No Venue
Zhang et al.
-
100 Days After Deepseek-r1: A Survey On Replication Studies And More Directions For Reasoning Language Models
(2025)
• No Venue
Zhang et al.
-
2.5 Years In Class: A Multimodal Textbook For Vision-language Pretraining
(2025)
• No Venue
Zhang et al.
-
Sageattention2++: A More Efficient Implementation Of Sageattention2
(2025)
• No Venue
Zhang et al.
-
Othink-r1: Intrinsic Fast/slow Thinking Mode Switching For Over-reasoning Mitigation
(2025)
• No Venue
Zhang et al.
-
Lightthinker: Thinking Step-by-step Compression
(2025)
• No Venue
Zhang et al.
-
Faster Video Diffusion With Trainable Sparse Attention
(2025)
• No Venue
Zhang et al.
-
Diffusion Vs. Autoregressive Language Models: A Text Embedding Perspective
(2025)
• No Venue
Zhang et al.
-
Dreamvla: A Vision-language-action Model Dreamed With Comprehensive World Knowledge
(2025)
• No Venue
Zhang et al.
-
The Lessons Of Developing Process Reward Models In Mathematical Reasoning
(2025)
• No Venue
Zhang et al.
-
MM-RLHF: The Next Step Forward In Multimodal LLM Alignment
(2025)
• No Venue
Zhang et al.
-
MARS: A Multi-agent Framework Incorporating Socratic Guidance For Automated Prompt Optimization
(2025)
• No Venue
Zhang et al.
-
Minimax-speech: Intrinsic Zero-shot Text-to-speech With A Learnable Speaker Encoder
(2025)
• No Venue
Zhang et al.
-
Redundancy Principles For Mllms Benchmarks
(2025)
• No Venue
Zhang et al.
-
Process-based Self-rewarding Language Models
(2025)
• No Venue
Zhang et al.
-
Packing Input Frame Context In Next-frame Prediction Models For Video Generation
(2025)
• No Venue
Lvmin Zhang, Maneesh Agrawala
-
Phi-ground Tech Report: Advancing Perception In GUI Grounding
(2025)
• No Venue
Zhang et al.
-
Qwen3 Embedding: Advancing Text Embedding And Reranking Through Foundation Models
(2025)
• No Venue
Zhang et al.
-
Tensor Product Attention Is All You Need
(2025)
• No Venue
Zhang et al.
-
Soundwave: Less Is More For Speech-text Alignment In Llms
(2025)
• No Venue
Zhang et al.
-
Sageattention3: Microscaling FP4 Attention For Inference And An Exploration Of 8-bit Training
(2025)
• No Venue
Zhang et al.
-
Sec: Advancing Complex Video Object Segmentation Via Progressive Concept Construction
(2025)
• No Venue
Zhang et al.
-
Spargeattn: Accurate Sparse Attention Accelerating Any Model Inference
(2025)
• No Venue
Zhang et al.
-
What, How, Where, And How Well? A Survey On Test-time Scaling In Large Language Models
(2025)
• No Venue
Zhang et al.
-
Videollama 3: Frontier Multimodal Foundation Models For Image And Video Understanding
(2025)
• No Venue
Zhang et al.
-
Vlm^2-bench: A Closer Look At How Well Vlms Implicitly Link Explicit Matching Visual Cues
(2025)
• No Venue
Zhang et al.
-
Envisioning Beyond The Pixels: Benchmarking Reasoning-informed Visual Editing
(2025)
• No Venue
Zhao et al.
-
DICEPTION: A Generalist Diffusion Model For Visual Perceptual Tasks
(2025)
• No Venue
Zhao et al.
-
Absolute Zero: Reinforced Self-play Reasoning With Zero Data
(2025)
• No Venue
Zhao et al.
-
Babel: Open Multilingual Large Language Models Serving Over 90% Of Global Speakers
(2025)
• No Venue
Zhao et al.
-
Paroattention: Pattern-aware Reordering For Efficient Sparse And Quantized Attention In Visual Generation Models
(2025)
• No Venue
Zhao et al.
-
MMVU: Measuring Expert-level Multi-discipline Video Understanding
(2025)
• No Venue
Zhao et al.
-
Hunyuan3d 2.0: Scaling Diffusion Models For High Resolution Textured 3D Assets Generation
(2025)
• No Venue
Zhao et al.
-
Insights Into Deepseek-v3: Scaling Challenges And Reflections On Hardware For AI Architectures
(2025)
• No Venue
Zhao et al.
-
Omnialign-v: Towards Enhanced Alignment Of Mllms With Human Preference
(2025)
• No Venue
Zhao et al.
-
Marrying Autoregressive Transformer And Diffusion With Multi-reference Autoregression
(2025)
• No Venue
Zhen et al.
-
Pyvision: Agentic Vision With Dynamic Tooling
(2025)
• No Venue
Zhao et al.
-
R1-omni: Explainable Omni-multimodal Emotion Recognition With Reinforcing Learning
(2025)
• No Venue
Jiaxing Zhao, Xihan Wei, Liefeng Bo
-
Scaling Diffusion Transformers Efficiently Via Μp
(2025)
• No Venue
Zheng et al.
-
Group Sequence Policy Optimization
(2025)
• No Venue
Zheng et al.
-
Vbench-2.0: Advancing Video Generation Benchmark Suite For Intrinsic Faithfulness
(2025)
• No Venue
Zheng et al.
-
A Survey On Vision-language-action Models: An Action Tokenization Perspective
(2025)
• No Venue
Zhong et al.
-
Roborefer: Towards Spatial Referring With Reasoning In Vision-language Models For Robotics
(2025)
• No Venue
Zhou et al.
-
3DIS-FLUX: Simple And Efficient Multi-instance Generation With Dit Rendering
(2025)
• No Venue
Zhou et al.
-
R1-zero's "aha Moment" In Visual Reasoning On A 2B Non-sft Model
(2025)
• No Venue
Zhou et al.
-
Scientists' First Exam: Probing Cognitive Abilities Of MLLM Via Perception, Understanding, And Reasoning
(2025)
• No Venue
Zhou et al.
-
Internvl3: Exploring Advanced Training And Test-time Recipes For Open-source Multimodal Models
(2025)
• No Venue
Zhu et al.
-
Transformers Without Normalization
(2025)
• No Venue
Zhu et al.
-
Scaling Test-time Compute For LLM Agents
(2025)
• No Venue
Zhu et al.
-
A Survey On Latent Reasoning
(2025)
• No Venue
Zhu et al.
-
Softpick: No Attention Sink, No Massive Activations With Rectified Softmax
(2025)
• No Venue
Zayd M. K. Zuhri, Erland Hilman Fuadi, Alham Fikri Aji
-
TTRL: Test-time Reinforcement Learning
(2025)
• No Venue
Zuo et al.
-
Emerging Properties In Unified Multimodal Pretraining
(2025)
• No Venue
Deng et al.
-
Towards Physically Plausible Video Generation Via VLM Planning
(2025)
• No Venue
Yang et al.
-
Twinmarket: A Scalable Behavioral And Social Simulation For Financial Markets
(2025)
• No Venue
Yang et al.
-
Zerogui: Automating Online GUI Learning At Zero Human Cost
(2025)
• No Venue
Yang et al.
-
Reconstruction Vs. Generation: Taming Optimization Dilemma In Latent Diffusion Models
(2025)
• No Venue
Jingfeng Yao, Xinggang Wang
-
Shapellm-omni: A Native Multimodal LLM For 3D Generation And Understanding
(2025)
• No Venue
Ye et al.
-
Survey On Evaluation Of Llm-based Agents
(2025)
• No Venue
Yehudai et al.
-
Llada-v: Large Language Diffusion Models With Visual Instruction Tuning
(2025)
• No Venue
You et al.
-
Unicorn: Text-only Data Synthesis For Vision Language Model Training
(2025)
• No Venue
Yu et al.
-
The Stochastic Parrot On Llm's Shoulder: A Summative Assessment Of Physical Concept Understanding
(2025)
• No Venue
Yu et al.
-
Formalmath: Benchmarking Formal Mathematical Reasoning Of Large Language Models
(2025)
• No Venue
Yu et al.
-
Discrete Diffusion In Large Language And Multimodal Models: A Survey
(2025)
• No Venue
Runpeng Yu, Qi Li, Xinchao Wang
-
PRELUDE: A Benchmark Designed To Require Global Comprehension And Reasoning Over Long Contexts
(2025)
• No Venue
Yu et al.
-
Vrbench: A Benchmark For Multi-step Reasoning In Long Narrative Videos
(2025)
• No Venue
Yu et al.
-
Agent-r: Training Language Model Agents To Reflect Via Iterative Self-training
(2025)
• No Venue
Yuan et al.
-
Being-0: A Humanoid Robotic Agent With Vision-language Models And Modular Skills
(2025)
• No Venue
Yuan et al.
-
Opens2v-nexus: A Detailed Benchmark And Million-scale Dataset For Subject-to-video Generation
(2025)
• No Venue
Yuan et al.
-
Does Reinforcement Learning Really Incentivize Reasoning Capacity In Llms Beyond The Base Model?
(2025)
• No Venue
Yue et al.
-
Designlab: Designing Slides Through Iterative Detection And Correction
(2025)
• No Venue
Yun et al.
-
Internlm-xcomposer2.5-reward: A Simple Yet Effective Multi-modal Reward Model
(2025)
• No Venue
Zang et al.
-
Skywork-swe: Unveiling Data Scaling Laws For Software Engineering In Llms
(2025)
• No Venue
Zeng et al.
-
Renderformer: Transformer-based Neural Rendering Of Triangle Meshes With Global Illumination
(2025)
• No Venue
Zeng et al.
-
SIFT: Grounding LLM Reasoning In Contexts Via Stickers
(2025)
• No Venue
Zeng et al.
-
Humaneval-v: Benchmarking High-level Visual Reasoning With Complex Diagrams In Coding Tasks
(2024)
• No Venue
Zhang et al.
-
HARE: Human Priors, A Key To Small Language Model Efficiency
(2024)
• No Venue
Zhang et al.
-
Document Parsing Unveiled: Techniques, Challenges, And Prospects For Structured Information Extraction
(2024)
• No Venue
Zhang et al.
-
Sharegpt4video: Improving Video Understanding And Generation With Better Captions
(2024)
• No Venue
Chen et al.
-
Diversity Empowers Intelligence: Integrating Expertise Of Software Engineering Agents
(2024)
• No Venue
Zhang et al.
-
A Careful Examination Of Large Language Model Performance On Grade School Arithmetic
(2024)
• No Venue
Zhang et al.
-
Critic-v: VLM Critics Help Catch VLM Errors In Multimodal Reasoning
(2024)
• No Venue
Zhang et al.
-
GRAPE: Generalizing Robot Policy Via Preference Alignment
(2024)
• No Venue
Zhang et al.
-
Extending Llama-3's Context Ten-fold Overnight
(2024)
• No Venue
Zhang et al.
-
Euclid: Supercharging Multimodal Llms With Synthetic High-fidelity Visual Descriptions
(2024)
• No Venue
Zhang et al.
-
Evaluation Agent: Efficient And Promptable Evaluation Framework For Visual Generative Models
(2024)
• No Venue
Zhang et al.
-
Ferret-v2: An Improved Baseline For Referring And Grounding With Large Language Models
(2024)
• No Venue
Zhang et al.
-
RAFT: Adapting Language Model To Domain Specific RAG
(2024)
• No Venue
Zhang et al.
-
Multi-dimensional Insights: Benchmarking Real-world Personalization In Large Multimodal Models
(2024)
• No Venue
Zhang et al.
-
Long Context Transfer From Language To Vision
(2024)
• No Venue
Zhang et al.
-
Llama-berry: Pairwise Optimization For O1-like Olympiad-level Mathematical Reasoning
(2024)
• No Venue
Zhang et al.
-
Internlm-xcomposer2.5-omnilive: A Comprehensive Multimodal System For Long-term Streaming Video And Audio Interactions
(2024)
• No Venue
Zhang et al.
-
Large Language Model-brained GUI Agents: A Survey
(2024)
• No Venue
Zhang et al.
-
Lmms-eval: Reality Check On The Evaluation Of Large Multimodal Models
(2024)
• No Venue
Zhang et al.
-
Mm-llms: Recent Advances In Multimodal Large Language Models
(2024)
• No Venue
Zhang et al.
-
Longcite: Enabling Llms To Generate Fine-grained Citations In Long-context QA
(2024)
• No Venue
Zhang et al.
-
Mathverse: Does Your Multi-modal LLM Truly See The Diagrams In Visual Math Problems?
(2024)
• No Venue
Zhang et al.
-
Q-galore: Quantized Galore With INT4 Projection And Layer-adaptive Low-rank Gradients
(2024)
• No Venue
Zhang et al.
-
Onegen: Efficient One-pass Unified Generation And Retrieval For Llms
(2024)
• No Venue
Zhang et al.
-
Multimodal Self-instruct: Synthetic Abstract Image And Visual Reasoning Instruction Using Language Model
(2024)
• No Venue
Zhang et al.
-
O1-coder: An O1 Replication For Coding
(2024)
• No Venue
Zhang et al.
-
Personalization Of Large Language Models: A Survey
(2024)
• No Venue
Zhang et al.
-
Video Instruction Tuning With Synthetic Data
(2024)
• No Venue
Zhang et al.
-
SPAR: Personalized Content-based Recommendation Via Long Engagement Attention
(2024)
• No Venue
Zhang et al.
-
Sageattention: Accurate 8-bit Attention For Plug-and-play Inference Acceleration
(2024)
• No Venue
Zhang et al.
-
Seallms 3: Open Foundation And Chat Multilingual Large Language Models For Southeast Asian Languages
(2024)
• No Venue
Zhang et al.
-
Tinyllama: An Open-source Small Language Model
(2024)
• No Venue
Zhang et al.
-
WALL-E: World Alignment By Rule Learning Improves World Model-based LLM Agents
(2024)
• No Venue
Zhou et al.
-
Transfusion: Predict The Next Token And Diffuse Images With One Multi-modal Model
(2024)
• No Venue
Zhou et al.
-
Code-as-monitor: Constraint-aware Visual Programming For Reactive And Proactive Robotic Failure Detection
(2024)
• No Venue
Zhou et al.
-
Self-discover: Large Language Models Self-compose Reasoning Structures
(2024)
• No Venue
Zhou et al.
-
Llmtimesmapreduce: Simplified Long-sequence Processing Using Large Language Models
(2024)
• No Venue
Zhou et al.
-
Megapairs: Massive Data Synthesis For Universal Multimodal Retrieval
(2024)
• No Venue
Zhou et al.
-
Apollo: An Exploration Of Video Understanding In Large Multimodal Models
(2024)
• No Venue
Zohar et al.
-
Masked Audio Generation Using A Single Non-autoregressive Transformer
(2024)
• No Venue
Ziv et al.
-
Vision Mamba: Efficient Visual Representation Learning With Bidirectional State Space Model
(2024)
• No Venue
Zhu et al.
-
How To Synthesize Text Data Without Model Collapse?
(2024)
• No Venue
Zhu et al.
-
APOLLO: Sgd-like Memory, Adamw-level Performance
(2024)
• No Venue
Zhu et al.
-
Reflections From The 2024 Large Language Model (LLM) Hackathon For Applications In Materials Science And Chemistry
(2024)
• No Venue
Zimmermann et al.
-
Structlm: Towards Building Generalist Models For Structured Knowledge Grounding
(2024)
• No Venue
Zhuang et al.
-
Moba: A Two-level Agent System For Efficient Mobile Task Automation
(2024)
• No Venue
Zhu et al.
-
Marco-o1: Towards Open Reasoning Models For Open-ended Solutions
(2024)
• No Venue
Zhao et al.
-
Cobra: Extending Mamba To Multi-modal Large Language Model For Efficient Inference
(2024)
• No Venue
Zhao et al.
-
Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report
(2024)
• No Venue
Zhao et al.
-
Galore: Memory-efficient LLM Training By Gradient Low-rank Projection
(2024)
• No Venue
Zhao et al.
-
Llama Beyond English: An Empirical Study On Language Capability Transfer
(2024)
• No Venue
Zhao et al.
-
Moviedreamer: Hierarchical Generation For Coherent Long Visual Sequence
(2024)
• No Venue
Zhao et al.
-
Wildchat: 1M Chatgpt Interaction Logs In The Wild
(2024)
• No Venue
Zhao et al.
-
Processbench: Identifying Process Errors In Mathematical Reasoning
(2024)
• No Venue
Zheng et al.
-
Openresearcher: Unleashing AI For Accelerated Scientific Research
(2024)
• No Venue
Zheng et al.
-
Opencodeinterpreter: Integrating Code Generation With Execution And Refinement
(2024)
• No Venue
Zheng et al.
-
Efficiently Democratizing Medical Llms For 50 Languages Via A Mixture Of Language Family Experts
(2024)
• No Venue
Zheng et al.
-
Attention Heads Of Large Language Models: A Survey
(2024)
• No Venue
Zheng et al.
-
Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models
(2024)
• No Venue
Zheng et al.
-
Videogen-of-thought: A Collaborative Framework For Multi-shot Video Generation
(2024)
• No Venue
Zheng et al.
-
Lyra: An Efficient And Speech-centric Framework For Omni-cognition
(2024)
• No Venue
Zhong et al.
-
Multi-lora Composition For Image Generation
(2024)
• No Venue
Zhong et al.
-
Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders
(2024)
• No Venue
Shi et al.
-
Chartmimic: Evaluating Lmm's Cross-modal Reasoning Capability Via Chart-to-code Generation
(2024)
• No Venue
Shi et al.
-
From Code To Correctness: Closing The Last Mile Of Code Generation With Hierarchical Debugging
(2024)
• No Venue
Shi et al.
-
PERL: Parameter Efficient Reinforcement Learning From Human Feedback
(2024)
• No Venue
Sidahmed et al.
-
Large-scale Text-to-image Model With Inpainting Is A Zero-shot Subject-driven Image Generator
(2024)
• No Venue
Shin et al.
-
Design2code: How Far Are We From Automating Front-end Engineering?
(2024)
• No Venue
Si et al.
-
Can Llms Generate Novel Research Ideas? A Large-scale Human Study With 100+ NLP Researchers
(2024)
• No Venue
Chenglei Si, Diyi Yang, Tatsunori Hashimoto
-
Aya Dataset: An Open-access Collection For Multilingual Instruction Tuning
(2024)
• No Venue
Singh et al.
-
Moviellm: Enhancing Long Video Understanding With Ai-generated Movies
(2024)
• No Venue
Song et al.
-
Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters
(2024)
• No Venue
Snell et al.
-
Dolma: An Open Corpus Of Three Trillion Tokens For Language Model Pretraining Research
(2024)
• No Venue
Soldaini et al.
-
Both Text And Images Leaked! A Systematic Analysis Of Multimodal LLM Data Contamination
(2024)
• No Venue
Song et al.
-
LLM Pruning And Distillation In Practice: The Minitron Approach
(2024)
• No Venue
Sreenivas et al.
-
Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms
(2024)
• No Venue
Tongyi Speechteam
-
To Cot Or Not To Cot? Chain-of-thought Helps Mainly On Math And Symbolic Reasoning
(2024)
• No Venue
Sprague et al.
-
Paligemma 2: A Family Of Versatile Vlms For Transfer
(2024)
• No Venue
Steiner et al.
-
Jina-embeddings-v3: Multilingual Embeddings With Task Lora
(2024)
• No Venue
Sturua et al.
-
Bitsfusion: 1.99 Bits Weight Quantization Of Diffusion Model
(2024)
• No Venue
Sui et al.
-
Branch-train-mix: Mixing Expert Llms Into A Mixture-of-experts LLM
(2024)
• No Venue
Sukhbaatar et al.
-
Multimodal Latent Language Modeling With Next-token Diffusion
(2024)
• No Venue
Sun et al.
-
Learning To (learn At Test Time): Rnns With Expressive Hidden States
(2024)
• No Venue
Sun et al.
-
Autoregressive Model Beats Diffusion: Llama For Scalable Image Generation
(2024)
• No Venue
Sun et al.
-
LAMBDA: A Large Model Based Data Agent
(2024)
• No Venue
Sun et al.
-
X-prompt: Towards Universal In-context Image Generation In Auto-regressive Vision Language Foundation Models
(2024)
• No Venue
Sun et al.
-
Parrot: Multilingual Visual Instruction Tuning
(2024)
• No Venue
Sun et al.
-
Os-genesis: Automating GUI Agent Trajectory Construction Via Reverse Task Synthesis
(2024)
• No Venue
Sun et al.
-
Outfitanyone: Ultra-high Quality Virtual Try-on For Any Clothing And Any Person
(2024)
• No Venue
Sun et al.
-
Trustllm: Trustworthiness In Large Language Models
(2024)
• No Venue
Sun et al.
-
Meta-prompting: Enhancing Language Models With Task-agnostic Scaffolding
(2024)
• No Venue
Mirac Suzgun, Adam Tauman Kalai
-
Unpacking SDXL Turbo: Interpreting Text-to-image Models With Sparse Autoencoders
(2024)
• No Venue
Surkov et al.
-
A Framework For Human Evaluation Of Large Language Models In Healthcare Derived From Literature Review
(2024)
• npj Digital Medicine
• 64 citations
Tam et al.
-
Htmlrag: HTML Is Better Than Plain Text For Modeling Retrieved Knowledge In RAG Systems
(2024)
• No Venue
Tan et al.
-
Video-infinity: Distributed Long Video Generation
(2024)
• No Venue
Tan et al.
-
Judgebench: A Benchmark For Evaluating Llm-based Judges
(2024)
• No Venue
Tan et al.
-
Ominicontrol: Minimal And Universal Control For Diffusion Transformer
(2024)
• No Venue
Tan et al.
-
Textsquare: Scaling Up Text-centric Visual Instruction Tuning
(2024)
• No Venue
Tang et al.
-
Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies
(2024)
• No Venue
Tao et al.
-
Octo: An Open-source Generalist Robot Policy
(2024)
• No Venue
Team et al.
-
Gemma 2: Improving Open Language Models At A Practical Size
(2024)
• No Venue
Team et al.
-
Chameleon: Mixed-modal Early-fusion Foundation Models
(2024)
• No Venue
Chameleon Team
-
Jamba-1.5: Hybrid Transformer-mamba Models At Scale
(2024)
• No Venue
Team et al.
-
Hermes 3 Technical Report
(2024)
• No Venue
Ryan Teknium, Jeffrey Quesnelle, Chen Guang
-
Add-it: Training-free Object Insertion In Images With Pretrained Diffusion Models
(2024)
• No Venue
Tewel et al.
-
Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges
(2024)
• No Venue
Thakur et al.
-
Training-free Consistent Text-to-image Generation
(2024)
• No Venue
Tewel et al.
-
Spreadsheetllm: Encoding Spreadsheets For Large Language Models
(2024)
• No Venue
Tian et al.
-
Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing
(2024)
• No Venue
Tian et al.
-
A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models
(2024)
• Arxiv
• 66 citations
Tonmoy et al.
-
Continuous Speech Synthesis Using Per-token Latent Diffusion
(2024)
• No Venue
Turetzky et al.
-
No "zero-shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
(2024)
• No Venue
Udandarao et al.
-
Diffusion Models Are Real-time Game Engines
(2024)
• No Venue
Valevski et al.
-
Fastvlm: Efficient Vision Encoding For Vision Language Models
(2024)
• No Venue
Vasu et al.
-
Replacing Judges With Juries: Evaluating LLM Generations With A Panel Of Diverse Models
(2024)
• No Venue
Verga et al.
-
Switti: Designing Scale-wise Transformers For Text-to-image Synthesis
(2024)
• No Venue
Voronov et al.
-
The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions
(2024)
• No Venue
Wallace et al.
-
Fusechat: Knowledge Fusion Of Chat Models
(2024)
• No Venue
Wan et al.
-
Opendevin: An Open Platform For AI Software Developers As Generalist Agents
(2024)
• No Venue
Wang et al.
-
Omnieval: An Omnidirectional And Automatic RAG Evaluation Benchmark In Financial Domain
(2024)
• No Venue
Wang et al.
-
Litesearch: Efficacious Tree Search For LLM
(2024)
• No Venue
Wang et al.
-
Agent Workflow Memory
(2024)
• No Venue
Wang et al.
-
Lift: Leveraging Human Feedback For Text-to-video Model Alignment
(2024)
• No Venue
Wang et al.
-
Instantid: Zero-shot Identity-preserving Generation In Seconds
(2024)
• No Venue
Wang et al.
-
Generative Inbetweening: Adapting Image-to-video Models For Keyframe Interpolation
(2024)
• No Venue
Wang et al.
-
Bitnet A4.8: 4-bit Activations For 1-bit Llms
(2024)
• No Venue
Hongyu Wang, Shuming Ma, Furu Wei
-
Docgraphlm: Documental Graph Language Model For Information Extraction
(2024)
• No Venue
Wang et al.
-
How Do Your Code Llms Perform? Empowering Code Instruction Tuning With High-quality Data
(2024)
• No Venue
Wang et al.
-
Let The Expert Stick To His Last: Expert-specialized Fine-tuning For Sparse Architectural Large Language Models
(2024)
• No Venue
Wang et al.
-
Knowledge Mechanisms In Large Language Models: A Survey And Perspective
(2024)
• No Venue
Wang et al.
-
Large Action Models: From Inception To Implementation
(2024)
• No Venue
Wang et al.
-
Offline Reinforcement Learning For LLM Multi-step Reasoning
(2024)
• No Venue
Wang et al.
-
Mllm-as-a-judge For Image Safety Without Human Labeling
(2024)
• No Venue
Wang et al.
-
The Mamba In The Llama: Distilling And Accelerating Hybrid Models
(2024)
• No Venue
Wang et al.
-
Loong: Generating Minute-level Long Videos With Autoregressive Language Models
(2024)
• No Venue
Wang et al.
-
Llama-mesh: Unifying 3D Mesh Generation With Language Models
(2024)
• No Venue
Wang et al.
-
Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture
(2024)
• No Venue
Wang et al.
-
Magicvideo-v2: Multi-stage High-aesthetic Video Generation
(2024)
• No Venue
Wang et al.
-
Mixture-of-agents Enhances Large Language Model Capabilities
(2024)
• No Venue
Wang et al.
-
Mambabyte: Token-free Selective State Space Model
(2024)
• No Venue
Wang et al.
-
Mdpo: Conditional Preference Optimization For Multimodal Large Language Models
(2024)
• No Venue
Wang et al.
-
Neural Network Diffusion
(2024)
• No Venue
Wang et al.
-
Multimodal Needle In A Haystack: Benchmarking Long-context Capability Of Multimodal Large Language Models
(2024)
• No Venue
Wang et al.
-
Mmlu-pro: A More Robust And Challenging Multi-task Language Understanding Benchmark
(2024)
• No Venue
Wang et al.
-
Mobile-agent-v2: Mobile Device Operation Assistant With Effective Navigation Via Multi-agent Collaboration
(2024)
• No Venue
Wang et al.
-
Needle In A Multimodal Haystack
(2024)
• No Venue
Wang et al.
-
Weaver: Foundation Models For Creative Writing
(2024)
• No Venue
Wang et al.
-
Qwen2-vl: Enhancing Vision-language Model's Perception Of The World At Any Resolution
(2024)
• No Venue
Wang et al.
-
Videoagent: Long-form Video Understanding With Large Language Model As Agent
(2024)
• No Venue
Wang et al.
-
Yolov9: Learning What You Want To Learn Using Programmable Gradient Information
(2024)
• No Venue
Chien-Yao Wang, I-Hau Yeh, Hong-Yuan Mark Liao
-
Paint By Inpaint: Learning To Add Image Objects By Removing Them First
(2024)
• No Venue
Wasserman et al.
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder For Fast, Memory Efficient, And Long Context Finetuning And Inference
(2024)
• No Venue
Warner et al.
-
Omniedit: Building Image Editing Generalist Models Through Specialist Supervision
(2024)
• No Venue
Wei et al.
-
Small Language Model Meets With Reinforced Vision Vocabulary
(2024)
• No Venue
Wei et al.
-
Worldcuisines: A Massive-scale Benchmark For Multilingual And Multicultural Visual Question Answering On Global Cuisines
(2024)
• No Venue
Winata et al.
-
Cut Your Losses In Large-vocabulary Language Models
(2024)
• No Venue
Wijmans et al.
-
Janus: Decoupling Visual Encoding For Unified Multimodal Understanding And Generation
(2024)
• No Venue
Wu et al.
-
Beyond Examples: High-level Automated Reasoning Paradigm In In-context Learning Via MCTS
(2024)
• No Venue
Wu et al.
-
Diffsensei: Bridging Multi-modal Llms And Diffusion Models For Customized Manga Generation
(2024)
• No Venue
Wu et al.
-
Tablebench: A Comprehensive And Complex Benchmark For Table Question Answering
(2024)
• No Venue
Wu et al.
-
Llama Pro: Progressive Llama With Block Expansion
(2024)
• No Venue
Wu et al.
-
Multi-head Mixture-of-experts
(2024)
• No Venue
Wu et al.
-
Mini-omni: Language Models Can Hear, Talk While Thinking In Streaming
(2024)
• No Venue
Zhifei Xie, Changqiao Wu
-
Llava-critic: Learning To Evaluate Multimodal Models
(2024)
• No Venue
Xiong et al.
-
MMIE: Massive Multimodal Interleaved Comprehension Benchmark For Large Vision-language Models
(2024)
• No Venue
Xia et al.
-
Agentless: Demystifying Llm-based Software Engineering Agents
(2024)
• No Venue
Xia et al.
-
Medtrinity-25m: A Large-scale Multimodal Dataset With Multigranular Annotations For Medicine
(2024)
• No Venue
Xie et al.
-
Configurable Foundation Models: Building Llms From A Modular Perspective
(2024)
• No Venue
Xiao et al.
-
Benchmarking Retrieval-augmented Generation For Medicine
(2024)
• Findings of the Association for Computational Linguistics ACL 2024
• 56 citations
Xiong et al.
-
Travelplanner: A Benchmark For Real-world Planning With Language Agents
(2024)
• No Venue
Xie et al.
-
A Preliminary Study Of O1 In Medicine: Are We Closer To An AI Doctor?
(2024)
• No Venue
Xie et al.
-
Open-finllms: Open Multimodal Large Language Models For Financial Applications
(2024)
• No Venue
Xie et al.
-
Osworld: Benchmarking Multimodal Agents For Open-ended Tasks In Real Computer Environments
(2024)
• No Venue
Xie et al.
-
Show-o: One Single Transformer To Unify Multimodal Understanding And Generation
(2024)
• No Venue
Xie et al.
-
Deepseek-prover-v1.5: Harnessing Proof Assistant Feedback For Reinforcement Learning And Monte-carlo Tree Search
(2024)
• No Venue
Xin et al.
-
Flowmind: Automatic Workflow Generation With Llms
(2024)
• No Venue
Zeng et al.
-
B-star: Monitoring And Balancing Exploration And Exploitation In Self-taught Reasoners
(2024)
• No Venue
Zeng et al.
-
Anygpt: Unified Multimodal LLM With Discrete Sequence Modeling
(2024)
• No Venue
Zhan et al.
-
An Image Is Worth 32 Tokens For Reconstruction And Generation
(2024)
• No Venue
Yu et al.
-
Scaling Up To Excellence: Practicing Model Scaling For Photo-realistic Image Restoration In The Wild
(2024)
• No Venue
Yu et al.
-
Free Process Rewards Without Process Labels
(2024)
• No Venue
Yuan et al.
-
Chatmusician: Understanding And Generating Music Intrinsically With LLM
(2024)
• No Venue
Yuan et al.
-
Videorefer Suite: Advancing Spatial-temporal Object Understanding With Video LLM
(2024)
• No Venue
Yuan et al.
-
Identity-preserving Text-to-video Generation By Frequency Decomposition
(2024)
• No Venue
Yuan et al.
-
Mora: Enabling Generalist Video Generation Via A Multi-agent Framework
(2024)
• No Venue
Yuan et al.
-
Mmmu-pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
(2024)
• No Venue
Yue et al.
-
Textgrad: Automatic "differentiation" Via Text
(2024)
• No Venue
Yuksekgonul et al.
-
Swe-bench-java: A Github Issue Resolving Benchmark For Java
(2024)
• No Venue
Zan et al.
-
Quiet-star: Language Models Can Teach Themselves To Think Before Speaking
(2024)
• No Venue
Zelikman et al.
-
3dgraphllm: Combining Semantic Graphs And Large Language Models For 3D Scene Understanding
(2024)
• No Venue
Tatiana Zemskova, Dmitry Yudin
-
Stronger Models Are NOT Stronger Teachers For Instruction Tuning
(2024)
• No Venue
Xu et al.
-
Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing
(2024)
• No Venue
Xu et al.
-
Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation
(2024)
• No Venue
Xu et al.
-
Agenttrek: Agent Trajectory Synthesis Via Guiding Replay With Web Tutorials
(2024)
• No Venue
Xu et al.
-
Androidlab: Training And Systematic Benchmarking Of Android Autonomous Agents
(2024)
• No Venue
Xu et al.
-
Hallucination Is Inevitable: An Innate Limitation Of Large Language Models
(2024)
• Arxiv
• 79 citations
Ziwei Xu, Sanjay Jain, Mohan Kankanhalli
-
Slowfast-llava: A Strong Training-free Baseline For Video Large Language Models
(2024)
• No Venue
Xu et al.
-
No More Adam: Learning Rate Scaling At Initialization Is All You Need
(2024)
• No Venue
Xu et al.
-
Pllava : Parameter-free Llava Extension From Images To Videos For Video Dense Captioning
(2024)
• No Venue
Xu et al.
-
Theagentcompany: Benchmarking LLM Agents On Consequential Real World Tasks
(2024)
• No Venue
Xu et al.
-
Powerinfer-2: Fast Large Language Model Inference On A Smartphone
(2024)
• No Venue
Xue et al.
-
Longvila: Scaling Long-context Visual Language Models For Long Videos
(2024)
• No Venue
Xue et al.
-
Xgen-mm (BLIP-3): A Family Of Open Large Multimodal Models
(2024)
• No Venue
Xue et al.
-
To Believe Or Not To Believe Your LLM
(2024)
• No Venue
Yadkori et al.
-
Promises And Challenges Of Generative Artificial Intelligence For Human Learning
(2024)
• Nature Human Behaviour
• 86 citations
Yan et al.
-
An Object Is Worth 64x64 Pixels: Generating 3D Object Via Image Diffusion
(2024)
• No Venue
Yan et al.
-
Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms
(2024)
• No Venue
Yang et al.
-
Law Of Vision Representation In Mllms
(2024)
• No Venue
Yang et al.
-
CRAG -- Comprehensive RAG Benchmark
(2024)
• No Venue
Yang et al.
-
1.58-bit FLUX
(2024)
• No Venue
Yang et al.
-
Cogvideox: Text-to-video Diffusion Models With An Expert Transformer
(2024)
• No Venue
Yang et al.
-
3D-GRAND: A Million-scale Dataset For 3d-llms With Better Grounding And Less Hallucination
(2024)
• No Venue
Yang et al.
-
Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models
(2024)
• No Venue
Yang et al.
-
Kolmogorov-arnold Transformer
(2024)
• No Venue
Xingyi Yang, Xinchao Wang
-
Evaluating And Aligning Codellms On Human Preference
(2024)
• No Venue
Yang et al.
-
Denoising Vision Transformers
(2024)
• No Venue
Yang et al.
-
Do Large Language Models Latently Perform Multi-hop Reasoning?
(2024)
• No Venue
Yang et al.
-
Fuzzcoder: Byte-level Fuzzing Test Via Large Language Model
(2024)
• No Venue
Yang et al.
-
Visionzip: Longer Is Better But Not Necessary In Vision Language Models
(2024)
• No Venue
Yang et al.
-
Qwen2 Technical Report
(2024)
• No Venue
Yang et al.
-
UCFE: A User-centric Financial Expertise Benchmark For Large Language Models
(2024)
• No Venue
Yang et al.
-
Vript: A Video Is Worth Thousands Of Words
(2024)
• No Venue
Yang et al.
-
More Agents Is All You Need
(2024)
• No Venue
Li et al.
-
Dreamreward: Text-to-3d Generation With Human Preference
(2024)
• No Venue
Ye et al.
-
Mulberry: Empowering MLLM With O1-like Reasoning And Reflection Via Collective Monte Carlo Tree Search
(2024)
• No Venue
Yao et al.
-
Minicpm-v: A GPT-4V Level MLLM On Your Phone
(2024)
• No Venue
Yao et al.
-
Differential Transformer
(2024)
• No Venue
Ye et al.
-
Mplug-owl3: Towards Long Image-sequence Understanding In Multi-modal Large Language Models
(2024)
• No Venue
Ye et al.
-
Flashspeech: Efficient Zero-shot Speech Synthesis
(2024)
• No Venue
Ye et al.
-
LOKI: A Comprehensive Synthetic Data Detection Benchmark Using Large Multimodal Models
(2024)
• No Venue
Ye et al.
-
Voco-llama: Towards Vision Compression With Large Language Models
(2024)
• No Venue
Ye et al.
-
Hermes 3 Technical Report
(2024)
• No Venue
Ryan Teknium, Jeffrey Quesnelle, Chen Guang
-
Imagen 3
(2024)
• No Venue
Imagen-Team-Google et al.
-
Deepseek-coder-v2: Breaking The Barrier Of Closed-source Models In Code Intelligence
(2024)
• No Venue
Deepseek-Ai et al.
-
Gpt-4o System Card
(2024)
• No Venue
Openai et al.
-
Star Attention: Efficient LLM Inference Over Long Sequences
(2024)
• No Venue
Shantanu Acharya, Fei Jia, Boris Ginsburg
-
Openai O1 System Card
(2024)
• No Venue
Openai et al.
-
Qwen2.5 Technical Report
(2024)
• No Venue
Qwen et al.
-
Yi: Open Foundation Models By 01.AI
(2024)
• No Venue
Ai et al.
-
Phi-4 Technical Report
(2024)
• No Venue
Abdin et al.
-
Alignment Studio: Aligning Large Language Models To Particular Contextual Regulations
(2024)
• No Venue
Achintalwar et al.
-
Evolutionary Optimization Of Model Merging Recipes
(2024)
• No Venue
Akiba et al.
-
Linear Transformers With Learnable Kernel Functions Are Better In-context Models
(2024)
• No Venue
Aksenov et al.
-
Make Your LLM Fully Utilize The Context
(2024)
• No Venue
An et al.
-
Seed-tts: A Family Of High-quality Versatile Speech Generation Models
(2024)
• No Venue
Anastassiou et al.
-
Homogenization Effects Of Large Language Models On Human Creative Ideation
(2024)
• Creativity and Cognition
• 57 citations
Barrett R. Anderson, Jash Hemant Shah, Max Kreminski
-
Chronos: Learning The Language Of Time Series
(2024)
• No Venue
Ansari et al.
-
PALP: Prompt Aligned Personalization Of Text-to-image Models
(2024)
• No Venue
Arar et al.
-
Scenescript: Reconstructing Scenes With An Autoregressive Structured Language Model
(2024)
• No Venue
Avetisyan et al.
-
To Code, Or Not To Code? Exploring Impact Of Code In Pre-training
(2024)
• No Venue
Aryabumi et al.
-
Slicegpt: Compress Large Language Models By Deleting Rows And Columns
(2024)
• No Venue
Ashkboos et al.
-
Screenai: A Vision-language Model For UI And Infographics Understanding
(2024)
• No Venue
Baechler et al.
-
Revisiting In-context Learning With Long Context Language Models
(2024)
• No Venue
Baek et al.
-
Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms
(2024)
• No Venue
Bai et al.
-
From Generalist To Specialist: Adapting Vision Language Models Via Task-specific Visual Instruction Tuning
(2024)
• No Venue
Bai et al.
-
Longbench V2: Towards Deeper Understanding And Reasoning On Realistic Long-context Multitasks
(2024)
• No Venue
Bai et al.
-
Meissonic: Revitalizing Masked Generative Transformers For Efficient High-resolution Text-to-image Synthesis
(2024)
• No Venue
Bai et al.
-
Seed-music: A Unified Framework For High Quality And Controlled Music Generation
(2024)
• No Venue
Bai et al.
-
LLM Augmented Llms: Expanding Capabilities Through Composition
(2024)
• No Venue
Bansal et al.
-
Lumiere: A Space-time Diffusion Model For Video Generation
(2024)
• No Venue
Bar-Tal et al.
-
SUTRA: Scalable Multilingual Language Model Architecture
(2024)
• No Venue
Bendale et al.
-
Speculative Streaming: Fast LLM Inference Without Auxiliary Models
(2024)
• No Venue
Bhendawade et al.
-
Paligemma: A Versatile 3B VLM For Transfer
(2024)
• No Venue
Beyer et al.
-
INDUS: Effective And Efficient Language Models For Scientific Applications
(2024)
• No Venue
Bhattacharjee et al.
-
Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models
(2024)
• No Venue
Bhatia et al.
-
Lora Learns Less And Forgets Less
(2024)
• No Venue
Biderman et al.
-
Make It Count: Text-to-image Generation With An Accurate Number Of Objects
(2024)
• No Venue
Binyamin et al.
-
Intelligent Clinical Documentation: Harnessing Generative AI For Patient-centric Clinical Note Generation
(2024)
• International Journal of Innovative Science and Research Technology (IJISRT)
• 988 citations
Anjanava Biswas, Wrick Talukdar
-
Windows Agent Arena: Evaluating Multi-modal OS Agents At Scale
(2024)
• No Venue
Bonatti et al.
-
Transformers Meet Neural Algorithmic Reasoners
(2024)
• No Venue
Bounsi et al.
-
Recurrentgemma: Moving Past Transformers For Efficient Open Language Models
(2024)
• No Venue
Botev et al.
-
Reducing Transformer Key-value Cache Size With Cross-layer Attention
(2024)
• No Venue
Brandon et al.
-
Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads
(2024)
• No Venue
Cai et al.
-
Roadmap Towards Superhuman Speech Understanding Using Large Language Models
(2024)
• No Venue
Bu et al.
-
Internlm2 Technical Report
(2024)
• No Venue
Cai et al.
-
On The Compositional Generalization Of Multimodal Llms For Medical Imaging
(2024)
• No Venue
Cai et al.
-
Uni-smart: Universal Science Multimodal Analysis And Research Transformer
(2024)
• No Venue
Cai et al.
-
Stealing Part Of A Production Language Model
(2024)
• No Venue
Carlini et al.
-
Mceval: Massively Multilingual Code Evaluation
(2024)
• No Venue
Chai et al.
-
Web Agents With World Models: Learning And Leveraging Environment Dynamics In Web Navigation
(2024)
• No Venue
Chae et al.
-
Scaling Synthetic Data Creation With 1,000,000,000 Personas
(2024)
• No Venue
Chan et al.
-
Getting It Right: Improving Spatial Consistency In Text-to-image Models
(2024)
• No Venue
Chatterjee et al.
-
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
(2024)
• No Venue
Chang et al.
-
Do NOT Think That Much For 2+3=? On The Overthinking Of O1-like Llms
(2024)
• No Venue
Chen et al.
-
Diffusion Forcing: Next-token Prediction Meets Full-sequence Diffusion
(2024)
• No Venue
Chen et al.
-
Contrastive Localized Language-image Pre-training
(2024)
• No Venue
Chen et al.
-
Bootstrapping Language Models With DPO Implicit Rewards
(2024)
• No Venue
Chen et al.
-
Agentpoison: Red-teaming LLM Agents Via Poisoning Memory Or Knowledge Bases
(2024)
• No Venue
Chen et al.
-
Cod, Towards An Interpretable Medical Agent Using Chain Of Diagnosis
(2024)
• No Venue
Chen et al.
-
Sharegpt4video: Improving Video Understanding And Generation With Better Captions
(2024)
• No Venue
Chen et al.
-
Mega-bench: Scaling Multimodal Evaluation To Over 500 Real-world Tasks
(2024)
• No Venue
Chen et al.
-
Florence-vl: Enhancing Vision-language Models With Generative Vision Encoder And Depth-breadth Fusion
(2024)
• No Venue
Chen et al.
-
Expanding Performance Boundaries Of Open-source Multimodal Models With Model, Data, And Test-time Scaling
(2024)
• No Venue
Chen et al.
-
Dolphin: Long Context As A New Modality For Energy-efficient On-device Language Models
(2024)
• No Venue
Chen et al.
-
EVLM: An Efficient Vision-language Model For Visual Understanding
(2024)
• No Venue
Chen et al.
-
F5-TTS: A Fairytaler That Fakes Fluent And Faithful Speech With Flow Matching
(2024)
• No Venue
Chen et al.
-
Language Models Are Hidden Reasoners: Unlocking Latent Reasoning Capabilities Via Self-rewarding
(2024)
• No Venue
Chen et al.
-
Gmai-mmbench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI
(2024)
• No Venue
Chen et al.
-
How Far Are We To GPT-4V? Closing The Gap To Commercial Multimodal Models With Open-source Suites
(2024)
• No Venue
Chen et al.
-
Self-play Fine-tuning Converts Weak Language Models To Strong Language Models
(2024)
• No Venue
Chen et al.
-
Panda-70m: Captioning 70M Videos With Multiple Cross-modality Teachers
(2024)
• No Venue
Chen et al.
-
Meshanything V2: Artist-created Mesh Generation With Adjacent Mesh Tokenization
(2024)
• No Venue
Chen et al.
-
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
(2024)
• No Venue
Chen et al.
-
Pixart-δ: Fast And Controllable Image Generation With Latent Consistency Models
(2024)
• No Venue
Chen et al.
-
Visionts: Visual Masked Autoencoders Are Free-lunch Zero-shot Time Series Forecasters
(2024)
• No Venue
Chen et al.
-
Spatialvlm: Endowing Vision-language Models With Spatial Reasoning Capabilities
(2024)
• No Venue
Chen et al.
-
V3D: Video Diffusion Models Are Effective 3D Generators
(2024)
• No Venue
Chen et al.
-
Chatbot Arena: An Open Platform For Evaluating Llms By Human Preference
(2024)
• No Venue
Chiang et al.
-
Videollama 2: Advancing Spatial-temporal Modeling And Audio Understanding In Video-llms
(2024)
• No Venue
Cheng et al.
-
Videgothink: Assessing Egocentric Video Understanding Capabilities For Embodied AI
(2024)
• No Venue
Cheng et al.
-
Compressed Chain Of Thought: Efficient Reasoning Through Dense Representations
(2024)
• No Venue
Jeffrey Cheng, Benjamin van Durme
-
Breaking The Memory Barrier: Near Infinite Batch Size Scaling For Contrastive Loss
(2024)
• No Venue
Cheng et al.
-
Instruction Pre-training: Language Models Are Supervised Multitask Learners
(2024)
• No Venue
Cheng et al.
-
CORAL: Benchmarking Multi-turn Conversational Retrieval-augmentation Generation
(2024)
• No Venue
Cheng et al.
-
On Domain-specific Post-training For Multimodal Large Language Models
(2024)
• No Venue
Cheng et al.
-
Yolo-world: Real-time Open-vocabulary Object Detection
(2024)
• No Venue
Cheng et al.
-
M-longdoc: A Benchmark For Multimodal Super-long Document Understanding And A Retrieval-aware Tuning Framework
(2024)
• No Venue
Chia et al.
-
Transformer Explainer: Interactive Learning Of Text-generative Models
(2024)
• No Venue
Cho et al.
-
M3docrag: Multi-modal Retrieval Is What You Need For Multi-page Multi-document Understanding
(2024)
• No Venue
Cho et al.
-
Qwen2-audio Technical Report
(2024)
• No Venue
Chu et al.
-
Med42-v2: A Suite Of Clinical Llms
(2024)
• No Venue
Christophe et al.
-
Visionllama: A Unified Llama Interface For Vision Tasks
(2024)
• No Venue
Chu et al.
-
Toto: Time Series Optimized Transformer For Observability
(2024)
• No Venue
Cohen et al.
-
VLOGGER: Multimodal Diffusion For Embodied Avatar Synthesis
(2024)
• No Venue
Corona et al.
-
Large Legal Fictions: Profiling Legal Hallucinations In Large Language Models
(2024)
• Journal of Legal Analysis
• 66 citations
Dahl et al.
-
The Power Of Noise: Redefining Retrieval For RAG Systems
(2024)
• Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 65 citations
Cuconasu et al.
-
NVLM: Open Frontier-class Multimodal Llms
(2024)
• No Venue
Dai et al.
-
Deepseekmoe: Towards Ultimate Expert Specialization In Mixture-of-experts Language Models
(2024)
• No Venue
Dai et al.
-
RACER: Rich Language-guided Failure Recovery Policies For Imitation Learning
(2024)
• No Venue
Dai et al.
-
Swiftbrush V2: Make Your One-step Diffusion Model Better Than Its Teacher
(2024)
• No Venue
Dao et al.
-
Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality
(2024)
• No Venue
Tri Dao, Albert Gu
-
Larimar: Large Language Models With Episodic Memory Control
(2024)
• No Venue
Das et al.
-
Griffin: Mixing Gated Linear Recurrences With Local Attention For Efficient Language Models
(2024)
• No Venue
de et al.
-
A Silver Bullet Or A Compromise For Full Attention? A Comprehensive Study Of Gist Token-based Context Compression
(2024)
• No Venue
Deng et al.
-
Coconut: Modernizing COCO Segmentation
(2024)
• No Venue
Deng et al.
-
Unveiling Encoder-free Vision-language Models
(2024)
• No Venue
Diao et al.
-
Longrope: Extending LLM Context Window Beyond 2 Million Tokens
(2024)
• No Venue
Ding et al.
-
Not All Language Model Features Are Linear
(2024)
• No Venue
Engels et al.
-
Toward General Instruction-following Alignment For Retrieval-augmented Generation
(2024)
• No Venue
Dong et al.
-
Sam2long: Enhancing SAM 2 For Long Video Segmentation With A Training-free Memory Tree
(2024)
• No Venue
Ding et al.
-
RLHF Workflow: From Reward Modeling To Online RLHF
(2024)
• No Venue
Dong et al.
-
Progressive Multimodal Reasoning Via Active Retrieval
(2024)
• No Venue
Dong et al.
-
Baichuanseed: Sharing The Potential Of Extensive Data Collection And Deduplication By Introducing A Competitive Large Language Model Baseline
(2024)
• No Venue
Dong et al.
-
CLEAR: Character Unlearning In Textual And Visual Modalities
(2024)
• No Venue
Dontsov et al.
-
Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback
(2024)
• No Venue
Dou et al.
-
The Llama 3 Herd Of Models
(2024)
• No Venue
Dubey et al.
-
An Interactive Agent Foundation Model
(2024)
• No Venue
Durante et al.
-
Layerskip: Enabling Early Exit Inference And Self-speculative Decoding
(2024)
• No Venue
Elhoushi et al.
-
Scalable Pre-training Of Large Autoregressive Image Models
(2024)
• No Venue
El-Nouby et al.
-
Scaling Rectified Flow Transformers For High-resolution Image Synthesis
(2024)
• No Venue
Esser et al.
-
Vitar: Vision Transformer With Any Resolution
(2024)
• No Venue
Fan et al.
-
Fluid: Scaling Autoregressive Text-to-image Generative Models With Continuous Tokens
(2024)
• No Venue
Fan et al.
-
A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models
(2024)
• Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
• 110 citations
Fan et al.
-
Mmbench-video: A Long-form Multi-shot Benchmark For Holistic Video Understanding
(2024)
• No Venue
Fang et al.
-
Llama-omni: Seamless Speech Interaction With Large Language Models
(2024)
• No Venue
Fang et al.
-
VILA^2: VILA Augmented VILA
(2024)
• No Venue
Fang et al.
-
Colpali: Efficient Document Retrieval With Vision Language Models
(2024)
• No Venue
Faysse et al.
-
FLUX That Plays Music
(2024)
• No Venue
Fei et al.
-
Efficiently Serving LLM Reasoning Programs With Certaindex
(2024)
• No Venue
Fu et al.
-
Natural Language Reinforcement Learning
(2024)
• No Venue
Feng et al.
-
Nnsight And NDIF: Democratizing Access To Foundation Model Internals
(2024)
• No Venue
Fiotto-Kaufman et al.
-
RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation
(2024)
• No Venue
Fleischer et al.
-
Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference
(2024)
• No Venue
Fu et al.
-
VITA: Towards Open-source Interactive Omni Multimodal LLM
(2024)
• No Venue
Fu et al.
-
Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools
(2024)
• No Venue
Glm et al.
-
Seerattention: Learning Intrinsic Sparse Attention In Your Llms
(2024)
• No Venue
Gao et al.
-
Omni-math: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
(2024)
• No Venue
Gao et al.
-
Towards A Unified View Of Preference Learning For Large Language Models: A Survey
(2024)
• No Venue
Gao et al.
-
Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context
(2024)
• Arxiv
• 123 citations
Team et al.
-
Kvasir-vqa: A Text-image Pair GI Tract Dataset
(2024)
• No Venue
Gautam et al.
-
Are We Done With MMLU?
(2024)
• No Venue
Gema et al.
-
Gemma: Open Models Based On Gemini Research And Technology
(2024)
• Arxiv
• 109 citations
Team et al.
-
AI And Memory Wall
(2024)
• IEEE Micro
• 58 citations
Gholami et al.
-
Better & Faster Large Language Models Via Multi-token Prediction
(2024)
• No Venue
Gloeckle et al.
-
Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression
(2024)
• No Venue
Goldstein et al.
-
Learn Your Reference Model For Real Good Alignment
(2024)
• No Venue
Gorbatovski et al.
-
Specialized Language Models With Cheap Inference From Limited Domain Data
(2024)
• No Venue
Grangier et al.
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
(2024)
• No Venue
Grosnit et al.
-
Olmo: Accelerating The Science Of Language Models
(2024)
• No Venue
Groeneveld et al.
-
The Unreasonable Ineffectiveness Of The Deeper Layers
(2024)
• No Venue
Gromov et al.
-
Roictrl: Boosting Instance Control For Visual Generation
(2024)
• No Venue
Gu et al.
-
Direct Language Model Alignment From Online AI Feedback
(2024)
• No Venue
Guo et al.
-
Deepseek-coder: When The Large Language Model Meets Programming -- The Rise Of Code Intelligence
(2024)
• No Venue
Guo et al.
-
Mammoth-vl: Eliciting Multimodal Reasoning With Instruction Tuning At Scale
(2024)
• No Venue
Guo et al.
-
Small Language Model Meets With Reinforced Vision Vocabulary
(2024)
• No Venue
Wei et al.
-
Pingpong: A Benchmark For Role-playing Language Models With User Emulation And Multi-model Evaluation
(2024)
• No Venue
Ilya Gusev
-
Ltx-video: Realtime Video Latent Diffusion
(2024)
• No Venue
Hacohen et al.
-
Model Merging And Safety Alignment: One Bad Model Spoils The Bunch
(2024)
• No Venue
Hammoud et al.
-
JPEG-LM: Llms As Image Generators With Canonical Codec Representations
(2024)
• No Venue
Han et al.
-
Infimm-webmath-40b: Advancing Multimodal Pre-training For Enhanced Mathematical Reasoning
(2024)
• No Venue
Han et al.
-
Token-budget-aware LLM Reasoning
(2024)
• No Venue
Han et al.
-
Training Large Language Models To Reason In A Continuous Latent Space
(2024)
• No Venue
Hao et al.
-
Spotting Llms With Binoculars: Zero-shot Detection Of Machine-generated Text
(2024)
• No Venue
Hans et al.
-
Mambavision: A Hybrid Mamba-transformer Vision Backbone
(2024)
• No Venue
Ali Hatamizadeh, Jan Kautz
-
Teaching Large Language Models To Reason With Reinforcement Learning
(2024)
• No Venue
Havrilla et al.
-
What Matters In Transformers? Not All Attention Is Needed
(2024)
• No Venue
He et al.
-
MLP-KAN: Unifying Deep Representation And Function Learning
(2024)
• No Venue
He et al.
-
Chinese Simpleqa: A Chinese Factuality Evaluation For Large Language Models
(2024)
• No Venue
He et al.
-
Inference Performance Optimization For Large Language Models On Cpus
(2024)
• No Venue
He et al.
-
Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models
(2024)
• No Venue
He et al.
-
Block Transformer: Global-to-local Language Modeling For Fast Inference
(2024)
• No Venue
Ho et al.
-
Cogvlm2: Visual Language Models For Image And Video Understanding
(2024)
• No Venue
Hong et al.
-
Not All LLM Reasoners Are Created Equal
(2024)
• No Venue
Hosseini et al.
-
RULER: What's The Real Context Size Of Your Long-context Language Models?
(2024)
• No Venue
Hsieh et al.
-
Acdit: Interpolating Autoregressive Conditional Modeling And Diffusion Transformer
(2024)
• No Venue
Hu et al.
-
Mplug-docowl 1.5: Unified Structure Learning For Ocr-free Document Understanding
(2024)
• No Venue
Hu et al.
-
Instruct-imagen: Image Generation With Multi-modal Instruction
(2024)
• No Venue
Hu et al.
-
Automated Design Of Agentic Systems
(2024)
• No Venue
Shengran Hu, Cong Lu, Jeff Clune
-
ELLA: Equip Diffusion Models With LLM For Enhanced Semantic Alignment
(2024)
• No Venue
Hu et al.
-
Longrecipe: Recipe For Efficient Long Context Generalization In Large Languge Models
(2024)
• No Venue
Hu et al.
-
Yulan-mini: An Open Data-efficient Language Model
(2024)
• No Venue
Hu et al.
-
Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework
(2024)
• No Venue
Hu et al.
-
Pokéllmon: A Human-parity Agent For Pokémon Battles With Large Language Models
(2024)
• No Venue
Sihao Hu, Tiansheng Huang, Ling Liu
-
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
(2024)
• No Venue
Huang et al.
-
Fourier Position Embedding: Enhancing Attention's Periodic Extension For Length Generalization
(2024)
• No Venue
Hua et al.
-
How Good Are Low-bit Quantized Llama3 Models? An Empirical Study
(2024)
• No Venue
Huang et al.
-
Billm: Pushing The Limit Of Post-training Quantization For Llms
(2024)
• No Venue
Huang et al.
-
Autocrawler: A Progressive Understanding Web Agent For Web Crawler Generation
(2024)
• No Venue
Huang et al.
-
Opencoder: The Open Cookbook For Top-tier Code Large Language Models
(2024)
• No Venue
Huang et al.
-
Mmevalpro: Calibrating Multimodal Benchmarks Towards Trustworthy And Efficient Evaluation
(2024)
• No Venue
Huang et al.
-
O1 Replication Journey -- Part 2: Surpassing O1-preview Through Simple Distillation, Big Progress Or Bitter Lesson?
(2024)
• No Venue
Huang et al.
-
Sleeper Agents: Training Deceptive Llms That Persist Through Safety Training
(2024)
• No Venue
Hubinger et al.
-
Qwen2.5-coder Technical Report
(2024)
• No Venue
Hui et al.
-
Transformerfam: Feedback Attention Is Working Memory
(2024)
• No Venue
Hwang et al.
-
Simple And Scalable Strategies To Continually Pre-train Large Language Models
(2024)
• No Venue
Ibrahim et al.
-
Wavtokenizer: An Efficient Acoustic Discrete Codec Tokenizer For Audio Language Modeling
(2024)
• No Venue
Ji et al.
-
E5-V: Universal Embeddings With Multimodal Large Language Models
(2024)
• No Venue
Jiang et al.
-
Mmsearch: Benchmarking The Potential Of Large Models As Multi-modal Search Engines
(2024)
• No Venue
Jiang et al.
-
Megascale: Scaling Large Language Model Training To More Than 10,000 Gpus
(2024)
• No Venue
Jiang et al.
-
Longrag: Enhancing Retrieval-augmented Generation With Long-context Llms
(2024)
• No Venue
Ziyan Jiang, Xueguang Ma, Wenhu Chen
-
Many-shot In-context Learning In Multimodal Foundation Models
(2024)
• No Venue
Jiang et al.
-
Mixtral Of Experts
(2024)
• No Venue
Jiang et al.
-
Mora: High-rank Updating For Parameter-efficient Fine-tuning
(2024)
• No Venue
Jiang et al.
-
RATIONALYST: Pre-training Process-supervision For Improving Reasoning
(2024)
• No Venue
Jiang et al.
-
Pyramidal Flow Matching For Efficient Video Generative Modeling
(2024)
• No Venue
Jin et al.
-
Dsbench: How Far Are Data Science Agents To Becoming Data Science Experts?
(2024)
• No Venue
Jing et al.
-
Naturalspeech 3: Zero-shot Speech Synthesis With Factorized Codec And Diffusion Models
(2024)
• No Venue
Ju et al.
-
Pegasus-v1 Technical Report
(2024)
• No Venue
Jung et al.
-
Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs
(2024)
• Proceedings of the CHI Conference on Human Factors in Computing Systems
• 80 citations
Kazemitabaar et al.
-
MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications
(2024)
• No Venue
Kanithi et al.
-
Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models
(2024)
• No Venue
Kaushal et al.
-
Video Depth Without Video Models
(2024)
• No Venue
Ke et al.
-
Law Of Vision Representation In Mllms
(2024)
• No Venue
Yang et al.
-
Openvla: An Open-source Vision-language-action Model
(2024)
• No Venue
Kim et al.
-
FLOAT: Generative Motion Latent Flow Matching For Audio-driven Talking Portrait
(2024)
• No Venue
Taekyung Ki, Dongchan Min, Gyoungsu Chae
-
Husky: A Unified, Open-source Language Agent For Multi-step Reasoning
(2024)
• No Venue
Kim et al.
-
Fifo-diffusion: Generating Infinite Videos From Text Without Training
(2024)
• No Venue
Kim et al.
-
Evaluating Language Models As Synthetic Data Generators
(2024)
• No Venue
Kim et al.
-
THEANINE: Revisiting Memory Management In Long-term Conversations With Timeline-augmented Response Generation
(2024)
• No Venue
Kim et al.
-
Prometheus 2: An Open Source Language Model Specialized In Evaluating Other Language Models
(2024)
• No Venue
Kim et al.
-
Sdpo: Don't Use Your Data All At Once
(2024)
• No Venue
Kim et al.
-
Training Language Models To Self-correct Via Reinforcement Learning
(2024)
• No Venue
Kumar et al.
-
Can Large Language Models Explore In-context?
(2024)
• No Venue
Krishnamurthy et al.
-
Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack
(2024)
• No Venue
Kuratov et al.
-
In Search Of Needles In A 10M Haystack: Recurrent Memory Finds What Llms Miss
(2024)
• No Venue
Kuratov et al.
-
"give Me BF16 Or Give Me Death"? Accuracy-performance Trade-offs In LLM Quantization
(2024)
• No Venue
Kurtic et al.
-
Summary Of A Haystack: A Challenge To Long-context Llms And RAG Systems
(2024)
• No Venue
Laban et al.
-
Biomistral: A Collection Of Open-source Pretrained Large Language Models For Medical Domains
(2024)
• Findings of the Association for Computational Linguistics ACL 2024
• 68 citations
Labrak et al.
-
Revisit Large-scale Image-caption Data In Pre-training Multimodal Foundation Models
(2024)
• No Venue
Lai et al.
-
TÜLU 3: Pushing Frontiers In Open Language Model Post-training
(2024)
• No Venue
Lambert et al.
-
What Matters When Building Vision-language Models?
(2024)
• No Venue
Laurençon et al.
-
Building And Better Understanding Vision-language Models: Insights And Future Directions
(2024)
• No Venue
Laurençon et al.
-
Unlocking The Conversion Of Web Screenshots Into HTML Code With The Websight Dataset
(2024)
• No Venue
Hugo Laurençon, Léo Tronchon, Victor Sanh
-
One Diffusion To Generate Them All
(2024)
• No Venue
Le et al.
-
Phantom Of Latent For Large Language And Vision Models
(2024)
• No Venue
Lee et al.
-
Moai: Mixture Of All Intelligence For Large Language And Vision Models
(2024)
• No Venue
Lee et al.
-
Gecko: Versatile Text Embeddings Distilled From Large Language Models
(2024)
• No Venue
Lee et al.
-
Trol: Traversal Of Layers For Large Language And Vision Models
(2024)
• No Venue
Lee et al.
-
Videoguide: Improving Video Diffusion Models Without Training Through A Teacher's Guide
(2024)
• No Venue
Lee et al.
-
Beyond A*: Better Planning With Transformers Via Search Dynamics Bootstrapping
(2024)
• No Venue
Lehnert et al.
-
The Curse Of Multi-modalities: Evaluating Hallucinations Of Large Multimodal Models Across Language, Visual, And Audio
(2024)
• No Venue
Leng et al.
-
More Agents Is All You Need
(2024)
• No Venue
Li et al.
-
Mini-gemini: Mining The Potential Of Multi-modality Vision Language Models
(2024)
• No Venue
Li et al.
-
Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models
(2024)
• No Venue
Li et al.
-
Controlnet++: Improving Conditional Controls With Efficient Consistency Feedback
(2024)
• No Venue
Li et al.
-
Baichuan-omni Technical Report
(2024)
• No Venue
Li et al.
-
Brushedit: All-in-one Image Inpainting And Editing
(2024)
• No Venue
Li et al.
-
Making Text Embedders Few-shot Learners
(2024)
• No Venue
Li et al.
-
K-sort Arena: Efficient And Reliable Benchmarking For Generative Models Via K-wise Human Preferences
(2024)
• No Venue
Li et al.
-
From Generation To Judgment: Opportunities And Challenges Of Llm-as-a-judge
(2024)
• No Venue
Li et al.
-
GMAI-VL & GMAI-VL-5.5M: A Large Vision-language Model And A Comprehensive Multimodal Dataset Towards General Medical AI
(2024)
• No Venue
Li et al.
-
Llava-next-interleave: Tackling Multi-image, Video, And 3D In Large Multimodal Models
(2024)
• No Venue
Li et al.
-
Synergen-vl: Towards Synergistic Image Understanding And Generation With Vision Experts And Token Folding
(2024)
• No Venue
Li et al.
-
Retrollm: Empowering Large Language Models To Retrieve Fine-grained Evidence Within Generation
(2024)
• No Venue
Li et al.
-
Omnicorpus: A Unified Multimodal Corpus Of 10 Billion-level Images Interleaved With Text
(2024)
• No Venue
Li et al.
-
Naturalbench: Evaluating Vision-language Models On Natural Adversarial Samples
(2024)
• No Venue
Li et al.
-
Needlebench: Can Llms Do Retrieval And Reasoning In 1 Million Context Window?
(2024)
• No Venue
Li et al.
-
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel In Long-horizon Tasks
(2024)
• No Venue
Li et al.
-
Structrag: Boosting Knowledge Intensive Reasoning Of Llms Via Inference-time Hybrid Information Structurization
(2024)
• No Venue
Li et al.
-
Scaling (down) CLIP: A Comprehensive Analysis Of Data, Architecture, And Training Strategies
(2024)
• No Venue
Zichao Li, Cihang Xie, Ekin Dogus Cubuk
-
Scilitllm: How To Adapt Llms For Scientific Literature Understanding
(2024)
• No Venue
Li et al.
-
Your Mixture-of-experts LLM Is Secretly An Embedding Model For Free
(2024)
• No Venue
Ziyue Li, Tianyi Zhou
-
Videomamba: State Space Model For Efficient Video Understanding
(2024)
• No Venue
Li et al.
-
Synthetic Data (almost) From Scratch: Generalized Instruction Tuning For Language Models
(2024)
• No Venue
Li et al.
-
Transformer-lite: High-efficiency Deployment Of Large Language Models On Mobile Phone Gpus
(2024)
• No Venue
Li et al.
-
What Happened In Llms Layers When Trained For Fast Vs. Slow Thinking: A Gradient Perspective
(2024)
• No Venue
Ming Li, Yanhong Li, Tianyi Zhou
-
Step-aware Preference Optimization: Aligning Preference With Denoising Performance At Each Step
(2024)
• No Venue
Liang et al.
-
Internal Consistency And Self-feedback In Large Language Models: A Survey
(2024)
• No Venue
Liang et al.
-
Controllable Text Generation For Large Language Models: A Survey
(2024)
• No Venue
Liang et al.
-
I-SHEEP: Self-alignment Of LLM From Scratch Through An Iterative Self-enhancement Paradigm
(2024)
• No Venue
Liang et al.
-
Mixture-of-transformers: A Sparse And Scalable Architecture For Multi-modal Foundation Models
(2024)
• No Venue
Liang et al.
-
Adding Nvme Ssds To Enable And Accelerate 100B Model Fine-tuning On A Single GPU
(2024)
• No Venue
Liao et al.
-
Showui: One Vision-language-action Model For GUI Visual Agent
(2024)
• No Venue
Lin et al.
-
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once On Gemma 2
(2024)
• No Venue
Lieberum et al.
-
Jamba: A Hybrid Transformer-mamba Language Model
(2024)
• No Venue
Lieber et al.
-
Rho-1: Not All Tokens Are What You Need
(2024)
• No Venue
Lin et al.
-
Moe-llava: Mixture Of Experts For Large Vision-language Models
(2024)
• No Venue
Lin et al.
-
Baichuan Alignment Technical Report
(2024)
• No Venue
Lin et al.
-
Critical Tokens Matter: Token-level Contrastive Estimation Enhence Llm's Reasoning Capability
(2024)
• No Venue
Lin et al.
-
STIV: Scalable Text And Image Conditioned Video Generation
(2024)
• No Venue
Lin et al.
-
Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild
(2024)
• No Venue
Lin et al.
-
Motionclone: Training-free Motion Cloning For Controllable Video Generation
(2024)
• No Venue
Ling et al.
-
Infini-gram: Scaling Unbounded N-gram Language Models To A Trillion Tokens
(2024)
• No Venue
Liu et al.
-
Harnessing Webpage Uis For Text-rich Visual Understanding
(2024)
• No Venue
Liu et al.
-
Chatqa: Building GPT-4 Level Conversational QA Models
(2024)
• No Venue
Liu et al.
-
Are Your Llms Capable Of Stable Reasoning?
(2024)
• No Venue
Liu et al.
-
Alleviating Distortion In Image Generation Via Multi-resolution Diffusion Models
(2024)
• No Venue
Liu et al.
-
Best Practices And Lessons Learned On Synthetic Data For Language Models
(2024)
• No Venue
Liu et al.
-
Diving Into Self-evolving Training For Multimodal Reasoning
(2024)
• No Venue
Liu et al.
-
Deliberation In Latent Space Via Differentiable Cache Augmentation
(2024)
• No Venue
Liu et al.
-
Distilled Decoding 1: One-step Sampling Of Image Auto-regressive Models With Flow Matching
(2024)
• No Venue
Liu et al.
-
Understanding Llms: A Comprehensive Overview From Training To Inference
(2024)
• No Venue
Liu et al.
-
NVILA: Efficient Frontier Visual Language Models
(2024)
• No Venue
Liu et al.
-
Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining
(2024)
• No Venue
Liu et al.
-
Linfusion: 1 GPU, 1 Minute, 16K Image
(2024)
• No Venue
Liu et al.
-
KAN: Kolmogorov-arnold Networks
(2024)
• No Venue
Liu et al.
-
Kangaroo: Lossless Self-speculative Decoding Via Double Early Exiting
(2024)
• No Venue
Liu et al.
-
Llms + Persona-plug = Personalized Llms
(2024)
• No Venue
Liu et al.
-
Mobilellm: Optimizing Sub-billion Parameter Language Models For On-device Use Cases
(2024)
• No Venue
Liu et al.
-
Magicquill: An Intelligent Interactive Image Editing System
(2024)
• No Venue
Liu et al.
-
MMDU: A Multi-turn Multi-image Dialog Understanding Benchmark And Instruction-tuning Dataset For Lvlms
(2024)
• No Venue
Liu et al.
-
Sora: A Review On Background, Technology, Limitations, And Opportunities Of Large Vision Models
(2024)
• No Venue
Liu et al.
-
Regmix: Data Mixture As Regression For Language Model Pre-training
(2024)
• No Venue
Liu et al.
-
POINTS1.5: Building A Vision-language Model Towards Real World Applications
(2024)
• No Venue
Liu et al.
-
Reconx: Reconstruct Any Scene From Sparse Views With Video Diffusion Model
(2024)
• No Venue
Liu et al.
-
Retrievalattention: Accelerating Long-context LLM Inference Via Vector Retrieval
(2024)
• No Venue
Liu et al.
-
Starcoder 2 And The Stack V2: The Next Generation
(2024)
• No Venue
Lozhkov et al.
-
Seacrowd: A Multilingual Multimodal Data Hub And Benchmark Suite For Southeast Asian Languages
(2024)
• No Venue
Lovenia et al.
-
Blending Is All You Need: Cheaper, Better Alternative To Trillion-parameters LLM
(2024)
• No Venue
Lu et al.
-
The AI Scientist: Towards Fully Automated Open-ended Scientific Discovery
(2024)
• No Venue
Lu et al.
-
From GPT-4 To Gemini And Beyond: Assessing The Landscape Of Mllms On Generalizability, Trustworthiness And Causality Through Four Modalities
(2024)
• No Venue
Lu et al.
-
Fit: Flexible Vision Transformer For Diffusion Model
(2024)
• No Venue
Lu et al.
-
A Controlled Study On Long Context Extension And Generalization In Llms
(2024)
• No Venue
Lu et al.
-
Deepseek-vl: Towards Real-world Vision-language Understanding
(2024)
• No Venue
Lu et al.
-
Genex: Generating An Explorable World
(2024)
• No Venue
Lu et al.
-
Large Language Models Are Superpositions Of All Characters: Attaining Arbitrary Role-play Via Self-alignment
(2024)
• No Venue
Lu et al.
-
Mathcoder2: Better Math Reasoning From Continued Pretraining On Model-translated Mathematical Code
(2024)
• No Venue
Lu et al.
-
Addition Is All You Need For Energy-efficient Language Models
(2024)
• No Venue
Hongyin Luo, Wei Sun
-
Robustft: Robust Supervised Fine-tuning For Large Language Models Under Noisy Response
(2024)
• No Venue
Luo et al.
-
Improve Mathematical Reasoning In Language Models By Automated Process Supervision
(2024)
• No Venue
Luo et al.
-
Mmevol: Empowering Multimodal Large Language Models With Evol-instruct
(2024)
• No Venue
Luo et al.
-
Semievol: Semi-supervised Fine-tuning For LLM Adaptation
(2024)
• No Venue
Luo et al.
-
Reft: Reasoning With Reinforced Fine-tuning
(2024)
• No Venue
Luong et al.
-
Aria Everyday Activities Dataset
(2024)
• No Venue
Lv et al.
-
Weblinx: Real-world Website Navigation With Multi-turn Dialogue
(2024)
• No Venue
Xing Han Lù, Zdeněk Kasner, Siva Reddy
-
Magic-me: Identity-specific Video Customized Diffusion
(2024)
• No Venue
Ma et al.
-
Foundation Models For Music: A Survey
(2024)
• No Venue
Ma et al.
-
The Era Of 1-bit Llms: All Large Language Models Are In 1.58 Bits
(2024)
• No Venue
Ma et al.
-
Language Model Can Listen While Speaking
(2024)
• No Venue
Ma et al.
-
Groma: Localized Visual Tokenization For Grounding Multimodal Large Language Models
(2024)
• No Venue
Ma et al.
-
Janusflow: Harmonizing Autoregression And Rectified Flow For Unified Multimodal Understanding And Generation
(2024)
• No Venue
Ma et al.
-
Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length
(2024)
• No Venue
Ma et al.
-
Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling
(2024)
• No Venue
Maini et al.
-
Wavelets Are All You Need For Autoregressive Image Generation
(2024)
• No Venue
Mattar et al.
-
MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training
(2024)
• No Venue
McKinzie et al.
-
LLM Agent Operating System
(2024)
• No Venue
Mei et al.
-
Openelm: An Efficient Language Model Family With Open-source Training And Inference Framework
(2024)
• No Venue
Mehta et al.
-
Shortgpt: Layers In Large Language Models Are More Redundant Than You Expect
(2024)
• No Venue
Men et al.
-
Towards World Simulator: Crafting Physical Commonsense-based Benchmark For Video Generation
(2024)
• No Venue
Meng et al.
-
Anidoc: Animation Creation Made Easier
(2024)
• No Venue
Meng et al.
-
MMIU: Multimodal Multi-image Understanding For Evaluating Large Vision-language Models
(2024)
• No Venue
Meng et al.
-
Large Language Models: A Survey
(2024)
• Arxiv
• 132 citations
Minaee et al.
-
MALT: Improving Reasoning With Multi-agent LLM Training
(2024)
• No Venue
Motwani et al.
-
ROS-LLM: A ROS Framework For Embodied AI With Task Feedback And Structured Reasoning
(2024)
• No Venue
Mower et al.
-
Olmoe: Open Mixture-of-experts Language Models
(2024)
• No Venue
Muennighoff et al.
-
Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering
(2024)
• No Venue
Muller et al.
-
Bimedix2: Bio-medical Expert LMM For Diverse Medical Modalities
(2024)
• No Venue
Mullappilly et al.
-
Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention
(2024)
• No Venue
Tsendsuren Munkhdalai, Manaal Faruqui, Siddharth Gopal
-
Compact Language Models Via Pruning And Knowledge Distillation
(2024)
• No Venue
Muralidharan et al.
-
Yesbut: A High-quality Annotated Multimodal Dataset For Evaluating Satire Comprehension Capability Of Vision-language Models
(2024)
• No Venue
Nandy et al.
-
Openvid-1m: A Large-scale High-quality Dataset For Text-to-video Generation
(2024)
• No Venue
Nan et al.
-
Swiftedit: Lightning Fast Text-guided Image Editing Via One-step Diffusion
(2024)
• No Venue
Nguyen et al.
-
A Survey Of Small Language Models
(2024)
• No Venue
Nguyen et al.
-
GUI Agents: A Survey
(2024)
• No Venue
Nguyen et al.
-
Dynasaur: Large Language Agents Beyond Predefined Actions
(2024)
• No Venue
Nguyen et al.
-
SNOOPI: Supercharged One-step Diffusion Distillation With Proper Guidance
(2024)
• No Venue
Nguyen et al.
-
Transformers Are Multi-state Rnns
(2024)
• No Venue
Oren et al.
-
Xland-100b: A Large-scale Multi-task Dataset For In-context Reinforcement Learning
(2024)
• No Venue
Nikulin et al.
-
Beyond Scaling Laws: Understanding Transformer Performance With Associative Memory
(2024)
• No Venue
Niu et al.
-
Llms Know More Than They Show: On The Intrinsic Representation Of LLM Hallucinations
(2024)
• No Venue
Orgad et al.
-
Reka Core, Flash, And Edge: A Series Of Powerful Multimodal Language Models
(2024)
• No Venue
Ormazabal et al.
-
Towards Modular Llms By Building And Reusing A Library Of Loras
(2024)
• No Venue
Ostapenko et al.
-
Byte Latent Transformer: Patches Scale Better Than Tokens
(2024)
• No Venue
Pagnoni et al.
-
Can Mamba Learn How To Learn? A Comparative Study On In-context Learning Tasks
(2024)
• No Venue
Park et al.
-
Iterative Reasoning Preference Optimization
(2024)
• No Venue
Pang et al.
-
Nemotron-4 15B Technical Report
(2024)
• No Venue
Parmar et al.
-
Advprompter: Fast Adaptive Adversarial Prompting For Llms
(2024)
• No Venue
Paulus et al.
-
Datadreamer: A Tool For Synthetic Data Generation And Reproducible LLM Workflows
(2024)
• No Venue
Ajay Patel, Colin Raffel, Chris Callison-Burch
-
Eagle And Finch: RWKV With Matrix-valued States And Dynamic Recurrence
(2024)
• No Venue
Peng et al.
-
Dreambench++: A Human-aligned Benchmark For Personalized Image Generation
(2024)
• No Venue
Peng et al.
-
Controlnext: Powerful And Efficient Control For Image And Video Generation
(2024)
• No Venue
Peng et al.
-
Moe-mamba: Efficient Selective State Space Models With Mixture Of Experts
(2024)
• No Venue
Pióro et al.
-
Movie Gen: A Cast Of Media Foundation Models
(2024)
• No Venue
Polyak et al.
-
Video Diffusion Alignment Via Reward Gradients
(2024)
• No Venue
Prabhudesai et al.
-
The Widening Gap: The Benefits And Harms Of Generative AI For Novice Programmers
(2024)
• Proceedings of the 2024 ACM Conference on International Computing Education Research - Volume 1
• 60 citations
Prather et al.
-
Mutual Reasoning Makes Smaller Llms Stronger Problem-solvers
(2024)
• No Venue
Qi et al.
-
Webrl: Training LLM Web Agents Via Self-evolving Online Curriculum Reinforcement Learning
(2024)
• No Venue
Qi et al.
-
Memorag: Moving Towards Next-gen RAG Via Memory-inspired Knowledge Discovery
(2024)
• No Venue
Qian et al.
-
We-math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?
(2024)
• No Venue
Qiao et al.
-
Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms
(2024)
• No Venue
Qiao et al.
-
Layerwise Recurrent Router For Mixture-of-experts
(2024)
• No Venue
Qiu et al.
-
Diffusiongpt: Llm-driven Text-to-image Generation System
(2024)
• No Venue
Qin et al.
-
Xgen-videosyn-1: High-fidelity Text-to-video Synthesis With Compressed Representations
(2024)
• No Venue
Qin et al.
-
Tokenflow: Unified Image Tokenizer For Multimodal Understanding And Generation
(2024)
• No Venue
Qu et al.
-
Hellobench: Evaluating Long Text Generation Capabilities Of Large Language Models
(2024)
• No Venue
Que et al.
-
EXAONE 3.0 7.8B Instruction Tuned Language Model
(2024)
• No Venue
Research et al.
-
Bringing Objects To Life: 4D Generation From 3D Objects
(2024)
• No Venue
Rahamim et al.
-
SAM 2: Segment Anything In Images And Videos
(2024)
• No Venue
Ravi et al.
-
Your Transformer Is Secretly Linear
(2024)
• No Venue
Razzhigaev et al.
-
Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling
(2024)
• No Venue
Ren et al.
-
Grandmaster-level Chess Without Search
(2024)
• No Venue
Ruoss et al.
-
Writing In The Margins: Better Inference Pattern For Long Context Retrieval
(2024)
• No Venue
Russak et al.
-
Eliminating Oversaturation And Artifacts Of High Guidance Scales In Diffusion Models
(2024)
• No Venue
Seyedmorteza Sadat, Otmar Hilliges, Romann M. Weber
-
A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications
(2024)
• Arxiv
• 92 citations
Sahoo et al.
-
Pre-training Small Base Lms With Fewer Tokens
(2024)
• No Venue
Sunny Sanyal, Sujay Sanghavi, Alexandros G. Dimakis
-
Fast High-resolution Image Synthesis With Latent Adversarial Diffusion Distillation
(2024)
• No Venue
Sauer et al.
-
RAPTOR: Recursive Abstractive Processing For Tree-organized Retrieval
(2024)
• No Venue
Sarthi et al.
-
Prithvi Wxc: Foundation Model For Weather And Climate
(2024)
• No Venue
Schmude et al.
-
The Prompt Report: A Systematic Survey Of Prompting Techniques
(2024)
• No Venue
Schulhoff et al.
-
Show, Don't Tell: Aligning Language Models With Demonstrated Feedback
(2024)
• No Venue
Shaikh et al.
-
Diffuse To Choose: Enriching Image Conditioned Inpainting In Latent Diffusion Models For Virtual Try-all
(2024)
• No Venue
Seyfioglu et al.
-
Inserf: Text-driven Generative Object Insertion In Neural 3D Scenes
(2024)
• No Venue
Shahbazi et al.
-
Talking About Large Language Models
(2024)
• Communications of the ACM
• 135 citations
Murray Shanahan
-
Imp: Highly Capable Large Multimodal Models For Mobile Devices
(2024)
• No Venue
Shao et al.
-
Deepseekmath: Pushing The Limits Of Mathematical Reasoning In Open Language Models
(2024)
• No Venue
Shao et al.
-
Scaling Retrieval-based Language Models With A Trillion-token Datastore
(2024)
• No Venue
Shao et al.
-
Nemo-aligner: Scalable Toolkit For Efficient Model Alignment
(2024)
• No Venue
Shen et al.
-
Explanatory Instructions: Towards Unified Vision Tasks Understanding And Zero-shot Generalization
(2024)
• No Venue
Shen et al.
-
Jetmoe: Reaching Llama2 Performance With 0.1M Dollars
(2024)
• No Venue
Shen et al.
-
LEDITS: Real Image Editing With DDPM Inversion And Semantic Guidance
(2023)
• No Venue
Linoy Tsaban, Apolinário Passos
-
Key-locked Rank One Editing For Text-to-image Personalization
(2023)
• Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Proceedings
• 68 citations
Tewel et al.
-
Pytorch FSDP: Experiences On Scaling Fully Sharded Data Parallel
(2023)
• Proceedings of the VLDB Endowment
• 75 citations
Zhao et al.
-
Fine-tuning Language Models For Factuality
(2023)
• No Venue
Tian et al.
-
Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models
(2023)
• Arxiv
• 374 citations
Zhu et al.
-
Is Chatgpt The Ultimate Programming Assistant -- How Far Is It?
(2023)
• Arxiv
• 92 citations
Tian et al.
-
Opportunities And Challenges For Chatgpt And Large Language Models In Biomedicine And Health
(2023)
• Briefings in Bioinformatics
• 205 citations
Tian et al.
-
Unleashing Text-to-image Diffusion Models For Visual Perception
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 78 citations
Zhao et al.
-
Enhancing STEM Learning With Chatgpt And Bing Chat As Objects To Think With: A Case Study
(2023)
• Eurasia Journal of Mathematics, Science and Technology Education
• 63 citations
Marco Antonio Rodrigues Vasconcelos, Renato P. Dos Santos
-
A Survey Of Large Language Models
(2023)
• Arxiv
• 1150 citations
Zhao et al.
-
Large Language Models Fail On Trivial Alterations To Theory-of-mind Tasks
(2023)
• Arxiv
• 56 citations
Tomer Ullman
-
Chatclimate: Grounding Conversational AI In Climate Science
(2023)
• Communications Earth & Environment
• 63 citations
Vaghefi et al.
-
Llama 2: Open Foundation And Fine-tuned Chat Models
(2023)
• No Venue
Touvron et al.
-
Reflexion: Language Agents With Verbal Reinforcement Learning
(2023)
• Arxiv
• 170 citations
Shinn et al.
-
Llasm: Large Language And Speech Model
(2023)
• No Venue
Shu et al.
-
The Curse Of Recursion: Training On Generated Data Makes Models Forget
(2023)
• Arxiv
• 101 citations
Shumailov et al.
-
Codefusion: A Pre-trained Diffusion Model For Code Generation
(2023)
• No Venue
Singh et al.
-
An Analysis Of The Automatic Bug Fixing Performance Of Chatgpt
(2023)
• 2023 IEEE/ACM International Workshop on Automated Program Repair (APR)
• 226 citations
Sobania et al.
-
Powerinfer: Fast Large Language Model Serving With A Consumer-grade GPU
(2023)
• No Venue
Song et al.
-
Agents: An Open-source Framework For Autonomous Language Agents
(2023)
• No Venue
Zhou et al.
-
How Far Are Large Language Models From Agents With Theory-of-mind?
(2023)
• No Venue
Zhou et al.
-
Sensecape: Enabling Multilevel Exploration And Sensemaking With Large Language Models
(2023)
• Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
• 65 citations
Suh et al.
-
Large Language Models For Information Retrieval: A Survey
(2023)
• Arxiv
• 63 citations
Zhu et al.
-
Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 97 citations
Sun et al.
-
Aligning Large Multimodal Models With Factually Augmented RLHF
(2023)
• No Venue
Sun et al.
-
3D-GPT: Procedural 3D Modeling With Large Language Models
(2023)
• No Venue
Sun et al.
-
Generative Multimodal Models Are In-context Learners
(2023)
• No Venue
Sun et al.
-
All In One: Multi-task Prompting For Graph Neural Networks
(2023)
• Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
• 85 citations
Sun et al.
-
EVA-CLIP: Improved Training Techniques For CLIP At Scale
(2023)
• Arxiv
• 64 citations
Sun et al.
-
Retentive Network: A Successor To Transformer For Large Language Models
(2023)
• No Venue
Sun et al.
-
Text Classification Via Large Language Models
(2023)
• Findings of the Association for Computational Linguistics: EMNLP 2023
• 88 citations
Sun et al.
-
Segment Everything Everywhere All At Once
(2023)
• Arxiv
• 137 citations
Zou et al.
-
Universal And Transferable Adversarial Attacks On Aligned Language Models
(2023)
• Arxiv
• 109 citations
Zou et al.
-
Can Chatgpt Understand Too? A Comparative Study On Chatgpt And Fine-tuned BERT
(2023)
• Arxiv
• 108 citations
Zhong et al.
-
Vipergpt: Visual Inference Via Python Execution For Reasoning
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 101 citations
Dídac Surís, Sachit Menon, Carl Vondrick
-
Judging Llm-as-a-judge With Mt-bench And Chatbot Arena
(2023)
• No Venue
Zheng et al.
-
Does Synthetic Data Generation Of Llms Help Clinical Text Mining?
(2023)
• Arxiv
• 72 citations
Tang et al.
-
Exploring Large Language Models' Cognitive Moral Development Through Defining Issues Test
(2023)
• No Venue
Tanmay et al.
-
GALIP: Generative Adversarial Clips For Text-to-image Synthesis
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 91 citations
Tao et al.
-
Codegeex: A Pre-trained Model For Code Generation With Multilingual Benchmarking On Humaneval-x
(2023)
• Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
• 88 citations
Zheng et al.
-
Judgelm: Fine-tuned Large Language Models Are Scalable Judges
(2023)
• No Venue
Lianghui Zhu, Xinggang Wang, Xinlong Wang
-
Exploring The Limits Of Chatgpt For Query Or Aspect-based Text Summarization
(2023)
• Arxiv
• 76 citations
Yang et al.
-
Deepspeed-chat: Easy, Fast And Affordable RLHF Training Of Chatgpt-like Models At All Scales
(2023)
• No Venue
Yao et al.
-
Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study
(2023)
• Proceedings of the 4th New Frontiers in Summarization Workshop
• 138 citations
Wang et al.
-
Large Language Models Can Be Easily Distracted By Irrelevant Context
(2023)
• Arxiv
• 58 citations
Shi et al.
-
Wavecoder: Widespread And Versatile Enhanced Instruction Tuning With Refined Data Generation
(2023)
• No Venue
Yu et al.
-
Towards Human-bot Collaborative Software Architecting With Chatgpt
(2023)
• Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering
• 108 citations
Ahmad et al.
-
MEGA: Multilingual Evaluation Of Generative AI
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 60 citations
Ahuja et al.
-
GQA: Training Generalized Multi-query Transformer Models From Multi-head Checkpoints
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 117 citations
Ainslie et al.
-
Can We Trust The Evaluation On Chatgpt?
(2023)
• Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023)
• 59 citations
Aiyappa et al.
-
Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent
(2023)
• No Venue
Aksitov et al.
-
The Falcon Series Of Open Language Models
(2023)
• Arxiv
• 88 citations
Almazrouei et al.
-
LLM In A Flash: Efficient Large Language Model Inference With Limited Memory
(2023)
• No Venue
Alizadeh et al.
-
Large-scale Automatic Audiobook Creation
(2023)
• No Venue
Walsh et al.
-
Fusionframes: Efficient Architectural Aspects For Text-to-video Generation Pipeline
(2023)
• No Venue
Arkhipkin et al.
-
Kandinsky 3.0 Technical Report
(2023)
• No Venue
Arkhipkin et al.
-
Self-rag: Learning To Retrieve, Generate, And Critique Through Self-reflection
(2023)
• No Venue
Asai et al.
-
Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models
(2023)
• No Venue
Awadalla et al.
-
The Chosen One: Consistent Characters In Text-to-image Diffusion Models
(2023)
• No Venue
Avrahami et al.
-
Llemma: An Open Language Model For Mathematics
(2023)
• No Venue
Azerbayev et al.
-
Dreamdiffusion: Generating High-quality Images From Brain EEG Signals
(2023)
• No Venue
Bai et al.
-
Qwen Technical Report
(2023)
• No Venue
Bai et al.
-
Codeplan: Repository-level Coding Using Llms And Planning
(2023)
• No Venue
Bairi et al.
-
Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation
(2023)
• Proceedings of the 17th ACM Conference on Recommender Systems
• 161 citations
Bao et al.
-
Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling
(2023)
• Arxiv
• 101 citations
Biderman et al.
-
Emergent Autonomous Scientific Research Capabilities Of Large Language Models
(2023)
• Arxiv
• 61 citations
Daniil A. Boiko, Robert MacKnight, Gabe Gomes
-
Chip-chat: Challenges And Opportunities In Conversational Hardware Design
(2023)
• 2023 ACM/IEEE 5th Workshop on Machine Learning for CAD (MLCAD)
• 68 citations
Blocklove et al.
-
Nougat: Neural Optical Understanding For Academic Documents
(2023)
• No Venue
Blecher et al.
-
Align Your Latents: High-resolution Video Synthesis With Latent Diffusion Models
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 290 citations
Blattmann et al.
-
Visual-language Prompt Tuning With Knowledge-guided Context Optimization
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 82 citations
Hantao Yao, Rui Zhang, Changsheng Xu
-
A Categorical Archive Of Chatgpt Failures
(2023)
• Arxiv
• 364 citations
Ali Borji
-
Eight Things To Know About Large Language Models
(2023)
• Arxiv
• 76 citations
Samuel R. Bowman
-
Promptify: Text-to-image Generation Through Interactive Prompt Exploration With Large Language Models
(2023)
• Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
• 80 citations
Brade et al.
-
Chemcrow: Augmenting Large-language Models With Chemistry Tools
(2023)
• Arxiv
• 106 citations
Bran et al.
-
RT-2: Vision-language-action Models Transfer Web Knowledge To Robotic Control
(2023)
• No Venue
Brohan et al.
-
Principled Instructions Are All You Need For Questioning Llama-1/2, GPT-3.5/4
(2023)
• No Venue
Sondos Mahmoud Bsharat, Aidar Myrzakhan, Zhiqiang Shen
-
Sparks Of Artificial General Intelligence: Early Experiments With GPT-4
(2023)
• Arxiv
• 1098 citations
Bubeck et al.
-
Weak-to-strong Generalization: Eliciting Strong Capabilities With Weak Supervision
(2023)
• No Venue
Burns et al.
-
A Comprehensive Survey Of Ai-generated Content (AIGC): A History Of Generative AI From GAN To Chatgpt
(2023)
• Arxiv
• 286 citations
Cao et al.
-
Assessing Cross-cultural Alignment Between Chatgpt And Human Societies: An Empirical Study
(2023)
• Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP)
• 67 citations
Cao et al.
-
Open Problems And Fundamental Limitations Of Reinforcement Learning From Human Feedback
(2023)
• No Venue
Casper et al.
-
Spanish Pre-trained BERT Model And Evaluation Data
(2023)
• Arxiv
• 242 citations
Cañete et al.
-
The Dawn Of Lmms: Preliminary Explorations With Gpt-4v(ision)
(2023)
• Arxiv
• 131 citations
Yang et al.
-
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
(2023)
• Arxiv
• 97 citations
Zhang et al.
-
Magicdance: Realistic Human Dance Video Generation With Motions & Facial Expressions Transfer
(2023)
• No Venue
Chang et al.
-
Could A Large Language Model Be Conscious?
(2023)
• Boston Review August 9 2023
• 99 citations
David J. Chalmers
-
The AI Generation Gap: Are Gen Z Students More Interested In Adopting Generative AI Such As Chatgpt In Teaching And Learning Than Their Gen X And Millennial Generation Teachers?
(2023)
• Smart Learning Environments
• 268 citations
Cecilia Ka Yuk Chan, Katherine K. W. Lee
-
A Survey On Evaluation Of Large Language Models
(2023)
• No Venue
Chang et al.
-
Muse: Text-to-image Generation Via Masked Generative Transformers
(2023)
• Arxiv
• 96 citations
Chang et al.
-
Pepnet: Parameter And Embedding Personalized Network For Infusing With Personalized Prior Information
(2023)
• Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
• 60 citations
Chang et al.
-
Attend-and-excite: Attention-based Semantic Guidance For Text-to-image Diffusion Models
(2023)
• ACM Transactions on Graphics
• 215 citations
Chefer et al.
-
Extending Context Window Of Large Language Models Via Positional Interpolation
(2023)
• No Venue
Chen et al.
-
Pixart-α: Fast Training Of Diffusion Transformer For Photorealistic Text-to-image Synthesis
(2023)
• No Venue
Chen et al.
-
MEDITRON-70B: Scaling Medical Pretraining For Large Language Models
(2023)
• Arxiv
• 64 citations
Chen et al.
-
Llava-interactive: An All-in-one Demo For Image Chat, Segmentation, Generation And Editing
(2023)
• No Venue
Chen et al.
-
Longlora: Efficient Fine-tuning Of Long-context Large Language Models
(2023)
• No Venue
Chen et al.
-
Photoverse: Tuning-free Image Customization With Text-to-image Diffusion Models
(2023)
• No Venue
Chen et al.
-
Teaching Large Language Models To Self-debug
(2023)
• Arxiv
• 61 citations
Chen et al.
-
Schrodinger Bridges Beat Diffusion Models On Text-to-speech Synthesis
(2023)
• No Venue
Chen et al.
-
Shikra: Unleashing Multimodal Llm's Referential Dialogue Magic
(2023)
• Arxiv
• 60 citations
Chen et al.
-
Xtrimopglm: Unified 100b-scale Pre-trained Transformer For Deciphering The Language Of Protein
(2023)
• Arxiv
• 66 citations
Chen et al.
-
A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models
(2023)
• Arxiv
• 158 citations
Ye et al.
-
Ip-adapter: Text Compatible Image Prompt Adapter For Text-to-image Diffusion Models
(2023)
• No Venue
Ye et al.
-
Adapting Large Language Models Via Reading Comprehension
(2023)
• No Venue
Daixuan Cheng, Shaohan Huang, Furu Wei
-
Can Large Language Models Be An Alternative To Human Evaluations?
(2023)
• Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 141 citations
Cheng-Han Chiang, Hung-Yi Lee
-
Contrastive Chain-of-thought Prompting
(2023)
• No Venue
Chia et al.
-
Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models
(2023)
• No Venue
Chuang et al.
-
Simple And Controllable Music Generation
(2023)
• No Venue
Copet et al.
-
Rerender A Video: Zero-shot Text-guided Video-to-video Translation
(2023)
• No Venue
Yang et al.
-
Switchhead: Accelerating Transformers With Mixture-of-experts Attention
(2023)
• No Venue
Csordás et al.
-
Efficient And Effective Text Encoding For Chinese Llama And Alpaca
(2023)
• Arxiv
• 56 citations
Yiming Cui, Ziqing Yang, Xin Yao
-
Where To Go Next For Recommender Systems? ID- Vs. Modality-based Recommender Models Revisited
(2023)
• Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 91 citations
Yuan et al.
-
Auggpt: Leveraging Chatgpt For Text Data Augmentation
(2023)
• Arxiv
• 85 citations
Dai et al.
-
Emu: Enhancing Image Generation Models Using Photogenic Needles In A Haystack
(2023)
• No Venue
Dai et al.
-
Large Language Models As Optimizers
(2023)
• No Venue
Yang et al.
-
Tinygpt-v: Efficient Multimodal Large Language Model Via Small Backbones
(2023)
• No Venue
Zhengqing Yuan, Zhaoxu Li, Lichao Sun
-
Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning
(2023)
• Arxiv
• 90 citations
Tri Dao
-
Vision Transformers Need Registers
(2023)
• No Venue
Darcet et al.
-
Generative AI In Computing Education: Perspectives Of Students And Instructors
(2023)
• 2023 IEEE Frontiers in Education Conference (FIE)
• 66 citations
Zastudil et al.
-
Patch N' Pack: Navit, A Vision Transformer For Any Aspect Ratio And Resolution
(2023)
• No Venue
Dehghani et al.
-
Scaling Vision Transformers To 22 Billion Parameters
(2023)
• Arxiv
• 60 citations
Dehghani et al.
-
Language Modeling Is Compression
(2023)
• No Venue
Delétang et al.
-
MM-REACT: Prompting Chatgpt For Multimodal Reasoning And Action
(2023)
• Arxiv
• 61 citations
Yang et al.
-
Brain2music: Reconstructing Music From Human Brain Activity
(2023)
• No Venue
Denk et al.
-
Lumos: Learning Agents With Unified Data, Modular Design, And Open-source Llms
(2023)
• No Venue
Yin et al.
-
Vid2seq: Large-scale Pretraining Of A Visual Language Model For Dense Video Captioning
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 122 citations
Yang et al.
-
Qlora: Efficient Finetuning Of Quantized Llms
(2023)
• No Venue
Dettmers et al.
-
Toxicity In Chatgpt: Analyzing Persona-assigned Language Models
(2023)
• Findings of the Association for Computational Linguistics: EMNLP 2023
• 94 citations
Deshpande et al.
-
Octopus: Embodied Vision-language Programmer From Environmental Feedback
(2023)
• No Venue
Yang et al.
-
Chain-of-verification Reduces Hallucination In Large Language Models
(2023)
• No Venue
Dhuliawala et al.
-
Imagereward: Learning And Evaluating Human Preferences For Text-to-image Generation
(2023)
• Arxiv
• 55 citations
Xu et al.
-
Fingpt: Open-source Financial Large Language Models
(2023)
• SSRN Electronic Journal
• 133 citations
Hongyang Yang, Xiao-Yang Liu, Christina Dan Wang
-
Longnet: Scaling Transformers To 1,000,000,000 Tokens
(2023)
• No Venue
Ding et al.
-
Lp-musiccaps: Llm-based Pseudo Music Captioning
(2023)
• No Venue
Doh et al.
-
Dreamllm: Synergistic Multimodal Comprehension And Creation
(2023)
• No Venue
Dong et al.
-
Palm-e: An Embodied Multimodal Language Model
(2023)
• Arxiv
• 309 citations
Driess et al.
-
Improving Factuality And Reasoning In Language Models Through Multiagent Debate
(2023)
• Arxiv
• 56 citations
Du et al.
-
Gpts Are Gpts: An Early Look At The Labor Market Impact Potential Of Large Language Models
(2023)
• Arxiv
• 430 citations
Eloundou et al.
-
Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review
(2023)
• British Journal of Educational Technology
• 329 citations
Yan et al.
-
Perspectives On Large Language Models For Relevance Judgment
(2023)
• Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval
• 96 citations
Faggioli et al.
-
Agenttuning: Enabling Generalized Agent Abilities For Llms
(2023)
• No Venue
Zeng et al.
-
LARP: Language-agent Role Play For Open-world Games
(2023)
• No Venue
Yan et al.
-
Large Language Models For Software Engineering: Survey And Open Problems
(2023)
• 2023 IEEE/ACM International Conference on Software Engineering: Future of Software Engineering (ICSE-FoSE)
• 117 citations
Fan et al.
-
RMT: Retentive Networks Meet Vision Transformers
(2023)
• No Venue
Fan et al.
-
Magicbrush: A Manually Annotated Dataset For Instruction-guided Image Editing
(2023)
• No Venue
Zhang et al.
-
Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models
(2023)
• First Monday
• 146 citations
Emilio Ferrara
-
Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records
(2023)
• No Venue
Fleming et al.
-
MME: A Comprehensive Evaluation Benchmark For Multimodal Large Language Models
(2023)
• Arxiv
• 70 citations
Fu et al.
-
Gptscore: Evaluate As You Desire
(2023)
• Arxiv
• 75 citations
Fu et al.
-
Encoder-based Domain Tuning For Fast Personalization Of Text-to-image Models
(2023)
• ACM Transactions on Graphics
• 98 citations
Gal et al.
-
Tablegpt: Towards Unifying Tables, Nature Language And Commands Into One GPT
(2023)
• No Venue
Zha et al.
-
Distil-whisper: Robust Knowledge Distillation Via Large-scale Pseudo Labelling
(2023)
• No Venue
Sanchit Gandhi, Patrick von Platen, Alexander M. Rush
-
Enabling Large Language Models To Generate Text With Citations
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 66 citations
Gao et al.
-
Neural Approaches To Conversational Information Retrieval
(2023)
• The Information Retrieval Series
• 57 citations
Gao et al.
-
Retrieval-augmented Generation For Large Language Models: A Survey
(2023)
• Arxiv
• 326 citations
Gao et al.
-
Llama-adapter V2: Parameter-efficient Visual Instruction Model
(2023)
• Arxiv
• 103 citations
Gao et al.
-
On The Origin Of Llms: An Evolutionary Tree And Graph For 15,821 Large Language Models
(2023)
• No Venue
Sarah Gao, Andrew Kean Gao
-
Regulating Chatgpt And Other Large Generative AI Models
(2023)
• 2023 ACM Conference on Fairness Accountability and Transparency
• 302 citations
Philipp Hacker, Andreas Engel, Marco Mauer
-
Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages
(2023)
• Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
• 77 citations
Sourojit Ghosh, Aylin Caliskan
-
Openagi: When LLM Meets Domain Experts
(2023)
• Arxiv
• 64 citations
Ge et al.
-
Gemini: A Family Of Highly Capable Multimodal Models
(2023)
• Arxiv
• 478 citations
Team et al.
-
Sigmoid Loss For Language Image Pre-training
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 138 citations
Zhai et al.
-
Tokenflow: Consistent Diffusion Features For Consistent Video Editing
(2023)
• No Venue
Geyer et al.
-
Prompt Cache: Modular Attention Reuse For Low-latency Inference
(2023)
• No Venue
Gim et al.
-
Chatgpt Outperforms Crowd-workers For Text-annotation Tasks
(2023)
• Proceedings of the National Academy of Sciences
• 546 citations
Fabrizio Gilardi, Meysam Alizadeh, Maël Kubli
-
Transformative Effects Of Chatgpt On Modern Education: Emerging Era Of AI Chatbots
(2023)
• Internet of Things and Cyber-Physical Systems
• 320 citations
Gill et al.
-
Imagebind: One Embedding Space To Bind Them All
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 348 citations
Girdhar et al.
-
Commoncanvas: An Open Diffusion Model Trained With Creative-commons Images
(2023)
• No Venue
Gokaslan et al.
-
Chatgpt Is Not All You Need. A State Of The Art Review Of Large Generative AI Models
(2023)
• Arxiv
• 179 citations
Roberto Gozalo-Brizuela, Eduardo C. Garrido-Merchan
-
PIPPA: A Partially Synthetic Conversational Dataset
(2023)
• No Venue
Tear Gosling, Alpin Dale, Yinhe Zheng
-
Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection
(2023)
• Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security
• 114 citations
Greshake et al.
-
A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models
(2023)
• No Venue
Xu et al.
-
Hallucinations In Large Multilingual Translation Models
(2023)
• Transactions of the Association for Computational Linguistics
• 63 citations
Guerreiro et al.
-
Legalbench: A Collaboratively Built Benchmark For Measuring Legal Reasoning In Large Language Models
(2023)
• SSRN Electronic Journal
• 69 citations
Guha et al.
-
Textbooks Are All You Need
(2023)
• No Venue
Gunasekar et al.
-
Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers
(2023)
• No Venue
Guo et al.
-
Animatediff: Animate Your Personalized Text-to-image Diffusion Models Without Specific Tuning
(2023)
• No Venue
Guo et al.
-
Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 72 citations
Xu et al.
-
Qa-lora: Quantization-aware Low-rank Adaptation Of Large Language Models
(2023)
• No Venue
Xu et al.
-
Multimodal Chain-of-thought Reasoning In Language Models
(2023)
• Arxiv
• 68 citations
Zhang et al.
-
Lemur: Harmonizing Natural Language And Code For Language Agents
(2023)
• No Venue
Xu et al.
-
Artificial Muses: Generative Artificial Intelligence Chatbots Have Risen To Human-level Creativity
(2023)
• Journal of Creativity
• 106 citations
Jennifer Haase, Paul H. P. Hanel
-
A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis
(2023)
• No Venue
Gur et al.
-
Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data
(2023)
• Arxiv
• 71 citations
Han et al.
-
Lm-infinite: Simple On-the-fly Length Generalization For Large Language Models
(2023)
• No Venue
Han et al.
-
Svdiff: Compact Parameter Space For Diffusion Fine-tuning
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 82 citations
Han et al.
-
Leveraging Large Language Models For Sequential Recommendation
(2023)
• Proceedings of the 17th ACM Conference on Recommender Systems
• 55 citations
Harte et al.
-
Fastervit: Fast Vision Transformers With Hierarchical Attention
(2023)
• No Venue
Hatamizadeh et al.
-
Exploring The Responses Of Large Language Models To Beginner Programmers' Help Requests
(2023)
• Proceedings of the 2023 ACM Conference on International Computing Education Research V.1
• 90 citations
Hellas et al.
-
In-context Learning Creates Task Vectors
(2023)
• No Venue
Roee Hendel, Mor Geva, Amir Globerson
-
How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation
(2023)
• Arxiv
• 146 citations
Hendy et al.
-
Metagpt: Meta Programming For A Multi-agent Collaborative Framework
(2023)
• Arxiv
• 71 citations
Hong et al.
-
LRM: Large Reconstruction Model For Single Image To 3D
(2023)
• No Venue
Hong et al.
-
3D-LLM: Injecting The 3D World Into Large Language Models
(2023)
• No Venue
Hong et al.
-
Flashdecoding++: Faster Large Language Model Inference On Gpus
(2023)
• No Venue
Hong et al.
-
Tool Documentation Enables Zero-shot Tool-usage With Large Language Models
(2023)
• No Venue
Hsieh et al.
-
Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes
(2023)
• Findings of the Association for Computational Linguistics: ACL 2023
• 117 citations
Hsieh et al.
-
Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task
(2023)
• Arxiv
• 62 citations
Xiong et al.
-
Opportunities And Challenges Of Chatgpt For Design Knowledge Management
(2023)
• Procedia CIRP
• 65 citations
Hu et al.
-
Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 99 citations
Hu et al.
-
Effective Long-context Scaling Of Foundation Models
(2023)
• No Venue
Xiong et al.
-
Language Is Not All You Need: Aligning Perception With Language Models
(2023)
• Arxiv
• 135 citations
Huang et al.
-
Is Chatgpt Better Than Human Annotators? Potential And Limitations Of Chatgpt In Explaining Implicit Hate Speech
(2023)
• Companion Proceedings of the ACM Web Conference 2023
• 152 citations
Fan Huang, Haewoon Kwak, Jisun An
-
C-eval: A Multi-level Multi-discipline Chinese Evaluation Suite For Foundation Models
(2023)
• Arxiv
• 55 citations
Huang et al.
-
Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model
(2023)
• International Journal of Oral Science
• 182 citations
Huang et al.
-
Tech: Text-guided Reconstruction Of Lifelike Clothed Humans
(2023)
• No Venue
Huang et al.
-
Large Language Models Cannot Self-correct Reasoning Yet
(2023)
• No Venue
Huang et al.
-
Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition
(2023)
• No Venue
Huang et al.
-
Mathprompter: Mathematical Reasoning Using Large Language Models
(2023)
• Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track)
• 71 citations
Shima Imani, Liang Du, Harsh Shrivastava
-
Text2room: Extracting Textured 3D Meshes From 2D Text-to-image Models
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 56 citations
Höllein et al.
-
Instruction Tuning For Large Language Models: A Survey
(2023)
• Arxiv
• 76 citations
Zhang et al.
-
14 Examples Of How Llms Can Transform Materials Science And Chemistry: A Reflection On A Large Language Model Hackathon
(2023)
• Digital Discovery
• 143 citations
Jablonka et al.
-
Designing Participatory AI: Creative Professionals' Worries And Expectations About Generative AI
(2023)
• Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems
• 78 citations
Nanna Inie, Jeanette Falk, Steven Tanimoto
-
Appagent: Multimodal Agents As Smartphone Users
(2023)
• No Venue
Zhang et al.
-
Ufogen: You Forward Once Large Scale Text-to-image Generation Via Diffusion Gans
(2023)
• No Venue
Xu et al.
-
Co-writing With Opinionated Language Models Affects Users' Views
(2023)
• Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
• 140 citations
Jakesch et al.
-
Human Heuristics For Ai-generated Language Are Flawed
(2023)
• Proceedings of the National Academy of Sciences
• 161 citations
Maurice Jakesch, Jeffrey Hancock, Mor Naaman
-
Chatgpt And Software Testing Education: Promises & Perils
(2023)
• 2023 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW)
• 166 citations
Jalil et al.
-
VMC: Video Motion Customization Using Temporal Attention Adaption For Text-to-video Diffusion Models
(2023)
• No Venue
Hyeonho Jeong, Geon Yeong Park, Jong Chul Ye
-
Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention
(2023)
• Arxiv
• 139 citations
Zhang et al.
-
A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4
(2023)
• Natural Language Processing Journal
• 146 citations
Katikapalli Subramanyam Kalyan
-
Structgpt: A General Framework For Large Language Model To Reason Over Structured Data
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 75 citations
Jiang et al.
-
Mistral 7B
(2023)
• Arxiv
• 159 citations
Jiang et al.
-
Cross-modal Implicit Relation Reasoning And Aligning For Text-to-image Person Retrieval
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 127 citations
Ding Jiang, Mang Ye
-
Active Retrieval Augmented Generation
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 129 citations
Jiang et al.
-
Graphologue: Exploring Large Language Model Responses With Interactive Diagrams
(2023)
• Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
• 61 citations
Jiang et al.
-
Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation
(2023)
• Proceedings of the 17th ACM Conference on Recommender Systems
• 71 citations
Zhang et al.
-
Florence-2: Advancing A Unified Representation For A Variety Of Vision Tasks
(2023)
• No Venue
Xiao et al.
-
Inferfix: End-to-end Program Repair With Llms
(2023)
• Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering
• 72 citations
Jin et al.
-
A Complete Survey On Generative AI (AIGC): Is Chatgpt From GPT-4 To GPT-5 All You Need?
(2023)
• Arxiv
• 82 citations
Zhang et al.
-
Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding
(2023)
• 28th International Conference on Intelligent User Interfaces
• 110 citations
Xiao et al.
-
Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines
(2023)
• No Venue
Khattab et al.
-
Is Chatgpt A Good Translator? Yes With GPT-4 As The Engine
(2023)
• Arxiv
• 247 citations
Jiao et al.
-
Time-llm: Time Series Forecasting By Reprogramming Large Language Models
(2023)
• Arxiv
• 69 citations
Jin et al.
-
Challenges And Applications Of Large Language Models
(2023)
• No Venue
Kaddour et al.
-
Scaling Up Gans For Text-to-image Synthesis
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 241 citations
Kang et al.
-
The Rise And Potential Of Large Language Model Based Agents: A Survey
(2023)
• Arxiv
• 185 citations
Xi et al.
-
In Conversation With Artificial Intelligence: Aligning Language Models With Human Values
(2023)
• Philosophy & Technology
• 76 citations
Atoosa Kasirzadeh, Iason Gabriel
-
Analyzing And Improving The Training Dynamics Of Diffusion Models
(2023)
• No Venue
Karras et al.
-
Studying The Effect Of AI Code Generators On Supporting Novice Learners In Introductory Programming
(2023)
• Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
• 183 citations
Kazemitabaar et al.
-
How Novices Use Llm-based Code Generators To Solve CS1 Coding Tasks In A Self-paced Learning Environment
(2023)
• Proceedings of the 23rd Koli Calling International Conference on Computing Education Research
• 64 citations
Kazemitabaar et al.
-
LERF: Language Embedded Radiance Fields
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 124 citations
Kerr et al.
-
Will Chatgpt Get You Caught? Rethinking Of Plagiarism Detection
(2023)
• Arxiv
• 194 citations
Mohammad Khalil, Erkan Er
-
Text2video-zero: Text-to-image Diffusion Models Are Zero-shot Video Generators
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 169 citations
Khachatryan et al.
-
Slime: Segment Like Me
(2023)
• No Venue
Khani et al.
-
Autogen: Enabling Next-gen LLM Applications Via Multi-agent Conversation
(2023)
• Arxiv
• 69 citations
Wu et al.
-
How Secure Is Code Generated By Chatgpt?
(2023)
• 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
• 62 citations
Khoury et al.
-
SOLAR 10.7B: Scaling Large Language Models With Simple Yet Effective Depth Up-scaling
(2023)
• No Venue
Kim et al.
-
Language Models Can Solve Computer Tasks
(2023)
• Arxiv
• 60 citations
Geunwoo Kim, Pierre Baldi, Stephen McAleer
-
Prometheus: Inducing Fine-grained Evaluation Capability In Language Models
(2023)
• No Venue
Kim et al.
-
Meta-transformer: A Unified Framework For Multimodal Learning
(2023)
• No Venue
Zhang et al.
-
Siren's Song In The AI Ocean: A Survey On Hallucination In Large Language Models
(2023)
• Arxiv
• 169 citations
Zhang et al.
-
A Watermark For Large Language Models
(2023)
• Arxiv
• 99 citations
Kirchenbauer et al.
-
Large Language Models Are State-of-the-art Evaluators Of Translation Quality
(2023)
• Arxiv
• 90 citations
Tom Kocmi, Christian Federmann
-
Gender Bias And Stereotypes In Large Language Models
(2023)
• Proceedings of The ACM Collective Intelligence Conference
• 167 citations
Hadas Kotek, Rikker Dockum, David Q. Sun
-
Videopoet: A Large Language Model For Zero-shot Video Generation
(2023)
• No Venue
Kondratyuk et al.
-
Vera: Vector-based Random Matrix Adaptation
(2023)
• No Venue
Dawid Jan Kopiczko, Tijmen Blankevoort, Yuki Markus Asano
-
Ai-generated Content (AIGC): A Survey
(2023)
• Arxiv
• 72 citations
Wu et al.
-
Bloomberggpt: A Large Language Model For Finance
(2023)
• Arxiv
• 258 citations
Wu et al.
-
Chatgpt: Beginning Of An End Of Manual Linguistic Data Annotation? Use Case Of Automatic Genre Identification
(2023)
• Arxiv
• 58 citations
Taja Kuzman, Igor Mozetič, Nikola Ljubešić
-
Openassistant Conversations -- Democratizing Large Language Model Alignment
(2023)
• Arxiv
• 83 citations
Köpf et al.
-
In-context Pretraining: Language Modeling Beyond Document Boundaries
(2023)
• No Venue
Shi et al.
-
Can Ai-generated Text Be Reliably Detected?
(2023)
• Arxiv
• 122 citations
Sadasivan et al.
-
In Chatgpt We Trust? Measuring And Characterizing The Reliability Of Chatgpt
(2023)
• Arxiv
• 55 citations
Shen et al.
-
S-lora: Serving Thousands Of Concurrent Lora Adapters
(2023)
• No Venue
Sheng et al.
-
Hugginggpt: Solving AI Tasks With Chatgpt And Its Friends In Hugging Face
(2023)
• Arxiv
• 208 citations
Shen et al.
-
Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback
(2023)
• No Venue
Shen et al.
-
From Words To Watts: Benchmarking The Energy Costs Of Large Language Model Inference
(2023)
• 2023 IEEE High Performance Extreme Computing Conference (HPEC)
• 59 citations
Samsi et al.
-
Whose Opinions Do Language Models Reflect?
(2023)
• Arxiv
• 74 citations
Santurkar et al.
-
Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws
(2023)
• No Venue
Nikhil Sardana, Jonathan Frankle
-
Diffusion Model Alignment Using Direct Preference Optimization
(2023)
• No Venue
Wallace et al.
-
Are Emergent Abilities Of Large Language Models A Mirage?
(2023)
• Arxiv
• 99 citations
Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
-
Thrilled By Your Progress! Large Language Models (GPT-4) No Longer Struggle To Pass Assessments In Higher Education Programming Courses
(2023)
• Proceedings of the 2023 ACM Conference on International Computing Education Research V.1
• 90 citations
Savelka et al.
-
Can Generative Pre-trained Transformers (GPT) Pass Assessments In Higher Education Programming Courses?
(2023)
• Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1
• 83 citations
Savelka et al.
-
Toolformer: Language Models Can Teach Themselves To Use Tools
(2023)
• Arxiv
• 245 citations
Schick et al.
-
GPT-RE: In-context Learning For Relation Extraction Using Large Language Models
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 89 citations
Wan et al.
-
An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation
(2023)
• IEEE Transactions on Software Engineering
• 100 citations
Schäfer et al.
-
A Picture Is Worth A Thousand Words: Principled Recaptioning Improves Image Generation
(2023)
• No Venue
Segalis et al.
-
Personality Traits In Large Language Models
(2023)
• Arxiv
• 66 citations
Serapio-García et al.
-
Automatic Prompt Optimization With "gradient Descent" And Beam Search
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 68 citations
Pryzant et al.
-
Gorilla: Large Language Model Connected With Massive Apis
(2023)
• Arxiv
• 60 citations
Patil et al.
-
The Refinedweb Dataset For Falcon LLM: Outperforming Curated Corpora With Web Data, And Web Data Only
(2023)
• No Venue
Penedo et al.
-
Instruction Tuning With GPT-4
(2023)
• Arxiv
• 166 citations
Peng et al.
-
The Impact Of AI On Developer Productivity: Evidence From Github Copilot
(2023)
• Arxiv
• 179 citations
Peng et al.
-
Check Your Facts And Try Again: Improving Large Language Models With External Knowledge And Automated Feedback
(2023)
• Arxiv
• 124 citations
Peng et al.
-
FP8-LM: Training FP8 Large Language Models
(2023)
• No Venue
Peng et al.
-
Towards Making The Most Of Chatgpt For Machine Translation
(2023)
• SSRN Electronic Journal
• 77 citations
Peng et al.
-
Kosmos-2: Grounding Multimodal Large Language Models To The World
(2023)
• No Venue
Peng et al.
-
A Study Of Generative Large Language Model For Medical Research And Healthcare
(2023)
• npj Digital Medicine
• 201 citations
Peng et al.
-
Yarn: Efficient Context Window Extension Of Large Language Models
(2023)
• No Venue
Peng et al.
-
LMDX: Language Model-based Document Information Extraction And Localization
(2023)
• No Venue
Perot et al.
-
One Wide Feedforward Is All You Need
(2023)
• No Venue
Pires et al.
-
SDXL: Improving Latent Diffusion Models For High-resolution Image Synthesis
(2023)
• No Venue
Podell et al.
-
The Robots Are Here: Navigating The Generative AI Revolution In Computing Education
(2023)
• Proceedings of the 2023 Working Group Reports on Innovation and Technology in Computer Science Education
• 182 citations
Prather et al.
-
Performance Of Chatgpt On The US Fundamentals Of Engineering Exam: Comprehensive Assessment Of Proficiency And Potential Implications For Professional Environmental Engineering Practice
(2023)
• Computers and Education: Artificial Intelligence
• 71 citations
Vinay Pursnani, Yusuf Sermet, Ibrahim Demir
-
Is Chatgpt A General-purpose Natural Language Processing Task Solver?
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 349 citations
Qin et al.
-
Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis
(2023)
• No Venue
Qin et al.
-
Revisiting Relation Extraction In The Era Of Large Language Models
(2023)
• Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 98 citations
Somin Wadhwa, Silvio Amir, Byron C. Wallace
-
Direct Preference Optimization: Your Language Model Is Secretly A Reward Model
(2023)
• Arxiv
• 147 citations
Rafailov et al.
-
Dreambooth3d: Subject-driven Text-to-3d Generation
(2023)
• 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
• 93 citations
Raj et al.
-
In-context Retrieval-augmented Language Models
(2023)
• Transactions of the Association for Computational Linguistics
• 150 citations
Ram et al.
-
Glamm: Pixel Grounding Large Multimodal Model
(2023)
• No Venue
Rasheed et al.
-
Hardware-aware Training For Large-scale And Diverse Deep Learning Inference Workloads Using In-memory Computing-based Accelerators
(2023)
• Nature Communications
• 66 citations
Rasch et al.
-
A Survey Of Hallucination In Large Foundation Models
(2023)
• Arxiv
• 66 citations
Vipula Rawte, Amit Sheth, Amitava Das
-
Kandinsky: An Improved Text-to-image Synthesis With Image Prior And Latent Diffusion
(2023)
• No Venue
Razzhigaev et al.
-
Testing The Reliability Of Chatgpt For Text Annotation And Classification: A Cautionary Remark
(2023)
• Arxiv
• 67 citations
Michael V. Reiss
-
Pdftriage: Question Answering Over Long, Structured Documents
(2023)
• No Venue
Saad-Falcon et al.
-
Sparq Attention: Bandwidth-efficient LLM Inference
(2023)
• No Venue
Ribar et al.
-
Code Llama: Open Foundation Models For Code
(2023)
• Arxiv
• 269 citations
Rozière et al.
-
Starvector: Generating Scalable Vector Graphics Code From Images
(2023)
• No Venue
Rodriguez et al.
-
FABRIC: Personalizing Diffusion Models With Iterative Feedback
(2023)
• No Venue
Rütte et al.
-
Audiopalm: A Large Language Model That Can Speak And Listen
(2023)
• No Venue
Rubenstein et al.
-
Hyperdreambooth: Hypernetworks For Fast Personalization Of Text-to-image Models
(2023)
• No Venue
Ruiz et al.
-
Larger Language Models Do In-context Learning Differently
(2023)
• Arxiv
• 87 citations
Wei et al.
-
Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review
(2023)
• Entropy
• 61 citations
Wong et al.
-
Multimodal Foundation Models: From Specialists To General-purpose Assistants
(2023)
• No Venue
Li et al.
-
BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models
(2023)
• Arxiv
• 644 citations
Li et al.
-
Multi-step Jailbreaking Privacy Attacks On Chatgpt
(2023)
• Findings of the Association for Computational Linguistics: EMNLP 2023
• 86 citations
Li et al.
-
Halueval: A Large-scale Hallucination Evaluation Benchmark For Large Language Models
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 119 citations
Li et al.
-
FLM-101B: An Open LLM And How To Train It With $100K Budget
(2023)
• No Venue
Li et al.
-
CAMEL: Communicative Agents For "mind" Exploration Of Large Language Model Society
(2023)
• Arxiv
• 60 citations
Li et al.
-
Chatdoctor: A Medical Chat Model Fine-tuned On A Large Language Model Meta-ai (llama) Using Medical Domain Knowledge
(2023)
• Cureus
• 276 citations
Li et al.
-
GLIGEN: Open-set Grounded Text-to-image Generation
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 261 citations
Li et al.
-
Making AI Less "thirsty": Uncovering And Addressing The Secret Water Footprint Of AI Models
(2023)
• Arxiv
• 88 citations
Li et al.
-
Instant3d: Fast Text-to-3d With Sparse-view Generation And Large Reconstruction Model
(2023)
• No Venue
Li et al.
-
JEN-1: Text-guided Universal Music Generation With Omnidirectional Diffusion Models
(2023)
• No Venue
Li et al.
-
Text Is All You Need: Learning Language Representations For Sequential Recommendation
(2023)
• Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
• 64 citations
Li et al.
-
Self-alignment With Instruction Backtranslation
(2023)
• No Venue
Li et al.
-
Otterhd: A High-resolution Multi-modality Model
(2023)
• No Venue
Li et al.
-
Photomaker: Customizing Realistic Human Photos Via Stacked ID Embedding
(2023)
• No Venue
Li et al.
-
Table-gpt: Table-tuned GPT For Diverse Table Tasks
(2023)
• No Venue
Li et al.
-
Stemgen: A Music Generation Model That Listens
(2023)
• No Venue
Parker et al.
-
GPT Detectors Are Biased Against Non-native English Writers
(2023)
• Patterns
• 246 citations
Liang et al.
-
A Prompt Pattern Catalog To Enhance Prompt Engineering With Chatgpt
(2023)
• Arxiv
• 518 citations
White et al.
-
System 2 Attention (is Something You Might Need Too)
(2023)
• No Venue
Jason Weston, Sainbayar Sukhbaatar
-
PMC-CLIP: Contrastive Language-image Pre-training Using Biomedical Documents
(2023)
• Lecture Notes in Computer Science
• 64 citations
Lin et al.
-
Learning To Model The World With Language
(2023)
• No Venue
Lin et al.
-
Text2motion: From Natural Language Instructions To Feasible Plans
(2023)
• Autonomous Robots
• 86 citations
Lin et al.
-
Videodirectorgpt: Consistent Multi-scene Video Generation Via Llm-guided Planning
(2023)
• No Venue
Lin et al.
-
Chatanything: Facetime Chat With Llm-enhanced Personas
(2023)
• No Venue
Zhao et al.
-
A Comprehensive Evaluation Of Chatgpt's Zero-shot Text-to-sql Capability
(2023)
• Arxiv
• 56 citations
Liu et al.
-
Audioldm 2: Learning Holistic Audio Generation With Self-supervised Pretraining
(2023)
• No Venue
Liu et al.
-
Lost In The Middle: How Language Models Use Long Contexts
(2023)
• No Venue
Liu et al.
-
Instaflow: One Step Is Enough For High-quality Diffusion-based Text-to-image Generation
(2023)
• No Venue
Liu et al.
-
Graphprompt: Unifying Pre-training And Downstream Tasks For Graph Neural Networks
(2023)
• Proceedings of the ACM Web Conference 2023
• 94 citations
Liu et al.
-
Evaluating The Logical Reasoning Ability Of Chatgpt And GPT-4
(2023)
• Arxiv
• 82 citations
Liu et al.
-
G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 265 citations
Liu et al.
-
Improved Baselines With Visual Instruction Tuning
(2023)
• No Venue
Liu et al.
-
LLM360: Towards Fully Transparent Open-source Llms
(2023)
• No Venue
Liu et al.
-
Jailbreaking Chatgpt Via Prompt Engineering: An Empirical Study
(2023)
• Arxiv
• 71 citations
Liu et al.
-
Llava-plus: Learning To Use Tools For Creating Multimodal Agents
(2023)
• No Venue
Liu et al.
-
Wavjourney: Compositional Audio Creation With Large Language Models
(2023)
• No Venue
Liu et al.
-
Visual Instruction Tuning
(2023)
• Arxiv
• 567 citations
Liu et al.
-
Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models
(2023)
• Meta-Radiology
• 482 citations
Liu et al.
-
One-2-3-45++: Fast Single Image To 3D Objects With Consistent Multi-view Generation And 3D Diffusion
(2023)
• No Venue
Liu et al.
-
Tinygsm: Achieving >80% On Gsm8k With Small Language Models
(2023)
• No Venue
Liu et al.
-
"what It Wants Me To Say": Bridging The Abstraction Gap Between End-user Programmers And Code-generating Large Language Models
(2023)
• Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
• 71 citations
Liu et al.
-
Chain-of-verification Reduces Hallucination In Large Language Models
(2023)
• No Venue
Dhuliawala et al.
-
The Flan Collection: Designing Data And Methods For Effective Instruction Tuning
(2023)
• Arxiv
• 76 citations
Longpre et al.
-
Can Chatgpt Forecast Stock Price Movements? Return Predictability And Large Language Models
(2023)
• SSRN Electronic Journal
• 269 citations
Alejandro Lopez-Lira, Yuehua Tang
-
Testing Of Detection Tools For Ai-generated Text
(2023)
• International Journal for Educational Integrity
• 179 citations
Weber-Wulff et al.
-
Chameleon: Plug-and-play Compositional Reasoning With Large Language Models
(2023)
• Arxiv
• 59 citations
Lu et al.
-
Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing
(2023)
• Journal of the Association for Information Science and Technology
• 524 citations
Lund et al.
-
Lcm-lora: A Universal Stable-diffusion Acceleration Module
(2023)
• No Venue
Luo et al.
-
Wavjourney: Compositional Audio Creation With Large Language Models
(2023)
• No Venue
Liu et al.
-
Full Parameter Fine-tuning For Large Language Models With Limited Resources
(2023)
• No Venue
Lv et al.
-
Fingpt: Large Generative Models For A Small Language
(2023)
• No Venue
Luukkonen et al.
-
Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding
(2023)
• No Venue
Wang et al.
-
Kosmos-2.5: A Multimodal Literate Model
(2023)
• No Venue
Lv et al.
-
SOLAR 10.7B: Scaling Large Language Models With Simple Yet Effective Depth Up-scaling
(2023)
• No Venue
Kim et al.
-
Towards Local Visual Modeling For Image Captioning
(2023)
• Pattern Recognition
• 74 citations
Ma et al.
-
Llm-pruner: On The Structural Pruning Of Large Language Models
(2023)
• Arxiv
• 61 citations
Xinyin Ma, Gongfan Fang, Xinchao Wang
-
Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning
(2023)
• Arxiv
• 230 citations
Dai et al.
-
Self-refine: Iterative Refinement With Self-feedback
(2023)
• Arxiv
• 162 citations
Madaan et al.
-
Neural Codec Language Models Are Zero-shot Text To Speech Synthesizers
(2023)
• Arxiv
• 144 citations
Wang et al.
-
Selfcheckgpt: Zero-resource Black-box Hallucination Detection For Generative Large Language Models
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 149 citations
Potsawee Manakul, Adian Liusie, Mark J. F. Gales
-
On The Robustness Of Code Generation Techniques: An Empirical Study On Github Copilot
(2023)
• 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)
• 66 citations
Mastropaolo et al.
-
Document-level Machine Translation With Large Language Models
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 84 citations
Wang et al.
-
Huatuo: Tuning Llama Model With Chinese Medical Knowledge
(2023)
• Arxiv
• 74 citations
Wang et al.
-
On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective
(2023)
• Arxiv
• 73 citations
Wang et al.
-
Sources Of Hallucination By Large Language Models On Inference Tasks
(2023)
• Findings of the Association for Computational Linguistics: EMNLP 2023
• 79 citations
McKenna et al.
-
Robogen: Towards Unleashing Infinite Data For Automated Robot Learning Via Generative Simulation
(2023)
• No Venue
Wang et al.
-
Shepherd: A Critic For Language Model Generation
(2023)
• No Venue
Wang et al.
-
Improving Text Embeddings With Large Language Models
(2023)
• No Venue
Wang et al.
-
Reprompt: Automatic Prompt Editing To Refine Ai-generative Art Towards Precise Expressions
(2023)
• Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
• 68 citations
Yunlong Wang, Shuyuan Shen, Brian Y. Lim
-
Query2doc: Query Expansion With Large Language Models
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 74 citations
Liang Wang, Nan Yang, Furu Wei
-
Augmented Language Models: A Survey
(2023)
• Arxiv
• 126 citations
Mialon et al.
-
Anymal: An Efficient And Scalable Any-modality Augmented Language Model
(2023)
• No Venue
Moon et al.
-
JARVIS-1: Open-world Multi-task Agents With Memory-augmented Multimodal Language Models
(2023)
• No Venue
Wang et al.
-
Is Chatgpt A Good NLG Evaluator? A Preliminary Study
(2023)
• Proceedings of the 4th New Frontiers in Summarization Workshop
• 144 citations
Wang et al.
-
Factscore: Fine-grained Atomic Evaluation Of Factual Precision In Long Form Text Generation
(2023)
• Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
• 85 citations
Min et al.
-
LAVIE: High-quality Video Generation With Cascaded Latent Diffusion Models
(2023)
• No Venue
Wang et al.
-
Chatgpt Or Human? Detect And Explain. Explaining Decisions Of Machine Learning Model For Detecting Short Chatgpt-generated Text
(2023)
• Arxiv
• 66 citations
Sandra Mitrović, Davide Andreoletti, Omran Ayoub
-
Anymal: An Efficient And Scalable Any-modality Augmented Language Model
(2023)
• No Venue
Moon et al.
-
Query-dependent Video Representation For Moment Retrieval And Highlight Detection
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 55 citations
Moon et al.
-
GPT-NER: Named Entity Recognition Via Large Language Models
(2023)
• Arxiv
• 107 citations
Wang et al.
-
Dragondiffusion: Enabling Drag-style Manipulation On Diffusion Models
(2023)
• No Venue
Mou et al.
-
Imagedream: Image-prompt Multi-view Diffusion For 3D Generation
(2023)
• No Venue
Peng Wang, Yichun Shi
-
Bitnet: Scaling 1-bit Transformers For Large Language Models
(2023)
• No Venue
Wang et al.
-
Octopack: Instruction Tuning Code Large Language Models
(2023)
• No Venue
Muennighoff et al.
-
Orca: Progressive Learning From Complex Explanation Traces Of GPT-4
(2023)
• No Venue
Mukherjee et al.
-
Can Chatgpt Write A Good Boolean Query For Systematic Review Literature Search?
(2023)
• Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 143 citations
Wang et al.
-
Auditing Large Language Models: A Three-layered Approach
(2023)
• AI and Ethics
• 104 citations
Mökander et al.
-
A Comprehensive Overview Of Large Language Models
(2023)
• Arxiv
• 248 citations
Naveed et al.
-
Culturax: A Cleaned, Enormous, And Multilingual Dataset For Large Language Models In 167 Languages
(2023)
• No Venue
Nguyen et al.
-
Skeleton-of-thought: Large Language Models Can Do Parallel Decoding
(2023)
• No Venue
Ning et al.
-
Contrastive Decoding Improves Reasoning In Large Language Models
(2023)
• No Venue
Sean O'Brien, Mike Lewis
-
Can Generalist Foundation Models Outcompete Special-purpose Tuning? Case Study In Medicine
(2023)
• Arxiv
• 91 citations
Nori et al.
-
Capabilities Of GPT-4 On Medical Challenge Problems
(2023)
• Arxiv
• 396 citations
Nori et al.
-
Generative Agents: Interactive Simulacra Of Human Behavior
(2023)
• Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
• 637 citations
Park et al.
-
Learning Gain Differences Between Chatgpt And Human Tutor Generated Algebra Hints
(2023)
• Arxiv
• 64 citations
Zachary A. Pardos, Shreya Bhandari
-
Chatgpt Beyond English: Towards A Comprehensive Evaluation Of Large Language Models In Multilingual Learning
(2023)
• Findings of the Association for Computational Linguistics: EMNLP 2023
• 117 citations
Lai et al.
-
A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets
(2023)
• Findings of the Association for Computational Linguistics: ACL 2023
• 58 citations
Laskar et al.
-
Copy Is All You Need
(2023)
• No Venue
Lan et al.
-
Multimodal Large Language Models: A Survey
(2023)
• 2023 IEEE International Conference on Big Data (BigData)
• 69 citations
Wu et al.
-
OBELICS: An Open Web-scale Filtered Dataset Of Interleaved Image-text Documents
(2023)
• No Venue
Laurençon et al.
-
RLAIF: Scaling Reinforcement Learning From Human Feedback With AI Feedback
(2023)
• No Venue
Lee et al.
-
Hierspeech++: Bridging The Gap Between Semantic And Acoustic Representation Of Speech By Hierarchical Variational Inference For Zero-shot Speech Synthesis
(2023)
• No Venue
Lee et al.
-
Multimodal Prompting With Missing Modalities For Visual Recognition
(2023)
• 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 67 citations
Lee et al.
-
Red Teaming Language Models With Language Models
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 149 citations
Perez et al.
-
Ignore Previous Prompt: Attack Techniques For Language Models
(2022)
• Arxiv
• 55 citations
Fábio Perez, Ian Ribeiro
-
Lifting The Curse Of Multilinguality By Pre-training Modular Transformers
(2022)
• Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 56 citations
Pfeiffer et al.
-
Synchromesh: Reliable Code Generation From Pre-trained Language Models
(2022)
• Arxiv
• 57 citations
Poesia et al.
-
Diffusion-lm Improves Controllable Text Generation
(2022)
• Arxiv
• 208 citations
Li et al.
-
BLIP: Bootstrapping Language-image Pre-training For Unified Vision-language Understanding And Generation
(2022)
• Arxiv
• 559 citations
Li et al.
-
Automating Code Review Activities By Large-scale Pre-training
(2022)
• Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering
• 99 citations
Li et al.
-
Comprehending And Ordering Semantics For Image Captioning
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 89 citations
Li et al.
-
A Comparative Study Of Pretrained Language Models For Long Clinical Text
(2022)
• Journal of the American Medical Informatics Association
• 82 citations
Li et al.
-
Competition-level Code Generation With Alphacode
(2022)
• Science
• 531 citations
Li et al.
-
A Survey On Retrieval-augmented Text Generation
(2022)
• Arxiv
• 61 citations
Li et al.
-
Invariant Grounding For Video Question Answering
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 83 citations
Li et al.
-
Mplug: Effective And Efficient Vision-language Learning By Cross-modal Skip-connections
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 103 citations
Li et al.
-
BLOOM: A 176b-parameter Open-access Multilingual Language Model
(2022)
• Arxiv
• 632 citations
Workshop et al.
-
Model Soups: Averaging Weights Of Multiple Fine-tuned Models Improves Accuracy Without Increasing Inference Time
(2022)
• Arxiv
• 119 citations
Wortsman et al.
-
On-device Training Under 256KB Memory
(2022)
• Arxiv
• 69 citations
Lin et al.
-
Frozen CLIP Models Are Efficient Video Learners
(2022)
• Lecture Notes in Computer Science
• 107 citations
Lin et al.
-
Sequence-to-sequence Knowledge Graph Completion And Question Answering
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 90 citations
Apoorv Saxena, Adrian Kochsiek, Rainer Gemulla
-
Photorealistic Text-to-image Diffusion Models With Deep Language Understanding
(2022)
• Arxiv
• 1503 citations
Saharia et al.
-
LAION-5B: An Open Large-scale Dataset For Training Next Generation Image-text Models
(2022)
• Arxiv
• 651 citations
Schuhmann et al.
-
Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 78 citations
Yan Ling, Jianfei Yu, Rui Xia
-
Few-shot Parameter-efficient Fine-tuning Is Better And Cheaper Than In-context Learning
(2022)
• Arxiv
• 222 citations
Liu et al.
-
Compositional Visual Generation With Composable Diffusion Models
(2022)
• Lecture Notes in Computer Science
• 172 citations
Liu et al.
-
BRIO: Bringing Order To Abstractive Summarization
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 168 citations
Liu et al.
-
WANLI: Worker And AI Collaboration For Natural Language Inference Dataset Creation
(2022)
• Findings of the Association for Computational Linguistics: EMNLP 2022
• 99 citations
Liu et al.
-
Swin Transformer V2: Scaling Up Capacity And Resolution
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 1352 citations
Liu et al.
-
Opal: Multimodal Image Generation For News Illustration
(2022)
• Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology
• 71 citations
Vivian Liu, Han Qiao, Lydia Chilton
-
Automatic Generation Of Programming Exercises And Code Explanations Using Large Language Models
(2022)
• Proceedings of the 2022 ACM Conference on International Computing Education Research - Volume 1
• 316 citations
Sarsa et al.
-
Neural Theory-of-mind? On The Limits Of Social Intelligence In Large Lms
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 72 citations
Sap et al.
-
Chain-of-thought Prompting Elicits Reasoning In Large Language Models
(2022)
• Arxiv
• 2622 citations
Wei et al.
-
A Survey On In-context Learning
(2022)
• ACM Computing Surveys
• 101 citations
Dong et al.
-
Emergent Abilities Of Large Language Models
(2022)
• Arxiv
• 836 citations
Wei et al.
-
What Artificial Neural Networks Can Tell Us About Human Language Acquisition
(2022)
• Algebraic Structures in Natural Language
• 84 citations
Alex Warstadt, Samuel R. Bowman
-
Unified Structure Generation For Universal Information Extraction
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 275 citations
Lu et al.
-
Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering
(2022)
• Arxiv
• 121 citations
Lu et al.
-
Prompt Distribution Learning
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 136 citations
Lu et al.
-
A-OKVQA: A Benchmark For Visual Question Answering Using World Knowledge
(2022)
• Lecture Notes in Computer Science
• 108 citations
Schwenk et al.
-
Hierarchical Text-conditional Image Generation With CLIP Latents
(2022)
• Arxiv
• 1981 citations
Ramesh et al.
-
Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining
(2022)
• Briefings in Bioinformatics
• 585 citations
Luo et al.
-
End-to-end Generative Pretraining For Multimodal Video Captioning
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 119 citations
Seo et al.
-
Phenaki: Variable Length Video Generation From Open Domain Textual Description
(2022)
• Arxiv
• 72 citations
Villegas et al.
-
Prompt For Extraction? PAIE: Prompting Argument Interaction For Event Argument Extraction
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 88 citations
Ma et al.
-
Language Models Of Code Are Few-shot Commonsense Learners
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 67 citations
Madaan et al.
-
Multiconer: A Large-scale Multilingual Dataset For Complex Named Entity Recognition
(2022)
• Arxiv
• 78 citations
Malmasi et al.
-
A Very Preliminary Analysis Of DALL-E 2
(2022)
• Arxiv
• 80 citations
Gary Marcus, Ernest Davis, Scott Aaronson
-
Chartqa: A Benchmark For Question Answering About Charts With Visual And Logical Reasoning
(2022)
• Findings of the Association for Computational Linguistics: ACL 2022
• 87 citations
Masry et al.
-
End-to-end Transformer Based Model For Image Captioning
(2022)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 107 citations
Yiyu Wang, Jungang Xu, Yingfei Sun
-
Locating And Editing Factual Associations In GPT
(2022)
• Arxiv
• 160 citations
Meng et al.
-
Generating Training Data With Language Models: Towards Zero-shot Language Understanding
(2022)
• Arxiv
• 72 citations
Meng et al.
-
Rethinking The Role Of Demonstrations: What Makes In-context Learning Work?
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 432 citations
Min et al.
-
Numglue: A Suite Of Fundamental Yet Challenging Mathematical Reasoning Tasks
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 113 citations
Mishra et al.
-
Dualprompt: Complementary Prompting For Rehearsal-free Continual Learning
(2022)
• Lecture Notes in Computer Science
• 187 citations
Wang et al.
-
Pretraining Is All You Need For Image-to-image Translation
(2022)
• Arxiv
• 78 citations
Wang et al.
-
Self-consistency Improves Chain Of Thought Reasoning In Language Models
(2022)
• Arxiv
• 490 citations
Wang et al.
-
Text Embeddings By Weakly-supervised Contrastive Pre-training
(2022)
• Arxiv
• 77 citations
Wang et al.
-
Training Data Is More Valuable Than You Think: A Simple And Effective Method By Retrieving From Training Data
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 67 citations
Wang et al.
-
Super-naturalinstructions: Generalization Via Declarative Instructions On 1600+ NLP Tasks
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 174 citations
Wang et al.
-
Summareranker: A Multi-task Mixture-of-experts Re-ranking Framework For Abstractive Summarization
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 60 citations
Mathieu Ravaut, Shafiq Joty, Nancy F. Chen
-
No More Fine-tuning? An Experimental Evaluation Of Prompt Tuning In Code Intelligence
(2022)
• Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering
• 89 citations
Wang et al.
-
Multimodal Token Fusion For Vision Transformers
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 111 citations
Wang et al.
-
Promda: Prompt-based Data Augmentation For Low-resource NLU Tasks
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 55 citations
Wang et al.
-
Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks
(2022)
• Arxiv
• 142 citations
Wang et al.
-
GIT: A Generative Image-to-text Transformer For Vision And Language
(2022)
• Arxiv
• 178 citations
Wang et al.
-
Text And Code Embeddings By Contrastive Pre-training
(2022)
• Arxiv
• 111 citations
Neelakantan et al.
-
Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 94 citations
Jiapeng Wang, Lianwen Jin, Kai Ding
-
Expanding Language-image Pretrained Models For General Video Recognition
(2022)
• Lecture Notes in Computer Science
• 156 citations
Ni et al.
-
Codegen: An Open Large Language Model For Code With Multi-turn Program Synthesis
(2022)
• Arxiv
• 188 citations
Nijkamp et al.
-
In-context Learning And Induction Heads
(2022)
• Arxiv
• 62 citations
Olsson et al.
-
The Creativity Of Text-to-image Generation
(2022)
• Proceedings of the 25th International Academic Mindtrek Conference
• 179 citations
Jonas Oppenlaender
-
Training Language Models To Follow Instructions With Human Feedback
(2022)
• Arxiv
• 3008 citations
Ouyang et al.
-
UX Research On Conversational Human-ai Interaction: A Literature Review Of The ACM Digital Library
(2022)
• CHI Conference on Human Factors in Computing Systems
• 59 citations
Zheng et al.
-
Large Language Models Are Few-shot Clinical Information Extractors
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 199 citations
Agrawal et al.
-
Training A Helpful And Harmless Assistant With Reinforcement Learning From Human Feedback
(2022)
• Arxiv
• 257 citations
Bai et al.
-
GPT Takes The Bar Exam
(2022)
• SSRN Electronic Journal
• 96 citations
Michael Bommarito, Daniel Martin Katz
-
Lamda: Language Models For Dialog Applications
(2022)
• Arxiv
• 616 citations
Thoppilan et al.
-
Using Large Language Models To Simulate Multiple Humans And Replicate Human Subject Studies
(2022)
• Arxiv
• 74 citations
Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai
-
Chemberta-2: Towards Chemical Foundation Models
(2022)
• Arxiv
• 85 citations
Ahmad et al.
-
Few-shot Training Llms For Project-specific Code-summarization
(2022)
• Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering
• 122 citations
Toufique Ahmed, Premkumar Devanbu
-
Winoground: Probing Vision And Language Models For Visio-linguistic Compositionality
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 119 citations
Thrush et al.
-
Flamingo: A Visual Language Model For Few-shot Learning
(2022)
• Arxiv
• 798 citations
Alayrac et al.
-
Language Models As Agent Models
(2022)
• Findings of the Association for Computational Linguistics: EMNLP 2022
• 56 citations
Jacob Andreas
-
Coca: Contrastive Captioners Are Image-text Foundation Models
(2022)
• Arxiv
• 466 citations
Yu et al.
-
Motionclip: Exposing Human Motion Generation To CLIP Space
(2022)
• Lecture Notes in Computer Science
• 160 citations
Tevet et al.
-
Promptsource: An Integrated Development Environment And Repository For Natural Language Prompts
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 133 citations
Bach et al.
-
Generate Rather Than Retrieve: Large Language Models Are Strong Context Generators
(2022)
• Arxiv
• 63 citations
Yu et al.
-
Scaling Autoregressive Models For Content-rich Text-to-image Generation
(2022)
• Arxiv
• 282 citations
Yu et al.
-
Exploring Visual Prompts For Adapting Large-scale Models
(2022)
• Arxiv
• 100 citations
Bahng et al.
-
Ediff-i: Text-to-image Diffusion Models With An Ensemble Of Expert Denoisers
(2022)
• Arxiv
• 204 citations
Balaji et al.
-
Fine-tuning Language Models To Find Agreement Among Humans With Diverse Preferences
(2022)
• Arxiv
• 79 citations
Bakker et al.
-
Transformer-based Language Models For Software Vulnerability Detection
(2022)
• Proceedings of the 38th Annual Computer Security Applications Conference
• 61 citations
Thapa et al.
-
Text2live: Text-driven Layered Image And Video Editing
(2022)
• Lecture Notes in Computer Science
• 134 citations
Bar-Tal et al.
-
Mslam: Massively Multilingual Joint Pre-training For Speech And Text
(2022)
• Arxiv
• 57 citations
Bapna et al.
-
Scene Text Recognition With Permuted Autoregressive Sequence Models
(2022)
• Lecture Notes in Computer Science
• 141 citations
Darwin Bautista, Rowel Atienza
-
Gpt-neox-20b: An Open-source Autoregressive Language Model
(2022)
• Proceedings of BigScience Episode #5 -- Workshop on Challenges & Perspectives in Creating Large Language Models
• 297 citations
Black et al.
-
Making The Most Of Text Semantics To Improve Biomedical Vision--language Processing
(2022)
• Lecture Notes in Computer Science
• 111 citations
Boecking et al.
-
UL2: Unifying Language Learning Paradigms
(2022)
• Arxiv
• 83 citations
Tay et al.
-
Investigating Explainability Of Generative AI For Code Through Scenario-based Design
(2022)
• 27th International Conference on Intelligent User Interfaces
• 144 citations
Sun et al.
-
On The Explainability Of Natural Language Processing Deep Models
(2022)
• ACM Computing Surveys
• 71 citations
Julia El Zini, Mariette Awad
-
Efficient Few-shot Learning Without Prompts
(2022)
• Arxiv
• 82 citations
Tunstall et al.
-
Revisiting The "video" In Video-language Understanding
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 84 citations
Buch et al.
-
Natgen: Generative Pre-training By "naturalizing" Source Code
(2022)
• Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering
• 69 citations
Chakraborty et al.
-
Tweetnlp: Cutting-edge Natural Language Processing For Social Media
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
• 78 citations
Camacho-Collados et al.
-
Quantifying Memorization Across Neural Language Models
(2022)
• Arxiv
• 126 citations
Carlini et al.
-
Galactica: A Large Language Model For Science
(2022)
• Arxiv
• 214 citations
Taylor et al.
-
MAESTRO: Matched Speech Text Representations Through Modality Matching
(2022)
• Interspeech 2022
• 58 citations
Chen et al.
-
Codet: Code Generation With Generated Tests
(2022)
• Arxiv
• 62 citations
Chen et al.
-
Hybrid Transformer With Multi-level Fusion For Multimodal Knowledge Graph Completion
(2022)
• Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 121 citations
Chen et al.
-
Linkbert: Pretraining Language Models With Document Links
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 200 citations
Michihiro Yasunaga, Jure Leskovec, Percy Liang
-
Program Of Thoughts Prompting: Disentangling Computation From Reasoning For Numerical Reasoning Tasks
(2022)
• Arxiv
• 94 citations
Chen et al.
-
Pali: A Jointly-scaled Multilingual Language-image Model
(2022)
• Arxiv
• 157 citations
Chen et al.
-
Re3: Generating Longer Stories With Recursive Reprompting And Revision
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 57 citations
Yang et al.
-
Zerogen: Efficient Zero-shot Learning Via Dataset Generation
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 71 citations
Ye et al.
-
Deep Bidirectional Language-knowledge Graph Pretraining
(2022)
• Arxiv
• 65 citations
Yasunaga et al.
-
React: Synergizing Reasoning And Acting In Language Models
(2022)
• Arxiv
• 285 citations
Yao et al.
-
Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning
(2022)
• Arxiv
• 96 citations
Antonia Creswell, Murray Shanahan, Irina Higgins
-
Vista: Vision And Scene Text Aggregation For Cross-modal Retrieval
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 66 citations
Cheng et al.
-
Star: Bootstrapping Reasoning With Reasoning
(2022)
• Arxiv
• 100 citations
Zelikman et al.
-
Black-box Tuning For Language-model-as-a-service
(2022)
• Arxiv
• 55 citations
Sun et al.
-
Relationprompt: Leveraging Prompts To Generate Synthetic Data For Zero-shot Relation Triplet Extraction
(2022)
• Findings of the Association for Computational Linguistics: ACL 2022
• 69 citations
Chia et al.
-
Palm: Scaling Language Modeling With Pathways
(2022)
• Arxiv
• 1924 citations
Chowdhery et al.
-
Scaling Instruction-finetuned Language Models
(2022)
• Arxiv
• 1060 citations
Chung et al.
-
Biobart: Pretraining And Evaluation Of A Biomedical Generative Language Model
(2022)
• Proceedings of the 21st Workshop on Biomedical Language Processing
• 89 citations
Yuan et al.
-
VQGAN-CLIP: Open Domain Image Generation And Editing With Natural Language Guidance
(2022)
• Lecture Notes in Computer Science
• 201 citations
Crowson et al.
-
Interactive Model Cards: A Human-centered Approach To Model Documentation
(2022)
• 2022 ACM Conference on Fairness Accountability and Transparency
• 63 citations
Crisan et al.
-
Prototypical Verbalizer For Prompt-based Few-shot Tuning
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 60 citations
Cui et al.
-
Beyond Text Generation: Supporting Writers With Continuous Automatic Text Summaries
(2022)
• Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology
• 62 citations
Dang et al.
-
Flashattention: Fast And Memory-efficient Exact Attention With Io-awareness
(2022)
• Arxiv
• 310 citations
Dao et al.
-
MERLOT Reserve: Neural Script Knowledge Through Vision And Language And Sound
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 106 citations
Zellers et al.
-
Procthor: Large-scale Embodied AI Using Procedural Generation
(2022)
• Arxiv
• 63 citations
Deitke et al.
-
COLD: A Benchmark For Chinese Offensive Language Detection
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 57 citations
Deng et al.
-
Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 92 citations
Deng et al.
-
LST: Ladder Side-tuning For Parameter And Memory Efficient Transfer Learning
(2022)
• Arxiv
• 64 citations
Yi-Lin Sung, Jaemin Cho, Mohit Bansal
-
Chatgpt: The End Of Online Exam Integrity?
(2022)
• Arxiv
• 290 citations
Teo Susnjak
-
Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale
(2022)
• Arxiv
• 89 citations
Dettmers et al.
-
Delta Tuning: A Comprehensive Study Of Parameter Efficient Methods For Pre-trained Language Models
(2022)
• Arxiv
• 100 citations
Ding et al.
-
Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers
(2022)
• Arxiv
• 107 citations
Ding et al.
-
Zero-shot Video Question Answering Via Frozen Bidirectional Language Models
(2022)
• Arxiv
• 59 citations
Yang et al.
-
Mukea: Multimodal Knowledge Extraction And Accumulation For Knowledge-based Visual Question Answering
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 83 citations
Ding et al.
-
VLT: Vision-language Transformer And Query Generation For Referring Segmentation
(2022)
• IEEE Transactions on Pattern Analysis and Machine Intelligence
• 84 citations
Ding et al.
-
Vision-language Pre-training With Triple Contrastive Learning
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 196 citations
Yang et al.
-
Tableformer: Robust Transformer Modeling For Table-text Encoding
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 60 citations
Yang et al.
-
Improving Visual Grounding With Visual-linguistic Verification And Iterative Reasoning
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 90 citations
Yang et al.
-
A Survey Of Vision-language Pre-trained Models
(2022)
• Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence
• 80 citations
Du et al.
-
Translation Between Molecules And Natural Language
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 66 citations
Edwards et al.
-
On The Origin Of Hallucinations In Conversational Models: Is It The Datasets Or The Models?
(2022)
• Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 83 citations
Dziri et al.
-
Sequential Recommendation Via Stochastic Self-attention
(2022)
• Proceedings of the ACM Web Conference 2022
• 126 citations
Fan et al.
-
Socratic Models: Composing Zero-shot Multimodal Reasoning With Language
(2022)
• Arxiv
• 143 citations
Zeng et al.
-
STEMM: Self-learning With Speech-text Manifold Mixup For Speech Translation
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 67 citations
Fang et al.
-
Promptdet: Towards Open-vocabulary Detection Using Uncurated Images
(2022)
• Lecture Notes in Computer Science
• 91 citations
Feng et al.
-
GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers
(2022)
• Arxiv
• 97 citations
Frantar et al.
-
Incoder: A Generative Model For Code Infilling And Synthesis
(2022)
• Arxiv
• 115 citations
Fried et al.
-
Complexity-based Prompting For Multi-step Reasoning
(2022)
• Arxiv
• 65 citations
Fu et al.
-
An Image Is Worth One Word: Personalizing Text-to-image Generation Using Textual Inversion
(2022)
• Arxiv
• 333 citations
Gal et al.
-
Make-a-scene: Scene-based Text-to-image Generation With Human Priors
(2022)
• Lecture Notes in Computer Science
• 215 citations
Gafni et al.
-
PAL: Program-aided Language Models
(2022)
• Arxiv
• 97 citations
Gao et al.
-
GLM-130B: An Open Bilingual Pre-trained Model
(2022)
• Arxiv
• 256 citations
Zeng et al.
-
Transformer Feed-forward Layers Build Predictions By Promoting Concepts In The Vocabulary Space
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 70 citations
Geva et al.
-
Improving Alignment Of Dialogue Agents Via Targeted Human Judgements
(2022)
• Arxiv
• 114 citations
Glaese et al.
-
Diffuseq: Sequence To Sequence Text Generation With Diffusion Models
(2022)
• Arxiv
• 74 citations
Gong et al.
-
A Contrastive Framework For Neural Text Generation
(2022)
• Arxiv
• 81 citations
Su et al.
-
X-pool: Cross-modal Language-video Attention For Text-video Retrieval
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 128 citations
Gorti et al.
-
Neural Machine Translation For Low Resource Languages
(2022)
• ACM Computing Surveys
• 175 citations
Goyle et al.
-
News Summarization And Evaluation In The Era Of GPT-3
(2022)
• Arxiv
• 173 citations
Tanya Goyal, Junyi Jessy Li, Greg Durrett
-
Interactive And Visual Prompt Engineering For Ad-hoc Task Adaptation With Large Language Models
(2022)
• IEEE Transactions on Visualization and Computer Graphics
• 120 citations
Strobelt et al.
-
Xylayoutlm: Towards Layout-aware Multimodal Networks For Visually-rich Document Understanding
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 66 citations
Gu et al.
-
Unixcoder: Unified Cross-modal Pre-training For Code Representation
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 299 citations
Guo et al.
-
TM2T: Stochastic And Tokenized Modeling For The Reciprocal Generation Of 3D Human Motions And Texts
(2022)
• Lecture Notes in Computer Science
• 90 citations
Guo et al.
-
Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again
(2022)
• Findings of the Association for Computational Linguistics: EMNLP 2022
• 83 citations
Gutiérrez et al.
-
"I Think This Is The Most Disruptive Technology": Exploring Sentiments Of Chatgpt Early Adopters Using Twitter Data
(2022)
• Arxiv
• 167 citations
Haque et al.
-
A Systematic Evaluation Of Large Language Models Of Code
(2022)
• Proceedings of the 6th ACM SIGPLAN International Symposium on Machine Programming
• 320 citations
Xu et al.
-
Decoupled Side Information Fusion For Sequential Recommendation
(2022)
• Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 87 citations
Yueqi Xie, Peilin Zhou, Sunghun Kim
-
Toxigen: A Large-scale Machine-generated Dataset For Adversarial And Implicit Hate Speech Detection
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 130 citations
Hartvigsen et al.
-
Prompt-to-prompt Image Editing With Cross Attention Control
(2022)
• Arxiv
• 319 citations
Hertz et al.
-
Training Compute-optimal Large Language Models
(2022)
• Arxiv
• 470 citations
Hoffmann et al.
-
Conditional Prompt Learning For Vision-language Models
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 770 citations
Zhou et al.
-
Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers
(2022)
• Arxiv
• 102 citations
Hong et al.
-
Test-time Prompt Tuning For Zero-shot Generalization In Vision-language Models
(2022)
• Arxiv
• 76 citations
Shu et al.
-
Least-to-most Prompting Enables Complex Reasoning In Large Language Models
(2022)
• Arxiv
• 241 citations
Zhou et al.
-
An Information-theoretic Approach To Prompt Engineering Without Ground Truth Labels
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 83 citations
Sorensen et al.
-
Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models
(2022)
• Transactions on Machine Learning Research May/2022 https://openreview.net/forum?id=uyTL5Bvosj
• 441 citations
Srivastava et al.
-
Glipv2: Unifying Localization And Vision-language Understanding
(2022)
• Arxiv
• 91 citations
Zhang et al.
-
Are Large Pre-trained Language Models Leaking Your Personal Information?
(2022)
• Findings of the Association for Computational Linguistics: EMNLP 2022
• 56 citations
Jie Huang, Hanyin Shao, Kevin Chen-Chuan Chang
-
Subgraph Retrieval Enhanced Model For Multi-hop Knowledge Base Question Answering
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 59 citations
Zhang et al.
-
Inner Monologue: Embodied Reasoning Through Planning With Language Models
(2022)
• Arxiv
• 155 citations
Huang et al.
-
Language Models As Zero-shot Planners: Extracting Actionable Knowledge For Embodied Agents
(2022)
• Arxiv
• 119 citations
Huang et al.
-
Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking
(2022)
• Proceedings of the 30th ACM International Conference on Multimedia
• 279 citations
Huang et al.
-
Unifiedskg: Unifying And Multi-tasking Structured Knowledge Grounding With Text-to-text Language Models
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 186 citations
Xie et al.
-
OPT: Open Pre-trained Transformer Language Models
(2022)
• Arxiv
• 832 citations
Zhang et al.
-
Storybuddy: A Human-ai Collaborative Chatbot For Parent-child Interactive Storytelling With Flexible Parental Involvement
(2022)
• CHI Conference on Human Factors in Computing Systems
• 101 citations
Zhang et al.
-
CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 71 citations
Song et al.
-
OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization
(2022)
• Arxiv
• 78 citations
Iyer et al.
-
Atlas: Few-shot Learning With Retrieval Augmented Language Models
(2022)
• Arxiv
• 163 citations
Izacard et al.
-
Prompting GPT-3 To Be Reliable
(2022)
• Arxiv
• 62 citations
Si et al.
-
Achieving Reliable Human Assessment Of Open-domain Dialogue Systems
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 57 citations
Ji et al.
-
Survey Of Hallucination In Natural Language Generation
(2022)
• ACM Computing Surveys
• 1698 citations
Ji et al.
-
From Discrimination To Generation: Knowledge Graph Completion With Generative Transformer
(2022)
• Companion Proceedings of the Web Conference 2022
• 62 citations
Xie et al.
-
Visual Prompt Tuning
(2022)
• Lecture Notes in Computer Science
• 649 citations
Jia et al.
-
Promptbert: Improving BERT Sentence Embeddings With Prompts
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 117 citations
Jiang et al.
-
Wenet 2.0: More Productive End-to-end Speech Recognition Toolkit
(2022)
• Interspeech 2022
• 66 citations
Zhang et al.
-
Bailando: 3D Dance Generation By Actor-critic GPT With Choreographic Memory
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 106 citations
Siyao et al.
-
Using Pre-trained Models To Boost Code Review Automation
(2022)
• Proceedings of the 44th International Conference on Software Engineering
• 101 citations
Tufano et al.
-
Tip-adapter: Training-free Adaption Of CLIP For Few-shot Classification
(2022)
• Lecture Notes in Computer Science
• 148 citations
Zhang et al.
-
Make-a-video: Text-to-video Generation Without Text-video Data
(2022)
• Arxiv
• 244 citations
Singer et al.
-
Structured Pruning Learns Compact And Accurate Models
(2022)
• Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 80 citations
Mengzhou Xia, Zexuan Zhong, Danqi Chen
-
Smoothquant: Accurate And Efficient Post-training Quantization For Large Language Models
(2022)
• Arxiv
• 56 citations
Xiao et al.
-
Maieutic Prompting: Logically Consistent Reasoning With Recursive Explanations
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 56 citations
Jung et al.
-
Language Models (mostly) Know What They Know
(2022)
• Arxiv
• 129 citations
Kadavath et al.
-
What To Hide From Your Students: Attention-guided Masked Image Modeling
(2022)
• Lecture Notes in Computer Science
• 59 citations
Kakogeorgiou et al.
-
Large Language Models Struggle To Learn Long-tail Knowledge
(2022)
• Arxiv
• 56 citations
Kandpal et al.
-
Large Language Models Are Human-level Prompt Engineers
(2022)
• Arxiv
• 200 citations
Zhou et al.
-
Blenderbot 3: A Deployed Conversational Agent That Continually Learns To Responsibly Engage
(2022)
• Arxiv
• 87 citations
Shuster et al.
-
Active Example Selection For In-context Learning
(2022)
• Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
• 55 citations
Yiming Zhang, Shi Feng, Chenhao Tan
-
Automatic Code Documentation Generation Using GPT-3
(2022)
• Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering
• 61 citations
Junaed Younus Khan, Gias Uddin
-
Clip-mesh: Generating Textured Meshes From Text Using Pretrained Image-text Models
(2022)
• SIGGRAPH Asia 2022 Conference Papers
• 133 citations
Khalid et al.
-
Decomposed Prompting: A Modular Approach For Solving Complex Tasks
(2022)
• Arxiv
• 64 citations
Khot et al.
-
Large Language Models Are Zero-shot Reasoners
(2022)
• Arxiv
• 939 citations
Kojima et al.
-
Beyond A Pre-trained Object Detector: Cross-modal Textual And Visual Context For Image Captioning
(2022)
• 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 64 citations
Chia-Wen Kuo, Zsolt Kira
-
Can Language Models Learn From Explanations In Context?
(2022)
• Findings of the Association for Computational Linguistics: EMNLP 2022
• 91 citations
Lampinen et al.
-
An Efficiency Study For SPLADE Models
(2022)
• Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 57 citations
Carlos Lassance, Stéphane Clinchant
-
Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering
(2022)
• Arxiv
• 60 citations
Lazaridou et al.
-
Coderl: Mastering Code Generation Through Pretrained Models And Deep Reinforcement Learning
(2022)
• Arxiv
• 76 citations
Le et al.
-
Coauthor: Designing A Human-ai Collaborative Writing Dataset For Exploring Language Model Capabilities
(2022)
• CHI Conference on Human Factors in Computing Systems
• 241 citations
Mina Lee, Percy Liang, Qian Yang
-
Promptchainer: Chaining Large Language Model Prompts Through Visual Programming
(2022)
• CHI Conference on Human Factors in Computing Systems Extended Abstracts
• 122 citations
Wu et al.
-
Wav2clip: Learning Robust Audio Representations From CLIP
(2022)
• ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 114 citations
Wu et al.
-
A New Generation Of Perspective API: Efficient Multilingual Character-level Transformers
(2022)
• Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
• 97 citations
Lees et al.
-
Solving Quantitative Reasoning Problems With Language Models
(2022)
• Arxiv
• 238 citations
Lewkowycz et al.
-
Going Full-tilt Boogie On Document Understanding With Text-image-layout Transformer
(2021)
• Lecture Notes in Computer Science
• 117 citations
Powalski et al.
-
MAUVE: Measuring The Gap Between Neural Text And Human Text Using Divergence Frontiers
(2021)
• Arxiv
• 90 citations
Pillutla et al.
-
Fast Model Editing At Scale
(2021)
• Arxiv
• 74 citations
Mitchell et al.
-
Rocketqav2: A Joint Training Method For Dense Passage Retrieval And Passage Re-ranking
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 133 citations
Ren et al.
-
What To Pre-train On? Efficient Intermediate Task Selection
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 62 citations
Poth et al.
-
Clipcap: CLIP Prefix For Image Captioning
(2021)
• Arxiv
• 281 citations
Ron Mokady, Amir Hertz, Amit H. Bermano
-
Retrieval Augmented Code Generation And Summarization
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 57 citations
Parvez et al.
-
Mind The Style Of Text! Adversarial And Backdoor Attacks Based On Text Style Transfer
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 75 citations
Qi et al.
-
PICARD: Parsing Incrementally For Constrained Auto-regressive Decoding From Language Models
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 168 citations
Torsten Scholak, Nathan Schucher, Dzmitry Bahdanau
-
Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm
(2021)
• Arxiv
• 94 citations
Li et al.
-
Structext: Structured Text Understanding With Multi-modal Transformers
(2021)
• Proceedings of the 29th ACM International Conference on Multimedia
• 94 citations
Li et al.
-
Structurallm: Structural Pre-training For Form Understanding
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 76 citations
Li et al.
-
Are NLP Models Really Able To Solve Simple Math Word Problems?
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 129 citations
Arkil Patel, Satwik Bhattamishra, Navin Goyal
-
Systematic Review For Ai-based Language Learning Tools
(2021)
• Journal of Digital Contents Society
• 55 citations
Jin Ha Woo, Heeyoul Choi
-
Learning How To Ask: Querying Lms With Mixtures Of Soft Prompts
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 293 citations
Guanghui Qin, Jason Eisner
-
Questeval: Summarization Asks For Fact-based Evaluation
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 119 citations
Scialom et al.
-
Language Models Are Few-shot Multilingual Learners
(2021)
• Proceedings of the 1st Workshop on Multilingual Representation Learning
• 67 citations
Winata et al.
-
Simple Entity-centric Questions Challenge Dense Retrievers
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 70 citations
Sciavolino et al.
-
R-drop: Regularized Dropout For Neural Networks
(2021)
• Arxiv
• 233 citations
Liang et al.
-
Learning Transferable Visual Models From Natural Language Supervision
(2021)
• Arxiv
• 4204 citations
Radford et al.
-
Few-shot Learning With Multilingual Language Models
(2021)
• Arxiv
• 73 citations
Lin et al.
-
Traceability Transformed: Generating More Accurate Links With Pre-trained BERT Models
(2021)
• 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)
• 88 citations
Lin et al.
-
LAION-400M: Open Dataset Of Clip-filtered 400 Million Image-text Pairs
(2021)
• Arxiv
• 318 citations
Schuhmann et al.
-
CPTR: Full Transformer Network For Image Captioning
(2021)
• Arxiv
• 107 citations
Liu et al.
-
Competence-based Multimodal Curriculum Learning For Medical Report Generation
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 83 citations
Liu et al.
-
Dexperts: Decoding-time Controlled Text Generation With Experts And Anti-experts
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 118 citations
Liu et al.
-
P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks
(2021)
• Arxiv
• 252 citations
Liu et al.
-
Image Retrieval On Real-life Images With Pre-trained Vision-and-language Models
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 115 citations
Liu et al.
-
Visually Grounded Reasoning Across Languages And Cultures
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 71 citations
Liu et al.
-
Simcls: A Simple Framework For Contrastive Learning Of Abstractive Summarization
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
• 189 citations
Yixin Liu, Pengfei Liu
-
SLAKE: A Semantically-labeled Knowledge-enhanced Dataset For Medical Visual Question Answering
(2021)
• 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)
• 121 citations
Liu et al.
-
TAPEX: Table Pre-training Via Learning A Neural SQL Executor
(2021)
• Arxiv
• 82 citations
Liu et al.
-
Generating Datasets With Pretrained Language Models
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 100 citations
Timo Schick, Hinrich Schütze
-
Gender Bias In Machine Translation
(2021)
• Transactions of the Association for Computational Linguistics
• 126 citations
Savoldi et al.
-
How Many Data Points Is A Prompt Worth?
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 166 citations
Teven Le Scao, Alexander M. Rush
-
Ethical And Social Risks Of Harm From Language Models
(2021)
• Arxiv
• 303 citations
Weidinger et al.
-
Perfection Not Required? Human-ai Partnerships In Code Translation
(2021)
• 26th International Conference on Intelligent User Interfaces
• 93 citations
Weisz et al.
-
Finetuned Language Models Are Zero-shot Learners
(2021)
• Arxiv
• 722 citations
Wei et al.
-
Get Your Vitamin C! Robust Fact Verification With Contrastive Evidence
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 104 citations
Tal Schuster, Adam Fisch, Regina Barzilay
-
Multitask Prompted Training Enables Zero-shot Task Generalization
(2021)
• Arxiv
• 470 citations
Sanh et al.
-
Societal Biases In Language Generation: Progress And Challenges
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 89 citations
Sheng et al.
-
End-to-end Training Of Neural Retrievers For Open-domain Question Answering
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 55 citations
Sachan et al.
-
UNICORN On RAINBOW: A Universal Commonsense Reasoning Model On A New Multitask Benchmark
(2021)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 77 citations
Lourie et al.
-
Wangchanberta: Pretraining Transformer-based Thai Language Models
(2021)
• Arxiv
• 65 citations
Lowphansirikul et al.
-
Pretrained Transformers As Universal Computation Engines
(2021)
• Arxiv
• 93 citations
Lu et al.
-
Codexglue: A Machine Learning Benchmark Dataset For Code Understanding And Generation
(2021)
• Arxiv
• 384 citations
Lu et al.
-
Pre-train, Prompt, And Predict: A Systematic Survey Of Prompting Methods In Natural Language Processing
(2021)
• Proceedings of the 29th ACM International Conference on Multimedia
• 62 citations
Liu et al.
-
How Much Can CLIP Benefit Vision-and-language Tasks?
(2021)
• Arxiv
• 151 citations
Shen et al.
-
End-to-end Audio-visual Speech Recognition With Conformers
(2021)
• ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 166 citations
Pingchuan Ma, Stavros Petridis, Maja Pantic
-
Challenges In Detoxifying Language Models
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 70 citations
Welbl et al.
-
XTREME-R: Towards More Challenging And Nuanced Multilingual Evaluation
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 100 citations
Ruder et al.
-
DAE-GAN: Dynamic Aspect-aware GAN For Text-to-image Synthesis
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 92 citations
Ruan et al.
-
A Simple Recipe For Multilingual Grammatical Error Correction
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
• 99 citations
Rothe et al.
-
Compacter: Efficient Low-rank Hypercomplex Adapter Layers
(2021)
• Arxiv
• 159 citations
Rabeeh Karimi Mahabadi, James Henderson, Sebastian Ruder
-
Parameter-efficient Multi-task Fine-tuning For Transformers Via Shared Hypernetworks
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 115 citations
Mahabadi et al.
-
Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm
(2021)
• Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems
• 550 citations
Laria Reynolds, Kyle McDonell
-
Scientific Credibility Of Machine Translation Research: A Meta-evaluation Of 769 Papers
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 62 citations
Benjamin Marie, Atsushi Fujita, Raphael Rubino
-
Studying The Usage Of Text-to-text Transfer Transformer To Support Code-related Tasks
(2021)
• 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)
• 182 citations
Mastropaolo et al.
-
Applying Codebert For Automated Program Repair Of Java Simple Bugs
(2021)
• 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)
• 91 citations
Ehsan Mashhadi, Hadi Hemmati
-
Calibrate Before Use: Improving Few-shot Performance Of Language Models
(2021)
• Arxiv
• 343 citations
Zhao et al.
-
Entailment As Few-shot Learner
(2021)
• Arxiv
• 105 citations
Wang et al.
-
Adaptsum: Towards Low-resource Domain Adaptation For Abstractive Summarization
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 64 citations
Tiezheng Yu, Zihan Liu, Pascale Fung
-
Referring Transformer: A One-step Approach To Multi-task Visual Grounding
(2021)
• Arxiv
• 66 citations
Muchen Li, Leonid Sigal
-
Persistent Anti-muslim Bias In Large Language Models
(2021)
• Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society
• 327 citations
Abubakar Abid, Maheen Farooqi, James Zou
-
Muppet: Massive Multi-task Representations With Pre-finetuning
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 166 citations
Aghajanyan et al.
-
True Few-shot Learning With Language Models
(2021)
• Arxiv
• 184 citations
Ethan Perez, Douwe Kiela, Kyunghyun Cho
-
Unified Pre-training For Program Understanding And Generation
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 435 citations
Ahmad et al.
-
Episodic Transformer For Vision-and-language Navigation
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 98 citations
Alexander Pashevich, Cordelia Schmid, Chen Sun
-
Cotext: Multi-task Learning With Code-text Transformer
(2021)
• Proceedings of the 1st Workshop on Natural Language Processing for Programming (NLP4Prog 2021)
• 76 citations
Phan et al.
-
Bartscore: Evaluating Generated Text As Text Generation
(2021)
• Arxiv
• 291 citations
Weizhe Yuan, Graham Neubig, Pengfei Liu
-
BEIR: A Heterogenous Benchmark For Zero-shot Evaluation Of Information Retrieval Models
(2021)
• Arxiv
• 97 citations
Thakur et al.
-
Multi-grained Vision Language Pre-training: Aligning Texts With Visual Concepts
(2021)
• Arxiv
• 90 citations
Yan Zeng, Xinsong Zhang, Hang Li
-
Are Transformers More Robust Than Cnns?
(2021)
• Arxiv
• 88 citations
Bai et al.
-
Docformer: End-to-end Transformer For Document Understanding
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 197 citations
Appalaraju et al.
-
Ext5: Towards Extreme Multi-task Scaling For Transfer Learning
(2021)
• Arxiv
• 73 citations
Aribandi et al.
-
A General Language Assistant As A Laboratory For Alignment
(2021)
• Arxiv
• 89 citations
Askell et al.
-
Incorporating Convolution Designs Into Visual Transformers
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 453 citations
Yuan et al.
-
Program Synthesis With Large Language Models
(2021)
• Arxiv
• 225 citations
Austin et al.
-
Representing Numbers In NLP: A Survey And A Vision
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 70 citations
Thawani et al.
-
Vision Guided Generative Pre-trained Language Models For Multimodal Abstractive Summarization
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 58 citations
Yu et al.
-
Towards Facilitating Empathic Conversations In Online Mental Health Support: A Reinforcement Learning Approach
(2021)
• Proceedings of the Web Conference 2021
• 120 citations
Sharma et al.
-
Styleclip: Text-driven Manipulation Of Stylegan Imagery
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 754 citations
Patashnik et al.
-
Tokens-to-token Vit: Training Vision Transformers From Scratch On Imagenet
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 1749 citations
Yuan et al.
-
Vector-quantized Image Modeling With Improved VQGAN
(2021)
• Arxiv
• 88 citations
Yu et al.
-
Vlmo: Unified Vision-language Pre-training With Mixture-of-modality-experts
(2021)
• Arxiv
• 188 citations
Bao et al.
-
XLM-T: Multilingual Language Models In Twitter For Sentiment Analysis And Beyond
(2021)
• Arxiv
• 105 citations
Francesco Barbieri, Luis Espinosa Anke, Jose Camacho-Collados
-
Redditbias: A Real-world Resource For Bias Evaluation And Debiasing Of Conversational Language Models
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 83 citations
Barikeri et al.
-
Improving Question Answering Model Robustness With Synthetic Adversarial Data Generation
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 62 citations
Bartolo et al.
-
Data Expansion Using Back Translation And Paraphrasing For Hate Speech Detection
(2021)
• Online Social Networks and Media
• 63 citations
Djamila Romaissa Beddiar, Md Saroar Jahan, Mourad Oussalah
-
Keyword Transformer: A Self-attention Model For Keyword Spotting
(2021)
• Interspeech 2021
• 98 citations
Axel Berg, Mark O'Connor, Miguel Tairum Cruz
-
Joint Visual Semantic Reasoning: Multi-stage Decoder For Text Recognition
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 57 citations
Bhunia et al.
-
Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 111 citations
Yoo et al.
-
Topic-driven And Knowledge-aware Transformer For Dialogue Emotion Detection
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 109 citations
Zhu et al.
-
Random Feature Attention
(2021)
• Arxiv
• 132 citations
Peng et al.
-
Towards Improving Adversarial Training Of NLP Models
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 74 citations
Jin Yong Yoo, Yanjun Qi
-
Improving Language Models By Retrieving From Trillions Of Tokens
(2021)
• Arxiv
• 187 citations
Borgeaud et al.
-
What Will It Take To Fix Benchmarking In Natural Language Understanding?
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 86 citations
Samuel R. Bowman, George E. Dahl
-
Aligntransformer: Hierarchical Alignment Of Visual Regions And Disease Tags For Medical Report Generation
(2021)
• Lecture Notes in Computer Science
• 75 citations
You et al.
-
Indonlg: Benchmark And Resources For Evaluating Indonesian Natural Language Generation
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 61 citations
Cahyawijaya et al.
-
Knowledgeable Or Educated Guess? Revisiting Language Models As Knowledge Bases
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 68 citations
Cao et al.
-
Clip4caption: CLIP For Video Caption
(2021)
• Proceedings of the 29th ACM International Conference on Multimedia
• 95 citations
Tang et al.
-
Multieurlex -- A Multi-lingual And Multi-label Legal Document Classification Dataset For Zero-shot Cross-lingual Transfer
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 59 citations
Ilias Chalkidis, Manos Fergadiotis, Ion Androutsopoulos
-
Charformer: Fast Character Transformers Via Gradient-based Subword Tokenization
(2021)
• Arxiv
• 71 citations
Tay et al.
-
Speechstew: Simply Mix All Available Speech Recognition Data To Train One Large Neural Network
(2021)
• Arxiv
• 70 citations
Chan et al.
-
Conceptual 12M: Pushing Web-scale Image-text Pre-training To Recognize Long-tail Visual Concepts
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 428 citations
Changpinyo et al.
-
Retrieving And Reading: A Comprehensive Survey On Open-domain Question Answering
(2021)
• Arxiv
• 147 citations
Zhu et al.
-
Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 182 citations
Hila Chefer, Shir Gur, Lior Wolf
-
Syncobert: Syntax-guided Multi-modal Contrastive Pre-training For Code Representation
(2021)
• Arxiv
• 60 citations
Wang et al.
-
Dialogsum: A Real-life Scenario Dialogue Summarization Dataset
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 105 citations
Chen et al.
-
Decision Transformer: Reinforcement Learning Via Sequence Modeling
(2021)
• Arxiv
• 351 citations
Chen et al.
-
Semantic And Syntactic Enhanced Aspect Sentiment Triplet Extraction
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 55 citations
Chen et al.
-
Geoqa: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 56 citations
Chen et al.
-
Evaluating Large Language Models Trained On Code
(2021)
• Arxiv
• 1258 citations
Chen et al.
-
Industry Scale Semi-supervised Learning For Natural Language Understanding
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers
• 65 citations
Chen et al.
-
Factual Probing Is [MASK]: Learning Vs. Learning To Recall
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 216 citations
Zexuan Zhong, Dan Friedman, Danqi Chen
-
Temporal Meta-path Guided Explainable Recommendation
(2021)
• Proceedings of the 14th ACM International Conference on Web Search and Data Mining
• 83 citations
Chen et al.
-
Improving Speech Translation By Understanding And Learning From The Auxiliary Text Translation Task
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 67 citations
Tang et al.
-
Scifive: A Text-to-text Transformer Model For Biomedical Literature
(2021)
• Arxiv
• 80 citations
Phan et al.
-
Crossfit: A Few-shot Learning Challenge For Cross-task Generalization In NLP
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 96 citations
Qinyuan Ye, Bill Yuchen Lin, Xiang Ren
-
Slot Self-attentive Dialogue State Tracking
(2021)
• Proceedings of the Web Conference 2021
• 62 citations
Ye et al.
-
QA-GNN: Reasoning With Language Models And Knowledge Graphs For Question Answering
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 355 citations
Yasunaga et al.
-
Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 89 citations
Zhong et al.
-
Visualmrc: Machine Reading Comprehension On Document Images
(2021)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 57 citations
Ryota Tanaka, Kyosuke Nishida, Sen Yoshida
-
Improving And Simplifying Pattern Exploiting Training
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 91 citations
Tam et al.
-
Structured Scene Memory For Vision-language Navigation
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 71 citations
Wang et al.
-
The Curious Case Of Hallucinations In Neural Machine Translation
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 91 citations
Vikas Raunak, Arul Menezes, Marcin Junczys-Dowmunt
-
Medically Aware GPT-3 As A Data Generator For Medical Dialogue Summarization
(2021)
• Proceedings of the Second Workshop on Natural Language Processing for Medical Conversations
• 102 citations
Chintagunta et al.
-
Unifying Vision-and-language Tasks Via Text Generation
(2021)
• Arxiv
• 149 citations
Cho et al.
-
Evaluation Of BERT And ALBERT Sentence Embedding Performance On Downstream NLP Tasks
(2021)
• 2020 25th International Conference on Pattern Recognition (ICPR)
• 95 citations
Choi et al.
-
FILIP: Fine-grained Interactive Language-image Pre-training
(2021)
• Arxiv
• 168 citations
Yao et al.
-
CPT: Colorful Prompt Tuning For Pre-trained Vision-language Models
(2021)
• Arxiv
• 86 citations
Yao et al.
-
W2v-bert: Combining Contrastive Learning And Masked Language Modeling For Self-supervised Speech Pre-training
(2021)
• 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 191 citations
Chung et al.
-
An Empirical Study On The Usage Of BERT Models For Code Completion
(2021)
• 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)
• 59 citations
Ciniselli et al.
-
Training Verifiers To Solve Math Word Problems
(2021)
• Arxiv
• 420 citations
Cobbe et al.
-
Overview Of The TREC 2020 Deep Learning Track
(2021)
• Arxiv
• 129 citations
Craswell et al.
-
Structured Prediction As Translation Between Augmented Natural Languages
(2021)
• International Conference on Learning Representations (ICLR) 2021
• 143 citations
Paolini et al.
-
TEACHTEXT: Crossmodal Generalized Distillation For Text-video Retrieval
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 109 citations
Croitoru et al.
-
Semeval-2021 Task 6: Detection Of Persuasion Techniques In Texts And Images
(2021)
• Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)
• 74 citations
Dimitrov et al.
-
Explaining Answers With Entailment Trees
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 66 citations
Dalvi et al.
-
Editing Factual Knowledge In Language Models
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 109 citations
Nicola de Cao, Wilker Aziz, Ivan Titov
-
MERLOT: Multimodal Neural Script Knowledge Models
(2021)
• Arxiv
• 149 citations
Zellers et al.
-
Unified Conversational Recommendation Policy Learning Via Graph-based Reinforcement Learning
(2021)
• Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 106 citations
Deng et al.
-
CLINE: Contrastive Learning With Semantic Negative Examples For Natural Language Understanding
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 80 citations
Wang et al.
-
Cogview: Mastering Text-to-image Generation Via Transformers
(2021)
• Arxiv
• 271 citations
Ding et al.
-
Trankit: A Light-weight Transformer-based Toolkit For Multilingual Natural Language Processing
(2021)
• Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
• 90 citations
Nguyen et al.
-
Contrastive Learning For Many-to-many Multilingual Neural Machine Translation
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 122 citations
Pan et al.
-
Multiple Meta-model Quantifying For Medical Visual Question Answering
(2021)
• Lecture Notes in Computer Science
• 84 citations
Do et al.
-
Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 734 citations
Wang et al.
-
Rpbert: A Text-image Relation Propagation-based BERT Model For Multimodal NER
(2021)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 118 citations
Sun et al.
-
Taco: Token-aware Cascade Contrastive Learning For Video-text Alignment
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 87 citations
Jianwei Yang, Yonatan Bisk, Jianfeng Gao
-
Glam: Efficient Scaling Of Language Models With Mixture-of-experts
(2021)
• Arxiv
• 97 citations
Du et al.
-
Towards Interpreting And Mitigating Shortcut Learning Behavior Of NLU Models
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 60 citations
Du et al.
-
SUPERB: Speech Processing Universal Performance Benchmark
(2021)
• Interspeech 2021
• 474 citations
Yang et al.
-
Neural Path Hunter: Reducing Hallucination In Dialogue Systems Via Path Grounding
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 61 citations
Dziri et al.
-
FUDGE: Controlled Text Generation With Future Discriminators
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 108 citations
Kevin Yang, Dan Klein
-
Causal Attention For Vision-language Tasks
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 109 citations
Yang et al.
-
Measuring And Improving Consistency In Pretrained Language Models
(2021)
• Transactions of the Association for Computational Linguistics
• 144 citations
Elazar et al.
-
Show Your Work: Scratchpads For Intermediate Computation With Language Models
(2021)
• Arxiv
• 123 citations
Nye et al.
-
Continuous-time Sequential Recommendation With Temporal Graph Collaborative Transformer
(2021)
• Proceedings of the 30th ACM International Conference on Information & Knowledge Management
• 142 citations
Fan et al.
-
Mediasum: A Large-scale Media Interview Dataset For Dialogue Summarization
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 83 citations
Zhu et al.
-
Progressive Transformer-based Generation Of Radiology Reports
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 73 citations
Nooralahzadeh et al.
-
Switch Transformers: Scaling To Trillion Parameter Models With Simple And Efficient Sparsity
(2021)
• Arxiv
• 656 citations
William Fedus, Barret Zoph, Noam Shazeer
-
Language Model As An Annotator: Exploring Dialogpt For Dialogue Summarization
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 63 citations
Feng et al.
-
A Survey Of Data Augmentation Approaches For NLP
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 411 citations
Feng et al.
-
Clipdraw: Exploring Text-to-drawing Synthesis Through Language-image Encoders
(2021)
• Arxiv
• 79 citations
Kevin Frans, L. B. Soros, Olaf Witkowski
-
Adversarial Text-to-image Synthesis: A Review
(2021)
• Neural Networks
• 162 citations
Frolov et al.
-
VIOLET : End-to-end Video-language Transformers With Masked Visual-token Modeling
(2021)
• Arxiv
• 88 citations
Fu et al.
-
Chinesebert: Chinese Pretraining Enhanced By Glyph And Pinyin Information
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 152 citations
Sun et al.
-
Consert: A Contrastive Framework For Self-supervised Sentence Representation Transfer
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 424 citations
Yan et al.
-
Stylegan-nada: Clip-guided Domain Adaptation Of Image Generators
(2021)
• Arxiv
• 57 citations
Gal et al.
-
Condenser: A Pre-training Architecture For Dense Retrieval
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 133 citations
Luyu Gao, Jamie Callan
-
Advances And Challenges In Conversational Recommender Systems: A Survey
(2021)
• AI Open
• 231 citations
Gao et al.
-
Rethink Training Of BERT Rerankers In Multi-stage Retrieval Pipeline
(2021)
• Lecture Notes in Computer Science
• 67 citations
Luyu Gao, Zhuyun Dai, Jamie Callan
-
Videogpt: Video Generation Using VQ-VAE And Transformers
(2021)
• Arxiv
• 139 citations
Yan et al.
-
Did Aristotle Use A Laptop? A Question Answering Benchmark With Implicit Reasoning Strategies
(2021)
• Arxiv
• 63 citations
Geva et al.
-
Synthesis Of Compositional Animations From Textual Descriptions
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 112 citations
Ghosh et al.
-
Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 66 citations
Mozhdeh Gheini, Xiang Ren, Jonathan May
-
Screen2words: Automatic Mobile UI Summarization With Multimodal Learning
(2021)
• The 34th Annual ACM Symposium on User Interface Software and Technology
• 70 citations
Wang et al.
-
Want To Reduce Labeling Cost? GPT-3 Can Help
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 101 citations
Wang et al.
-
Larger-scale Transformers For Multilingual Masked Language Modeling
(2021)
• Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021)
• 65 citations
Goyal et al.
-
The FLORES-101 Evaluation Benchmark For Low-resource And Multilingual Machine Translation
(2021)
• Arxiv
• 83 citations
Goyal et al.
-
Kaleido-bert: Vision-language Pre-training On Fashion Domain
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 99 citations
Zhuge et al.
-
Simvlm: Simple Visual Language Model Pretraining With Weak Supervision
(2021)
• Arxiv
• 310 citations
Wang et al.
-
Textflint: Unified Multilingual Robustness Evaluation Toolkit For Natural Language Processing
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations
• 76 citations
Gui et al.
-
Airbert: In-domain Pretraining For Vision-and-language Navigation
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 91 citations
Guhur et al.
-
Gradient-based Adversarial Attacks Against Text Transformers
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 55 citations
Guo et al.
-
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation
(2021)
• Arxiv
• 170 citations
Sun et al.
-
Maria: Spanish Language Models
(2021)
• Procesamiento del Lenguaje Natural v. 68 p. 39-60 mar. 2022. ISSN 1989-7553
• 56 citations
Gutiérrez-Fandiño et al.
-
Webgpt: Browser-assisted Question-answering With Human Feedback
(2021)
• Arxiv
• 193 citations
Nakano et al.
-
Retrieval Augmentation Reduces Hallucination In Conversation
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 290 citations
Shuster et al.
-
CPM-2: Large-scale Cost-effective Pre-trained Language Models
(2021)
• AI Open
• 85 citations
Zhang et al.
-
Bertese: Learning To Speak To BERT
(2021)
• Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
• 66 citations
Adi Haviv, Jonathan Berant, Amir Globerson
-
WARP: Word-level Adversarial Reprogramming
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 219 citations
Karen Hambardzumyan, Hrant Khachatrian, Jonathan May
-
Stacked Acoustic-and-textual Encoding: Integrating The Pre-trained Models Into Speech Translation Encoders
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 60 citations
Xu et al.
-
Greedy Gradient Ensemble For Robust Visual Question Answering
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 65 citations
Han et al.
-
VLM: Task-agnostic Video-language Model Pre-training For Video Understanding
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 70 citations
Xu et al.
-
Multi-task Pre-training For Plug-and-play Task-oriented Dialogue System
(2021)
• Arxiv
• 57 citations
Su et al.
-
Xl-sum: Large-scale Multilingual Abstractive Summarization For 44 Languages
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 158 citations
Hasan et al.
-
Escaping The Big Data Paradigm With Compact Transformers
(2021)
• Arxiv
• 272 citations
Hassani et al.
-
Raise A Child In Large Language Model: Towards Effective And Generalizable Fine-tuning
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 107 citations
Xu et al.
-
Towards A Unified View Of Parameter-efficient Transfer Learning
(2021)
• Arxiv
• 257 citations
He et al.
-
The Stem Cell Hypothesis: Dilemma Behind Multi-task Learning With Transformer Encoders
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 67 citations
Han He, Jinho D. Choi
-
Debertav3: Improving Deberta Using Electra-style Pre-training With Gradient-disentangled Embedding Sharing
(2021)
• Arxiv
• 350 citations
Pengcheng He, Jianfeng Gao, Weizhu Chen
-
On The Effectiveness Of Adapter-based Tuning For Pretrained Language Model Adaptation
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 98 citations
He et al.
-
E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 75 citations
Xu et al.
-
Entity Structure Within And Throughout: Modeling Mention Dependencies For Document-level Relation Extraction
(2021)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 144 citations
Xu et al.
-
Pretrained Language Models For Text Generation: A Survey
(2021)
• Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence
• 102 citations
Li et al.
-
Probing Image-language Transformers For Verb Understanding
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 56 citations
Lisa Anne Hendricks, Aida Nematzadeh
-
CUAD: An Expert-annotated NLP Dataset For Legal Contract Review
(2021)
• Arxiv
• 91 citations
Hendrycks et al.
-
Open Domain Question Answering Over Tables Via Dense Retrieval
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 57 citations
Herzig et al.
-
Clipscore: A Reference-free Evaluation Metric For Image Captioning
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 502 citations
Hessel et al.
-
Efficiently Teaching An Effective Dense Retriever With Balanced Topic Aware Sampling
(2021)
• Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 241 citations
Hofstätter et al.
-
Compound Word Transformer: Learning To Compose Full-song Music Over Dynamic Directed Hypergraphs
(2021)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 109 citations
Hsiao et al.
-
Retagnn: Relational Temporal Attentive Graph Neural Networks For Holistic Sequential Recommendation
(2021)
• Proceedings of the Web Conference 2021
• 74 citations
Cheng Hsu, Cheng-Te Li
-
Lora: Low-rank Adaptation Of Large Language Models
(2021)
• Arxiv
• 1780 citations
Hu et al.
-
Unit: Multimodal Multitask Learning With A Unified Transformer
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 213 citations
Ronghang Hu, Amanpreet Singh
-
WIT: Wikipedia-based Image Text Dataset For Multimodal Multilingual Machine Learning
(2021)
• Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 134 citations
Srinivasan et al.
-
Increasing Faithfulness In Knowledge-grounded Dialogue With Controllable Features
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 63 citations
Rashkin et al.
-
Seeing Out Of The Box: End-to-end Pre-training For Vision-language Representation Learning
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 186 citations
Huang et al.
-
Efficient Attentions For Long Document Summarization
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 108 citations
Huang et al.
-
Multimodal Few-shot Learning With Frozen Language Models
(2021)
• Arxiv
• 252 citations
Tsimpoukelli et al.
-
Machine Translationese: Effects Of Algorithmic Bias On Linguistic Complexity In Machine Translation
(2021)
• Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
• 61 citations
Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam
-
Vinvl: Revisiting Visual Representations In Vision-language Models
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 690 citations
Zhang et al.
-
Tip-adapter: Training-free Clip-adapter For Better Vision-language Modeling
(2021)
• Arxiv
• 113 citations
Zhang et al.
-
Bob: BERT Over BERT For Training Persona-based Dialogue Models From Limited Personalized Data
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 82 citations
Song et al.
-
Wenlan: Bridging Vision And Language By Large-scale Multi-modal Pre-training
(2021)
• Arxiv
• 78 citations
Huo et al.
-
Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
(2021)
• Arxiv
• 65 citations
Zhang et al.
-
TABBIE: Pretrained Representations Of Tabular Data
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 100 citations
Iida et al.
-
COCO-LM: Correcting And Contrasting Text Sequences For Language Model Pretraining
(2021)
• Arxiv
• 121 citations
Meng et al.
-
Cross-modal Contrastive Learning For Text-to-image Generation
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 261 citations
Zhang et al.
-
How To Train BERT With An Academic Budget
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 70 citations
Peter Izsak, Moshe Berchansky, Omer Levy
-
Perceiver IO: A General Architecture For Structured Inputs & Outputs
(2021)
• Arxiv
• 172 citations
Jaegle et al.
-
Process For Adapting Language Models To Society (PALMS) With Values-targeted Datasets
(2021)
• Arxiv
• 74 citations
Irene Solaiman, Christy Dennison
-
Do Transformer Modifications Transfer Across Implementations And Applications?
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 69 citations
Narang et al.
-
Masked Language Modeling And The Distributional Hypothesis: Order Word Matters Pre-training For Little
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 166 citations
Sinha et al.
-
An Explanation Of In-context Learning As Implicit Bayesian Inference
(2021)
• Arxiv
• 93 citations
Xie et al.
-
Mentalbert: Publicly Available Pretrained Language Models For Mental Healthcare
(2021)
• Proceedings of the Language Resources and Evaluation Conference (LREC) 2022
• 103 citations
Ji et al.
-
Does The Magic Of BERT Apply To Medical Code Assignment? A Quantitative Study
(2021)
• Computers in Biology and Medicine
• 72 citations
Shaoxiong Ji, Matti Hölttä, Pekka Marttinen
-
Efficient Large-scale Language Model Training On GPU Clusters Using Megatron-lm
(2021)
• Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
• 270 citations
Narayanan et al.
-
Scaling Up Visual And Vision-language Representation Learning With Noisy Text Supervision
(2021)
• International Conference on Machine Learning 2021
• 723 citations
Jia et al.
-
Complex Temporal Question Answering On Knowledge Graphs
(2021)
• Proceedings of the 30th ACM International Conference on Information & Knowledge Management
• 73 citations
Jia et al.
-
CURE: Code-aware Neural Machine Translation For Automatic Program Repair
(2021)
• 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)
• 204 citations
Nan Jiang, Thibaud Lutellier, Lin Tan
-
Planning With Learned Entity Prompts For Abstractive Summarization
(2021)
• Transactions of the Association for Computational Linguistics
• 80 citations
Narayan et al.
-
Lawformer: A Pre-trained Language Model For Chinese Legal Long Documents
(2021)
• AI Open
• 167 citations
Xiao et al.
-
AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing
(2021)
• Arxiv
• 140 citations
Katikapalli Subramanyam Kalyan, Ajit Rajasekharan, Sivanesan Sangeetha
-
Clip-it! Language-guided Video Summarization
(2021)
• Thirty-Fifth Conference on Neural Information Processing Systems. 2021
• 66 citations
Medhini Narasimhan, Anna Rohrbach, Trevor Darrell
-
AMMU : A Survey Of Transformer-based Biomedical Pretrained Language Models
(2021)
• Journal of Biomedical Informatics
• 213 citations
Katikapalli Subramanyam Kalyan, Ajit Rajasekharan, Sivanesan Sangeetha
-
MDETR -- Modulated Detection For End-to-end Multi-modal Understanding
(2021)
• 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
• 481 citations
Kamath et al.
-
Debiasing Pre-trained Contextualised Embeddings
(2021)
• Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
• 82 citations
Masahiro Kaneko, Danushka Bollegala
-
The NLP Cookbook: Modern Recipes For Transformer Based Deep Learning Architectures
(2021)
• IEEE Access
• 117 citations
Sushant Singh, Ausif Mahmood
-
Textocr: Towards Large-scale End-to-end Reasoning For Arbitrary-shaped Scene Text
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 99 citations
Singh et al.
-
Multilingual LAMA: Investigating Knowledge In Multilingual Pretrained Language Models
(2021)
• Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
• 73 citations
Nora Kassner, Philipp Dufter, Hinrich Schütze
-
Jointgt: Graph-text Joint Representation Learning For Text Generation From Knowledge Graphs
(2021)
• Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
• 65 citations
Ke et al.
-
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation
(2021)
• Proceedings of the 29th ACM International Conference on Multimedia
• 114 citations
Zaid Khan, Yun Fu
-
Muril: Multilingual Representations For Indian Languages
(2021)
• Arxiv
• 139 citations
Khanuja et al.
-
MMBERT: Multimodal BERT Pretraining For Improved Medical VQA
(2021)
• 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)
• 96 citations
Khare et al.
-
Zero-shot Text-to-image Generation
(2021)
• Arxiv
• 1083 citations
Ramesh et al.
-
Dynabench: Rethinking Benchmarking In NLP
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 187 citations
Kiela et al.
-
I-BERT: Integer-only BERT Quantization
(2021)
• ICML 2021 (Oral)
• 91 citations
Kim et al.
-
What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 62 citations
Kim et al.
-
Self-guided Contrastive Learning For BERT Sentence Representations
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 153 citations
Taeuk Kim, Kang Min Yoo, Sang-Goo Lee
-
Vilt: Vision-and-language Transformer Without Convolution Or Region Supervision
(2021)
• Arxiv
• 425 citations
Wonjae Kim, Bokyung Son, Ildoo Kim
-
Prefix-tuning: Optimizing Continuous Prompts For Generation
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 1671 citations
Xiang Lisa Li, Percy Liang
-
Coreference Resolution Without Span Representations
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)
• 61 citations
Yuval Kirstain, Ori Ram, Omer Levy
-
Spark NLP: Natural Language Understanding At Scale
(2021)
• Software Impacts
• 57 citations
Veysel Kocaman, David Talby
-
Hurdles To Progress In Long-form Question Answering
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 103 citations
Kalpesh Krishna, Aurko Roy, Mohit Iyyer
-
Block Pruning For Faster Transformers
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 84 citations
Lagunas et al.
-
Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 73 citations
Sun et al.
-
Generative Spoken Language Modeling From Raw Audio
(2021)
• Arxiv
• 128 citations
Lakhotia et al.
-
Constrained Language Models Yield Few-shot Semantic Parsers
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 120 citations
Shin et al.
-
Few-shot Question Answering By Pretraining Span Selection
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 70 citations
Ram et al.
-
Sustainable Modular Debiasing Of Language Models
(2021)
• Findings of the Association for Computational Linguistics: EMNLP 2021
• 64 citations
Anne Lauscher, Tobias Lüken, Goran Glavaš
-
Mind The Gap: Assessing Temporal Generalization In Neural Language Models
(2021)
• Arxiv
• 70 citations
Lazaridou et al.
-
Dialogue State Tracking With A Language Model Using Schema-driven Prompting
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 78 citations
Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf
-
Towards Few-shot Fact-checking Via Perplexity
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 63 citations
Lee et al.
-
Curriculum-meta Learning For Order-robust Continual Relation Extraction
(2021)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 56 citations
Wu et al.
-
Recursively Summarizing Books With Human Feedback
(2021)
• Arxiv
• 64 citations
Wu et al.
-
Personalized Transformer For Explainable Recommendation
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 91 citations
Lei Li, Yongfeng Zhang, Li Chen
-
Fastformer: Additive Attention Can Be All You Need
(2021)
• Arxiv
• 71 citations
Wu et al.
-
Less Is More: Clipbert For Video-and-language Learning Via Sparse Sampling
(2021)
• 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 450 citations
Lei et al.
-
Empowering News Recommendation With Pre-trained Language Models
(2021)
• Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 127 citations
Wu et al.
-
The Power Of Scale For Parameter-efficient Prompt Tuning
(2021)
• Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
• 1764 citations
Brian Lester, Rami Al-Rfou, Noah Constant
-
PAQ: 65 Million Probably-asked Questions And What You Can Do With Them
(2021)
• Transactions of the Association for Computational Linguistics
• 128 citations
Lewis et al.
-
Lightweight Self-attentive Sequential Recommendation
(2021)
• Proceedings of the 30th ACM International Conference on Information & Knowledge Management
• 86 citations
Li et al.
-
Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation
(2021)
• Arxiv
• 688 citations
Li et al.
-
MST: Masked Self-supervised Transformer For Visual Representation
(2021)
• Arxiv
• 57 citations
Li et al.
-
Implicit Representations Of Meaning In Neural Language Models
(2021)
• Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
• 65 citations
Belinda Z. Li, Maxwell Nye, Jacob Andreas
-
Document-level Event Argument Extraction By Conditional Generation
(2021)
• Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 202 citations
Sha Li, Heng Ji, Jiawei Han
-
Hidden Backdoors In Human-centric Language Models
(2021)
• Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications Security
• 86 citations
Li et al.
-
A Taxonomy Of Empathetic Response Intents In Human Social Conversations
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 77 citations
Anuradha Welivita, Pearl Pu
-
Exploring Versatile Generative Language Model Via Parameter-efficient Transfer Learning
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 74 citations
Zhaojiang Lin, Andrea Madotto, Pascale Fung
-
Sequential Recommendation With Self-attentive Multi-adversarial Network
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 92 citations
Ren et al.
-
Pre-training Multilingual Neural Machine Translation By Leveraging Alignment Information
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 81 citations
Lin et al.
-
Open-retrieval Conversational Question Answering
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 139 citations
Qu et al.
-
Birds Have Four Legs?! Numersense: Probing Numerical Commonsense Knowledge Of Pre-trained Language Models
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 119 citations
Lin et al.
-
Cosda-ml: Multi-lingual Code-switching Data Augmentation For Zero-shot Cross-lingual NLP
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 118 citations
Qin et al.
-
End-to-end Neural Transformer Based Spoken Language Understanding
(2020)
• Interspeech 2020
• 56 citations
Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann
-
Low-resource Knowledge-grounded Dialogue Generation
(2020)
• Arxiv
• 86 citations
Zhao et al.
-
BLEURT: Learning Robust Metrics For Text Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 813 citations
Thibault Sellam, Dipanjan Das, Ankur P. Parikh
-
Beyond Accuracy: Behavioral Testing Of NLP Models With Checklist
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 757 citations
Ribeiro et al.
-
Language Models Are Open Knowledge Graphs
(2020)
• Arxiv
• 81 citations
Chenguang Wang, Xiao Liu, Dawn Song
-
CDL: Curriculum Dual Learning For Emotion-controllable Response Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 80 citations
Lei Shen, Yang Feng
-
Are All Languages Created Equal In Multilingual BERT?
(2020)
• Proceedings of the 5th Workshop on Representation Learning for NLP
• 203 citations
Shijie Wu, Mark Dredze
-
CLEAR: Contrastive Learning For Sentence Representation
(2020)
• Arxiv
• 230 citations
Wu et al.
-
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
(2020)
• Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
• 73 citations
Vylomova et al.
-
Multi-task Learning For Natural Language Processing In The 2020s: Where Are We Going?
(2020)
• Pattern Recognition Letters
• 70 citations
Joseph Worsham, Jugal Kalita
-
Ambigqa: Answering Ambiguous Open-domain Questions
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 157 citations
Min et al.
-
UHH-LT At Semeval-2020 Task 12: Fine-tuning Of Pre-trained Transformer Networks For Offensive Language Detection
(2020)
• Proceedings of the Fourteenth Workshop on Semantic Evaluation
• 65 citations
Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann
-
Towards Transparent And Explainable Attention Models
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 71 citations
Mohankumar et al.
-
On The Predictive Power Of Neural Language Models For Human Real-time Comprehension Behavior
(2020)
• Arxiv
• 88 citations
Wilcox et al.
-
Fast Transformers With Clustered Attention
(2020)
• Arxiv
• 67 citations
Apoorv Vyas, Angelos Katharopoulos, François Fleuret
-
FLERT: Document-level Features For Named Entity Recognition
(2020)
• Arxiv
• 59 citations
Stefan Schweter, Alan Akbik
-
USR: An Unsupervised And Reference Free Evaluation Metric For Dialog Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 135 citations
Shikib Mehri, Maxine Eskenazi
-
Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue
(2020)
• Arxiv
• 104 citations
Shikib Mehri, Mihail Eric, Dilek Hakkani-Tur
-
Hybridqa: A Dataset Of Multi-hop Question Answering Over Tabular And Textual Data
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 183 citations
Chen et al.
-
ARBERT & MARBERT: Deep Bidirectional Transformers For Arabic
(2020)
• ACL-2021 camera ready version
• 133 citations
Muhammad Abdul-Mageed, Abdelrahim Elmadany, El Moatez Billah Nagoudi
-
Adapterhub: A Framework For Adapting Transformers
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
• 403 citations
Pfeiffer et al.
-
Towards A Human-like Open-domain Chatbot
(2020)
• Arxiv
• 445 citations
Adiwardana et al.
-
Debugging Tests For Model Explanations
(2020)
• Arxiv
• 68 citations
Adebayo et al.
-
History For Visual Dialog: Do We Really Need It?
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 59 citations
Agarwal et al.
-
Better Fine-tuning By Reducing Representational Collapse
(2020)
• Arxiv
• 104 citations
Aghajanyan et al.
-
Unsupervised Evaluation Of Interactive Dialog With Dialogpt
(2020)
• Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 84 citations
Shikib Mehri, Maxine Eskenazi
-
ETC: Encoding Long And Structured Inputs In Transformers
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 247 citations
Ainslie et al.
-
Unsupervised Domain Clusters In Pretrained Language Models
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 174 citations
Roee Aharoni, Yoav Goldberg
-
A Transformer-based Approach For Source Code Summarization
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 319 citations
Ahmad et al.
-
Does Syntax Need To Grow On Trees? Sources Of Hierarchical Inductive Bias In Sequence-to-sequence Networks
(2020)
• Transactions of the Association for Computational Linguistics
• 68 citations
R. Thomas McCoy, Robert Frank, Tal Linzen
-
The Radicalization Risks Of GPT-3 And Advanced Neural Language Models
(2020)
• Arxiv
• 81 citations
Kris McGuffie, Alex Newhouse
-
STORIUM: A Dataset And Evaluation Platform For Machine-in-the-loop Story Generation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 74 citations
Akoury et al.
-
Exploring And Predicting Transferability Across NLP Tasks
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 110 citations
Vu et al.
-
Parameter-efficient Transfer From Sequential Behaviors For User Modeling And Recommendation
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 141 citations
Yuan et al.
-
Self-supervised Multimodal Versatile Networks
(2020)
• Arxiv
• 178 citations
Alayrac et al.
-
Automatic Machine Translation Evaluation In Many Languages Via Zero-shot Paraphrasing
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 129 citations
Brian Thompson, Matt Post
-
On Faithfulness And Factuality In Abstractive Summarization
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 717 citations
Maynez et al.
-
Translation Artifacts In Cross-lingual Transfer Learning
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 68 citations
Mikel Artetxe, Gorka Labaka, Eneko Agirre
-
Araelectra: Pre-training Text Discriminators For Arabic Language Understanding
(2020)
• Arxiv
• 63 citations
Wissam Antoun, Fady Baly, Hazem Hajj
-
Arabert: Transformer-based Model For Arabic Language Understanding
(2020)
• Arxiv
• 519 citations
Wissam Antoun, Fady Baly, Hazem Hajj
-
How Context Affects Language Models' Factual Predictions
(2020)
• Arxiv
• 85 citations
Petroni et al.
-
Optimizing Transformer For Low-resource Neural Machine Translation
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 60 citations
Ali Araabi, Christof Monz
-
Re-translation Versus Streaming For Simultaneous Translation
(2020)
• Proceedings of the 17th International Conference on Spoken Language Translation
• 61 citations
Arivazhagan et al.
-
Contextual Embeddings: When Are They Worth It?
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 68 citations
Arora et al.
-
Inltk: Natural Language Toolkit For Indic Languages
(2020)
• Proceedings of Second Workshop for NLP Open Source Software (NLP-OSS)
• 65 citations
Gaurav Arora
-
A Study Of Non-autoregressive Model For Sequence Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 59 citations
Ren et al.
-
Evaluating Conversational Recommender Systems Via User Simulation
(2020)
• Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
• 85 citations
Shuo Zhang, Krisztian Balog
-
Logic-guided Data Augmentation And Regularization For Consistent Question Answering
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 63 citations
Akari Asai, Hannaneh Hajishirzi
-
POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 66 citations
Zhang et al.
-
Transition-based Parsing With Stack-transformers
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 57 citations
Astudillo et al.
-
How Much Knowledge Can You Pack Into The Parameters Of A Language Model?
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 548 citations
Adam Roberts, Colin Raffel, Noam Shazeer
-
Reducing Quantity Hallucinations In Abstractive Summarization
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 87 citations
Zheng Zhao, Shay B. Cohen, Bonnie Webber
-
MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 406 citations
Pfeiffer et al.
-
Discriminative Nearest Neighbor Few-shot Intent Detection By Transferring Natural Language Inference
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 61 citations
Zhang et al.
-
Unsupervised Question Decomposition For Question Answering
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 116 citations
Perez et al.
-
COMET: A Neural Framework For MT Evaluation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 455 citations
Rei et al.
-
Reclor: A Reading Comprehension Dataset Requiring Logical Reasoning
(2020)
• Arxiv
• 128 citations
Yu et al.
-
Dialogue-based Relation Extraction
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 103 citations
Yu et al.
-
Document Modeling With Graph Attention Networks For Multi-grained Machine Reading Comprehension
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 57 citations
Zheng et al.
-
Deep Multimodal Neural Architecture Search
(2020)
• Proceedings of the 28th ACM International Conference on Multimedia
• 78 citations
Yu et al.
-
Stanza: A Python Natural Language Processing Toolkit For Many Human Languages
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 1281 citations
Qi et al.
-
Self-supervised Meta-learning For Few-shot Natural Language Classification Tasks
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 70 citations
Bansal et al.
-
Knowledge-grounded Dialogue Generation With Pre-trained Language Models
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 150 citations
Zhao et al.
-
Grappa: Grammar-augmented Pre-training For Table Semantic Parsing
(2020)
• Arxiv
• 90 citations
Yu et al.
-
HHH: An Online Medical Chatbot System Based On Knowledge Graph And Hierarchical Bi-directional Attention
(2020)
• Proceedings of the Australasian Computer Science Week Multiconference
• 55 citations
Qiming Bao, Lin Ni, Jiamou Liu
-
Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training
(2020)
• Arxiv
• 172 citations
Bao et al.
-
State-of-the-art Augmented NLP Transformer Models For Direct And Single-step Retrosynthesis
(2020)
• Nature Communications
• 335 citations
Tetko et al.
-
A Primer In Bertology: What We Know About How BERT Works
(2020)
• Transactions of the Association for Computational Linguistics
• 1285 citations
Anna Rogers, Olga Kovaleva, Anna Rumshisky
-
Beat The AI: Investigating Adversarial Human Annotation For Reading Comprehension
(2020)
• Transactions of the Association for Computational Linguistics
• 120 citations
Bartolo et al.
-
SLURP: A Spoken Language Understanding Resource Package
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 105 citations
Bastianelli et al.
-
The Elephant In The Interpretability Room: Why Use Attention As Explanation When We Have Saliency Methods?
(2020)
• Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP
• 139 citations
Jasmijn Bastings, Katja Filippova
-
Few-shot Generative Conversational Query Rewriting
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 116 citations
Yu et al.
-
Longformer: The Long-document Transformer
(2020)
• Arxiv
• 2184 citations
Iz Beltagy, Matthew E. Peters, Arman Cohan
-
Simultaneous Translation Policies: From Fixed To Adaptive
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Zheng et al.
-
Cross-modal Knowledge Reasoning For Knowledge-based Visual Question Answering
(2020)
• Pattern Recognition
• 98 citations
Yu et al.
-
Assessing Phrasal Representation And Composition In Transformers
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 57 citations
Lang Yu, Allyson Ettinger
-
Explainable Machine Learning In Deployment
(2020)
• Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency
• 519 citations
Bhatt et al.
-
Towards Making The Most Of Context In Neural Machine Translation
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 69 citations
Zheng et al.
-
Experience Grounds Language
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 224 citations
Bisk et al.
-
Language (technology) Is Power: A Critical Survey Of "bias" In NLP
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 719 citations
Blodgett et al.
-
On The Limitations Of Cross-lingual Encoders As Exposed By Reference-free Machine Translation Evaluation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 59 citations
Zhao et al.
-
Causal Mediation Analysis For Interpreting Neural NLP: The Case Of Gender Bias
(2020)
• Arxiv
• 78 citations
Vig et al.
-
Non-attentive Tacotron: Robust And Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
(2020)
• Arxiv
• 73 citations
Shen et al.
-
Few-shot Learning For Opinion Summarization
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 61 citations
Arthur Bražinskas, Mirella Lapata, Ivan Titov
-
Generating Hierarchical Explanations On Text Classification Via Feature Interaction Detection
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 77 citations
Hanjie Chen, Guangtao Zheng, Yangfeng Ji
-
Byte Pair Encoding Is Suboptimal For Language Model Pretraining
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 132 citations
Kaj Bostrom, Greg Durrett
-
Toxicity Detection: Does Context Really Matter?
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 93 citations
Pavlopoulos et al.
-
Adversarial Filters Of Dataset Biases
(2020)
• Arxiv
• 146 citations
Bras et al.
-
A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation
(2020)
• Arxiv
• 94 citations
Shen et al.
-
Language Models Are Few-shot Learners
(2020)
• Arxiv
• 13449 citations
Brown et al.
-
On The Inference Calibration Of Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 63 citations
Wang et al.
-
Hard-coded Gaussian Attention For Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Weiqiu You, Simeng Sun, Mohit Iyyer
-
DIET: Lightweight Language Understanding For Dialogue Systems
(2020)
• Arxiv
• 105 citations
Bunk et al.
-
TLDR: Extreme Summarization Of Scientific Documents
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 146 citations
Cachola et al.
-
AMR Parsing Via Graph-sequence Iterative Inference
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 92 citations
Deng Cai, Wai Lam
-
Like Hiking? You Probably Enjoy Nature: Persona-grounded Dialog With Commonsense Expansions
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 60 citations
Majumder et al.
-
Learning From Context Or Names? An Empirical Study On Neural Relation Extraction
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 170 citations
Peng et al.
-
Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models
(2020)
• Lecture Notes in Computer Science
• 100 citations
Cao et al.
-
Sign Language Transformers: Joint End-to-end Sign Language Recognition And Translation
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 344 citations
Camgoz et al.
-
Zero-shot Transfer Learning With Synthesized Data For Multi-domain Dialogue State Tracking
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 77 citations
Campagna et al.
-
SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching
(2020)
• Arxiv
• 100 citations
Peng et al.
-
Pre-trained Language Model For Biomedical Question Answering
(2020)
• Communications in Computer and Information Science
• 67 citations
Yoon et al.
-
Deformer: Decomposing Pre-trained Transformers For Faster Question Answering
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Cao et al.
-
Multilingual Alignment Of Contextual Word Representations
(2020)
• Arxiv
• 161 citations
Steven Cao, Nikita Kitaev, Dan Klein
-
Efficient Intent Detection With Dual Sentence Encoders
(2020)
• Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI
• 294 citations
Casanueva et al.
-
With Little Power Comes Great Responsibility
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 81 citations
Card et al.
-
Gpt-too: A Language-model-first Approach For Amr-to-text Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 80 citations
Mager et al.
-
Few-shot Natural Language Generation For Task-oriented Dialog
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 164 citations
Peng et al.
-
REL: An Entity Linker Standing On The Shoulders Of Giants
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 90 citations
Hulst et al.
-
Evaluation Of Text Generation: A Survey
(2020)
• Arxiv
• 200 citations
Asli Celikyilmaz, Elizabeth Clark, Jianfeng Gao
-
Training Question Answering Models From Synthetic Data
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 115 citations
Puri et al.
-
Coach: A Coarse-to-fine Approach For Cross-domain Slot Filling
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 98 citations
Liu et al.
-
GLUCOSE: Generalized And Contextualized Story Explanations
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 85 citations
Mostafazadeh et al.
-
Crosswoz: A Large-scale Chinese Cross-domain Task-oriented Dialogue Dataset
(2020)
• Transactions of the Association for Computational Linguistics
• 91 citations
Zhu et al.
-
Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work?
(2020)
• Arxiv
• 56 citations
Pruksachatkun et al.
-
Molweni: A Challenge Multiparty Dialogues-based Machine Reading Comprehension Dataset With Discourse Structure
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 73 citations
Li et al.
-
Mapping Natural Language Instructions To Mobile UI Action Sequences
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 66 citations
Li et al.
-
Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 126 citations
Li et al.
-
Oscar: Object-semantics Aligned Pre-training For Vision-language Tasks
(2020)
• Lecture Notes in Computer Science
• 1301 citations
Li et al.
-
HERO: Hierarchical Encoder For Video+language Omni-representation Pre-training
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 342 citations
Li et al.
-
Does Multi-encoder Help? A Case Study On Context-aware Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 59 citations
Li et al.
-
A Comparison Of Pre-trained Vision-and-language Models For Multimodal Representation Learning Across Medical Images And Reports
(2020)
• 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
• 69 citations
Yikuan Li, Hanyin Wang, Yuan Luo
-
Developing RNN-T Models Surpassing High-performance Hybrid Models With Customization Capability
(2020)
• Interspeech 2020
• 98 citations
Li et al.
-
Making Monolingual Sentence Embeddings Multilingual Using Knowledge Distillation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 624 citations
Nils Reimers, Iryna Gurevych
-
Streaming Transformer-based Acoustic Models Using Self-attention With Augmented Memory
(2020)
• Interspeech 2020
• 57 citations
Wu et al.
-
Compositional Explanations Of Neurons
(2020)
• Arxiv
• 55 citations
Jesse Mu, Jacob Andreas
-
Mintl: Minimalist Transfer Learning For Task-oriented Dialogue Systems
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 141 citations
Lin et al.
-
How Can We Accelerate Progress Towards Human-like Linguistic Generalization?
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 143 citations
Tal Linzen
-
Estimation-action-reflection: Towards Deep Interaction Between Conversational And Recommender Systems
(2020)
• Proceedings of the 13th International Conference on Web Search and Data Mining
• 102 citations
Lei et al.
-
MART: Memory-augmented Recurrent Transformer For Coherent Video Paragraph Captioning
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 159 citations
Lei et al.
-
Learning From Task Descriptions
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 61 citations
Weller et al.
-
Pre-training Via Paraphrasing
(2020)
• Arxiv
• 105 citations
Lewis et al.
-
Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding
(2020)
• Arxiv
• 296 citations
Lepikhin et al.
-
Syntactic Structure From Deep Learning
(2020)
• Annual Review of Linguistics
• 188 citations
Tal Linzen, Marco Baroni
-
Retrieval-augmented Generation For Knowledge-intensive NLP Tasks
(2020)
• Arxiv
• 1826 citations
Lewis et al.
-
Are Pre-trained Language Models Aware Of Phrases? Simple But Strong Baselines For Grammar Induction
(2020)
• Arxiv
• 56 citations
Kim et al.
-
Query Resolution For Conversational Search With Limited Supervision
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 57 citations
Voskarides et al.
-
Beyond Domain Apis: Task-oriented Conversational Modeling With Unstructured Knowledge Access
(2020)
• Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 56 citations
Kim et al.
-
Sequential Latent Knowledge Selection For Knowledge-grounded Dialogue
(2020)
• Arxiv
• 110 citations
Byeongchang Kim, Jaewoo Ahn, Gunhee Kim
-
Unsupervised Commonsense Question Answering With Self-talk
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 178 citations
Shwartz et al.
-
Lite Transformer With Long-short Range Attention
(2020)
• Arxiv
• 124 citations
Wu et al.
-
Imojie: Iterative Memory-based Joint Open Information Extraction
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 68 citations
Kolluru et al.
-
Reformer: The Efficient Transformer
(2020)
• Arxiv
• 918 citations
Nikita Kitaev, Łukasz Kaiser, Anselm Levskaya
-
Pre-trained Summarization Distillation
(2020)
• Arxiv
• 58 citations
Sam Shleifer, Alexander M. Rush
-
Indolem And Indobert: A Benchmark Dataset And Pre-trained Language Model For Indonesian NLP
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 169 citations
Koto et al.
-
Syntax-guided Controlled Generation Of Paraphrases
(2020)
• Transactions of the Association for Computational Linguistics
• 71 citations
Kumar et al.
-
Noisy Text Data: Achilles' Heel Of BERT
(2020)
• Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020)
• 56 citations
Ankit Kumar, Piyush Makhija, Anuj Gupta
-
Room-across-room: Multilingual Vision-and-language Navigation With Dense Spatiotemporal Grounding
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 159 citations
Ku et al.
-
NILE : Natural Language Inference With Faithful Natural Language Explanations
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 109 citations
Sawan Kumar, Partha Talukdar
-
Data Augmentation Using Pre-trained Transformer Models
(2020)
• Proceedings of the 2nd Workshop on Life-long Learning for Spoken Language Systems
• 130 citations
Varun Kumar, Ashutosh Choudhary, Eunah Cho
-
Autoprompt: Eliciting Knowledge From Language Models With Automatically Generated Prompts
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 938 citations
Shin et al.
-
Biomegatron: Larger Biomedical Domain Language Model
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 104 citations
Shin et al.
-
Artificial Intelligence Versus Maya Angelou: Experimental Evidence That People Cannot Differentiate Ai-generated From Human-written Poetry
(2020)
• Computers in Human Behavior
• 293 citations
Nils Köbis, Luca Mossink
-
Unsupervised Translation Of Programming Languages
(2020)
• Arxiv
• 166 citations
Lachaux et al.
-
Babywalk: Going Farther In Vision-and-language Navigation By Taking Baby Steps
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 57 citations
Zhu et al.
-
An Empirical Study Of Pre-trained Transformers For Arabic Information Extraction
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 62 citations
Lan et al.
-
A Computational Approach To Understanding Empathy Expressed In Text-based Mental Health Support
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 184 citations
Sharma et al.
-
Incorporating BERT Into Neural Machine Translation
(2020)
• Arxiv
• 223 citations
Zhu et al.
-
Explaining Question Answering Models Through Text Generation
(2020)
• Arxiv
• 55 citations
Veronica Latcinnik, Jonathan Berant
-
Fixed Encoder Self-attention Patterns In Transformer-based Machine Translation
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 66 citations
Alessandro Raganato, Yves Scherrer, Jörg Tiedemann
-
Very Deep Transformers For Neural Machine Translation
(2020)
• Arxiv
• 73 citations
Liu et al.
-
Learning-to-rank With BERT In Tf-ranking
(2020)
• Arxiv
• 69 citations
Han et al.
-
Explaining Black Box Predictions And Unveiling Data Artifacts Through Influence Functions
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 93 citations
Xiaochuang Han, Byron C. Wallace, Yulia Tsvetkov
-
Fairseq S2T: Fast Speech-to-text Modeling With Fairseq
(2020)
• Arxiv
• 96 citations
Wang et al.
-
Plotmachines: Outline-conditioned Generation With Dynamic Plot State Tracking
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 99 citations
Rashkin et al.
-
TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 191 citations
Wu et al.
-
MEGATRON-CNTRL: Controllable Story Generation With External Knowledge Using Large-scale Language Models
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 105 citations
Xu et al.
-
Trippy: A Triple Copy Strategy For Value Independent Neural Dialog State Tracking
(2020)
• Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 191 citations
Heck et al.
-
Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 196 citations
Hao et al.
-
Streaming Automatic Speech Recognition With The Transformer Model
(2020)
• ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 200 citations
Niko Moritz, Takaaki Hori, Jonathan Le Roux
-
Towards Debiasing NLU Models From Unknown Biases
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 103 citations
Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych
-
Have Your Text And Use It Too! End-to-end Neural Data-to-text Generation With Semantic Fidelity
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 66 citations
Hamza Harkous, Isabel Groves, Amir Saffari
-
Measuring And Reducing Gendered Correlations In Pre-trained Models
(2020)
• Arxiv
• 107 citations
Webster et al.
-
AGIF: An Adaptive Graph-interactive Framework For Joint Multiple Intent Detection And Slot Filling
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 102 citations
Qin et al.
-
Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers
(2020)
• Arxiv
• 558 citations
Wang et al.
-
Probing Pretrained Language Models For Lexical Semantics
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 176 citations
Vulić et al.
-
Transformer-based Online Ctc/attention End-to-end Speech Recognition Architecture
(2020)
• ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 98 citations
Miao et al.
-
Recipes For Safety In Open-domain Chatbots
(2020)
• Arxiv
• 101 citations
Xu et al.
-
Incorporating External Knowledge Through Pre-training For Natural Language To Code Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Xu et al.
-
Learning Visual Representations With Caption Annotations
(2020)
• Lecture Notes in Computer Science
• 100 citations
Mert Bulent Sariyildiz, Julien Perez, Diane Larlus
-
An Empirical Study On Robustness To Spurious Correlations Using Pre-trained Language Models
(2020)
• Transactions of the Association for Computational Linguistics
• 116 citations
Tu et al.
-
Bert-of-theseus: Compressing BERT By Progressive Module Replacing
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 169 citations
Xu et al.
-
End-to-end Slot Alignment And Recognition For Cross-lingual NLU
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 110 citations
Weijia Xu, Batool Haider, Saab Mansour
-
Transfer Learning And Distant Supervision For Multilingual Transformer Models: A Study On African Languages
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 56 citations
Hedderich et al.
-
Phobert: Pre-trained Language Models For Vietnamese
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 320 citations
Dat Quoc Nguyen, Anh Tuan Nguyen
-
Aligning AI With Shared Human Values
(2020)
• Arxiv
• 99 citations
Hendrycks et al.
-
Pretrained Transformers Improve Out-of-distribution Robustness
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 278 citations
Hendrycks et al.
-
CLUE: A Chinese Language Understanding Evaluation Benchmark
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 228 citations
Xu et al.
-
Few-shot Text Generation With Pattern-exploiting Training
(2020)
• Arxiv
• 77 citations
Timo Schick, Hinrich Schütze
-
Learning To Summarize From Human Feedback
(2020)
• Arxiv
• 362 citations
Stiennon et al.
-
Constructing A Multi-hop QA Dataset For Comprehensive Evaluation Of Reasoning Steps
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 62 citations
Ho et al.
-
Improving Efficient Neural Ranking Models With Cross-architecture Knowledge Distillation
(2020)
• Arxiv
• 65 citations
Hofstätter et al.
-
Language And Visual Entity Relationship Graph For Agent Navigation
(2020)
• Arxiv
• 70 citations
Hong et al.
-
Ternarybert: Distillation-aware Ultra-low Bit BERT
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 135 citations
Zhang et al.
-
A Simple Language Model For Task-oriented Dialogue
(2020)
• Arxiv
• 264 citations
Hosseini-Asl et al.
-
Dynabert: Dynamic BERT With Adaptive Width And Depth
(2020)
• Arxiv
• 123 citations
Hou et al.
-
Linformer: Self-attention With Linear Complexity
(2020)
• Arxiv
• 872 citations
Wang et al.
-
XTREME: A Massively Multilingual Multi-task Benchmark For Evaluating Cross-lingual Generalization
(2020)
• Arxiv
• 326 citations
Hu et al.
-
OCNLI: Original Chinese Natural Language Inference
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 74 citations
Hu et al.
-
Bertweet: A Pre-trained Language Model For English Tweets
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
• 693 citations
Dat Quoc Nguyen, Thanh Vu, Anh Tuan Nguyen
-
Challenges In Building Intelligent Open-domain Dialog Systems
(2020)
• ACM Transactions on Information Systems
• 273 citations
Minlie Huang, Xiaoyan Zhu, Jianfeng Gao
-
Leveraging Unpaired Text Data For Training End-to-end Speech-to-intent Systems
(2020)
• ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 59 citations
Huang et al.
-
Improve Transformer Models With Better Relative Position Embeddings
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 90 citations
Huang et al.
-
GRADE: Automatic Graph-enhanced Coherence Metric For Evaluating Open-domain Dialogue Systems
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 67 citations
Huang et al.
-
Knowledge Graph-augmented Abstractive Summarization With Semantic-driven Cloze Reward
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 151 citations
Luyang Huang, Lingfei Wu, Lu Wang
-
Pixel-bert: Aligning Image Pixels With Text By Deep Multi-modal Transformers
(2020)
• Arxiv
• 284 citations
Huang et al.
-
Tabtransformer: Tabular Data Modeling Using Contextual Embeddings
(2020)
• Arxiv
• 115 citations
Huang et al.
-
Reducing Gender Bias In Neural Machine Translation As A Domain Adaptation Problem
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 107 citations
Danielle Saunders, Bill Byrne
-
Mind The Trade-off: Debiasing NLU Models Without Degrading The In-distribution Performance
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 96 citations
Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych
-
Fquad: French Question Answering Dataset
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 65 citations
D'Hoffschmidt et al.
-
Speech Translation And The End-to-end Promise: Taking Stock Of Where We Are
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 83 citations
Matthias Sperber, Matthias Paulik
-
Towards Controllable Biases In Language Generation
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 85 citations
Sheng et al.
-
Social Biases In NLP Models As Barriers For Persons With Disabilities
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 245 citations
Hutchinson et al.
-
Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 84 citations
Jeretic et al.
-
Deebert: Dynamic Early Exiting For Accelerating BERT Inference
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 257 citations
Xin et al.
-
Imitation Attacks And Defenses For Black-box Machine Translation Systems
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 56 citations
Eric Wallace, Mitchell Stern, Dawn Song
-
Syntactic Data Augmentation Increases Robustness To Inference Heuristics
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 151 citations
Min et al.
-
Espnet-st: All-in-one Speech Translation Toolkit
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 139 citations
Inaguma et al.
-
Pretraining With Contrastive Sentence Objectives Improves Discourse Performance Of Language Models
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 73 citations
Iter et al.
-
Towards Faithfully Interpretable NLP Systems: How Should We Define And Evaluate Faithfulness?
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 336 citations
Alon Jacovi, Yoav Goldberg
-
You Impress Me: Dialogue Generation Via Mutual Persona Perception
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 131 citations
Liu et al.
-
Towards Automated Neural Interaction Discovery For Click-through Rate Prediction
(2020)
• Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
• 84 citations
Song et al.
-
Learning To Faithfully Rationalize By Construction
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 145 citations
Jain et al.
-
Mpnet: Masked And Permuted Pre-training For Language Understanding
(2020)
• Arxiv
• 427 citations
Song et al.
-
Automatic Detection Of Machine Generated Text: A Critical Survey
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 110 citations
Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan
-
Adversarial Training For Large Neural Language Models
(2020)
• Arxiv
• 91 citations
Liu et al.
-
The Effect Of Natural Distribution Shift On Question Answering Models
(2020)
• Arxiv
• 62 citations
Miller et al.
-
XCOPA: A Multilingual Dataset For Causal Commonsense Reasoning
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 130 citations
Ponti et al.
-
Jiant: A Software Toolkit For Research On General-purpose Text Understanding Models
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 75 citations
Pruksachatkun et al.
-
Language Generation With Multi-hop Reasoning On Commonsense Knowledge Graph
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 93 citations
Ji et al.
-
Asking And Answering Questions To Evaluate The Factual Consistency Of Summaries
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 291 citations
Alex Wang, Kyunghyun Cho, Mike Lewis
-
Convbert: Improving BERT With Span-based Dynamic Convolution
(2020)
• Arxiv
• 98 citations
Jiang et al.
-
X-FACTR: Multilingual Factual Knowledge Retrieval From Pretrained Language Models
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 84 citations
Jiang et al.
-
In Defense Of Grid Features For Visual Question Answering
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 340 citations
Jiang et al.
-
Neural CRF Model For Sentence Alignment In Text Simplification
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 95 citations
Jiang et al.
-
Seq2edits: Sequence Transduction Using Span-level Edit Operations
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 59 citations
Felix Stahlberg, Shankar Kumar
-
Balancing Training For Multilingual Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 77 citations
Xinyi Wang, Yulia Tsvetkov, Graham Neubig
-
NL4DV: A Toolkit For Generating Analytic Specifications For Data Visualization From Natural Language Queries
(2020)
• IEEE Transactions on Visualization and Computer Graphics
• 135 citations
Arpit Narechania, Arjun Srinivasan, John Stasko
-
Robust Encodings: A Framework For Combating Adversarial Typos
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 87 citations
Jones et al.
-
ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 98 citations
Xiao et al.
-
Chatbots As Conversational Healthcare Services
(2020)
• IEEE Internet Computing
• 98 citations
Mlađan Jovanović, Marcos Baez, Fabio Casati
-
Reasoning With Latent Structure Refinement For Document-level Relation Extraction
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 261 citations
Nan et al.
-
WT5?! Training Text-to-text Models To Explain Their Predictions
(2020)
• Arxiv
• 110 citations
Narang et al.
-
Exploring Controllable Text Generation Techniques
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 63 citations
Shrimai Prabhumoye, Alan W Black, Ruslan Salakhutdinov
-
Text-to-text Pre-training For Data-to-text Tasks
(2020)
• Proceedings of the 13th International Conference on Natural Language Generation
• 121 citations
Mihir Kale, Abhinav Rastogi
-
Logiqa: A Challenge Dataset For Machine Reading Comprehension With Logical Reasoning
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 91 citations
Liu et al.
-
Norm-based Curriculum Learning For Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 99 citations
Liu et al.
-
Selective Question Answering Under Domain Shift
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 116 citations
Amita Kamath, Robin Jia, Percy Liang
-
Spatially Aware Multimodal Transformers For Textvqa
(2020)
• Lecture Notes in Computer Science
• 79 citations
Kant et al.
-
Dynamic Context Selection For Document-level Neural Machine Translation Via Reinforcement Learning
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 61 citations
Kang et al.
-
Improved Natural Language Generation Via Loss Truncation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 76 citations
Daniel Kang, Tatsunori Hashimoto
-
Learning An Unreferenced Metric For Online Dialogue Evaluation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 66 citations
Sinha et al.
-
Flowtron: An Autoregressive Flow-based Generative Network For Text-to-speech Synthesis
(2020)
• Arxiv
• 79 citations
Valle et al.
-
Colbert: Efficient And Effective Passage Search Via Contextualized Late Interaction Over BERT
(2020)
• Arxiv
• 197 citations
Omar Khattab, Matei Zaharia
-
Can You Put It All Together: Evaluating Conversational Agents' Ability To Blend Skills
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 167 citations
Smith et al.
-
Towards Conversational Recommendation Over Multi-type Dialogs
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 144 citations
Liu et al.
-
Mucko: Multi-layer Cross-modal Knowledge Reasoning For Fact-based Visual Question Answering
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 105 citations
Zhu et al.
-
Scaling Laws For Neural Language Models
(2020)
• Arxiv
• 1278 citations
Kaplan et al.
-
Dense Passage Retrieval For Open-domain Question Answering
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 2007 citations
Karpukhin et al.
-
Fastbert: A Self-distilling BERT With Adaptive Inference Time
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 239 citations
Liu et al.
-
Understanding The Difficulty Of Training Transformers
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 143 citations
Liu et al.
-
Mitigating Gender Bias For Neural Dialogue Generation With Adversarial Learning
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 55 citations
Liu et al.
-
A Survey On Contextual Embeddings
(2020)
• Arxiv
• 113 citations
Qi Liu, Matt J. Kusner, Phil Blunsom
-
Leveraging Monolingual Data With Self-supervision For Multilingual Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 58 citations
Siddhant et al.
-
Multilingual Denoising Pre-training For Neural Machine Translation
(2020)
• Transactions of the Association for Computational Linguistics
• 763 citations
Liu et al.
-
Rikinet: Reading Wikipedia Pages For Natural Question Answering
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 56 citations
Liu et al.
-
Unifiedqa: Crossing Format Boundaries With A Single QA System
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 452 citations
Khashabi et al.
-
Nearest Neighbor Machine Translation
(2020)
• Arxiv
• 130 citations
Khandelwal et al.
-
Gluecos : An Evaluation Benchmark For Code-switched NLP
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 103 citations
Khanuja et al.
-
Asking Questions The Human Way: Scalable Question-answer Generation From Text Corpus
(2020)
• Proceedings of The Web Conference 2020
• 84 citations
Liu et al.
-
More Bang For Your Buck: Natural Perturbation For Robust Question Answering
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 63 citations
Daniel Khashabi, Tushar Khot, Ashish Sabharwal
-
Improving Vision-and-language Navigation With Image-text Pairs From The Web
(2020)
• Lecture Notes in Computer Science
• 140 citations
Majumdar et al.
-
A Large-scale Chinese Short-text Conversation Dataset
(2020)
• Lecture Notes in Computer Science
• 95 citations
Wang et al.
-
Synthesizer: Rethinking Self-attention In Transformer Models
(2020)
• Arxiv
• 193 citations
Tay et al.
-
Long Range Arena: A Benchmark For Efficient Transformers
(2020)
• Arxiv
• 181 citations
Tay et al.
-
LEGAL-BERT: The Muppets Straight Out Of Law School
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 571 citations
Chalkidis et al.
-
Pre-training Tasks For Embedding-based Large-scale Retrieval
(2020)
• Arxiv
• 117 citations
Chang et al.
-
Convokit: A Toolkit For The Analysis Of Conversations
(2020)
• Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 76 citations
Chang et al.
-
Low-resource Languages: A Review Of Past Work And Future Challenges
(2020)
• Arxiv
• 88 citations
Alexandre Magueresse, Vincent Carles, Evan Heetderks
-
Counterfactual Samples Synthesizing For Robust Visual Question Answering
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 313 citations
Chen et al.
-
Adversarial Robustness: From Self-supervised Pre-training To Fine-tuning
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 157 citations
Chen et al.
-
Adabert: Task-adaptive BERT Compression With Differentiable Neural Architecture Search
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 70 citations
Chen et al.
-
Accurate Word Alignment Induction From Neural Machine Translation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 58 citations
Chen et al.
-
Task-oriented Dialogue As Dataflow Synthesis
(2020)
• Transactions of the Association for Computational Linguistics
• 100 citations
MacHines et al.
-
Efficient Document Re-ranking For Transformers By Precomputing Term Representations
(2020)
• Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 104 citations
MacAvaney et al.
-
Logical Natural Language Generation From Open-domain Tables
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 117 citations
Chen et al.
-
KGPT: Knowledge-grounded Pre-training For Data-to-text Generation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 104 citations
Chen et al.
-
Learning Modality Interaction For Temporal Sentence Localization And Event Captioning In Videos
(2020)
• Lecture Notes in Computer Science
• 81 citations
Chen et al.
-
Low-resource Domain Adaptation For Compositional Task-oriented Semantic Parsing
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 65 citations
Chen et al.
-
Recall And Learn: Fine-tuning Deep Pretrained Language Models With Less Forgetting
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 127 citations
Chen et al.
-
Modeling Global And Local Node Contexts For Text Generation From Knowledge Graphs
(2020)
• Transactions of the Association for Computational Linguistics
• 65 citations
Ribeiro et al.
-
Imagebert: Cross-modal Pre-training With Large-scale Weak-supervised Image-text Data
(2020)
• Arxiv
• 158 citations
Qi et al.
-
What Happens To BERT Embeddings During Fine-tuning?
(2020)
• Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP
• 128 citations
Merchant et al.
-
Gender Bias In Multilingual Embeddings And Cross-lingual Transfer
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 61 citations
Zhao et al.
-
Grounded Adaptation For Zero-shot Executable Semantic Parsing
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 79 citations
Zhong et al.
-
Rapidly Bootstrapping A Question Answering Dataset For COVID-19
(2020)
• Arxiv
• 57 citations
Tang et al.
-
Aligntts: Efficient Feed-forward Text-to-speech System Without Explicit Alignment
(2020)
• ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 60 citations
Zeng et al.
-
A Novel Graph-based Multi-modal Fusion Encoder For Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 129 citations
Yin et al.
-
MIME: Mimicking Emotions For Empathetic Response Generation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 154 citations
Majumder et al.
-
Sentibert: A Transferable Transformer-based Architecture For Compositional Sentiment Semantics
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 116 citations
da Yin, Tao Meng, Kai-Wei Chang
-
HAT: Hardware-aware Transformers For Efficient Natural Language Processing
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 193 citations
Wang et al.
-
Multilingual Translation With Extensible Multilingual Pretraining And Finetuning
(2020)
• Arxiv
• 174 citations
Tang et al.
-
Curriculum Pre-training For End-to-end Speech Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 93 citations
Wang et al.
-
SAFER: A Structure-free Approach For Certified Robustness To Adversarial Word Substitutions
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 79 citations
Mao Ye, Chengyue Gong, Qiang Liu
-
Leveraging Code Generation To Improve Code Retrieval And Summarization Via Dual Learning
(2020)
• Proceedings of The Web Conference 2020
• 65 citations
Ye et al.
-
Object-and-action Aware Model For Visual Language Navigation
(2020)
• Lecture Notes in Computer Science
• 77 citations
Qi et al.
-
Multiwoz 2.2 : A Dialogue Dataset With Additional Annotation Corrections And State Tracking Baselines
(2020)
• Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI
• 167 citations
Zang et al.
-
Advaug: Robust Adversarial Augmentation For Neural Machine Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 97 citations
Cheng et al.
-
GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference
(2020)
• 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)
• 105 citations
Zadeh et al.
-
Tabert: Pretraining For Joint Understanding Of Textual And Tabular Data
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 346 citations
Yin et al.
-
Coreferential Reasoning Learning For Language Representation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 162 citations
Ye et al.
-
Keep CALM And Explore: Language Models For Action Generation In Text-based Games
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 63 citations
Yao et al.
-
Charbert: Character-aware Pre-trained Language Model
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 80 citations
Ma et al.
-
Univl: A Unified Video And Language Pre-training Model For Multimodal Understanding And Generation
(2020)
• Arxiv
• 165 citations
Luo et al.
-
Overcoming Language Priors With Self-supervised Learning For Visual Question Answering
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 92 citations
Zhu et al.
-
Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 95 citations
Clement et al.
-
Towards Persona-based Empathetic Conversational Models
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 102 citations
Zhong et al.
-
Vlanet: Video-language Alignment Network For Weakly-supervised Video Moment Retrieval
(2020)
• Lecture Notes in Computer Science
• 69 citations
Ma et al.
-
Rethinking Embedding Coupling In Pre-trained Language Models
(2020)
• Arxiv
• 69 citations
Chung et al.
-
Tydi QA: A Benchmark For Information-seeking Question Answering In Typologically Diverse Languages
(2020)
• Arxiv
• 114 citations
Clark et al.
-
ELECTRA: Pre-training Text Encoders As Discriminators Rather Than Generators
(2020)
• Arxiv
• 1607 citations
Clark et al.
-
Transformers As Soft Reasoners Over Language
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 160 citations
Peter Clark, Oyvind Tafjord, Kyle Richardson
-
Learning To Discretely Compose Reasoning Module Networks For Video Captioning
(2020)
• Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
• 55 citations
Tan et al.
-
SPECTER: Document-level Representation Learning Using Citation-informed Transformers
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 309 citations
Cohan et al.
-
Overview Of The TREC 2019 Deep Learning Track
(2020)
• Arxiv
• 90 citations
Craswell et al.
-
Pre-training Is (almost) All You Need: An Application To Commonsense Reasoning
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Tamborrino et al.
-
Universal Natural Language Processing With Limited Annotations: Try Few-shot Textual Entailment As A Start
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 64 citations
Yin et al.
-
Mutual: A Dataset For Multi-turn Dialogue Reasoning
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 111 citations
Cui et al.
-
Revisiting Pre-trained Models For Chinese Natural Language Processing
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 639 citations
Cui et al.
-
Attention Flows: Analyzing And Comparing Attention Mechanisms In Language Models
(2020)
• IEEE Transactions on Visualization and Computer Graphics
• 75 citations
Joseph F Derose, Jiayao Wang, Matthew Berger
-
Underspecification Presents Challenges For Credibility In Modern Machine Learning
(2020)
• Arxiv
• 379 citations
D'Amour et al.
-
Funnel-transformer: Filtering Out Sequential Redundancy For Efficient Language Processing
(2020)
• Arxiv
• 104 citations
Dai et al.
-
KLEJ: Comprehensive Benchmark For Polish Language Understanding
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Rybak et al.
-
Totto: A Controlled Table-to-text Generation Dataset
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 178 citations
Parikh et al.
-
A Survey Of The State Of Explainable AI For Natural Language Processing
(2020)
• Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing
• 118 citations
Danilevsky et al.
-
Learning To Update Natural Language Comments Based On Code Changes
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 56 citations
Panthaplackel et al.
-
Tensorflow Lite Micro: Embedded Machine Learning On Tinyml Systems
(2020)
• Arxiv
• 293 citations
David et al.
-
VD-BERT: A Unified Vision And Dialog Transformer With BERT
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 63 citations
Wang et al.
-
Goemotions: A Dataset Of Fine-grained Emotions
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 499 citations
Demszky et al.
-
Robbert: A Dutch Roberta-based Language Model
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 178 citations
Pieter Delobelle, Thomas Winters, Bettina Berendt
-
Lexically Constrained Neural Machine Translation With Levenshtein Transformer
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 59 citations
Raymond Hendy Susanto, Shamil Chollampatt, Liling Tan
-
Residual Energy-based Models For Text Generation
(2020)
• ICLR 2020
• 61 citations
Deng et al.
-
A Monolingual Approach To Contextualized Word Embeddings For Mid-resource Languages
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 154 citations
Pedro Javier Ortiz Suárez, Laurent Romary, Benoît Sagot
-
Intellicode Compose: Code Generation Using Transformer
(2020)
• Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering
• 349 citations
Svyatkovskiy et al.
-
Calibration Of Pre-trained Transformers
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 170 citations
Shrey Desai, Greg Durrett
-
Improving Massively Multilingual Neural Machine Translation And Zero-shot Translation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 203 citations
Zhang et al.
-
Jukebox: A Generative Model For Music
(2020)
• Arxiv
• 232 citations
Dhariwal et al.
-
An Information Bottleneck Approach For Controlling Conciseness In Rationale Extraction
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 80 citations
Paranjape et al.
-
Non-autoregressive Machine Translation With Latent Alignments
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 120 citations
Saharia et al.
-
Progressive Transformers For End-to-end Sign Language Production
(2020)
• Lecture Notes in Computer Science
• 84 citations
Ben Saunders, Necati Cihan Camgoz, Richard Bowden
-
What Do Position Embeddings Learn? An Empirical Study Of Pre-trained Language Model Positional Encoding
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 68 citations
Yu-An Wang, Yun-Nung Chen
-
TORQUE: A Reading Comprehension Dataset Of Temporal Ordering Questions
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 81 citations
Ning et al.
-
The Sockeye 2 Neural Machine Translation Toolkit At AMTA 2020
(2020)
• Arxiv
• 71 citations
Domhan et al.
-
Enabling Language Models To Fill In The Blanks
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 135 citations
Chris Donahue, Mina Lee, Percy Liang
-
End-to-end Adversarial Text-to-speech
(2020)
• Arxiv
• 70 citations
Donahue et al.
-
Udapter: Language Adaptation For Truly Universal Dependency Parsing
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 100 citations
Üstün et al.
-
Big Bird: Transformers For Longer Sequences
(2020)
• Neural Information Processing Systems (NeurIPS) 2020
• 823 citations
Zaheer et al.
-
Semantic Graphs For Generating Deep Questions
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 74 citations
Pan et al.
-
A Contextual Hierarchical Attention Network With Adaptive Objective For Dialogue State Tracking
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 62 citations
Shan et al.
-
X-linear Attention Networks For Image Captioning
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 587 citations
Pan et al.
-
Document-level Event Role Filler Extraction Using Multi-granularity Contextualized Encoding
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 91 citations
Xinya Du, Claire Cardie
-
A Streaming On-device End-to-end Model Surpassing Server-side Conventional Model Quality And Latency
(2020)
• ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 203 citations
Sainath et al.
-
The Birth Of Romanian BERT
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 55 citations
Stefan Daniel Dumitrescu, Andrei-Marius Avram, Sampo Pyysalo
-
FEQA: A Question Answering Evaluation Framework For Faithfulness Assessment In Abstractive Summarization
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 279 citations
Esin Durmus, He He, Mona Diab
-
Transmodality: An End2end Fusion Method With Transformer For Multimodal Sentiment Analysis
(2020)
• Proceedings of The Web Conference 2020
• 116 citations
Zilong Wang, Zhaohong Wan, Xiaojun Wan
-
Transform And Tell: Entity-aware News Image Captioning
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 77 citations
Alasdair Tran, Alexander Mathews, Lexing Xie
-
Prottrans: Towards Cracking The Language Of Life's Code Through Self-supervised Deep Learning And High Performance Computing
(2020)
• Arxiv
• 197 citations
Elnaggar et al.
-
A Comparison Of LSTM And BERT For Small Corpus
(2020)
• Arxiv
• 62 citations
Aysu Ezen-Can
-
Template-based Question Generation From Retrieved Sentences For Improved Unsupervised Question Answering
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 74 citations
Fabbri et al.
-
Beyond English-centric Multilingual Machine Translation
(2020)
• Arxiv
• 418 citations
Fan et al.
-
Generative Data Augmentation For Commonsense Reasoning
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 90 citations
Yang et al.
-
CERT: Contrastive Self-supervised Learning For Language Understanding
(2020)
• Arxiv
• 187 citations
Fang et al.
-
The Tatoeba Translation Challenge -- Realistic Data Sets For Low Resource And Multilingual MT
(2020)
• Arxiv
• 74 citations
Jörg Tiedemann
-
Retrofitting Structure-aware Transformer Language Model For End Tasks
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 55 citations
Hao Fei, Yafeng Ren, Donghong Ji
-
Zero-shot Cross-lingual Transfer With Meta Learning
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 102 citations
Nooralahzadeh et al.
-
Codebert: A Pre-trained Model For Programming And Natural Languages
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 1588 citations
Feng et al.
-
Doc2dial: A Goal-oriented Document-grounded Dialogue Dataset
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 82 citations
Feng et al.
-
Scalable Multi-hop Relational Reasoning For Knowledge-aware Question Answering
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 189 citations
Feng et al.
-
End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 56 citations
Shakeri et al.
-
LUKE: Deep Contextualized Entity Representations With Entity-aware Self-attention
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 503 citations
Yamada et al.
-
Chart-to-text: Generating Natural Language Descriptions For Charts By Adapting The Transformer Model
(2020)
• Proceedings of the 13th International Conference on Natural Language Generation
• 69 citations
Jason Obeid, Enamul Hoque
-
Compositional Generalization In Semantic Parsing: Pre-training Vs. Specialized Architectures
(2020)
• Arxiv
• 81 citations
Furrer et al.
-
End-to-end Speech-translation With Knowledge Distillation: FBK@IWSLT2020
(2020)
• Proceedings of the 17th International Conference on Spoken Language Translation
• 64 citations
Gaido et al.
-
Large-scale Adversarial Training For Vision-and-language Representation Learning
(2020)
• Arxiv
• 270 citations
Gan et al.
-
Recent Neural Methods On Slot Filling And Intent Classification For Task-oriented Dialogue Systems: A Survey
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 75 citations
Samuel Louvan, Bernardo Magnini
-
How Effective Is Task-agnostic Data Augmentation For Pretrained Transformers?
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 56 citations
Shayne Longpre, Yu Wang, Christopher Dubois
-
Paraphrase Augmented Task-oriented Dialog Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 77 citations
Gao et al.
-
Dialogue Response Ranking Training With Large-scale Human Feedback Data
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 62 citations
Gao et al.
-
Generating Question Titles For Stack Overflow From Mined Code Snippets
(2020)
• ACM Transactions on Software Engineering and Methodology
• 58 citations
Gao et al.
-
Multi-modal Graph Neural Network For Joint Reasoning On Vision And Scene Text
(2020)
• 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 104 citations
Gao et al.
-
SUPERT: Towards New Frontiers In Unsupervised Evaluation Metrics For Multi-document Summarization
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 109 citations
Yang Gao, Wei Zhao, Steffen Eger
-
Adv-bert: BERT Is Not Robust On Misspellings! Generating Nature Adversarial Samples On BERT
(2020)
• Arxiv
• 83 citations
Sun et al.
-
Mobilebert: A Compact Task-agnostic BERT For Resource-limited Devices
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 555 citations
Sun et al.
-
What The [MASK]? Making Sense Of Language-specific BERT Models
(2020)
• Arxiv
• 84 citations
Debora Nozza, Federico Bianchi, Dirk Hovy
-
Contrastive Distillation On Intermediate Representations For Language Model Compression
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 58 citations
Sun et al.
-
Colake: Contextualized Language And Knowledge Embedding
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 143 citations
Sun et al.
-
Mixup-transformer: Dynamic Data Augmentation For NLP Tasks
(2020)
• Proceedings of the 28th International Conference on Computational Linguistics
• 75 citations
Sun et al.
-
Evaluating Models' Local Decision Boundaries Via Contrast Sets
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 278 citations
Gardner et al.
-
BAE: Bert-based Adversarial Examples For Text Classification
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 394 citations
Siddhant Garg, Goutham Ramakrishnan
-
Realtoxicityprompts: Evaluating Neural Toxic Degeneration In Language Models
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 434 citations
Gehman et al.
-
TRIE: End-to-end Text Reading And Information Extraction For Document Understanding
(2020)
• Proceedings of the 28th ACM International Conference on Multimedia
• 95 citations
Zhang et al.
-
Document Ranking With A Pretrained Sequence-to-sequence Model
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 331 citations
Rodrigo Nogueira, Zhiying Jiang, Jimmy Lin
-
Injecting Numerical Reasoning Skills Into Language Models
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 161 citations
Mor Geva, Ankit Gupta, Jonathan Berant
-
End-to-end Neural Word Alignment Outperforms GIZA++
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 62 citations
Thomas Zenkel, Joern Wuebker, John Denero
-
Aligned Cross Entropy For Non-autoregressive Machine Translation
(2020)
• Arxiv
• 67 citations
Ghazvininejad et al.
-
MUTANT: A Training Paradigm For Out-of-distribution Generalization In Visual Question Answering
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 121 citations
Gokhale et al.
-
VQA-LOL: Visual Question Answering Under The Lens Of Logic
(2020)
• Lecture Notes in Computer Science
• 73 citations
Gokhale et al.
-
Content Planning For Neural Story Generation With Aristotelian Rescoring
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 105 citations
Goldfarb-Tarrant et al.
-
Evaluating Factuality In Generation With Dependency-level Entailment
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 87 citations
Tanya Goyal, Greg Durrett
-
Compressing BERT: Studying The Effects Of Weight Pruning On Transfer Learning
(2020)
• Proceedings of the 5th Workshop on Representation Learning for NLP
• 233 citations
Mitchell A. Gordon, Kevin Duh, Nicholas Andrews
-
Neural Syntactic Preordering For Controlled Paraphrase Generation
(2020)
• Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
• 101 citations
Tanya Goyal, Greg Durrett
-
Speaker-aware BERT For Multi-turn Response Selection In Retrieval-based Chatbots
(2020)
• Proceedings of the 29th ACM International Conference on Information & Knowledge Management
• 147 citations
Gu et al.
-
A Knowledge-enhanced Pretraining Model For Commonsense Story Generation
(2020)
• Transactions of the Association for Computational Linguistics
• 235 citations
Guan et al.
-
BERT Loses Patience: Fast And Robust Inference With Early Exit
(2020)
• Arxiv
• 163 citations
Zhou et al.
-
Supervised Contrastive Learning For Pre-trained Language Model Fine-tuning
(2020)
• Arxiv
• 220 citations
Gunel et al.
-
Sequence-level Mixed Sample Data Augmentation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 71 citations
Demi Guo, Yoon Kim, Alexander M. Rush
-
Connecting The Dots: A Knowledgeable Path Generator For Commonsense Question Answering
(2020)
• Findings of the Association for Computational Linguistics: EMNLP 2020
• 68 citations
Wang et al.
-
Contrastive Learning For Weakly Supervised Phrase Grounding
(2020)
• Lecture Notes in Computer Science
• 114 citations
Gupta et al.
-
Cat-gen: Improving Robustness In NLP Models Via Controlled Adversarial Text Generation
(2020)
• Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
• 56 citations
Wang et al.
-
REALM: Retrieval-augmented Language Model Pre-training
(2020)
• Arxiv
• 517 citations
Guu et al.
-
The Evolved Transformer
(2019)
• Arxiv
• 206 citations
David R. So, Chen Liang, Quoc V. Le
-
Multi-hop Reading Comprehension Through Question Decomposition And Rescoring
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 185 citations
Min et al.
-
Saliency-guided Attention Network For Image-sentence Matching
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 101 citations
Ji et al.
-
Neural Arabic Question Answering
(2019)
• Proceedings of the Fourth Arabic Natural Language Processing Workshop
• 121 citations
Mozannar et al.
-
Exploiting Persona Information For Diverse Generation Of Conversational Responses
(2019)
• Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
• 122 citations
Song et al.
-
Certified Robustness To Adversarial Word Substitutions
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 235 citations
Jia et al.
-
Automatic Generation Of Pull Request Descriptions
(2019)
• 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE)
• 98 citations
Liu et al.
-
Attention Interpretability Across NLP Tasks
(2019)
• Arxiv
• 98 citations
Vashishth et al.
-
Extractive Summarization Of Long Documents By Combining Global And Local Context
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 161 citations
Wen Xiao, Giuseppe Carenini
-
Self-assembling Modular Networks For Interpretable Multi-hop Reasoning
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 63 citations
Yichen Jiang, Mohit Bansal
-
Improving Neural Response Diversity With Frequency-aware Cross-entropy Loss
(2019)
• The World Wide Web Conference
• 56 citations
Jiang et al.
-
Language As An Abstraction For Hierarchical Deep Reinforcement Learning
(2019)
• Arxiv
• 58 citations
Jiang et al.
-
Improving Multi-task Deep Neural Networks Via Knowledge Distillation For Natural Language Understanding
(2019)
• Arxiv
• 166 citations
Liu et al.
-
Towards Unsupervised Image Captioning With Shared Multimodal Embeddings
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 97 citations
Iro Laina, Christian Rupprecht, Nassir Navab
-
Attentive History Selection For Conversational Question Answering
(2019)
• Proceedings of the 28th ACM International Conference on Information and Knowledge Management
• 90 citations
Qu et al.
-
Direct Speech-to-speech Translation With A Sequence-to-sequence Model
(2019)
• Interspeech 2019
• 152 citations
Jia et al.
-
BERT With History Answer Embedding For Conversational Question Answering
(2019)
• Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 207 citations
Qu et al.
-
End-to-end Speech Translation With Knowledge Distillation
(2019)
• Interspeech 2019
• 147 citations
Liu et al.
-
Unsupervised Question Answering By Cloze Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 110 citations
Patrick Lewis, Ludovic Denoyer, Sebastian Riedel
-
TWEETQA: A Social Media Focused Question Answering Dataset
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 61 citations
Xiong et al.
-
Opennre: An Open And Extensible Toolkit For Neural Relation Extraction
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
• 131 citations
Han et al.
-
Probing Neural Network Comprehension Of Natural Language Arguments
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 401 citations
Timothy Niven, Hung-Yu Kao
-
Evaluating Sequence-to-sequence Models For Handwritten Text Recognition
(2019)
• 2019 International Conference on Document Analysis and Recognition (ICDAR)
• 108 citations
Michael et al.
-
The Curious Case Of Neural Text Degeneration
(2019)
• Arxiv
• 1199 citations
Holtzman et al.
-
BERT And Pals: Projected Attention Layers For Efficient Adaptation In Multi-task Learning
(2019)
• Arxiv
• 117 citations
Asa Cooper Stickland, Iain Murray
-
Visualizing And Understanding The Effectiveness Of BERT
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 171 citations
Hao et al.
-
Modeling Recurrence For Transformer
(2019)
• Proceedings of the 2019 Conference of the North
• 92 citations
Hao et al.
-
The Woman Worked As A Babysitter: On Biases In Language Generation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 375 citations
Sheng et al.
-
Curriculum Learning For Domain Adaptation In Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 120 citations
Zhang et al.
-
Unifying Human And Statistical Evaluation For Natural Language Generation
(2019)
• Proceedings of the 2019 Conference of the North
• 180 citations
Tatsunori B. Hashimoto, Hugh Zhang, Percy Liang
-
Neural Text Generation With Unlikelihood Training
(2019)
• Arxiv
• 255 citations
Welleck et al.
-
Humor Detection: A Transformer Gets The Last Laugh
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 105 citations
Orion Weller, Kevin Seppi
-
Compositional Semantic Parsing Across Graphbanks
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Matthias Lindemann, Jonas Groschwitz, Alexander Koller
-
Human-grounded Evaluations Of Explanation Methods For Text Classification
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 63 citations
Piyawat Lertvittayakumjorn, Francesca Toni
-
Moverscore: Text Generation Evaluating With Contextualized Embeddings And Earth Mover Distance
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 443 citations
Zhao et al.
-
Non-monotonic Sequential Text Generation
(2019)
• Arxiv
• 68 citations
Welleck et al.
-
Revisiting Self-training For Neural Sequence Generation
(2019)
• Arxiv
• 152 citations
He et al.
-
Robust Sequence-to-sequence Acoustic Modeling With Stepwise Monotonic Attention For Neural TTS
(2019)
• Interspeech 2019
• 82 citations
Mutian He, Yan Deng, Lei He
-
Pointing Novel Objects In Image Captioning
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 80 citations
Li et al.
-
Cycle-consistency For Robust Visual Question Answering
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 152 citations
Shah et al.
-
Improving Robustness Of Machine Translation With Synthetic Noise
(2019)
• Proceedings of the 2019 Conference of the North
• 70 citations
Vaibhav et al.
-
Imitation Learning For Non-autoregressive Neural Machine Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 72 citations
Wei et al.
-
Domain Robustness In Neural Machine Translation
(2019)
• Arxiv
• 59 citations
Mathias Müller, Annette Rios, Rico Sennrich
-
EQUATE: A Benchmark Evaluation Framework For Quantitative Reasoning In Natural Language Inference
(2019)
• Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
• 76 citations
Ravichander et al.
-
Cross-lingual Language Model Pretraining
(2019)
• Arxiv
• 1748 citations
Guillaume Lample, Alexis Conneau
-
Large Memory Layers With Product Keys
(2019)
• Arxiv
• 57 citations
Lample et al.
-
ALBERT: A Lite BERT For Self-supervised Learning Of Language Representations
(2019)
• Arxiv
• 4215 citations
Lan et al.
-
Way Off-policy Batch Deep Reinforcement Learning Of Implicit Human Preferences In Dialog
(2019)
• Arxiv
• 141 citations
Jaques et al.
-
Semantic Neural Machine Translation Using AMR
(2019)
• Transactions of the Association for Computational Linguistics
• 104 citations
Song et al.
-
MASS: Masked Sequence To Sequence Pre-training For Language Generation
(2019)
• Arxiv
• 524 citations
Song et al.
-
Soft Contextual Data Augmentation For Neural Machine Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 112 citations
Zhu et al.
-
Clevr-ref+: Diagnosing Visual Reasoning With Referring Expressions
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 95 citations
Liu et al.
-
Language Modeling With Deep Transformers
(2019)
• Interspeech 2019
• 179 citations
Irie et al.
-
Comparison Of Diverse Decoding Methods From Conditional Language Models
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 89 citations
Ippolito et al.
-
Answer Them All! Toward Universal Visual Question Answering Models
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 80 citations
Robik Shrestha, Kushal Kafle, Christopher Kanan
-
Distilling Translations With Visual Awareness
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 84 citations
Julia Ive, Pranava Madhyastha, Lucia Specia
-
Learning To Speak And Act In A Fantasy Text Adventure Game
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 143 citations
Urbanek et al.
-
An Evaluation Dataset For Intent Classification And Out-of-scope Prediction
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 334 citations
Larson et al.
-
Attention Is Not Explanation
(2019)
• Arxiv
• 541 citations
Sarthak Jain, Byron C. Wallace
-
Stay On The Path: Instruction Fidelity In Vision-and-language Navigation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 132 citations
Jain et al.
-
Well-read Students Learn Better: On The Importance Of Pre-training Compact Models
(2019)
• Arxiv
• 438 citations
Turc et al.
-
Multimodal Transformer Networks For End-to-end Video-grounded Dialogue Systems
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 96 citations
Le et al.
-
Multilingual End-to-end Speech Translation
(2019)
• 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 80 citations
Inaguma et al.
-
Unicoder: A Universal Language Encoder By Pre-training With Multiple Cross-lingual Tasks
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 166 citations
Huang et al.
-
Transferable Representation Learning In Vision-and-language Navigation
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 83 citations
Huang et al.
-
ANA At Semeval-2019 Task 3: Contextual Emotion Detection In Conversations Through Hierarchical Lstms And BERT
(2019)
• Proceedings of the 13th International Workshop on Semantic Evaluation
• 72 citations
Chenyang Huang, Amine Trabelsi, Osmar R. Zaïane
-
Cosmos QA: Machine Reading Comprehension With Contextual Commonsense Reasoning
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 268 citations
Huang et al.
-
Sunny And Dark Outside?! Improving Answer Consistency In VQA Through Entailed Question Generation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 56 citations
Ray et al.
-
Fast Transformer Decoding: One Write-head Is All You Need
(2019)
• Arxiv
• 61 citations
Noam Shazeer
-
A Stack-propagation Framework With Token-level Intent Detection For Spoken Language Understanding
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 293 citations
Qin et al.
-
Latent Retrieval For Weakly Supervised Open Domain Question Answering
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 711 citations
Kenton Lee, Ming-Wei Chang, Kristina Toutanova
-
Flaubert: Unsupervised Language Model Pre-training For French
(2019)
• Arxiv
• 125 citations
Le et al.
-
Attention-passing Models For Robust And Data-efficient End-to-end Speech Translation
(2019)
• Transactions of the Association for Computational Linguistics
• 106 citations
Sperber et al.
-
A Neural Model For Generating Natural Language Summaries Of Program Subroutines
(2019)
• 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)
• 280 citations
Alexander Leclair, Siyuan Jiang, Collin McMillan
-
Passage Re-ranking With BERT
(2019)
• Arxiv
• 392 citations
Rodrigo Nogueira, Kyunghyun Cho
-
On NMT Search Errors And Model Errors: Cat Got Your Tongue?
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 121 citations
Felix Stahlberg, Bill Byrne
-
Compare-mt: A Tool For Holistic Comparison Of Language Generation Systems
(2019)
• Proceedings of the 2019 Conference of the North
• 118 citations
Neubig et al.
-
Learning By Abstraction: The Neural State Machine
(2019)
• Arxiv
• 124 citations
Drew A. Hudson, Christopher D. Manning
-
Poly-encoders: Transformer Architectures And Pre-training Strategies For Fast And Accurate Multi-sentence Scoring
(2019)
• Arxiv
• 172 citations
Humeau et al.
-
Transformers Without Tears: Improving The Normalization Of Self-attention
(2019)
• Arxiv
• 133 citations
Toan Q. Nguyen, Julian Salazar
-
Large-scale Representation Learning From Visually Grounded Untranscribed Speech
(2019)
• Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
• 64 citations
Gabriel Ilharco, Yuan Zhang, Jason Baldridge
-
A Comprehensive Exploration On Wikisql With Table-aware Word Contextualization
(2019)
• Arxiv
• 119 citations
Hwang et al.
-
On Evaluation Of Adversarial Perturbations For Sequence-to-sequence Models
(2019)
• Proceedings of the 2019 Conference of the North
• 118 citations
Michel et al.
-
Question Answering For Privacy Policies: Combining Computational And Legal Perspectives
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 90 citations
Ravichander et al.
-
Training Neural Response Selection For Task-oriented Dialogue Systems
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 86 citations
Henderson et al.
-
A Repository Of Conversational Datasets
(2019)
• Proceedings of the First Workshop on NLP for Conversational AI
• 81 citations
Henderson et al.
-
What Would Elsa Do? Freezing Layers During Transformer Fine-tuning
(2019)
• Arxiv
• 69 citations
Jaejun Lee, Raphael Tang, Jimmy Lin
-
Entity-consistent End-to-end Task-oriented Dialogue System With KB Retriever
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 60 citations
Qin et al.
-
Using Pre-training Can Improve Model Robustness And Uncertainty
(2019)
• Arxiv
• 415 citations
Dan Hendrycks, Kimin Lee, Mantas Mazeika
-
Sentence-bert: Sentence Embeddings Using Siamese Bert-networks
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 7804 citations
Nils Reimers, Iryna Gurevych
-
Fooling Neural Network Interpretations Via Adversarial Model Manipulation
(2019)
• NeurIPS 2019 ICCV workshop 2019
• 67 citations
Juyeon Heo, Sunghwan Joo, Taesup Moon
-
Bertje: A Dutch BERT Model
(2019)
• Arxiv
• 224 citations
Vries et al.
-
Adversarial Representation Learning For Text-to-image Matching
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 214 citations
Nikolaos Sarafianos, Xiang Xu, Ioannis A. Kakadiaris
-
Insertion Transformer: Flexible Sequence Generation Via Insertion Operations
(2019)
• Arxiv
• 160 citations
Stern et al.
-
Adaptive Attention Span In Transformers
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 287 citations
Sukhbaatar et al.
-
Learning To Compose And Reason With Language Tree Structures For Visual Grounding
(2019)
• IEEE Transactions on Pattern Analysis and Machine Intelligence
• 145 citations
Hong et al.
-
Pretrained Encyclopedia: Weakly Supervised Knowledge-pretrained Language Model
(2019)
• Arxiv
• 165 citations
Xiong et al.
-
BERT Post-training For Review Reading Comprehension And Aspect-based Sentiment Analysis
(2019)
• Arxiv
• 291 citations
Xu et al.
-
Parameter-efficient Transfer Learning For NLP
(2019)
• Arxiv
• 1104 citations
Houlsby et al.
-
Adaptation Of Deep Bidirectional Multilingual Transformers For Russian Language
(2019)
• Arxiv
• 252 citations
Yuri Kuratov, Mikhail Arkhipov
-
Automatic Source Code Summarization With Extended Tree-lstm
(2019)
• 2019 International Joint Conference on Neural Networks (IJCNN)
• 81 citations
Shido et al.
-
ACUTE-EVAL: Improved Dialogue Evaluation With Optimized Questions And Multi-turn Comparisons
(2019)
• Arxiv
• 84 citations
Margaret Li, Jason Weston, Stephen Roller
-
Do Attention Heads In BERT Track Syntactic Dependencies?
(2019)
• Arxiv
• 103 citations
Htut et al.
-
Do NLP Models Know Numbers? Probing Numeracy In Embeddings
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 226 citations
Wallace et al.
-
Few-shot Representation Learning For Out-of-vocabulary Words
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 56 citations
Hu et al.
-
Are You Looking? Grounding To Multiple Modalities In Vision-and-language Navigation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 77 citations
Hu et al.
-
Domain Adaptation Of Neural Machine Translation By Lexicon Induction
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 65 citations
Hu et al.
-
A Multi-type Multi-span Network For Reading Comprehension That Requires Discrete Reasoning
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 83 citations
Hu et al.
-
Parabank: Monolingual Bitext Generation And Sentential Paraphrasing Via Lexically-constrained Neural Machine Translation
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 85 citations
Hu et al.
-
Convlab: Multi-domain End-to-end Dialog System Platform
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 90 citations
Lee et al.
-
Topic-guided Variational Autoencoders For Text Generation
(2019)
• Arxiv
• 62 citations
Wang et al.
-
Improving Question Answering Over Incomplete Kbs With Knowledge-aware Reader
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 119 citations
Xiong et al.
-
End-to-end Knowledge-routed Relational Dialogue System For Automatic Diagnosis
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 144 citations
Xu et al.
-
Revealing The Importance Of Semantic Retrieval For Machine Reading At Scale
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 153 citations
Yixin Nie, Songhe Wang, Mohit Bansal
-
Universal Adversarial Triggers For Attacking And Analyzing NLP
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 596 citations
Wallace et al.
-
Lattice Transformer For Speech Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 57 citations
Zhang et al.
-
PEGASUS: Pre-training With Extracted Gap-sentences For Abstractive Summarization
(2019)
• Arxiv
• 895 citations
Zhang et al.
-
Build It Break It Fix It For Dialogue Safety: Robustness From Adversarial Human Attack
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 157 citations
Dinan et al.
-
Adversarial Domain Adaptation For Machine Reading Comprehension
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 60 citations
Wang et al.
-
Recosa: Detecting The Relevant Contexts With Self-attention For Multi-turn Dialogue Generation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 121 citations
Zhang et al.
-
A Simple Convolutional Generative Network For Next Item Recommendation
(2019)
• Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining
• 508 citations
Yuan et al.
-
Simpler And Faster Learning Of Adaptive Policies For Simultaneous Translation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 85 citations
Zheng et al.
-
Are Transformers Universal Approximators Of Sequence-to-sequence Functions?
(2019)
• Arxiv
• 74 citations
Yun et al.
-
Audio-visual Scene-aware Dialog
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 152 citations
Alamri et al.
-
Unsupervised Paraphrasing Without Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 68 citations
Aurko Roy, David Grangier
-
To Tune Or Not To Tune? Adapting Pretrained Representations To Diverse Tasks
(2019)
• Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)
• 398 citations
Matthew E. Peters, Sebastian Ruder, Noah A. Smith
-
Self-critical Reasoning For Robust Visual Question Answering
(2019)
• Arxiv
• 93 citations
Jialin Wu, Raymond J. Mooney
-
Structured Pruning Of A Bert-based Question Answering Model
(2019)
• Arxiv
• 77 citations
J. S. McCarley, Rishav Chakravarti, Avirup Sil
-
How Language-neutral Is Multilingual BERT?
(2019)
• Arxiv
• 82 citations
Jindřich Libovický, Rudolf Rosa, Alexander Fraser
-
Domain Adaptive Dialog Generation Via Meta Learning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 138 citations
Kun Qian, Zhou Yu
-
Corpora Generation For Grammatical Error Correction
(2019)
• Proceedings of the 2019 Conference of the North
• 145 citations
Lichtarge et al.
-
UER: An Open-source Toolkit For Pre-training Models
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
• 72 citations
Zhao et al.
-
Massively Multilingual Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 412 citations
Roee Aharoni, Melvin Johnson, Orhan Firat
-
Explain Yourself! Leveraging Language Models For Commonsense Reasoning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 365 citations
Rajani et al.
-
Structured Fusion Networks For Dialog
(2019)
• Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue
• 93 citations
Shikib Mehri, Tejas Srinivasan, Maxine Eskenazi
-
Pretraining Methods For Dialog Context Representation Learning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 76 citations
Mehri et al.
-
Language2pose: Natural Language Grounded Pose Forecasting
(2019)
• 2019 International Conference on 3D Vision (3DV)
• 182 citations
Chaitanya Ahuja, Louis-Philippe Morency
-
Analyzing The Structure Of Attention In A Transformer Language Model
(2019)
• Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 285 citations
Jesse Vig, Yonatan Belinkov
-
Consistency By Agreement In Zero-shot Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 64 citations
Maruan Al-Shedivat, Ankur P. Parikh
-
Reasoning Over Paragraph Effects In Situations
(2019)
• Proceedings of the 2nd Workshop on Machine Reading for Question Answering
• 98 citations
Lin et al.
-
Right For The Wrong Reasons: Diagnosing Syntactic Heuristics In Natural Language Inference
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 947 citations
R. Thomas McCoy, Ellie Pavlick, Tal Linzen
-
Very Deep Self-attention Networks For End-to-end Speech Recognition
(2019)
• Interspeech 2019
• 182 citations
Pham et al.
-
Sticking To The Facts: Confident Decoding For Faithful Data-to-text Generation
(2019)
• Arxiv
• 57 citations
Tian et al.
-
Synthetic QA Corpora Generation With Roundtrip Consistency
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 217 citations
Alberti et al.
-
A BERT Baseline For The Natural Questions
(2019)
• Arxiv
• 111 citations
Chris Alberti, Kenton Lee, Michael Collins
-
Fusion Of Detected Objects In Text For Visual Question Answering
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 171 citations
Alberti et al.
-
Asking Clarifying Questions In Open-domain Information-seeking Conversations
(2019)
• Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 156 citations
Aliannejadi et al.
-
A Multiscale Visualization Of Attention In The Transformer Model
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 458 citations
Jesse Vig
-
Analyzing And Interpreting Neural Networks For NLP: A Report On The First Blackboxnlp Workshop
(2019)
• Natural Language Engineering
• 60 citations
Afra Alishahi, Grzegorz Chrupała, Tal Linzen
-
Moel: Mixture Of Empathetic Listeners
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 190 citations
Lin et al.
-
Publicly Available Clinical BERT Embeddings
(2019)
• Arxiv
• 670 citations
Alsentzer et al.
-
Fine-tuning Pre-trained Transformer Language Models To Distantly Supervised Relation Extraction
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 113 citations
Christoph Alt, Marc Hübner, Leonhard Hennig
-
Mathqa: Towards Interpretable Math Word Problem Solving With Operation-based Formalisms
(2019)
• Arxiv
• 107 citations
Amini et al.
-
Factor Graph Attention
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 109 citations
Schwartz et al.
-
Kagnet: Knowledge-aware Graph Networks For Commonsense Reasoning
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 461 citations
Lin et al.
-
Pushing The Limits Of Low-resource Morphological Inflection
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 78 citations
Antonios Anastasopoulos, Graham Neubig
-
Paraphrasing With Large Language Models
(2019)
• Proceedings of the 3rd Workshop on Neural Generation and Translation
• 79 citations
Sam Witteveen, Martin Andrews
-
Do Massively Pretrained Language Models Make Better Storytellers?
(2019)
• Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
• 153 citations
See et al.
-
Giving BERT A Calculator: Finding Operations And Arguments With Reading Comprehension
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 84 citations
Andor et al.
-
Sequential Latent Spaces For Modeling The Intention During Diverse Image Captioning
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 60 citations
Aneja et al.
-
Extending Machine Language Models Toward Human-level Language Understanding
(2019)
• Arxiv
• 68 citations
McClelland et al.
-
The Bottom-up Evolution Of Representations In The Transformer: A Study With Machine Translation And Language Modeling Objectives
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 141 citations
Elena Voita, Rico Sennrich, Ivan Titov
-
Attention Is Not Not Explanation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 757 citations
Sarah Wiegreffe, Yuval Pinter
-
Massively Multilingual Neural Machine Translation In The Wild: Findings And Challenges
(2019)
• Arxiv
• 320 citations
Arivazhagan et al.
-
Monotonic Infinite Lookback Attention For Simultaneous Machine Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 177 citations
Arivazhagan et al.
-
The Missing Ingredient In Zero-shot Neural Machine Translation
(2019)
• Arxiv
• 98 citations
Arivazhagan et al.
-
Vision-and-dialog Navigation
(2019)
• Arxiv
• 127 citations
Thomason et al.
-
Evaluating Recurrent Neural Network Explanations
(2019)
• Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 69 citations
Arras et al.
-
Learning To Select Knowledge For Response Generation In Dialog Systems
(2019)
• Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
• 185 citations
Lian et al.
-
Deep Unknown Intent Detection With Margin Loss
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 138 citations
Ting-En Lin, Hua Xu
-
What Makes A Good Conversation? How Controllable Attributes Affect Human Judgments
(2019)
• Proceedings of the 2019 Conference of the North
• 242 citations
See et al.
-
Context-aware Monolingual Repair For Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 100 citations
Elena Voita, Rico Sennrich, Ivan Titov
-
Learning To Retrieve Reasoning Paths Over Wikipedia Graph For Question Answering
(2019)
• Arxiv
• 161 citations
Asai et al.
-
Personalizing Dialogue Agents Via Meta-learning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 183 citations
Lin et al.
-
Structural Supervision Improves Learning Of Non-local Grammatical Dependencies
(2019)
• Proceedings of the 2019 Conference of the North
• 67 citations
Wilcox et al.
-
Code-switched Language Models Using Neural Based Synthetic Data From Parallel Sentences
(2019)
• Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
• 92 citations
Winata et al.
-
Syntax-enhanced Neural Machine Translation With Syntax-aware Word Representations
(2019)
• Proceedings of the 2019 Conference of the North
• 59 citations
Zhang et al.
-
Improving Grounded Natural Language Understanding Through Human-robot Dialog
(2019)
• 2019 International Conference on Robotics and Automation (ICRA)
• 63 citations
Thomason et al.
-
Knowledge Enhanced Contextual Word Representations
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 651 citations
Peters et al.
-
Cloze-driven Pretraining Of Self-attention Networks
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 203 citations
Baevski et al.
-
Language Models As Knowledge Bases?
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 1642 citations
Petroni et al.
-
Embodied Question Answering In Photorealistic Environments With Point Cloud Perception
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 112 citations
Wijmans et al.
-
Summary Level Training Of Sentence Rewriting For Abstractive Summarization
(2019)
• Proceedings of the 2nd Workshop on New Frontiers in Summarization
• 63 citations
Bae et al.
-
A Comparative Study On End-to-end Speech To Text Translation
(2019)
• 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 80 citations
Parnia Bahar, Tobias Bieschke, Hermann Ney
-
Compressive Transformers For Long-range Sequence Modelling
(2019)
• Arxiv
• 148 citations
Rae et al.
-
Selective Attention For Context-aware Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 169 citations
Sameen Maruf, André F. T. Martins, Gholamreza Haffari
-
Lingvo: A Modular And Scalable Framework For Sequence-to-sequence Modeling
(2019)
• Arxiv
• 202 citations
Shen et al.
-
Enhancing Amr-to-text Generation With Dual Graph Representations
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 68 citations
Leonardo F. R. Ribeiro, Claire Gardent, Iryna Gurevych
-
Simultaneous Translation With Flexible Policy Via Restricted Imitation Learning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 74 citations
Zheng et al.
-
Chid: A Large-scale Chinese Idiom Dataset For Cloze Test
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 55 citations
Chujie Zheng, Minlie Huang, Aixin Sun
-
Constrained Decoding For Neural NLG From Compositional Representations In Task-oriented Dialogue
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 77 citations
Balakrishnan et al.
-
Huggingface's Transformers: State-of-the-art Natural Language Processing
(2019)
• Arxiv
• 3242 citations
Wolf et al.
-
Non-parametric Adaptation For Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 77 citations
Ankur Bapna, Orhan Firat
-
Wikimatrix: Mining 135M Parallel Sentences In 1620 Language Pairs From Wikipedia
(2019)
• Arxiv
• 152 citations
Schwenk et al.
-
Interpretable Neural Predictions With Differentiable Binary Variables
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 200 citations
Jasmijn Bastings, Wilker Aziz, Ivan Titov
-
Sparc: Cross-domain Semantic Parsing In Context
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 111 citations
Yu et al.
-
Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents
(2019)
• Arxiv
• 289 citations
Wolf et al.
-
On Adversarial Removal Of Hypothesis-only Bias In Natural Language Inference
(2019)
• Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019)
• 68 citations
Belinkov et al.
-
Scibert: A Pretrained Language Model For Scientific Text
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 2517 citations
Iz Beltagy, Kyle Lo, Arman Cohan
-
Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 228 citations
Selvaraju et al.
-
Multimodal Transformer With Multi-view Visual Representation For Image Captioning
(2019)
• IEEE Transactions on Circuits and Systems for Video Technology
• 400 citations
Yu et al.
-
BLOCK: Bilinear Superdiagonal Fusion For Visual Question Answering And Visual Relationship Detection
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 198 citations
Ben-Younes et al.
-
Abductive Commonsense Reasoning
(2019)
• Arxiv
• 240 citations
Bhagavatula et al.
-
Improving Neural Story Generation By Targeted Common Sense Grounding
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 56 citations
Mao et al.
-
Personalized Dialogue Generation With Diversified Traits
(2019)
• Arxiv
• 87 citations
Zheng et al.
-
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 450 citations
Marino et al.
-
Multi-target Embodied Question Answering
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 70 citations
Yu et al.
-
Efficient 8-bit Quantization Of Transformer Neural Machine Language Translation Model
(2019)
• Arxiv
• 65 citations
Bhandare et al.
-
Playing The Lottery With Rewards And Multiple Languages: Lottery Tickets In RL And NLP
(2019)
• Arxiv
• 82 citations
Yu et al.
-
Deep Modular Co-attention Networks For Visual Question Answering
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 809 citations
Yu et al.
-
Robust Zero-shot Cross-domain Slot Filling With Example Values
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 86 citations
Shah et al.
-
Mixture Models For Diverse Machine Translation: Tricks Of The Trade
(2019)
• Arxiv
• 78 citations
Shen et al.
-
Activitynet-qa: A Dataset For Understanding Complex Web Videos Via Question Answering
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 168 citations
Yu et al.
-
Conditional Teacher-student Learning
(2019)
• ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 89 citations
Meng et al.
-
Fine-tune Bert For Docred With Two-step Process
(2019)
• Arxiv
• 115 citations
Wang et al.
-
Scene Text Visual Question Answering
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 228 citations
Biten et al.
-
BERT Rediscovers The Classical NLP Pipeline
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 1264 citations
Ian Tenney, Dipanjan Das, Ellie Pavlick
-
Augmenting Neural Networks With First-order Logic
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 70 citations
Tao Li, Vivek Srikumar
-
Proactive Human-machine Conversation With Explicit Conversation Goals
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 150 citations
Wu et al.
-
Global-to-local Memory Pointer Networks For Task-oriented Dialogue
(2019)
• Arxiv
• 130 citations
Chien-Sheng Wu, Richard Socher, Caiming Xiong
-
Are Sixteen Heads Really Better Than One?
(2019)
• Arxiv
• 315 citations
Paul Michel, Omer Levy, Graham Neubig
-
Generating Personalized Recipes From Historical User Preferences
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 64 citations
Majumder et al.
-
Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection
(2019)
• Arxiv
• 80 citations
Zhao et al.
-
Towards Automatic Face-to-face Translation
(2019)
• Proceedings of the 27th ACM International Conference on Multimedia
• 91 citations
R et al.
-
Large-batch Training For LSTM And Beyond
(2019)
• Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
• 76 citations
You et al.
-
Transferable Multi-domain State Generator For Task-oriented Dialogue Systems
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 409 citations
Wu et al.
-
What Do You Learn From Context? Probing For Sentence Structure In Contextualized Word Representations
(2019)
• Arxiv
• 439 citations
Tenney et al.
-
A Unified Neural Coherence Model
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 55 citations
Moon et al.
-
Pay Less Attention With Lightweight And Dynamic Convolutions
(2019)
• Arxiv
• 350 citations
Wu et al.
-
"mask And Infill" : Applying Masked Language Model To Sentiment Transfer
(2019)
• Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
• 99 citations
Wu et al.
-
Neural Legal Judgment Prediction In English
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 249 citations
Ilias Chalkidis, Ion Androutsopoulos, Nikolaos Aletras
-
On Identifiability In Transformers
(2019)
• Arxiv
• 95 citations
Brunner et al.
-
Investigating Entity Knowledge In BERT With Simple Neural End-to-end Entity Linking
(2019)
• Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
• 64 citations
Samuel Broscheit
-
Multi-task Learning For Conversational Question Answering Over A Large-scale Knowledge Base
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 94 citations
Shen et al.
-
Simple And Effective Curriculum Pointer-generator Networks For Reading Comprehension Over Long Narratives
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 91 citations
Tay et al.
-
Taskmaster-1: Toward A Realistic And Diverse Dialog Dataset
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 143 citations
Byrne et al.
-
Rubi: Reducing Unimodal Biases In Visual Question Answering
(2019)
• Advances in Neural Information Processing Systems 2019 (pp. 839-850)
• 182 citations
Cadene et al.
-
MUREL: Multimodal Relational Reasoning For Visual Question Answering
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 306 citations
Cadene et al.
-
Probing The Need For Visual Context In Multimodal Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 146 citations
Caglayan et al.
-
How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations
(2019)
• Arxiv
• 61 citations
Aken et al.
-
Streamlined Dense Video Captioning
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 137 citations
Mun et al.
-
Genie: A Generator Of Natural Language Semantic Parsers For Virtual Assistant Commands
(2019)
• Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation
• 65 citations
Campagna et al.
-
Is Attention Interpretable?
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 495 citations
Sofia Serrano, Noah A. Smith
-
Unsupervised Neural Machine Translation With SMT As Posterior Regularization
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 62 citations
Ren et al.
-
Revisiting Low-resource Neural Machine Translation: A Case Study
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 166 citations
Rico Sennrich, Biao Zhang
-
Counterfactual Story Reasoning And Generation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 106 citations
Qin et al.
-
Transfer Learning In Biomedical Natural Language Processing: An Evaluation Of BERT And Elmo On Ten Benchmarking Datasets
(2019)
• Proceedings of the 18th BioNLP Workshop and Shared Task
• 758 citations
Yifan Peng, Shankai Yan, Zhiyong Lu
-
Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
• 129 citations
Wallace et al.
-
Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer
(2019)
• Arxiv
• 8262 citations
Raffel et al.
-
Hierarchical Temporal Convolutional Networks For Dynamic Recommender Systems
(2019)
• The World Wide Web Conference
• 99 citations
You et al.
-
Tagged Back-translation
(2019)
• Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers)
• 188 citations
Isaac Caswell, Ciprian Chelba, David Grangier
-
Cosql: A Conversational Text-to-sql Challenge Towards Cross-domain Natural Language Interfaces To Databases
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 104 citations
Yu et al.
-
Automatic Argument Quality Assessment -- New Datasets And Methods
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 69 citations
Toledo et al.
-
Interpreting And Improving Natural-language Processing (in Machines) With Natural Language-processing (in The Brain)
(2019)
• Arxiv
• 83 citations
Mariya Toneva, Leila Wehbe
-
Compositional Generalization For Primitive Substitutions
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 59 citations
Li et al.
-
KERMIT: Generative Insertion-based Modeling For Sequences
(2019)
• Arxiv
• 78 citations
Chan et al.
-
Neural Keyphrase Generation Via Reinforcement Learning With Adaptive Rewards
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 84 citations
Chan et al.
-
Visual Semantic Reasoning For Image-text Matching
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 497 citations
Li et al.
-
Learning And Evaluating General Linguistic Intelligence
(2019)
• Arxiv
• 177 citations
Yogatama et al.
-
BERT-DST: Scalable End-to-end Dialogue State Tracking With Bidirectional Encoder Representations From Transformer
(2019)
• Interspeech 2019
• 114 citations
Guan-Lin Chao, Ian Lane
-
Roberta: A Robustly Optimized BERT Pretraining Approach
(2019)
• Arxiv
• 16039 citations
Liu et al.
-
On Tiny Episodic Memories In Continual Learning
(2019)
• Arxiv
• 333 citations
Chaudhry et al.
-
Controllable Text-to-image Generation
(2019)
• Arxiv
• 143 citations
Li et al.
-
Decomposable Neural Paraphrase Generation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 86 citations
Li et al.
-
A Surprisingly Robust Trick For Winograd Schema Challenge
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 109 citations
Kocijan et al.
-
Answering Complex Open-domain Questions Through Iterative Query Generation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 104 citations
Qi et al.
-
Bidirectional Attentive Memory Networks For Question Answering Over Knowledge Bases
(2019)
• Proceedings of the 2019 Conference of the North
• 120 citations
Yu Chen, Lingfei Wu, Mohammed J. Zaki
-
Behavior Sequence Transformer For E-commerce Recommendation In Alibaba
(2019)
• Proceedings of the 1st International Workshop on Deep Learning Practice for High-Dimensional Sparse Data
• 331 citations
Chen et al.
-
BERT For Joint Intent Classification And Slot Filling
(2019)
• Arxiv
• 422 citations
Qian Chen, Zhu Zhuo, Wen Wang
-
Controllable Paraphrase Generation With A Syntactic Exemplar
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 105 citations
Chen et al.
-
Hint-based Training For Non-autoregressive Machine Translation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 65 citations
Li et al.
-
Towards Knowledge-based Recommender Dialog System
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 188 citations
Chen et al.
-
Reinforcement Learning Based Graph-to-sequence Model For Natural Question Generation
(2019)
• Arxiv
• 74 citations
Yu Chen, Lingfei Wu, Mohammed J. Zaki
-
Multi-hop Question Answering Via Reasoning Chains
(2019)
• Arxiv
• 69 citations
Jifan Chen, Shih-Ting Lin, Greg Durrett
-
Semantically Conditioned Dialog Response Generation Via Hierarchical Disentangled Self-attention
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 134 citations
Chen et al.
-
Understanding Dataset Design Choices For Multi-hop Reasoning
(2019)
• Proceedings of the 2019 Conference of the North
• 102 citations
Jifan Chen, Greg Durrett
-
Reinforcement Learning Based Curriculum Optimization For Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 67 citations
Kumar et al.
-
Probing What Different NLP Tasks Teach Machines About Function Word Comprehension
(2019)
• Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (*SEM 2019)
• 110 citations
Kim et al.
-
Zero-shot Cross-lingual Dialogue Systems With Transferable Latent Variables
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 78 citations
Liu et al.
-
Entity-relation Extraction As Multi-turn Question Answering
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 328 citations
Li et al.
-
Findings Of The First Shared Task On Machine Translation Robustness
(2019)
• Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)
• 58 citations
Li et al.
-
Rethinking Action Spaces For Reinforcement Learning In End-to-end Dialog Agents With Latent Variable Models
(2019)
• Proceedings of the 2019 Conference of the North
• 127 citations
Tiancheng Zhao, Kaige Xie, Maxine Eskenazi
-
Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding
(2019)
• Arxiv
• 138 citations
Wang et al.
-
Multilingual Neural Machine Translation With Knowledge Distillation
(2019)
• Arxiv
• 146 citations
Tan et al.
-
Incremental Transformer With Deliberation Decoder For Document Grounded Conversations
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 105 citations
Li et al.
-
Dual Attention Networks For Visual Reference Resolution In Visual Dialog
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 88 citations
Gi-Cheon Kang, Jaeseo Lim, Byoung-Tak Zhang
-
Does It Make Sense? And Why? A Pilot Study For Sense Making And Explanation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 112 citations
Wang et al.
-
Probing Biomedical Embeddings From Language Models
(2019)
• Proceedings of the 3rd Workshop on Evaluating Vector Space Representations for
• 105 citations
Jin et al.
-
Visually Grounded Neural Syntax Acquisition
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 63 citations
Shi et al.
-
Release Strategies And The Social Impacts Of Language Models
(2019)
• Arxiv
• 256 citations
Solaiman et al.
-
An Entity-driven Framework For Abstractive Summarization
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 69 citations
Sharma et al.
-
Improving Multi-turn Dialogue Modelling With Utterance Rewriter
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 104 citations
Su et al.
-
VL-BERT: Pre-training Of Generic Visual-linguistic Representations
(2019)
• Arxiv
• 714 citations
Su et al.
-
Extracting Multiple-relations In One-pass With Pre-trained Transformers
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 94 citations
Wang et al.
-
The Photobook Dataset: Building Common Ground Through Visually-grounded Dialogue
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 73 citations
Haber et al.
-
Dykgchat: Benchmarking Dialogue Generation Grounding On Dynamic Knowledge Graphs
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 75 citations
Yi-Lin Tuan, Yun-Nung Chen, Hung-Yi Lee
-
Neural Module Networks For Reasoning Over Text
(2019)
• Arxiv
• 73 citations
Gupta et al.
-
Synchronous Bidirectional Neural Machine Translation
(2019)
• Transactions of the Association for Computational Linguistics
• 118 citations
Long Zhou, Jiajun Zhang, Chengqing Zong
-
Clinically Accurate Chest X-ray Report Generation
(2019)
• Arxiv
• 108 citations
Liu et al.
-
Speech Model Pre-training For End-to-end Spoken Language Understanding
(2019)
• Interspeech 2019
• 242 citations
Lugosch et al.
-
Multi-style Generative Reading Comprehension
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 82 citations
Nishida et al.
-
On Learning Meaningful Code Changes Via Neural Machine Translation
(2019)
• 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)
• 203 citations
Tufano et al.
-
Addressing Semantic Drift In Question Generation For Semi-supervised Question Answering
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 113 citations
Shiyue Zhang, Mohit Bansal
-
Visual Entailment: A Novel Task For Fine-grained Image Understanding
(2019)
• Arxiv
• 166 citations
Xie et al.
-
Augmenting Data With Mixup For Sentence Classification: An Empirical Study
(2019)
• Arxiv
• 153 citations
Hongyu Guo, Yongyi Mao, Richong Zhang
-
Rewarding Smatch: Transition-based AMR Parsing With Reinforcement Learning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 65 citations
Naseem et al.
-
Towards Complex Text-to-sql In Cross-domain Database With Intermediate Representation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 322 citations
Guo et al.
-
Image-question-answer Synergistic Network For Visual Dialog
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 80 citations
Dalu Guo, Chang Xu, Dacheng Tao
-
Star-transformer
(2019)
• Proceedings of the 2019 Conference of the North
• 137 citations
Guo et al.
-
Fast Structured Decoding For Sequence Models
(2019)
• Arxiv
• 60 citations
Sun et al.
-
Linking Artificial And Human Neural Representations Of Language
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 80 citations
Jon Gauthier, Roger Levy
-
Do Neural Dialog Systems Use The Conversation History Effectively? An Empirical Study
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 115 citations
Sankar et al.
-
GLTR: Statistical Detection And Visualization Of Generated Text
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 247 citations
Sebastian Gehrmann, Hendrik Strobelt, Alexander M. Rush
-
Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation
(2019)
• Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)
• 131 citations
Marcin Junczys-Dowmunt
-
Answering While Summarizing: Multi-task Learning For Multi-hop QA With Evidence Extraction
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 101 citations
Nishida et al.
-
DREAM: A Challenge Dataset And Models For Dialogue-based Reading Comprehension
(2019)
• Arxiv
• 76 citations
Sun et al.
-
How To Fine-tune BERT For Text Classification?
(2019)
• Lecture Notes in Computer Science
• 1197 citations
Sun et al.
-
Approximating Interactive Human Evaluation With Self-play For Open-domain Dialog Systems
(2019)
• Arxiv
• 55 citations
Ghandeharioun et al.
-
Mask-predict: Parallel Decoding Of Conditional Masked Language Models
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 443 citations
Ghazvininejad et al.
-
Better Automatic Evaluation Of Open-domain Dialogue Systems With Contextualized Embeddings
(2019)
• Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural Language Generation
• 85 citations
Ghazarian et al.
-
Freelb: Enhanced Adversarial Training For Natural Language Understanding
(2019)
• Arxiv
• 185 citations
Zhu et al.
-
LAMOL: Language Modeling For Lifelong Language Learning
(2019)
• Arxiv
• 78 citations
Fan-Keng Sun, Cheng-Hao Ho, Hung-Yi Lee
-
Samsum Corpus: A Human-annotated Dialogue Dataset For Abstractive Summarization
(2019)
• Proceedings of the 2nd Workshop on New Frontiers in Summarization
• 149 citations
Gliwa et al.
-
Hyst: A Hybrid Approach For Flexible And Accurate Dialogue State Tracking
(2019)
• Interspeech 2019
• 84 citations
Rahul Goel, Shachi Paul, Dilek Hakkani-Tür
-
Human Vs. Muppet: A Conservative Estimate Of Human Performance On The GLUE Benchmark
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 57 citations
Nikita Nangia, Samuel R. Bowman
-
Bert4rec: Sequential Recommendation With Bidirectional Encoder Representations From Transformer
(2019)
• Arxiv
• 264 citations
Sun et al.
-
Adding Interpretable Attention To Neural Translation Models Improves Word Alignment
(2019)
• Arxiv
• 85 citations
Thomas Zenkel, Joern Wuebker, John Denero
-
Nlprolog: Reasoning With Weak Unification For Question Answering In Natural Language
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 76 citations
Weber et al.
-
Superglue: A Stickier Benchmark For General-purpose Language Understanding Systems
(2019)
• Arxiv
• 923 citations
Wang et al.
-
AMR Parsing As Sequence-to-graph Transduction
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 114 citations
Zhang et al.
-
Topical-chat: Towards Knowledge-grounded Open-domain Conversations
(2019)
• Interspeech 2019
• 206 citations
Gopalakrishnan et al.
-
Real-time Open-domain Question Answering With Dense-sparse Phrase Index
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 143 citations
Seo et al.
-
Simple Applications Of BERT For Ad Hoc Document Retrieval
(2019)
• Arxiv
• 142 citations
Wei Yang, Haotian Zhang, Jimmy Lin
-
Using Natural Language For Reward Shaping In Reinforcement Learning
(2019)
• Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
• 61 citations
Prasoon Goyal, Scott Niekum, Raymond J. Mooney
-
Levenshtein Transformer
(2019)
• Arxiv
• 205 citations
Jiatao Gu, Changhan Wang, Jake Zhao
-
Insertion-based Decoding With Automatically Inferred Generation Order
(2019)
• Transactions of the Association for Computational Linguistics
• 93 citations
Jiatao Gu, Qi Liu, Kyunghyun Cho
-
Improved Zero-shot Neural Machine Translation Via Ignoring Spurious Correlations
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 88 citations
Gu et al.
-
Interactive Matching Network For Multi-turn Response Selection In Retrieval-based Chatbots
(2019)
• Proceedings of the 28th ACM International Conference on Information and Knowledge Management
• 71 citations
Jia-Chen Gu, Zhen-Hua Ling, Quan Liu
-
A Comparative Study On Transformer Vs RNN In Speech Applications
(2019)
• 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 462 citations
Karita et al.
-
Information Maximizing Visual Question Generation
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 89 citations
Ranjay Krishna, Michael Bernstein, Li Fei-Fei
-
Complexity-weighted Loss And Diverse Reranking For Sentence Simplification
(2019)
• Proceedings of the 2019 Conference of the North
• 62 citations
Kriz et al.
-
Single Headed Attention RNN: Stop Thinking With Your Head
(2019)
• Arxiv
• 59 citations
Stephen Merity
-
Cognitive Graph For Multi-hop Reading Comprehension At Scale
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 205 citations
Ding et al.
-
The Second Conversational Intelligence Challenge (convai2)
(2019)
• The Springer Series on Challenges in Machine Learning
• 388 citations
Dinan et al.
-
Saliency-driven Word Alignment Interpretation For Neural Machine Translation
(2019)
• Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers)
• 60 citations
Shuoyang Ding, Hainan Xu, Philipp Koehn
-
Xlnet: Generalized Autoregressive Pretraining For Language Understanding
(2019)
• Arxiv
• 5705 citations
Yang et al.
-
Pretrained Transformers For Simple Question Answering Over Knowledge Graphs
(2019)
• Lecture Notes in Computer Science
• 59 citations
D. Lukovnikov, A. Fischer, J. Lehmann
-
Multimodal Abstractive Summarization For How2 Videos
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 94 citations
Palaskar et al.
-
Deep Session Interest Network For Click-through Rate Prediction
(2019)
• Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
• 240 citations
Feng et al.
-
Knowledge-enriched Transformer For Emotion Detection In Textual Conversations
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 270 citations
Peixiang Zhong, di Wang, Chunyan Miao
-
Thieves On Sesame Street! Model Extraction Of Bert-based Apis
(2019)
• Arxiv
• 71 citations
Krishna et al.
-
Integrating Text And Image: Determining Multimodal Document Intent In Instagram Posts
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 93 citations
Kruk et al.
-
Compact Trilinear Interaction For Visual Question Answering
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 64 citations
Do et al.
-
Unified Language Model Pre-training For Natural Language Understanding And Generation
(2019)
• Arxiv
• 833 citations
Dong et al.
-
Editnts: An Neural Programmer-interpreter Model For Sentence Simplification Through Explicit Editing
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 149 citations
Dong et al.
-
RAVEN: A Dataset For Relational And Analogical Visual Reasoning
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 160 citations
Zhang et al.
-
Meta-sim: Learning To Generate Synthetic Datasets
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 213 citations
Kar et al.
-
Sentence Embedding Alignment For Lifelong Relation Extraction
(2019)
• Proceedings of the 2019 Conference of the North
• 107 citations
Wang et al.
-
Reducing Gender Bias In Word-level Language Models With A Gender-equalizing Loss Function
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
• 61 citations
Qian et al.
-
Massively Multilingual Transfer For NER
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 185 citations
Afshin Rahimi, Yuan Li, Trevor Cohn
-
Editing-based SQL Query Generation For Cross-domain Context-dependent Questions
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 114 citations
Zhang et al.
-
Investigating Meta-learning Algorithms For Low-resource Natural Language Understanding Tasks
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 116 citations
Zi-Yi Dou, Keyi Yu, Antonios Anastasopoulos
-
Multi-task Deep Neural Networks For Natural Language Understanding
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 1052 citations
Liu et al.
-
Tree Transformer: Integrating Tree Structures Into Self-attention
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 123 citations
Yau-Shian Wang, Hung-Yi Lee, Yun-Nung Chen
-
Guided Dialog Policy Learning: Reward Estimation For Multi-domain Task-oriented Dialog
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 71 citations
Ryuichi Takanobu, Hanlin Zhu, Minlie Huang
-
How Multilingual Is Multilingual BERT?
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 1058 citations
Telmo Pires, Eva Schlinger, Dan Garrette
-
A Closer Look At Feature Space Data Augmentation For Few-shot Intent Classification
(2019)
• Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019)
• 84 citations
Kumar et al.
-
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
(2019)
• Arxiv
• 210 citations
Dua et al.
-
Evaluating Coherence In Dialogue Systems Using Entailment
(2019)
• Proceedings of the 2019 Conference of the North
• 75 citations
Dziri et al.
-
Semantic Noise Matters For Neural Natural Language Generation
(2019)
• Proceedings of the 12th International Conference on Natural Language Generation
• 86 citations
Ondřej Dušek, David M. Howcroft, Verena Rieser
-
Evaluating The State-of-the-art Of End-to-end Natural Language Generation: The E2E NLG Challenge
(2019)
• Computer Speech & Language
• 151 citations
Ondřej Dušek, Jekaterina Novikova, Verena Rieser
-
Large-scale Multilingual Speech Recognition With A Streaming End-to-end Model
(2019)
• Interspeech 2019
• 151 citations
Kannan et al.
-
CLUTRR: A Diagnostic Benchmark For Inductive Reasoning From Text
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 93 citations
Sinha et al.
-
Training On Synthetic Noise Improves Robustness To Natural Noise In Machine Translation
(2019)
• Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019)
• 100 citations
Karpukhin et al.
-
Pre-trained Language Model Representations For Language Generation
(2019)
• Proceedings of the 2019 Conference of the North
• 125 citations
Sergey Edunov, Alexei Baevski, Michael Auli
-
Span-based Joint Entity And Relation Extraction With Transformer Pre-training
(2019)
• Arxiv
• 152 citations
Markus Eberts, Adrian Ulges
-
Towards VQA Models That Can Read
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 444 citations
Singh et al.
-
Howto100m: Learning A Text-video Embedding By Watching Hundred Million Narrated Video Clips
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 786 citations
Miech et al.
-
Multifit: Efficient Multi-lingual Language Model Fine-tuning
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 101 citations
Eisenschlos et al.
-
Recommendation As A Communication Game: Self-supervised Bot-play For Goal-oriented Dialogue
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 71 citations
Kang et al.
-
ELI5: Long Form Question Answering
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 277 citations
Fan et al.
-
Multiwoz 2.1: A Consolidated Multi-domain Dialogue Dataset With State Corrections And State Tracking Baselines
(2019)
• Arxiv
• 170 citations
Eric et al.
-
Question Answering As An Automatic Evaluation Metric For News Article Summarization
(2019)
• Proceedings of the 2019 Conference of the North
• 102 citations
Matan Eyal, Tal Baumel, Michael Elhadad
-
Recent Advances In Neural Question Generation
(2019)
• Arxiv
• 85 citations
Pan et al.
-
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
(2019)
• 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 68 citations
Wang et al.
-
Strategies For Structuring Story Generation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 193 citations
Angela Fan, Mike Lewis, Yann Dauphin
-
Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 255 citations
Fan et al.
-
Reducing Transformer Depth On Demand With Structured Dropout
(2019)
• Arxiv
• 293 citations
Angela Fan, Edouard Grave, Armand Joulin
-
Using Local Knowledge Graph Construction To Scale Seq2seq Models To Multi-document Inputs
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 101 citations
Fan et al.
-
Multimodal Transformer For Unaligned Multimodal Language Sequences
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 1085 citations
Tsai et al.
-
Multi-hop Paragraph Retrieval For Open-domain Question Answering
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 94 citations
Yair Feldman, Ran El-Yaniv
-
Investigating Multilingual NMT Representations At Scale
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 109 citations
Kudugunta et al.
-
Scene Memory Transformer For Embodied Agents In Long-horizon Tasks
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 169 citations
Fang et al.
-
Learning To Collocate Neural Modules For Image Captioning
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 94 citations
Xu Yang, Hanwang Zhang, Jianfei Cai
-
Language Models With Transformers
(2019)
• Arxiv
• 67 citations
Chenguang Wang, Mu Li, Alexander J. Smola
-
Critically Examining The "neural Hype": Weak Baselines And The Additivity Of Effectiveness Gains From Neural Ranking Models
(2019)
• Arxiv
• 91 citations
Yang et al.
-
A Hybrid Retrieval-generation Neural Conversation Model
(2019)
• Proceedings of the 28th ACM International Conference on Information and Knowledge Management
• 64 citations
Yang et al.
-
Making History Matter: History-advantage Sequence Training For Visual Dialog
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 64 citations
Tianhao Yang, Zheng-Jun Zha, Hanwang Zhang
-
Context-aware Self-attention Networks
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 92 citations
Yang et al.
-
Meta-learning For Low-resource Natural Language Generation In Task-oriented Dialogue Systems
(2019)
• Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
• 86 citations
Mi et al.
-
Jasper: An End-to-end Convolutional Neural Acoustic Model
(2019)
• Interspeech 2019
• 212 citations
Li et al.
-
Mirrorgan: Learning Text-to-image Generation By Redescription
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 560 citations
Qiao et al.
-
Pullnet: Open Domain Question Answering With Iterative Retrieval On Knowledge Bases And Text
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 282 citations
Haitian Sun, Tania Bedrax-Weiss, William W. Cohen
-
PAWS-X: A Cross-lingual Adversarial Dataset For Paraphrase Identification
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 226 citations
Yang et al.
-
Neural Data-to-text Generation: A Comparison Between Pipeline And End-to-end Architectures
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 102 citations
Ferreira et al.
-
Can Neural Networks Understand Monotonicity Reasoning?
(2019)
• Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 57 citations
Yanaka et al.
-
MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension
(2019)
• Proceedings of the 2nd Workshop on Machine Reading for Question Answering
• 242 citations
Fisch et al.
-
Context-aware Visual Policy Network For Fine-grained Image Captioning
(2019)
• IEEE Transactions on Pattern Analysis and Machine Intelligence
• 144 citations
Zha et al.
-
Mixout: Effective Regularization To Finetune Large-scale Pretrained Language Models
(2019)
• Arxiv
• 103 citations
Cheolhyoung Lee, Kyunghyun Cho, Wanmo Kang
-
Equalizing Gender Biases In Neural Machine Translation With Word Embeddings Techniques
(2019)
• Proceedings of the First Workshop on Gender Bias in Natural Language Processing
• 112 citations
Joel Escudé Font, Marta R. Costa-Jussà
-
Do Neural Language Representations Learn Physical Commonsense?
(2019)
• Arxiv
• 55 citations
Maxwell Forbes, Ari Holtzman, Yejin Choi
-
Modeling Graph Structure In Transformer For Better Amr-to-text Generation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 97 citations
Zhu et al.
-
Competence-based Curriculum Learning For Neural Machine Translation
(2019)
• Proceedings of the 2019 Conference of the North
• 270 citations
Platanios et al.
-
Text-based Editing Of Talking-head Video
(2019)
• ACM Transactions on Graphics
• 258 citations
Fried et al.
-
APE At Scale And Its Implications On MT Evaluation Biases
(2019)
• Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers)
• 56 citations
Markus Freitag, Isaac Caswell, Scott Roy
-
Cyclical Annealing Schedule: A Simple Approach To Mitigating KL Vanishing
(2019)
• Arxiv
• 164 citations
Fu et al.
-
From Language To Goals: Inverse Reinforcement Learning For Vision-based Instruction Following
(2019)
• Arxiv
• 75 citations
Fu et al.
-
Multi-hop Reading Comprehension Across Multiple Documents By Reasoning Over Heterogeneous Graphs
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 147 citations
Tu et al.
-
Visualbert: A Simple And Performant Baseline For Vision And Language
(2019)
• Arxiv
• 1212 citations
Li et al.
-
Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks
(2019)
• Arxiv
• 1532 citations
Lu et al.
-
Jointly Learning To Align And Translate With Transformer Models
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 132 citations
Garg et al.
-
Neural Language Models As Psycholinguistic Subjects: Representations Of Syntactic State
(2019)
• Proceedings of the 2019 Conference of the North
• 173 citations
Futrell et al.
-
Understanding The Behaviors Of BERT In Ranking
(2019)
• Arxiv
• 153 citations
Qiao et al.
-
Language Models As Or For Knowledge Bases
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 1642 citations
Razniewski et al.
-
Fine-tuning Language Models From Human Preferences
(2019)
• Arxiv
• 364 citations
Ziegler et al.
-
Multi-step Reasoning Via Recurrent Dual Attention For Visual Dialog
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 107 citations
Gan et al.
-
Generalizable Neuro-symbolic Systems For Commonsense Question Answering
(2019)
• Proceedings of the First Workshop on Commonsense Inference in Natural Language Processing
• 73 citations
Oltramari et al.
-
Structured Two-stream Attention Network For Video Question Answering
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 63 citations
Gao et al.
-
Generating Multiple Diverse Responses For Short-text Conversation
(2019)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 58 citations
Gao et al.
-
Dialog State Tracking: A Neural Reading Comprehension Approach
(2019)
• Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue
• 167 citations
Gao et al.
-
Product-aware Answer Generation In E-commerce Question-answering
(2019)
• Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining
• 80 citations
Gao et al.
-
Jointly Optimizing Diversity And Relevance In Neural Response Generation
(2019)
• Proceedings of the 2019 Conference of the North
• 96 citations
Gao et al.
-
Multi-modality Latent Interaction Network For Visual Question Answering
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 69 citations
Gao et al.
-
A Study Of BFLOAT16 For Deep Learning Training
(2019)
• Arxiv
• 130 citations
Kalamkar et al.
-
Generalization In Generation: A Closer Look At Exposure Bias
(2019)
• Proceedings of the 3rd Workshop on Neural Generation and Translation
• 65 citations
Florian Schmidt
-
Dynamically Fused Graph Network For Multi-hop Reasoning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 181 citations
Xiao et al.
-
Patient Knowledge Distillation For BERT Model Compression
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 555 citations
Sun et al.
-
TENER: Adapting Transformer Encoder For Named Entity Recognition
(2019)
• Arxiv
• 229 citations
Yan et al.
-
Compositional Questions Do Not Necessitate Multi-hop Reasoning
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 148 citations
Min et al.
-
Videobert: A Joint Model For Video And Language Representation Learning
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 1048 citations
Sun et al.
-
Semantics Disentangling For Text-to-image Generation
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 192 citations
Yin et al.
-
Inoculation By Fine-tuning: A Method For Analyzing Challenge Datasets
(2019)
• Proceedings of the 2019 Conference of the North
• 95 citations
Nelson F. Liu, Roy Schwartz, Noah A. Smith
-
Context And Attribute Grounded Dense Captioning
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 61 citations
Yin et al.
-
Distilling Task-specific Knowledge From BERT Into Simple Neural Networks
(2019)
• Arxiv
• 358 citations
Tang et al.
-
ERNIE: Enhanced Language Representation With Informative Entities
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 1313 citations
Zhang et al.
-
Learning To Speak Fluently In A Foreign Language: Multilingual Speech Synthesis And Cross-language Voice Cloning
(2019)
• Interspeech 2019
• 150 citations
Zhang et al.
-
PAWS: Paraphrase Adversaries From Word Scrambling
(2019)
• Arxiv
• 136 citations
Yuan Zhang, Jason Baldridge, Luheng He
-
Bertscore: Evaluating Text Generation With BERT
(2019)
• Arxiv
• 1981 citations
Zhang et al.
-
A Logic-driven Framework For Consistency Of Neural Models
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 63 citations
Li et al.
-
Generalized Data Augmentation For Low-resource Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 111 citations
Xia et al.
-
Conversing By Reading: Contentful Neural Conversation With On-demand Machine Reading
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 107 citations
Qin et al.
-
Zero-shot Text Classification With Generative Language Models
(2019)
• Arxiv
• 78 citations
Raul Puri, Bryan Catanzaro
-
Target-guided Open-domain Conversation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 113 citations
Tang et al.
-
Generating Long Sequences With Sparse Transformers
(2019)
• Arxiv
• 644 citations
Child et al.
-
Robust Navigation With Language Pretraining And Stochastic Sampling
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 85 citations
Li et al.
-
Adapt Or Get Left Behind: Domain Adaptation Through BERT Language Model Finetuning For Aspect-target Sentiment Classification
(2019)
• Arxiv
• 126 citations
Rietzler et al.
-
Pivot-based Transfer Learning For Neural Machine Translation Between Non-english Languages
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 65 citations
Kim et al.
-
When And Why Is Document-level Context Useful In Neural Machine Translation?
(2019)
• Proceedings of the Fourth Workshop on Discourse in Machine Translation (DiscoMT 2019)
• 68 citations
Yunsu Kim, Duc Thanh Tran, Hermann Ney
-
Text Summarization With Pretrained Encoders
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 1484 citations
Yang Liu, Mirella Lapata
-
Broad-coverage Semantic Parsing As Transduction
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 83 citations
Zhang et al.
-
Bp-transformer: Modelling Long-range Context Via Binary Partitioning
(2019)
• Arxiv
• 61 citations
Ye et al.
-
Robust Neural Machine Translation With Doubly Adversarial Inputs
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 234 citations
Yong Cheng, Lu Jiang, Wolfgang MacHerey
-
Deepcopy: Grounded Response Generation With Hierarchical Pointer Networks
(2019)
• Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue
• 84 citations
Yavuz et al.
-
Neural Machine Reading Comprehension: Methods And Trends
(2019)
• Applied Sciences
• 152 citations
Liu et al.
-
Relation-aware Graph Attention Network For Visual Question Answering
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 358 citations
Li et al.
-
Q8BERT: Quantized 8bit BERT
(2019)
• 2019 Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition (EMC2-NIPS)
• 383 citations
Zafrir et al.
-
Progressive Attention Memory Network For Movie Story Question Answering
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 91 citations
Kim et al.
-
Long And Diverse Text Generation With Planning-based Hierarchical Variational Model
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 94 citations
Shao et al.
-
Grounding Human-to-vehicle Advice For Self-driving Vehicles
(2019)
• 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
• 87 citations
Kim et al.
-
Episodic Memory In Lifelong Language Learning
(2019)
• Arxiv
• 106 citations
D'Autume et al.
-
Inferring Which Medical Treatments Work From Reports Of Clinical Trials
(2019)
• Proceedings of the 2019 Conference of the North
• 106 citations
Lehman et al.
-
Linguistic Knowledge And Transferability Of Contextual Representations
(2019)
• Proceedings of the 2019 Conference of the North
• 714 citations
Liu et al.
-
Effective Cross-lingual Transfer Of Neural Machine Translation Models Without Shared Vocabularies
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 83 citations
Yunsu Kim, Yingbo Gao, Hermann Ney
-
Language Learning Using Speech To Image Retrieval
(2019)
• Interspeech 2019
• 59 citations
Danny Merkx, Stefan L. Frank, Mirjam Ernestus
-
Hellaswag: Can A Machine Really Finish Your Sentence?
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 481 citations
Zellers et al.
-
Mixture Content Selection For Diverse Sequence Generation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 63 citations
Jaemin Cho, Minjoon Seo, Hannaneh Hajishirzi
-
Find Or Classify? Dual Strategy For Slot-value Predictions On Multi-domain Dialog State Tracking
(2019)
• Arxiv
• 104 citations
Zhang et al.
-
LXMERT: Learning Cross-modality Encoder Representations From Transformers
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 2007 citations
Hao Tan, Mohit Bansal
-
The Eighth Dialog System Technology Challenge
(2019)
• Arxiv
• 60 citations
Kim et al.
-
An Embarrassingly Simple Approach For Transfer Learning From Pretrained Language Models
(2019)
• Proceedings of the 2019 Conference of the North
• 113 citations
Alexandra Chronopoulou, Christos Baziotis, Alexandros Potamianos
-
The Effect Of Translationese In Machine Translation Test Sets
(2019)
• Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers)
• 73 citations
Mike Zhang, Antonio Toral
-
Monotonic Multihead Attention
(2019)
• Arxiv
• 78 citations
Ma et al.
-
CONAN -- Counter Narratives Through Nichesourcing: A Multilingual Dataset Of Responses To Fight Online Hate Speech
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 163 citations
Chung et al.
-
Boolq: Exploring The Surprising Difficulty Of Natural Yes/no Questions
(2019)
• Arxiv
• 206 citations
Clark et al.
-
From 'F' To 'A' On The N.Y. Regents Science Exams: An Overview Of The Aristo Project
(2019)
• Arxiv
• 81 citations
Clark et al.
-
What Makes A Good Conversation? Challenges In Designing Truly Conversational Agents
(2019)
• Arxiv
• 133 citations
Clark et al.
-
On The Use Of BERT For Neural Machine Translation
(2019)
• Proceedings of the 3rd Workshop on Neural Generation and Translation
• 100 citations
Stéphane Clinchant, Kweon Woo Jung, Vassilina Nikoulina
-
Visualizing And Measuring The Geometry Of BERT
(2019)
• Arxiv
• 218 citations
Coenen et al.
-
Pretrained Language Models For Sequential Sentence Classification
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 109 citations
Cohan et al.
-
Pretraining-based Natural Language Generation For Text Summarization
(2019)
• Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
• 183 citations
Haoyu Zhang, Jianjun Xu, Ji Wang
-
Affect-driven Dialog Generation
(2019)
• Proceedings of the 2019 Conference of the North
• 107 citations
Colombo et al.
-
A Tensorized Transformer For Language Modeling
(2019)
• Arxiv
• 63 citations
Ma et al.
-
Supervised Multimodal Bitransformers For Classifying Images And Text
(2019)
• Arxiv
• 163 citations
Kiela et al.
-
Adaptively Sparse Transformers
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 192 citations
Gonçalo M. Correia, Vlad Niculae, André F. T. Martins
-
Revealing The Dark Secrets Of BERT
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 497 citations
Kovaleva et al.
-
Joey NMT: A Minimalist NMT Toolkit For Novices
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
• 90 citations
Julia Kreutzer, Jasmijn Bastings, Stefan Riezler
-
Learning Deep Transformer Models For Machine Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 570 citations
Wang et al.
-
CSS10: A Collection Of Single Speaker Speech Datasets For 10 Languages
(2019)
• Interspeech 2019
• 71 citations
Kyubyong Park, Thomas Mulc
-
Improving Neural Conversational Models With Entropy-based Data Filtering
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 64 citations
Richard Csaky, Patrik Purgai, Gabor Recski
-
Ensemble-based Deep Reinforcement Learning For Chatbots
(2019)
• Neurocomputing
• 75 citations
Cuayáhuitl et al.
-
Nemo: A Toolkit For Building AI Applications Using Neural Modules
(2019)
• Arxiv
• 166 citations
Kuchaiev et al.
-
Flowseq: Non-autoregressive Conditional Sequence Generation With Generative Flow
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 152 citations
Ma et al.
-
Defending Against Neural Fake News
(2019)
• Arxiv
• 398 citations
Zellers et al.
-
Multiqa: An Empirical Investigation Of Generalization And Transfer In Reading Comprehension
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 184 citations
Alon Talmor, Jonathan Berant
-
Knowledge Aware Conversation Generation With Explainable Reasoning Over Augmented Graphs
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 99 citations
Liu et al.
-
Fine-tune BERT For Extractive Summarization
(2019)
• Arxiv
• 361 citations
Yang Liu
-
Neural Text Summarization: A Critical Evaluation
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 347 citations
Kryściński et al.
-
Transformer-xl: Attentive Language Models Beyond A Fixed-length Context
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 2957 citations
Dai et al.
-
Deeper Text Understanding For IR With Contextual Neural Language Modeling
(2019)
• Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
• 233 citations
Zhuyun Dai, Jamie Callan
-
Question Answering As Global Reasoning Over Semantic Abstractions
(2019)
• Arxiv
• 60 citations
Khashabi et al.
-
Learning To Navigate Unseen Environments: Back Translation With Environmental Dropout
(2019)
• Proceedings of the 2019 Conference of the North
• 274 citations
Hao Tan, Licheng Yu, Mohit Bansal
-
Sample Efficient Text Summarization Using A Single Pre-trained Transformer
(2019)
• Arxiv
• 67 citations
Khandelwal et al.
-
Multi-step Retriever-reader Interaction For Scalable Open-domain Question Answering
(2019)
• Arxiv
• 113 citations
Das et al.
-
Plug And Play Language Models: A Simple Approach To Controlled Text Generation
(2019)
• Arxiv
• 440 citations
Dathathri et al.
-
Quoref: A Reading Comprehension Dataset With Questions Requiring Coreferential Reasoning
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 164 citations
Dasigi et al.
-
Align2ground: Weakly Supervised Phrase Grounding Guided By Image-caption Alignment
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 90 citations
Datta et al.
-
R-transformer: Recurrent Neural Network Enhanced Transformer
(2019)
• Arxiv
• 88 citations
Wang et al.
-
Openkiwi: An Open Source Framework For Quality Estimation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
• 107 citations
Kepler et al.
-
Reflective Decoding Network For Image Captioning
(2019)
• 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
• 96 citations
Ke et al.
-
Positional Encoding To Control Output Sequence Length
(2019)
• Proceedings of the 2019 Conference of the North
• 88 citations
Sho Takase, Naoaki Okazaki
-
Bridging The Gap Between Training And Inference For Neural Machine Translation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 214 citations
Zhang et al.
-
Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism
(2019)
• Arxiv
• 795 citations
Shoeybi et al.
-
XLDA: Cross-lingual Data Augmentation For Natural Language Inference And Question Answering
(2019)
• Arxiv
• 69 citations
Singh et al.
-
Generalization Through Memorization: Nearest Neighbor Language Models
(2019)
• Arxiv
• 214 citations
Khandelwal et al.
-
Adversarial Learning With Contextual Embeddings For Zero-resource Cross-lingual Classification And NER
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 78 citations
Phillip Keung, Yichao Lu, Vikas Bhardwaj
-
Stabilizing Transformers For Reinforcement Learning
(2019)
• Arxiv
• 90 citations
Parisotto et al.
-
Quartz: An Open-domain Dataset Of Qualitative Relationship Questions
(2019)
• Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
• 72 citations
Tafjord et al.
-
Hierarchical Transformers For Long Document Classification
(2019)
• 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 193 citations
Pappagari et al.
-
CTRL: A Conditional Transformer Language Model For Controllable Generation
(2019)
• Arxiv
• 824 citations
Keskar et al.
-
Handling Divergent Reference Texts When Evaluating Table-to-text Generation
(2019)
• Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
• 143 citations
Dhingra et al.
-
Pythia: Ai-assisted Code Completion System
(2019)
• Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
• 136 citations
Svyatkovskiy et al.
-
Unsupervised Neural Machine Translation With Weight Sharing
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 137 citations
Yang et al.
-
Maskgan: Better Text Generation Via Filling In The______
(2018)
• Arxiv
• 276 citations
William Fedus, Ian Goodfellow, Andrew M. Dai
-
Neural Automated Essay Scoring And Coherence Modeling For Adversarially Crafted Input
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 86 citations
Youmna Farag, Helen Yannakoudakis, Ted Briscoe
-
Response Ranking With Deep Matching Networks And External Knowledge In Information-seeking Conversation Systems
(2018)
• The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
• 114 citations
Yang et al.
-
Adventure: Adversarial Training For Textual Entailment With Knowledge-guided Examples
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 69 citations
Kang et al.
-
Under The Hood: Using Diagnostic Classifiers To Investigate And Improve How Language Models Track Agreement Information
(2018)
• Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 196 citations
Giulianelli et al.
-
Mcscript: A Novel Dataset For Assessing Machine Comprehension Using Script Knowledge
(2018)
• Arxiv
• 70 citations
Ostermann et al.
-
Sounding Board: A User-centric And Content-driven Social Chatbot
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations
• 62 citations
Fang et al.
-
Improving Abstraction In Text Summarization
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 151 citations
Kryściński et al.
-
Self-attentive Sequential Recommendation
(2018)
• 2018 IEEE International Conference on Data Mining (ICDM)
• 2043 citations
Wang-Cheng Kang, Julian McAuley
-
Modeling Localness For Self-attention Networks
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 217 citations
Yang et al.
-
Modeling Multi-turn Conversation With Deep Utterance Aggregation
(2018)
• COLING 2018 pages 3740-3752
• 157 citations
Zhang et al.
-
An Annotated Corpus For Machine Reading Of Instructions In Wet Lab Protocols
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 62 citations
Kulkarni et al.
-
Duorc: Towards Complex Language Understanding With Paraphrased Reading Comprehension
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 130 citations
Saha et al.
-
Assessing Composition In Sentence Vector Representations
(2018)
• In Proceedings of the 27th International Conference on Computational Linguistics (pp. 1790-1801)
• 70 citations
Ettinger et al.
-
Zero-shot Cross-lingual Classification Using Multilingual Neural Machine Translation
(2018)
• Arxiv
• 80 citations
Eriguchi et al.
-
Dialog-based Interactive Image Retrieval
(2018)
• Arxiv
• 91 citations
Guo et al.
-
An Interpretable Reasoning Network For Multi-relation Question Answering
(2018)
• Arxiv
• 70 citations
Mantong Zhou, Minlie Huang, Xiaoyan Zhu
-
Can Neural Networks Understand Logical Entailment?
(2018)
• Arxiv
• 63 citations
Evans et al.
-
Soft Layer-specific Multi-task Summarization With Entailment And Question Generation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 145 citations
Han Guo, Ramakanth Pasunuru, Mohit Bansal
-
Sequential Copying Networks
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 59 citations
Zhou et al.
-
Neural System Combination For Machine Translation
(2018)
• Lecture Notes in Computer Science
• 336 citations
Zhou et al.
-
Learning Factorized Multimodal Representations
(2018)
• Arxiv
• 192 citations
Tsai et al.
-
Imagine This! Scripts To Compositions To Videos
(2018)
• Lecture Notes in Computer Science
• 64 citations
Gupta et al.
-
Zero-shot User Intent Detection Via Capsule Neural Networks
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 209 citations
Xia et al.
-
Conversational Recommender System
(2018)
• The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
• 345 citations
Yueming Sun, Yi Zhang
-
Question Generation From SQL Queries Improves Neural Semantic Parsing
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 56 citations
Guo et al.
-
Sentence Encoders On Stilts: Supplementary Training On Intermediate Labeled-data Tasks
(2018)
• Arxiv
• 291 citations
Jason Phang, Thibault Févry, Samuel R. Bowman
-
Complex Sequential Question Answering: Towards Learning To Converse Over Linked Question Answer Pairs With A Knowledge Graph
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 171 citations
Saha et al.
-
Parameter Sharing Methods For Multilingual Self-attentional Translation Models
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 114 citations
Devendra Singh Sachan, Graham Neubig
-
Modeling Diverse Relevance Patterns In Ad-hoc Retrieval
(2018)
• The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
• 68 citations
Fan et al.
-
Back-translation Sampling By Targeting Difficult Words In Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 83 citations
Marzieh Fadaee, Christof Monz
-
Hierarchical Neural Story Generation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 1081 citations
Angela Fan, Mike Lewis, Yann Dauphin
-
Content Preserving Text Generation With Attribute Controls
(2018)
• Arxiv
• 84 citations
Lajanugen Logeswaran, Honglak Lee, Samy Bengio
-
Improving The Transformer Translation Model With Document-level Context
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 264 citations
Zhang et al.
-
Universal Neural Machine Translation For Extremely Low Resource Languages
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 251 citations
Gu et al.
-
Hybrid Retrieval-generation Reinforced Agent For Medical Image Report Generation
(2018)
• Arxiv
• 135 citations
Li et al.
-
Reasoning About Actions And State Changes By Injecting Commonsense Knowledge
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 91 citations
Tandon et al.
-
Scaling Neural Machine Translation
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 565 citations
Ott et al.
-
Learning Visual Knowledge Memory Networks For Visual Question Answering
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 62 citations
Su et al.
-
Variational Recurrent Neural Machine Translation
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 78 citations
Su et al.
-
Multi-passage Machine Reading Comprehension With Cross-passage Answer Verification
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 105 citations
Wang et al.
-
Building A Conversational Agent Overnight With Dialogue Self-play
(2018)
• Arxiv
• 177 citations
Shah et al.
-
Learning General Purpose Distributed Sentence Representations Via Large Scale Multi-task Learning
(2018)
• Arxiv
• 181 citations
Subramanian et al.
-
Generating Informative And Diverse Conversational Responses Via Adversarial Information Maximization
(2018)
• Arxiv
• 174 citations
Zhang et al.
-
Towards Deep Conversational Recommendations
(2018)
• Arxiv
• 126 citations
Li et al.
-
How Much Reading Does Reading Comprehension Require? A Critical Investigation Of Popular Benchmarks
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 276 citations
Divyansh Kaushik, Zachary C. Lipton
-
Coarse-to-fine Decoding For Neural Semantic Parsing
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 375 citations
Li Dong, Mirella Lapata
-
Confidence Modeling For Neural Semantic Parsing
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 68 citations
Li Dong, Chris Quirk, Mirella Lapata
-
Interactive Visual Grounding Of Referring Expressions For Human-robot Interaction
(2018)
• Robotics: Science and Systems XIV
• 116 citations
Mohit Shridhar, David Hsu
-
Unpaired Image Captioning By Language Pivoting
(2018)
• Lecture Notes in Computer Science
• 93 citations
Gu et al.
-
The Memad Submission To The WMT18 Multimodal Translation Task
(2018)
• Proceedings of the Third Conference on Machine Translation: Shared Task Papers
• 63 citations
Grönroos et al.
-
Meta-learning For Low-resource Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 324 citations
Gu et al.
-
Learning To Map Context-dependent Sentences To Executable Formal Queries
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 95 citations
Alane Suhr, Srinivasan Iyer, Yoav Artzi
-
Beyond Word Importance: Contextual Decomposition To Extract Interactions From Lstms
(2018)
• Arxiv
• 124 citations
W. James Murdoch, Peter J. Liu, Bin Yu
-
Multi-head Attention With Disagreement Regularization
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 177 citations
Li et al.
-
Learning Conditioned Graph Structures For Interpretable Visual Question Answering
(2018)
• Arxiv
• 114 citations
Will Norcliffe-Brown, Efstathios Vafeias, Sarah Parisot
-
Sentencepiece: A Simple And Language Independent Subword Tokenizer And Detokenizer For Neural Text Processing
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
• 2647 citations
Taku Kudo, John Richardson
-
Zero-shot Question Generation From Knowledge Graphs For Unseen Predicates And Entity Types
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 78 citations
Hady Elsahar, Christophe Gravier, Frederique Laforest
-
Neural User Simulation For Corpus-based Policy Optimisation For Spoken Dialogue Systems
(2018)
• Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue
• 56 citations
Kreyssig et al.
-
Self-attention With Relative Position Representations
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 1989 citations
Peter Shaw, Jakob Uszkoreit, Ashish Vaswani
-
Clarinet: Parallel Wave Generation In End-to-end Text-to-speech
(2018)
• Arxiv
• 245 citations
Wei Ping, Kainan Peng, Jitong Chen
-
Analyzing Uncertainty In Neural Machine Translation
(2018)
• Arxiv
• 119 citations
Ott et al.
-
Asynchronous Bidirectional Decoding For Neural Machine Translation
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 117 citations
Zhang et al.
-
Findings Of The E2E NLG Challenge
(2018)
• Proceedings of the 11th International Conference on Natural Language Generation
• 100 citations
Ondřej Dušek, Jekaterina Novikova, Verena Rieser
-
A Co-matching Model For Multi-choice Reading Comprehension
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 105 citations
Wang et al.
-
How2: A Large-scale Dataset For Multimodal Language Understanding
(2018)
• Arxiv
• 168 citations
Sanabria et al.
-
Colorless Green Recurrent Networks Dream Hierarchically
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 561 citations
Gulordava et al.
-
On Adversarial Examples For Character-level Neural Machine Translation
(2018)
• COLING 2018
• 155 citations
Javid Ebrahimi, Daniel Lowd, Dejing Dou
-
Document-level Neural Machine Translation With Hierarchical Attention Networks
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 282 citations
Miculicich et al.
-
Understanding Back-translation At Scale
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 1040 citations
Edunov et al.
-
Stack-captioning: Coarse-to-fine Learning For Image Captioning
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 174 citations
Gu et al.
-
Compact Personalized Models For Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 65 citations
Joern Wuebker, Patrick Simianer, John Denero
-
Exploiting Deep Representations For Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 93 citations
Dou et al.
-
Dialogwae: Multimodal Response Generation With Conditional Wasserstein Auto-encoder
(2018)
• Arxiv
• 103 citations
Gu et al.
-
Training Tips For The Transformer Model
(2018)
• The Prague Bulletin of Mathematical Linguistics
• 205 citations
Martin Popel, Ondřej Bojar
-
Clicr: A Dataset Of Clinical Case Reports For Machine Reading Comprehension
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 87 citations
Simon Šuster, Walter Daelemans
-
Accelerating Neural Transformer Via An Average Attention Network
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 116 citations
Biao Zhang, Deyi Xiong, Jinsong Su
-
Wronging A Right: Generating Better Errors To Improve Grammatical Error Detection
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 59 citations
Sudhanshu Kasewa, Pontus Stenetorp, Sebastian Riedel
-
Harvesting Paragraph-level Question-answer Pairs From Wikipedia
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 177 citations
Xinya Du, Claire Cardie
-
Emrqa: A Large Corpus For Question Answering On Electronic Medical Records
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 159 citations
Pampari et al.
-
Polite Dialogue Generation Without Parallel Data
(2018)
• Transactions of the Association for Computational Linguistics
• 174 citations
Tong Niu, Mohit Bansal
-
Von Mises-fisher Loss For Training Sequence To Sequence Models With Continuous Outputs
(2018)
• Arxiv
• 59 citations
Sachin Kumar, Yulia Tsvetkov
-
Graph2seq: Graph To Sequence Learning With Attention-based Neural Networks
(2018)
• Arxiv
• 163 citations
Xu et al.
-
Switchout: An Efficient Data Augmentation Algorithm For Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 203 citations
Wang et al.
-
Rapid Adaptation Of Neural Machine Translation To New Languages
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 209 citations
Graham Neubig, Junjie Hu
-
GLUE: A Multi-task Benchmark And Analysis Platform For Natural Language Understanding
(2018)
• Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 3643 citations
Wang et al.
-
Attention-based LSTM For Psychological Stress Detection From Spoken Language Using Distant Supervision
(2018)
• 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 58 citations
Genta Indra Winata, Onno Pepijn Kampman, Pascale Fung
-
Towards Exploiting Background Knowledge For Building Conversation Systems
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 168 citations
Moghe et al.
-
Stress Test Evaluation For Natural Language Inference
(2018)
• Arxiv
• 303 citations
Naik et al.
-
Attaining The Unattainable? Reassessing Claims Of Human Parity In Neural Machine Translation
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 206 citations
Toral et al.
-
Experience Replay For Continual Learning
(2018)
• Arxiv
• 359 citations
Rolnick et al.
-
Personalized Language Model For Query Auto-completion
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 55 citations
Aaron Jaech, Mari Ostendorf
-
Improved Fusion Of Visual And Language Representations By Dense Symmetric Co-attention For Visual Question Answering
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 335 citations
Duy-Kien Nguyen, Takayuki Okatani
-
Tracking State Changes In Procedural Text: A Challenge Dataset And Models For Process Paragraph Comprehension
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 118 citations
Mishra et al.
-
Explicit Reasoning Over End-to-end Neural Architectures For Visual Question Answering
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 58 citations
Somak Aditya, Yezhou Yang, Chitta Baral
-
Two Can Play This Game: Visual Dialog With Discriminative Question Generation And Answering
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 94 citations
Unnat Jain, Svetlana Lazebnik, Alexander Schwing
-
Adversarial Over-sensitivity And Over-stability Strategies For Dialogue Models
(2018)
• Proceedings of the 22nd Conference on Computational Natural Language Learning
• 61 citations
Tong Niu, Mohit Bansal
-
Retrieve And Refine: Improved Sequence Generation Models For Dialogue
(2018)
• Proceedings of the 2018 EMNLP Workshop SCAI: The 2nd International Workshop on Search-Oriented Conversational AI
• 186 citations
Jason Weston, Emily Dinan, Alexander H. Miller
-
Input Combination Strategies For Multi-source Transformer Decoder
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 65 citations
Jindřich Libovický, Jindřich Helcl, David Mareček
-
Flowqa: Grasping Flow In History For Conversational Machine Comprehension
(2018)
• Arxiv
• 86 citations
Hsin-Yuan Huang, Eunsol Choi, Wen-Tau Yih
-
Gpipe: Efficient Training Of Giant Neural Networks Using Pipeline Parallelism
(2018)
• Arxiv
• 836 citations
Huang et al.
-
End-to-end Non-autoregressive Neural Machine Translation With Connectionist Temporal Classification
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 130 citations
Jindřich Libovický, Jindřich Helcl
-
Deterministic Non-autoregressive Neural Sequence Modeling By Iterative Refinement
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 403 citations
Jason Lee, Elman Mansimov, Kyunghyun Cho
-
Exploiting Rich Syntactic Information For Semantic Parsing With Graph-to-sequence Model
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 71 citations
Xu et al.
-
Natural Language To Structured Query Generation Via Meta-learning
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 125 citations
Huang et al.
-
Improving Variational Encoder-decoders In Dialogue Generation
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 94 citations
Shen et al.
-
Global Encoding For Abstractive Summarization
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 153 citations
Lin et al.
-
Tensor2tensor For Neural Machine Translation
(2018)
• Arxiv
• 332 citations
Vaswani et al.
-
A Graph-to-sequence Model For Amr-to-text Generation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 201 citations
Song et al.
-
On The Alignment Problem In Multi-head Attention-based Neural Machine Translation
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 58 citations
Tamer Alkhouli, Gabriel Bretschner, Hermann Ney
-
Code2seq: Generating Sequences From Structured Representations Of Code
(2018)
• Arxiv
• 401 citations
Alon et al.
-
Simple Fusion: Return Of The Language Model
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 66 citations
Felix Stahlberg, James Cross, Veselin Stoyanov
-
Deep Relevance Ranking Using Enhanced Document-query Interactions
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 114 citations
Ryan McDonald, Georgios-Ioannis Brokos, Ion Androutsopoulos
-
Counting To Explore And Generalize In Text-based Games
(2018)
• Arxiv
• 57 citations
Yuan et al.
-
Tied Multitask Learning For Neural Speech Translation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 176 citations
Antonios Anastasopoulos, David Chiang
-
Representation Learning For Grounded Spatial Reasoning
(2018)
• Transactions of the Association for Computational Linguistics
• 59 citations
Michael Janner, Karthik Narasimhan, Regina Barzilay
-
A Reinforced Topic-aware Convolutional Sequence-to-sequence Model For Abstractive Text Summarization
(2018)
• Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
• 127 citations
Wang et al.
-
Collecting Diverse Natural Language Inference Problems For Sentence Representation Evaluation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 125 citations
Poliak et al.
-
Relational Recurrent Neural Networks
(2018)
• Arxiv
• 139 citations
Santoro et al.
-
Adapting The Neural Encoder-decoder Framework From Single To Multi-document Summarization
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 172 citations
Logan Lebanoff, Kaiqiang Song, Fei Liu
-
What Do RNN Language Models Learn About Filler-gap Dependencies?
(2018)
• Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 167 citations
Wilcox et al.
-
Did The Model Understand The Question?
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 160 citations
Mudrakarta et al.
-
Adafactor: Adaptive Learning Rates With Sublinear Memory Cost
(2018)
• Arxiv
• 290 citations
Noam Shazeer, Mitchell Stern
-
Focal Visual-text Attention For Visual Question Answering
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 115 citations
Liang et al.
-
Webly Supervised Joint Embedding For Cross-modal Image-text Retrieval
(2018)
• Proceedings of the 26th ACM international conference on Multimedia
• 73 citations
Mithun et al.
-
Move Forward And Tell: A Progressive Generator Of Video Descriptions
(2018)
• Lecture Notes in Computer Science
• 115 citations
Yilei Xiong, Bo Dai, Dahua Lin
-
When And Why Are Pre-trained Word Embeddings Useful For Neural Machine Translation?
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 349 citations
Qi et al.
-
Inferring Semantic Layout For Hierarchical Text-to-image Synthesis
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 361 citations
Hong et al.
-
Memory Augmented Policy Optimization For Program Synthesis And Semantic Parsing
(2018)
• Arxiv
• 105 citations
Liang et al.
-
Unsupervised Statistical Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 325 citations
Mikel Artetxe, Gorka Labaka, Eneko Agirre
-
Universal Language Model Fine-tuning For Text Classification
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 3484 citations
Jeremy Howard, Sebastian Ruder
-
A Visual Attention Grounding Neural Model For Multimodal Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 90 citations
Zhou et al.
-
Breaking NLI Systems With Sentences That Require Simple Lexical Inferences
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 380 citations
Max Glockner, Vered Shwartz, Yoav Goldberg
-
Joint Training For Neural Machine Translation Models With Monolingual Data
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 107 citations
Zhang et al.
-
Simple And Effective Semi-supervised Question Answering
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 69 citations
Bhuwan Dhingra, Danish Pruthi, Dheeraj Rajagopal
-
Modeling Naive Psychology Of Characters In Simple Commonsense Stories
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 82 citations
Rashkin et al.
-
Advancing The State Of The Art In Open Domain Dialog Systems Through The Alexa Prize
(2018)
• Arxiv
• 62 citations
Khatri et al.
-
BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding
(2018)
• Arxiv
• 38308 citations
Devlin et al.
-
Adaptive Document Retrieval For Deep Question Answering
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 60 citations
Bernhard Kratzwald, Stefan Feuerriegel
-
Concatenated Power Mean Word Embeddings As Universal Cross-lingual Sentence Representations
(2018)
• Arxiv
• 71 citations
Rücklé et al.
-
The Web As A Knowledge-base For Answering Complex Questions
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 435 citations
Alon Talmor, Jonathan Berant
-
Universal Transformers
(2018)
• Arxiv
• 413 citations
Dehghani et al.
-
Toward Scalable Neural Dialogue State Tracking Model
(2018)
• Arxiv
• 57 citations
Elnaz Nouri, Ehsan Hosseini-Asl
-
Transforming Question Answering Datasets Into Natural Language Inference Datasets
(2018)
• Arxiv
• 133 citations
Dorottya Demszky, Kelvin Guu, Percy Liang
-
Latent Alignment And Variational Attention
(2018)
• Arxiv
• 88 citations
Deng et al.
-
What Level Of Quality Can Neural Machine Translation Attain On Literary Text?
(2018)
• Machine Translation: Technologies and Applications
• 74 citations
Antonio Toral, Andy Way
-
A Hierarchical Latent Structure For Variational Conversation Modeling
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 109 citations
Yookoon Park, Jaemin Cho, Gunhee Kim
-
Fast Decoding In Sequence Models Using Discrete Latent Variables
(2018)
• Arxiv
• 180 citations
Kaiser et al.
-
Multimodal Explanations: Justifying Decisions And Pointing To The Evidence
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 337 citations
Park et al.
-
Controlling Personality-based Stylistic Variation With Neural Natural Language Generators
(2018)
• Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue
• 65 citations
Oraby et al.
-
Correcting Length Bias In Neural Machine Translation
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 141 citations
Kenton Murray, David Chiang
-
Query And Output: Generating Words By Querying Distributed Word Representations For Paraphrase Generation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 67 citations
Ma et al.
-
A Simple Method For Commonsense Reasoning
(2018)
• Arxiv
• 341 citations
Trieu H. Trinh, Quoc V. Le
-
Extreme Adaptation For Personalized Neural Machine Translation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 98 citations
Paul Michel, Graham Neubig
-
Has Machine Translation Achieved Human Parity? A Case For Document-level Evaluation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 280 citations
Samuel Läubli, Rico Sennrich, Martin Volk
-
Neural Open Information Extraction
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 131 citations
Lei Cui, Furu Wei, Ming Zhou
-
Learning To Summarize Radiology Findings
(2018)
• Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis
• 117 citations
Zhang et al.
-
Talk The Walk: Navigating New York City Through Grounded Dialogue
(2018)
• Arxiv
• 117 citations
Vries et al.
-
Speaker-follower Models For Vision-and-language Navigation
(2018)
• Arxiv
• 222 citations
Fried et al.
-
Bag-of-words As Target For Neural Machine Translation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 82 citations
Ma et al.
-
Incremental Decoding And Training Methods For Simultaneous Translation In Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 108 citations
Dalvi et al.
-
Towards End-to-end Prosody Transfer For Expressive Speech Synthesis With Tacotron
(2018)
• Arxiv
• 230 citations
Skerry-Ryan et al.
-
Bi-directional Neural Machine Translation With Synthetic Parallel Data
(2018)
• Proceedings of the 2nd Workshop on Neural Machine Translation and Generation
• 80 citations
Xing Niu, Michael Denkowski, Marine Carpuat
-
Semantic Parsing For Task Oriented Dialog Using Hierarchical Representations
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 189 citations
Gupta et al.
-
Visual Coreference Resolution In Visual Dialog Using Neural Module Networks
(2018)
• Lecture Notes in Computer Science
• 185 citations
Kottur et al.
-
Syllable-based Sequence-to-sequence Speech Recognition With The Transformer In Mandarin Chinese
(2018)
• Interspeech 2018
• 97 citations
Zhou et al.
-
Visual Question Answering As Reading Comprehension
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 177 citations
Li et al.
-
Evaluating Compositionality In Sentence Embeddings
(2018)
• Arxiv
• 118 citations
Dasgupta et al.
-
Large-scale QA-SRL Parsing
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 86 citations
Fitzgerald et al.
-
Neural Modular Control For Embodied Question Answering
(2018)
• Arxiv
• 78 citations
Das et al.
-
Context-aware Neural Machine Translation Learns Anaphora Resolution
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 315 citations
Voita et al.
-
Extending A Parser To Distant Domains Using A Few Dozen Partially Annotated Examples
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 98 citations
Vidur Joshi, Matthew Peters, Mark Hopkins
-
Ordered Neurons: Integrating Tree Structures Into Recurrent Neural Networks
(2018)
• Arxiv
• 184 citations
Shen et al.
-
Sdnet: Contextualized Attention-based Deep Network For Conversational Question Answering
(2018)
• Arxiv
• 124 citations
Chenguang Zhu, Michael Zeng, Xuedong Huang
-
Mem2seq: Effectively Incorporating Knowledge Bases Into End-to-end Task-oriented Dialog Systems
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 303 citations
Andrea Madotto, Chien-Sheng Wu, Pascale Fung
-
SWAG: A Large-scale Adversarial Dataset For Grounded Commonsense Inference
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 712 citations
Zellers et al.
-
Learning To Ask Questions In Open-domain Conversational Systems With Typed Decoders
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 66 citations
Wang et al.
-
Chatpainter: Improving Text To Image Generation Using Dialogue
(2018)
• Arxiv
• 80 citations
Sharma et al.
-
Leveraging Intra-user And Inter-user Representation Learning For Automated Hate Speech Detection
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 93 citations
Qian et al.
-
Deep Dyna-q: Integrating Planning For Task-completion Dialogue Policy Learning
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 169 citations
Peng et al.
-
A Tree-based Decoder For Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 57 citations
Wang et al.
-
MTNT: A Testbed For Machine Translation Of Noisy Text
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 137 citations
Paul Michel, Graham Neubig
-
Neural Argument Generation Augmented With Externally Retrieved Evidence
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 57 citations
Xinyu Hua, Lu Wang
-
Can A Suit Of Armor Conduct Electricity? A New Dataset For Open Book Question Answering
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 506 citations
Mihaylov et al.
-
Getting Gender Right In Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 180 citations
Eva Vanmassenhove, Christian Hardmeier, Andy Way
-
Why Self-attention? A Targeted Evaluation Of Neural Machine Translation Architectures
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 292 citations
Tang et al.
-
Opennmt: Neural Machine Translation Toolkit
(2018)
• Arxiv
• 67 citations
Klein et al.
-
Scheduled Multi-task Learning: From Syntax To Translation
(2018)
• Transactions of the Association for Computational Linguistics
• 81 citations
Eliyahu Kiperwasser, Miguel Ballesteros
-
Atrank: An Attention-based User Behavior Modeling Framework For Recommendation
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 260 citations
Zhou et al.
-
From Eliza To Xiaoice: Challenges And Opportunities With Social Chatbots
(2018)
• Frontiers of Information Technology & Electronic Engineering
• 633 citations
Heung-Yeung Shum, Xiaodong He, di Li
-
Allennlp: A Deep Semantic Natural Language Processing Platform
(2018)
• Proceedings of Workshop for NLP Open Source Software (NLP-OSS)
• 1139 citations
Gardner et al.
-
Open Domain Question Answering Using Early Fusion Of Knowledge Bases And Text
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 403 citations
Sun et al.
-
TRANX: A Transition-based Neural Abstract Syntax Parser For Semantic Parsing And Code Generation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
• 192 citations
Pengcheng Yin, Graham Neubig
-
Neural-symbolic VQA: Disentangling Reasoning From Vision And Language Understanding
(2018)
• Arxiv
• 226 citations
Yi et al.
-
Approaching Neural Grammatical Error Correction As A Low-resource Machine Translation Task
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 179 citations
Junczys-Dowmunt et al.
-
What Makes Reading Comprehension Questions Easier?
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 111 citations
Sugawara et al.
-
Interpreting Recurrent And Attention-based Neural Models: A Case Study On Natural Language Inference
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 90 citations
Reza Ghaeini, Xiaoli Z. Fern, Prasad Tadepalli
-
Ms-uedin Submission To The WMT2018 APE Shared Task: Dual-source Transformer For Automatic Post-editing
(2018)
• Proceedings of the Third Conference on Machine Translation: Shared Task Papers
• 70 citations
Marcin Junczys-Dowmunt, Roman Grundkiewicz
-
Recipeqa: A Challenge Dataset For Multimodal Comprehension Of Cooking Recipes
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 122 citations
Yagcioglu et al.
-
Wizard Of Wikipedia: Knowledge-powered Conversational Agents
(2018)
• Arxiv
• 501 citations
Dinan et al.
-
A Deep Ensemble Model With Slot Alignment For Sequence-to-sequence Natural Language Generation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 79 citations
Juraska et al.
-
Record: Bridging The Gap Between Human And Machine Commonsense Reading Comprehension
(2018)
• Arxiv
• 238 citations
Zhang et al.
-
Question-guided Hybrid Convolution For Visual Question Answering
(2018)
• Lecture Notes in Computer Science
• 77 citations
Gao et al.
-
Interpretable Charge Predictions For Criminal Cases: Learning To Generate Court Views From Fact Descriptions
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 131 citations
Ye et al.
-
CNN+CNN: Convolutional Decoders For Image Captioning
(2018)
• Arxiv
• 65 citations
Qingzhong Wang, Antoni B. Chan
-
Towards Robust Neural Machine Translation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 182 citations
Cheng et al.
-
Reaching Human-level Performance In Automatic Grammatical Error Correction: An Empirical Study
(2018)
• Arxiv
• 100 citations
Tao Ge, Furu Wei, Ming Zhou
-
Rankme: Reliable Human Ratings For Natural Language Generation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 80 citations
Jekaterina Novikova, Ondřej Dušek, Verena Rieser
-
Exploring Visual Relationship For Image Captioning
(2018)
• Lecture Notes in Computer Science
• 878 citations
Yao et al.
-
Revisiting Character-based Neural Machine Translation With Capacity And Compression
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 98 citations
Cherry et al.
-
Neural Metaphor Detection In Context
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 121 citations
Gao et al.
-
Babyai: A Platform To Study The Sample Efficiency Of Grounded Language Learning
(2018)
• Arxiv
• 69 citations
Chevalier-Boisvert et al.
-
Multi-reward Reinforced Summarization With Saliency And Entailment
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 153 citations
Ramakanth Pasunuru, Mohit Bansal
-
End-to-end Content And Plan Selection For Data-to-text Generation
(2018)
• Proceedings of the 11th International Conference on Natural Language Generation
• 81 citations
Gehrmann et al.
-
Abstractive Dialogue Summarization With Sentence-gated Modeling Optimized By Dialogue Acts
(2018)
• 2018 IEEE Spoken Language Technology Workshop (SLT)
• 109 citations
Chih-Wen Goo, Yun-Nung Chen
-
Explainable Recommendation Via Multi-task Learning In Opinionated Text Data
(2018)
• The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
• 111 citations
Wang et al.
-
Ultra-fine Entity Typing
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 216 citations
Choi et al.
-
Fine-grained Attention Mechanism For Neural Machine Translation
(2018)
• Neurocomputing
• 197 citations
Heeyoul Choi, Kyunghyun Cho, Yoshua Bengio
-
Quac : Question Answering In Context
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 688 citations
Choi et al.
-
Guiding Neural Machine Translation With Retrieved Translation Pieces
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 117 citations
Zhang et al.
-
Rearranging The Familiar: Testing Compositional Generalization In Recurrent Networks
(2018)
• Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 106 citations
João Loula, Marco Baroni, Brenden M. Lake
-
Black-box Generation Of Adversarial Text Sequences To Evade Deep Learning Classifiers
(2018)
• 2018 IEEE Security and Privacy Workshops (SPW)
• 614 citations
Gao et al.
-
Learning Semantic Textual Similarity From Conversations
(2018)
• Proceedings of The Third Workshop on Representation Learning for NLP
• 159 citations
Yang et al.
-
Text Data Augmentation Made Simple By Leveraging NLP Cloud Apis
(2018)
• Arxiv
• 82 citations
Claude Coulombe
-
Are All Languages Equally Hard To Language-model?
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 94 citations
Cotterell et al.
-
Fully Convolutional Speech Recognition
(2018)
• Arxiv
• 90 citations
Zeghidour et al.
-
Improved Training Of End-to-end Attention Models For Speech Recognition
(2018)
• Interspeech 2018
• 292 citations
Zeyer et al.
-
Contextual Parameter Generation For Universal Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 170 citations
Platanios et al.
-
Dr-bilstm: Dependent Reading Bidirectional LSTM For Natural Language Inference
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 94 citations
Ghaeini et al.
-
Dialogue Learning With Human Teaching And Feedback In End-to-end Trainable Task-oriented Dialogue Systems
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 157 citations
Liu et al.
-
XNLI: Evaluating Cross-lingual Sentence Representations
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 938 citations
Conneau et al.
-
What You Can Cram Into A Single Vector: Probing Sentence Embeddings For Linguistic Properties
(2018)
• Arxiv
• 321 citations
Conneau et al.
-
An Analysis Of Neural Language Modeling At Multiple Scales
(2018)
• Arxiv
• 161 citations
Stephen Merity, Nitish Shirish Keskar, Richard Socher
-
RETURNN As A Generic Flexible Neural Toolkit With Application To Translation And Speech Recognition
(2018)
• Proceedings of ACL 2018, System Demonstrations
• 73 citations
Albert Zeyer, Tamer Alkhouli, Hermann Ney
-
A Survey Of Domain Adaptation For Neural Machine Translation
(2018)
• Arxiv
• 137 citations
Chenhui Chu, Rui Wang
-
BLEU Is Not Suitable For The Evaluation Of Text Simplification
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 139 citations
Elior Sulem, Omri Abend, Ari Rappoport
-
Textual Explanations For Self-driving Vehicles
(2018)
• Lecture Notes in Computer Science
• 250 citations
Kim et al.
-
Integrating Transformer And Paraphrase Rules For Sentence Simplification
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 108 citations
Zhao et al.
-
Multi-task Learning For Joint Language Understanding And Dialogue State Tracking
(2018)
• Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue
• 65 citations
Abhinav Rastogi, Raghav Gupta, Dilek Hakkani-Tur
-
Microsoft Dialogue Challenge: Building End-to-end Task-completion Dialogue Systems
(2018)
• Arxiv
• 68 citations
Li et al.
-
GLAC Net: Glocal Attention Cascading Networks For Multi-image Cued Story Generation
(2018)
• Arxiv
• 56 citations
Kim et al.
-
Extending Neural Generative Conversational Model Using External Knowledge Sources
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 72 citations
Prasanna Parthasarathi, Joelle Pineau
-
Multimodal Dual Attention Memory For Video Story Question Answering
(2018)
• Lecture Notes in Computer Science
• 78 citations
Kim et al.
-
Bilinear Attention Networks
(2018)
• Arxiv
• 586 citations
Jin-Hwa Kim, Jaehyun Jun, Byoung-Tak Zhang
-
Attention-guided Answer Distillation For Machine Reading Comprehension
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 58 citations
Hu et al.
-
A Large-scale Test Set For The Evaluation Of Context-aware Pronoun Translation In Neural Machine Translation
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 152 citations
Müller et al.
-
Adversarial Example Generation With Syntactically Controlled Paraphrase Networks
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 624 citations
Iyyer et al.
-
Multilingual Extractive Reading Comprehension By Runtime Machine Translation
(2018)
• Arxiv
• 66 citations
Asai et al.
-
Training Millions Of Personalized Dialogue Agents
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 218 citations
Mazaré et al.
-
The Natural Language Decathlon: Multitask Learning As Question Answering
(2018)
• Arxiv
• 349 citations
McCann et al.
-
A Skeleton-based Model For Promoting Coherence Among Sentences In Narrative Story Generation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 102 citations
Xu et al.
-
Sql-to-text Generation With Graph-to-sequence Model
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 72 citations
Xu et al.
-
A Unified Model For Extractive And Abstractive Summarization Using Inconsistency Loss
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 279 citations
Hsu et al.
-
Know What You Don't Know: Unanswerable Questions For Squad
(2018)
• Arxiv
• 134 citations
Pranav Rajpurkar, Robin Jia, Percy Liang
-
Deep Contextualized Word Representations
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 11074 citations
Peters et al.
-
Explainable Neural Computation Via Stack Neural Module Networks
(2018)
• Lecture Notes in Computer Science
• 189 citations
Hu et al.
-
DP-GAN: Diversity-promoting Generative Adversarial Network For Generating Informative And Diversified Text
(2018)
• Arxiv
• 58 citations
Xu et al.
-
Adaptive Input Representations For Neural Language Modeling
(2018)
• Arxiv
• 266 citations
Alexei Baevski, Michael Auli
-
Sequence-to-sequence Data Augmentation For Dialogue Language Understanding
(2018)
• Arxiv
• 121 citations
Hou et al.
-
Dissecting Contextual Word Embeddings: Architecture And Representation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 447 citations
Peters et al.
-
Systematic Generalization: What Is Required And Can It Be Learned?
(2018)
• Arxiv
• 78 citations
Bahdanau et al.
-
Learning To Understand Goal Specifications By Modelling Reward
(2018)
• Arxiv
• 64 citations
Bahdanau et al.
-
Trellis Networks For Sequence Modeling
(2018)
• Arxiv
• 71 citations
Shaojie Bai, J. Zico Kolter, Vladlen Koltun
-
Generating More Interesting Responses In Neural Conversation Models With Distributional Constraints
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 99 citations
Baheti et al.
-
An Empirical Evaluation Of Generic Convolutional And Recurrent Networks For Sequence Modeling
(2018)
• Arxiv
• 4005 citations
Shaojie Bai, J. Zico Kolter, Vladlen Koltun
-
Incsql: Training Incremental Text-to-sql Parsers With Non-deterministic Oracles
(2018)
• Arxiv
• 59 citations
Shi et al.
-
Neural Machine Translation Into Language Varieties
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 55 citations
Surafel M. Lakew, Aliia Erofeeva, Marcello Federico
-
Ranking Paragraphs For Improving Answer Recall In Open-domain Question Answering
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 97 citations
Lee et al.
-
Qanet: Combining Local Convolution With Global Self-attention For Reading Comprehension
(2018)
• Arxiv
• 471 citations
Yu et al.
-
Improving Automatic Source Code Summarization Via Deep Reinforcement Learning
(2018)
• Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering
• 372 citations
Wan et al.
-
Unpaired Sentiment-to-sentiment Translation: A Cycled Reinforcement Learning Approach
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 208 citations
Xu et al.
-
Overcoming Language Priors In Visual Question Answering With Adversarial Regularization
(2018)
• Arxiv
• 132 citations
Sainandan Ramakrishnan, Aishwarya Agrawal, Stefan Lee
-
Low-resource Speech-to-text Translation
(2018)
• Interspeech 2018
• 59 citations
Bansal et al.
-
Deriving Machine Attention From Human Rationales
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 92 citations
Bao et al.
-
Multimodal Named Entity Recognition For Short Social Media Posts
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 150 citations
Seungwhan Moon, Leonardo Neves, Vitor Carvalho
-
Evaluation Of Sentence Embeddings In Downstream And Linguistic Probing Tasks
(2018)
• Arxiv
• 105 citations
Christian S. Perone, Roberto Silveira, Thomas S. Paula
-
Training Deeper Neural Machine Translation Models With Transparent Attention
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 146 citations
Bapna et al.
-
No Metrics Are Perfect: Adversarial Reward Learning For Visual Storytelling
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 178 citations
Wang et al.
-
Measuring Abstract Reasoning In Neural Networks
(2018)
• Arxiv
• 133 citations
Barrett et al.
-
Targeted Syntactic Evaluation Of Language Models
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 401 citations
Rebecca Marvin, Tal Linzen
-
Jump To Better Conclusions: SCAN Both Left And Right
(2018)
• Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
• 58 citations
Bastings et al.
-
Commonsense For Generative Multi-hop Question Answering Tasks
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 173 citations
Lisa Bauer, Yicheng Wang, Mohit Bansal
-
On Accurate Evaluation Of Gans For Language Generation
(2018)
• Arxiv
• 83 citations
Stanislau Semeniuta, Aliaksei Severyn, Sylvain Gelly
-
Neural Code Comprehension: A Learnable Representation Of Code Semantics
(2018)
• Arxiv
• 81 citations
Tal Ben-Nun, Alice Shoshana Jakobovits, Torsten Hoefler
-
Tell-and-answer: Towards Explainable Visual Question Answering Using Attributes And Captions
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 58 citations
Li et al.
-
Generating Wikipedia By Summarizing Long Sequences
(2018)
• Arxiv
• 551 citations
Liu et al.
-
A Study Of Reinforcement Learning For Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 155 citations
Wu et al.
-
Compositional Attention Networks For Machine Reasoning
(2018)
• Arxiv
• 241 citations
Drew A. Hudson, Christopher D. Manning
-
Look Before You Leap: Bridging Model-free And Model-based Reinforcement Learning For Planned-ahead Vision-and-language Navigation
(2018)
• Lecture Notes in Computer Science
• 196 citations
Wang et al.
-
Response Selection With Topic Clues For Retrieval-based Chatbots
(2018)
• Neurocomputing
• 59 citations
Wu et al.
-
Video Captioning Via Hierarchical Reinforcement Learning
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 270 citations
Wang et al.
-
A Joint Sequence Fusion Model For Video Question Answering And Retrieval
(2018)
• Lecture Notes in Computer Science
• 324 citations
Youngjae Yu, Jongseok Kim, Gunhee Kim
-
Mattnet: Modular Attention Network For Referring Expression Comprehension
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 750 citations
Yu et al.
-
The Unreasonable Effectiveness Of The Forget Gate
(2018)
• Arxiv
• 70 citations
Jos van Der Westhuizen, Joan Lasenby
-
Typesql: Knowledge-based Type-aware Neural Text-to-sql Generation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 208 citations
Yu et al.
-
Learning To Extract Coherent Summary Via Deep Reinforcement Learning
(2018)
• Proceedings of the AAAI Conference on Artificial Intelligence
• 134 citations
Yuxiang Wu, Baotian Hu
-
Morphosyntactic Tagging With A Meta-bilstm Model Over Context Sensitive Token Encodings
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 84 citations
Bohnet et al.
-
A Comparison Of Transformer And Recurrent Neural Networks On Multilingual Neural Machine Translation
(2018)
• Arxiv
• 59 citations
Surafel M. Lakew, Mauro Cettolo, Marcello Federico
-
Learning To Write With Cooperative Discriminators
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 219 citations
Holtzman et al.
-
Mapping Instructions To Actions In 3D Environments With Visual Goal Prediction
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 142 citations
Misra et al.
-
Discourse-aware Neural Rewards For Coherent Text Generation
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 80 citations
Bosselut et al.
-
Learning To Split And Rephrase From Wikipedia Edit History
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 68 citations
Botha et al.
-
How Agents See Things: On Visual Representations In An Emergent Language Game
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 82 citations
Diane Bouchacourt, Marco Baroni
-
CUNI System For The WMT18 Multimodal Translation Task
(2018)
• Proceedings of the Third Conference on Machine Translation: Shared Task Papers
• 60 citations
Jindřich Helcl, Jindřich Libovický, Dušan Variš
-
Stochastic Answer Networks For Natural Language Inference
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 195 citations
Xiaodong Liu, Kevin Duh, Jianfeng Gao
-
Multiwoz -- A Large-scale Multi-domain Wizard-of-oz Dataset For Task-oriented Dialogue Modelling
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 864 citations
Budzianowski et al.
-
Generative Code Modeling With Graphs
(2018)
• Arxiv
• 84 citations
Brockschmidt et al.
-
Phrase-based & Neural Unsupervised Machine Translation
(2018)
• Arxiv
• 260 citations
Lample et al.
-
Grounding Visual Explanations
(2018)
• Lecture Notes in Computer Science
• 204 citations
Hendricks et al.
-
Global-locally Self-attentive Dialogue State Tracker
(2018)
• Arxiv
• 76 citations
Victor Zhong, Caiming Xiong, Richard Socher
-
Multi-task Learning For Argumentation Mining In Low-resource Settings
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 65 citations
Schulz et al.
-
Learning Visual Question Answering By Bootstrapping Hard Attention
(2018)
• Lecture Notes in Computer Science
• 103 citations
Malinowski et al.
-
Object Hallucination In Image Captioning
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 254 citations
Rohrbach et al.
-
Leveraging Grammar And Reinforcement Learning For Neural Program Synthesis
(2018)
• Arxiv
• 72 citations
Bunel et al.
-
Using Monolingual Data In Neural Machine Translation: A Systematic Study
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 123 citations
Franck Burlot, François Yvon
-
End-to-end Automatic Speech Translation Of Audiobooks
(2018)
• 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 130 citations
Bérard et al.
-
Hypothesis Only Baselines In Natural Language Inference
(2018)
• Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics
• 577 citations
Poliak et al.
-
Robust Machine Comprehension Models Via Adversarial Training
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 104 citations
Yicheng Wang, Mohit Bansal
-
Conversational AI: The Science Behind The Alexa Prize
(2018)
• Alexa.Prize.Proceedings https://developer.amazon.com/alexaprize/proceedings accessed (2018)-01-01
• 222 citations
Ram et al.
-
E-snli: Natural Language Inference With Natural Language Explanations
(2018)
• Arxiv
• 279 citations
Camburu et al.
-
Predicting Expressive Speaking Style From Text In End-to-end Speech Synthesis
(2018)
• 2018 IEEE Spoken Language Technology Workshop (SLT)
• 109 citations
Daisy Stanton, Yuxuan Wang, Rj Skerry-Ryan
-
Knowledgeable Reader: Enhancing Cloze-style Reading Comprehension With External Commonsense Knowledge
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 190 citations
Todor Mihaylov, Anette Frank
-
A Dataset For Document Grounded Conversations
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 207 citations
Kangyan Zhou, Shrimai Prabhumoye, Alan W Black
-
Toward Diverse Text Generation With Inverse Reinforcement Learning
(2018)
• Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
• 86 citations
Shi et al.
-
Yuanfudao At Semeval-2018 Task 11: Three-way Attention And Relational Knowledge For Commonsense Machine Comprehension
(2018)
• Proceedings of The 12th International Workshop on Semantic Evaluation
• 83 citations
Wang et al.
-
Learning To Describe Differences Between Pairs Of Similar Images
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 105 citations
Harsh Jhamtani, Taylor Berg-Kirkpatrick
-
Retrieval-based Neural Code Generation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 85 citations
Hayati et al.
-
Learning Neural Templates For Text Generation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 214 citations
Sam Wiseman, Stuart M. Shieber, Alexander M. Rush
-
An Empirical Study Of Example Forgetting During Deep Neural Network Learning
(2018)
• Arxiv
• 200 citations
Toneva et al.
-
Dopamine: A Research Framework For Deep Reinforcement Learning
(2018)
• Arxiv
• 176 citations
Castro et al.
-
Deep Communicating Agents For Abstractive Summarization
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
• 337 citations
Celikyilmaz et al.
-
Universal Sentence Encoder
(2018)
• Arxiv
• 1368 citations
Cer et al.
-
Ukp-athene: Multi-sentence Textual Entailment For Claim Verification
(2018)
• Proceedings of the First Workshop on Fact Extraction and VERification (FEVER)
• 189 citations
Hanselowski et al.
-
Achieving Human Parity On Automatic Chinese To English News Translation
(2018)
• Arxiv
• 573 citations
Hassan et al.
-
Sentiment Adaptive End-to-end Dialog Systems
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 76 citations
Weiyan Shi, Zhou Yu
-
Learning To Ask Good Questions: Ranking Clarification Questions Using Neural Expected Value Of Perfect Information
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 153 citations
Sudha Rao, Hal Daumé
-
Unsupervised Discrete Sentence Representation Learning For Interpretable Neural Dialog Generation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 139 citations
Tiancheng Zhao, Kyusong Lee, Maxine Eskenazi
-
"found In Translation": Predicting Outcomes Of Complex Organic Chemistry Reactions Using Neural Sequence-to-sequence Models
(2018)
• Chemical Science
• 423 citations
Schwaller et al.
-
Semi-autoregressive Neural Machine Translation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 80 citations
Chunqi Wang, Ji Zhang, Haiqing Chen
-
Trivial Transfer Learning For Low-resource Neural Machine Translation
(2018)
• Proceedings of the Third Conference on Machine Translation: Research Papers
• 158 citations
Tom Kocmi, Ondřej Bojar
-
Do Explanations Make VQA Models More Predictable To A Human?
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 57 citations
Chandrasekaran et al.
-
Zero-shot Dialog Generation With Cross-domain Latent Actions
(2018)
• Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue
• 55 citations
Tiancheng Zhao, Maxine Eskenazi
-
Diverse And Coherent Paragraph Generation From Images
(2018)
• Lecture Notes in Computer Science
• 67 citations
Moitreya Chatterjee, Alexander G. Schwing
-
Efficient And Robust Question Answering From Minimal Context Over Documents
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 151 citations
Min et al.
-
Quality Expectations Of Machine Translation
(2018)
• Machine Translation: Technologies and Applications
• 84 citations
Andy Way
-
Semstyle: Learning To Generate Stylised Image Captions Using Unaligned Text
(2018)
• 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
• 132 citations
Alexander Mathews, Lexing Xie, Xuming He
-
Neural Machine Translation Decoding With Terminology Constraints
(2018)
• Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
• 93 citations
Hasler et al.
-
Texygen: A Benchmarking Platform For Text Generation Models
(2018)
• Arxiv
• 164 citations
Zhu et al.
-
Recurrent Fusion Network For Image Captioning
(2018)
• Lecture Notes in Computer Science
• 304 citations
Jiang et al.
-
Twitter Sentiment Analysis Via Bi-sense Emoji Embedding And Attention-based LSTM
(2018)
• Proceedings of the 26th ACM international conference on Multimedia
• 60 citations
Chen et al.
-
The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 401 citations
Chen et al.
-
Enhancing Sentence Embedding With Generalized Pooling
(2018)
• Arxiv
• 60 citations
Qian Chen, Zhen-Hua Ling, Xiaodan Zhu
-
Fast Abstractive Summarization With Reinforce-selected Sentence Rewriting
(2018)
• Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 624 citations
Yen-Chun Chen, Mohit Bansal
-
A Retrieve-and-edit Framework For Predicting Structured Outputs
(2018)
• Arxiv
• 96 citations
Hashimoto et al.
-
Tree-to-tree Neural Networks For Program Translation
(2018)
• Arxiv
• 91 citations
Xinyun Chen, Chang Liu, Dawn Song
-
Pythia V0.1: The Winning Entry To The VQA Challenge 2018
(2018)
• Arxiv
• 177 citations
Jiang et al.
-
DRCD: A Chinese Machine Reading Comprehension Dataset
(2018)
• Arxiv
• 91 citations
Shao et al.
-
Guided Neural Language Generation For Abstractive Summarization Using Abstract Meaning Representation
(2018)
• Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
• 70 citations
Hardy, Andreas Vlachos
-
Seq2seq-vis: A Visual Debugging Tool For Sequence-to-sequence Models
(2018)
• IEEE Transactions on Visualization and Computer Graphics
• 219 citations
Strobelt et al.
-
Escape: A Large-scale Synthetic Corpus For Automatic Post-editing
(2018)
• Arxiv
• 55 citations
Negri et al.
-
Flexible End-to-end Dialogue System For Knowledge Grounded Conversation
(2017)
• Arxiv
• 89 citations
Zhu et al.
-
Compressing Word Embeddings Via Deep Compositional Code Learning
(2017)
• Arxiv
• 95 citations
Raphael Shu, Hideki Nakayama
-
Incorporating Copying Mechanism In Image Captioning For Learning Novel Objects
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 152 citations
Yao et al.
-
A Question Answering Approach To Emotion Cause Extraction
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 183 citations
Gui et al.
-
A Deep Reinforced Model For Abstractive Summarization
(2017)
• Arxiv
• 1294 citations
Romain Paulus, Caiming Xiong, Richard Socher
-
Question Answering Through Transfer Learning From Large Fine-grained Supervision Data
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 120 citations
Sewon Min, Minjoon Seo, Hannaneh Hajishirzi
-
Breaking The Softmax Bottleneck: A High-rank RNN Language Model
(2017)
• Arxiv
• 270 citations
Yang et al.
-
Skeleton Key: Image Captioning By Skeleton-attribute Decomposition
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 99 citations
Wang et al.
-
Multi-task Video Captioning With Video And Entailment Generation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 100 citations
Ramakanth Pasunuru, Mohit Bansal
-
Fine-grained Human Evaluation Of Neural Versus Phrase-based Machine Translation
(2017)
• The Prague Bulletin of Mathematical Linguistics
• 84 citations
Filip Klubička, Antonio Toral, Víctor M. Sánchez-Cartagena
-
Opennmt: Open-source Toolkit For Neural Machine Translation
(2017)
• Proceedings of ACL 2017, System Demonstrations
• 1819 citations
Klein et al.
-
Turing At Semeval-2017 Task 8: Sequential Approach To Rumour Stance Classification With Branch-lstm
(2017)
• Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)
• 117 citations
Elena Kochkina, Maria Liakata, Isabelle Augenstein
-
RACE: Large-scale Reading Comprehension Dataset From Examinations
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 963 citations
Lai et al.
-
A Syntactic Neural Model For General-purpose Code Generation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 545 citations
Pengcheng Yin, Graham Neubig
-
Question Answering On Knowledge Bases And Text Using Universal Schema And Memory Networks
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 143 citations
Das et al.
-
Neural Machine Translation With Source-side Latent Graph Parsing
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 59 citations
Kazuma Hashimoto, Yoshimasa Tsuruoka
-
Personalization In Goal-oriented Dialog
(2017)
• Arxiv
• 68 citations
Chaitanya K. Joshi, Fei Mi, Boi Faltings
-
Modulating Early Visual Processing By Language
(2017)
• Arxiv
• 235 citations
Vries et al.
-
Composite Task-completion Dialogue Policy Learning Via Hierarchical Deep Reinforcement Learning
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 164 citations
Peng et al.
-
Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 327 citations
Das et al.
-
Natural Language Generation For Spoken Dialogue System Using RNN Encoder-decoder Networks
(2017)
• Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017)
• 55 citations
van-Khanh Tran, Le-Minh Nguyen
-
TAC-GAN - Text Conditioned Auxiliary Classifier Generative Adversarial Network
(2017)
• Arxiv
• 108 citations
Dash et al.
-
Parlai: A Dialog Research Software Platform
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
• 307 citations
Miller et al.
-
Actor-critic Sequence Training For Image Captioning
(2017)
• Arxiv
• 110 citations
Zhang et al.
-
OBJ2TEXT: Generating Visually Descriptive Language From Object Layouts
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 62 citations
Xuwang Yin, Vicente Ordonez
-
Modeling Source Syntax For Neural Machine Translation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 133 citations
Li et al.
-
A Teacher-student Framework For Zero-resource Neural Machine Translation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 110 citations
Chen et al.
-
Hierarchical LSTM With Adjusted Temporal Attention For Video Captioning
(2017)
• Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
• 162 citations
Song et al.
-
Improving Semantic Relevance For Sequence-to-sequence Learning Of Chinese Social Media Text Summarization
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 62 citations
Ma et al.
-
Adversarial Examples For Evaluating Reading Comprehension Systems
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 1353 citations
Robin Jia, Percy Liang
-
Evidence Aggregation For Answer Re-ranking In Open-domain Question Answering
(2017)
• Arxiv
• 116 citations
Wang et al.
-
A Survey On Dialogue Systems: Recent Advances And New Frontiers
(2017)
• Arxiv
• 396 citations
Chen et al.
-
Recurrent Neural Network-based Sentence Encoder With Gated Attention For Natural Language Inference
(2017)
• Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP
• 94 citations
Chen et al.
-
Improved Neural Machine Translation With A Syntax-aware Encoder And Decoder
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 132 citations
Chen et al.
-
Deep Learning For User Comment Moderation
(2017)
• Proceedings of the First Workshop on Abusive Language Online
• 104 citations
John Pavlopoulos, Prodromos Malakasiotis, Ion Androutsopoulos
-
Selective Encoding For Abstractive Sentence Summarization
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 233 citations
Zhou et al.
-
Cross-domain Semantic Parsing Via Paraphrasing
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 84 citations
Yu Su, Xifeng Yan
-
Curriculum Learning And Minibatch Bucketing In Neural Machine Translation
(2017)
• RANLP 2017 - Recent Advances in Natural Language Processing Meet Deep Learning
• 97 citations
Tom Kocmi, Ondrej Bojar
-
Paying Attention To Descriptions Generated By Image Captioning Models
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 78 citations
Tavakoli et al.
-
Neural Paraphrase Identification Of Questions With Noisy Pretraining
(2017)
• Proceedings of the First Workshop on Subword and Character Level Models in NLP
• 65 citations
Tomar et al.
-
Generative Encoder-decoder Models For Task-oriented Spoken Dialog Systems With Chatting Capability
(2017)
• Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue
• 87 citations
Zhao et al.
-
A Joint Model For Question Answering And Question Generation
(2017)
• Arxiv
• 87 citations
Tong Wang, Xingdi Yuan, Adam Trischler
-
A Multifaceted Evaluation Of Neural Versus Phrase-based Machine Translation For 9 Language Directions
(2017)
• Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
• 115 citations
Antonio Toral, Víctor M. Sánchez-Cartagena
-
Learning To Rank Question Answer Pairs With Holographic Dual LSTM Architecture
(2017)
• Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 119 citations
Tay et al.
-
Tacotron: Towards End-to-end Speech Synthesis
(2017)
• Interspeech 2017
• 1523 citations
Wang et al.
-
End-to-end Optimization Of Goal-driven And Visually Grounded Dialogue Systems
(2017)
• Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
• 94 citations
Strub et al.
-
Neural Speed Reading Via Skim-rnn
(2017)
• Arxiv
• 56 citations
Seo et al.
-
Grounded Language Learning In A Simulated 3D World
(2017)
• Arxiv
• 163 citations
Hermann et al.
-
Multi-modal Factorized Bilinear Pooling With Co-attention Learning For Visual Question Answering
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 659 citations
Yu et al.
-
Simulating Action Dynamics With Neural Process Networks
(2017)
• Arxiv
• 79 citations
Bosselut et al.
-
Contextual Sequence Modeling For Recommendation With Recurrent Neural Networks
(2017)
• Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems
• 159 citations
Elena Smirnova, Flavian Vasile
-
Learning Discourse-level Diversity For Neural Dialog Models Using Conditional Variational Autoencoders
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 728 citations
Tiancheng Zhao, Ran Zhao, Maxine Eskenazi
-
MAT: A Multimodal Attentive Translator For Image Captioning
(2017)
• Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
• 57 citations
Liu et al.
-
Adversarial Learning For Neural Dialogue Generation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 796 citations
Li et al.
-
Topically Driven Neural Language Model
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 63 citations
Jey Han Lau, Timothy Baldwin, Trevor Cohn
-
Speaking The Same Language: Matching Machine To Human Captions By Adversarial Training
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 242 citations
Shetty et al.
-
Affect-lm: A Neural Language Model For Customizable Affective Text Generation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 158 citations
Ghosh et al.
-
Massive Exploration Of Neural Machine Translation Architectures
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 467 citations
Britz et al.
-
S-net: From Answer Extraction To Answer Generation For Machine Reading Comprehension
(2017)
• Arxiv
• 72 citations
Tan et al.
-
Learning Structured Natural Language Representations For Semantic Parsing
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 62 citations
Cheng et al.
-
Hierarchically-attentive RNN For Album Summarization And Storytelling
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 74 citations
Licheng Yu, Mohit Bansal, Tamara L. Berg
-
Neural Machine Translation With Extended Context
(2017)
• Proceedings of the Third Workshop on Discourse in Machine Translation
• 247 citations
Jörg Tiedemann, Yves Scherrer
-
Structured Attentions For Visual Question Answering
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 110 citations
Zhu et al.
-
Semeval-2017 Task 1: Semantic Textual Similarity - Multilingual And Cross-lingual Focused Evaluation
(2017)
• Arxiv
• 345 citations
Cer et al.
-
Quasar: Datasets For Question Answering By Search And Reading
(2017)
• Arxiv
• 149 citations
Bhuwan Dhingra, Kathryn Mazaitis, William W. Cohen
-
Question Answering And Question Generation As Dual Tasks
(2017)
• Arxiv
• 176 citations
Tang et al.
-
An End-to-end Trainable Neural Network Model With Belief Tracking For Task-oriented Dialog
(2017)
• Interspeech 2017
• 99 citations
Bing Liu, Ian Lane
-
Grammatical Error Correction With Neural Reinforcement Learning
(2017)
• Arxiv
• 59 citations
Keisuke Sakaguchi, Matt Post, Benjamin van Durme
-
Convolutional Sequence To Sequence Learning
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 435 citations
Gehring et al.
-
Neural AMR: Sequence-to-sequence Models For Parsing And Generation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 271 citations
Konstas et al.
-
Seq2sql: Generating Structured Queries From Natural Language Using Reinforcement Learning
(2017)
• Arxiv
• 800 citations
Victor Zhong, Caiming Xiong, Richard Socher
-
Incorporating Global Visual Features Into Attention-based Neural Machine Translation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 154 citations
Iacer Calixto, Qun Liu, Nick Campbell
-
Visual Reference Resolution Using Attention Memory For Visual Dialog
(2017)
• Arxiv
• 91 citations
Seo et al.
-
Shakespearizing Modern Language Using Copy-enriched Sequence-to-sequence Models
(2017)
• Proceedings of the Workshop on Stylistic Variation
• 166 citations
Jhamtani et al.
-
Doubly-attentive Decoder For Multi-modal Neural Machine Translation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 181 citations
Iacer Calixto, Qun Liu, Nick Campbell
-
Ensemble Distillation For Neural Machine Translation
(2017)
• Arxiv
• 67 citations
Markus Freitag, Yaser Al-Onaizan, Baskaran Sankaran
-
Fast And Accurate Entity Recognition With Iterated Dilated Convolutions
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 466 citations
Strubell et al.
-
Deal Or No Deal? End-to-end Learning For Negotiation Dialogues
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 332 citations
Lewis et al.
-
Latent Variable Dialogue Models And Their Diversity
(2017)
• Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
• 55 citations
Kris Cao, Stephen Clark
-
Nlp2code: Code Snippet Content Assist Via Natural Language Tasks
(2017)
• 2017 IEEE International Conference on Software Maintenance and Evolution (ICSME)
• 64 citations
Brock Angus Campbell, Christoph Treude
-
Language Generation With Recurrent Generative Adversarial Networks Without Pre-training
(2017)
• Arxiv
• 90 citations
Press et al.
-
Dynamic Entity Representations In Neural Language Models
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 108 citations
Ji et al.
-
Triviaqa: A Large Scale Distantly Supervised Challenge Dataset For Reading Comprehension
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 1389 citations
Joshi et al.
-
Modeling Target-side Inflection In Neural Machine Translation
(2017)
• Proceedings of the Second Conference on Machine Translation
• 60 citations
Aleš Tamchyna, Marion Weller-di Marco, Alexander Fraser
-
Shapeworld - A New Test Methodology For Multimodal Language Understanding
(2017)
• Arxiv
• 55 citations
Alexander Kuhnle, Ann Copestake
-
Neural Semantic Parsing By Character-based Translation: Experiments With Abstract Meaning Representations
(2017)
• Arxiv
• 85 citations
Rik van Noord, Johan Bos
-
Bpemb: Tokenization-free Pre-trained Subword Embeddings In 275 Languages
(2017)
• Arxiv
• 129 citations
Benjamin Heinzerling, Michael Strube
-
Nematus: A Toolkit For Neural Machine Translation
(2017)
• Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics
• 350 citations
Sennrich et al.
-
Efficient Natural Language Response Suggestion For Smart Reply
(2017)
• Arxiv
• 220 citations
Henderson et al.
-
Robust Incremental Neural Semantic Graph Parsing
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 80 citations
Jan Buys, Phil Blunsom
-
Personalizing Session-based Recommendations With Hierarchical Recurrent Neural Networks
(2017)
• Proceedings of the Eleventh ACM Conference on Recommender Systems
• 664 citations
Quadrana et al.
-
DCN+: Mixed Objective And Deep Residual Coattention For Question Answering
(2017)
• Arxiv
• 91 citations
Caiming Xiong, Victor Zhong, Richard Socher
-
LIUM-CVC Submissions For WMT17 Multimodal Translation Task
(2017)
• Proceedings of the Second Conference on Machine Translation
• 73 citations
Caglayan et al.
-
NMTPY: A Flexible Toolkit For Advanced Neural Machine Translation Systems
(2017)
• The Prague Bulletin of Mathematical Linguistics
• 71 citations
Caglayan et al.
-
Reinforcement Learning For Bandit Neural Machine Translation With Simulated Human Feedback
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 105 citations
Khanh Nguyen, Hal Daumé, Jordan Boyd-Graber
-
The University Of Edinburgh's Neural MT Systems For WMT17
(2017)
• Proceedings of the Second Conference on Machine Translation
• 161 citations
Sennrich et al.
-
Neural End-to-end Learning For Computational Argumentation Mining
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 168 citations
Steffen Eger, Johannes Daxenberger, Iryna Gurevych
-
Findings Of The Second Shared Task On Multimodal Machine Translation And Multilingual Image Description
(2017)
• Proceedings of the Second Conference on Machine Translation
• 194 citations
Elliott et al.
-
A Copy-augmented Sequence-to-sequence Architecture Gives Good Performance On Task-oriented Dialogue
(2017)
• Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
• 126 citations
Mihail Eric, Christopher D. Manning
-
Dynamic Integration Of Background Knowledge In Neural NLU Systems
(2017)
• Arxiv
• 68 citations
Dirk Weissenborn, Tomáš Kočiský, Chris Dyer
-
Neural Net Models For Open-domain Discourse Coherence
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 121 citations
Jiwei Li, Dan Jurafsky
-
Modeling Coherence For Neural Machine Translation With Dynamic And Topic Caches
(2017)
• Arxiv
• 87 citations
Kuang et al.
-
Latent Intention Dialogue Models
(2017)
• Arxiv
• 137 citations
Wen et al.
-
Deep Voice 3: Scaling Text-to-speech With Convolutional Sequence Learning
(2017)
• Arxiv
• 281 citations
Ping et al.
-
Searchqa: A New Q&A Dataset Augmented With Context From A Search Engine
(2017)
• Arxiv
• 423 citations
Dunn et al.
-
Flexible And Creative Chinese Poetry Generation Using Neural Memory
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 80 citations
Zhang et al.
-
Neural Lattice-to-sequence Models For Uncertain Inputs
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 75 citations
Sperber et al.
-
Just ASK: Building An Architecture For Extensible Self-service Spoken Language Understanding
(2017)
• Arxiv
• 58 citations
Kumar et al.
-
Learning To Ask: Neural Question Generation For Reading Comprehension
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 638 citations
Xinya Du, Junru Shao, Claire Cardie
-
Inter-session Modeling For Session-based Recommendation
(2017)
• Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems
• 72 citations
Massimiliano Ruocco, Ole Steinar Lillestøl Skrede, Helge Langseth
-
Imagination Improves Multimodal Translation
(2017)
• Arxiv
• 91 citations
Desmond Elliott, Ákos Kádár
-
Comprehension-guided Referring Expressions
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 148 citations
Ruotian Luo, Gregory Shakhnarovich
-
Addressing The Data Sparsity Issue In Neural AMR Parsing
(2017)
• Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
• 77 citations
Peng et al.
-
MEMEN: Multi-layer Embedding With Memory Networks For Machine Comprehension
(2017)
• Arxiv
• 70 citations
Pan et al.
-
Non-autoregressive Neural Machine Translation
(2017)
• Arxiv
• 471 citations
Gu et al.
-
Towards String-to-tree Neural Machine Translation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 150 citations
Roee Aharoni, Yoav Goldberg
-
Creativity: Generating Diverse Questions Using Variational Autoencoders
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 134 citations
Unnat Jain, Ziyu Zhang, Alexander Schwing
-
Adversarial Ranking For Language Generation
(2017)
• Arxiv
• 192 citations
Lin et al.
-
Weighted Transformer Network For Machine Translation
(2017)
• Arxiv
• 138 citations
Karim Ahmed, Nitish Shirish Keskar, Richard Socher
-
Story Generation From Sequence Of Independent Short Descriptions
(2017)
• Arxiv
• 84 citations
Jain et al.
-
Key-value Retrieval Networks For Task-oriented Dialogue
(2017)
• Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue
• 398 citations
Mihail Eric, Christopher D. Manning
-
Trainable Greedy Decoding For Neural Machine Translation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 59 citations
Jiatao Gu, Kyunghyun Cho, Victor O. K. Li
-
Attention Strategies For Multi-source Sequence-to-sequence Learning
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 176 citations
Jindřich Libovický, Jindřich Helcl
-
Fluency-guided Cross-lingual Image Captioning
(2017)
• Proceedings of the 25th ACM international conference on Multimedia
• 57 citations
Weiyu Lan, Xirong Li, Jianfeng Dong
-
Fusionnet: Fusing Via Fully-aware Attention With Application To Machine Comprehension
(2017)
• Arxiv
• 90 citations
Huang et al.
-
Generating Steganographic Text With Lstms
(2017)
• Proceedings of ACL 2017, Student Research Workshop
• 110 citations
Tina Fang, Martin Jaggi, Katerina Argyraki
-
Visual Translation Embedding Network For Visual Relation Detection
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 550 citations
Zhang et al.
-
Learning To Parse And Translate Improves Neural Machine Translation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 153 citations
Akiko Eriguchi, Yoshimasa Tsuruoka, Kyunghyun Cho
-
A Deep Reinforcement Learning Chatbot
(2017)
• Arxiv
• 210 citations
Serban et al.
-
TGIF-QA: Toward Spatio-temporal Reasoning In Visual Question Answering
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 418 citations
Jang et al.
-
An Empirical Analysis Of Nmt-derived Interlingual Embeddings And Their Use In Parallel Sentence Identification
(2017)
• IEEE Journal of Selected Topics in Signal Processing
• 80 citations
España-Bonet et al.
-
Word-entity Duet Representations For Document Ranking
(2017)
• Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
• 78 citations
Chenyan Xiong, Jamie Callan, Tie-Yan Liu
-
Machine Comprehension By Text-to-text Neural Question Generation
(2017)
• Proceedings of the 2nd Workshop on Representation Learning for NLP
• 166 citations
Yuan et al.
-
Attention Is All You Need
(2017)
• Arxiv
• 60613 citations
Vaswani et al.
-
Does Neural Machine Translation Benefit From Larger Context?
(2017)
• Arxiv
• 139 citations
Jean et al.
-
Iterative Policy Learning In End-to-end Trainable Task-oriented Neural Dialog Models
(2017)
• 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 98 citations
Bing Liu, Ian Lane
-
Learning A Neural Semantic Parser From User Feedback
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 328 citations
Iyer et al.
-
Natural Language Inference Over Interaction Space
(2017)
• Arxiv
• 197 citations
Yichen Gong, Heng Luo, Jian Zhang
-
A Causal Framework For Explaining The Predictions Of Black-box Sequence-to-sequence Models
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 177 citations
David Alvarez-Melis, Tommi S. Jaakkola
-
Multi-task Learning For Speaker-role Adaptation In Neural Conversation Models
(2017)
• Arxiv
• 66 citations
Luan et al.
-
I2T2I: Learning Text To Image Synthesis With Textual Data Augmentation
(2017)
• 2017 IEEE International Conference on Image Processing (ICIP)
• 59 citations
Dong et al.
-
A Simple Neural Network Module For Relational Reasoning
(2017)
• Arxiv
• 1056 citations
Santoro et al.
-
Learning To Paraphrase For Question Answering
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 192 citations
Dong et al.
-
Data Augmentation For Low-resource Neural Machine Translation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 519 citations
Marzieh Fadaee, Arianna Bisazza, Christof Monz
-
Steering Output Style And Topic In Neural Response Generation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 71 citations
Wang et al.
-
From Language To Programs: Bridging Reinforcement Learning And Maximum Marginal Likelihood
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 159 citations
Guu et al.
-
The Microsoft 2017 Conversational Speech Recognition System
(2017)
• 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
• 313 citations
Xiong et al.
-
Generating High-quality And Informative Conversation Responses With Sequence-to-sequence Models
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 194 citations
Shao et al.
-
THUMT: An Open Source Toolkit For Neural Machine Translation
(2017)
• Arxiv
• 88 citations
Zhang et al.
-
An Analysis Of Visual Question Answering Algorithms
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 238 citations
Kushal Kafle, Christopher Kanan
-
Stronger Baselines For Trustable Results In Neural Machine Translation
(2017)
• Proceedings of the First Workshop on Neural Machine Translation
• 106 citations
Michael Denkowski, Graham Neubig
-
Image-grounded Conversations: Multimodal Context For Natural Question And Response Generation
(2017)
• Arxiv
• 121 citations
Mostafazadeh et al.
-
Rasa: Open Source Language Understanding And Dialogue Management
(2017)
• Arxiv
• 219 citations
Bocklisch et al.
-
Dynamic Data Selection For Neural Machine Translation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 138 citations
Marlies van Der Wees, Arianna Bisazza, Christof Monz
-
A Sequential Matching Framework For Multi-turn Response Selection In Retrieval-based Chatbots
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 552 citations
Wu et al.
-
A Simple And Accurate Syntax-agnostic Neural Model For Dependency-based Semantic Role Labeling
(2017)
• Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017)
• 122 citations
Diego Marcheggiani, Anton Frolov, Ivan Titov
-
The E2E Dataset: New Challenges For End-to-end Generation
(2017)
• Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue
• 188 citations
Jekaterina Novikova, Ondřej Dušek, Verena Rieser
-
Lexically Constrained Decoding For Sequence Generation Using Grid Beam Search
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 291 citations
Chris Hokamp, Qun Liu
-
MUTAN: Multimodal Tucker Fusion For Visual Question Answering
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 676 citations
Ben-Younes et al.
-
Adversarial Neural Machine Translation
(2017)
• Arxiv
• 78 citations
Wu et al.
-
Learning To Remember Rare Events
(2017)
• Arxiv
• 256 citations
Kaiser et al.
-
Sockeye: A Toolkit For Neural Machine Translation
(2017)
• Arxiv
• 205 citations
Hieber et al.
-
Sentence Simplification With Deep Reinforcement Learning
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 333 citations
Xingxing Zhang, Mirella Lapata
-
Reinforced Video Captioning With Entailment Rewards
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 120 citations
Ramakanth Pasunuru, Mohit Bansal
-
R\(^3\): Reinforced Reader-ranker For Open-domain Question Answering
(2017)
• Arxiv
• 92 citations
Wang et al.
-
A Unified Query-based Generative Model For Question Generation And Question Answering
(2017)
• Arxiv
• 56 citations
Linfeng Song, Zhiguo Wang, Wael Hamza
-
A Conditional Variational Framework For Dialog Generation
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 117 citations
Shen et al.
-
Relevance Of Unsupervised Metrics In Task-oriented Dialogue For Evaluating Natural Language Generation
(2017)
• Arxiv
• 206 citations
Sharma et al.
-
Deep Reinforcement Learning-based Image Captioning With Embedding Reward
(2017)
• 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 303 citations
Ren et al.
-
Deep Architectures For Neural Machine Translation
(2017)
• Proceedings of the Second Conference on Machine Translation
• 107 citations
Barone et al.
-
A Neural Architecture For Generating Natural Language Descriptions From Source Code Changes
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 112 citations
Pablo Loyola, Edison Marrese-Taylor, Yutaka Matsuo
-
AMR Parsing Using Stack-lstms
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 74 citations
Miguel Ballesteros, Yaser Al-Onaizan
-
Challenges In Data-to-document Generation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 527 citations
Sam Wiseman, Stuart M. Shieber, Alexander M. Rush
-
Predicting Target Language CCG Supertags Improves Neural Machine Translation
(2017)
• Proceedings of the Second Conference on Machine Translation
• 79 citations
Nadejde et al.
-
Evaluating Quality Of Chatbots And Intelligent Conversational Agents
(2017)
• Arxiv
• 197 citations
Nicole M. Radziwill, Morgan C. Benton
-
Towards Zero-shot Frame Semantic Parsing For Domain Scaling
(2017)
• Interspeech 2017
• 134 citations
Bapna et al.
-
The Repeval 2017 Shared Task: Multi-genre Natural Language Inference With Sentence Representations
(2017)
• Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP
• 87 citations
Nangia et al.
-
Controlling Linguistic Style Aspects In Neural Language Generation
(2017)
• Proceedings of the Workshop on Stylistic Variation
• 276 citations
Jessica Ficler, Yoav Goldberg
-
Learning Joint Multilingual Sentence Representations With Neural Machine Translation
(2017)
• Proceedings of the 2nd Workshop on Representation Learning for NLP
• 203 citations
Holger Schwenk, Matthijs Douze
-
Making Neural QA As Simple As Possible But Not Simpler
(2017)
• Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017)
• 186 citations
Dirk Weissenborn, Georg Wiese, Laura Seiffe
-
What Do Neural Machine Translation Models Learn About Morphology?
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 281 citations
Belinkov et al.
-
Evaluating Layers Of Representation In Neural Machine Translation On Part-of-speech And Semantic Tagging Tasks
(2017)
• IJCNLP 8 (2017) volume 1 1-10
• 127 citations
Belinkov et al.
-
Synthetic And Natural Noise Both Break Neural Machine Translation
(2017)
• Arxiv
• 434 citations
Yonatan Belinkov, Yonatan Bisk
-
Towards An Automatic Turing Test: Learning To Evaluate Dialogue Responses
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 363 citations
Lowe et al.
-
Dailydialog: A Manually Labelled Multi-turn Dialogue Dataset
(2017)
• Arxiv
• 641 citations
Li et al.
-
A Parallel Corpus Of Python Functions And Documentation Strings For Automated Code Documentation And Code Generation
(2017)
• Arxiv
• 68 citations
Antonio Valerio Miceli Barone, Rico Sennrich
-
Improved Variational Autoencoders For Text Modeling Using Dilated Convolutions
(2017)
• Arxiv
• 262 citations
Yang et al.
-
Shortcut-stacked Sentence Encoders For Multi-domain Inference
(2017)
• Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP
• 127 citations
Yixin Nie, Mohit Bansal
-
A Hybrid Convolutional Variational Autoencoder For Text Generation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 222 citations
Stanislau Semeniuta, Aliaksei Severyn, Erhardt Barth
-
Adversarial Generation Of Natural Language
(2017)
• Proceedings of the 2nd Workshop on Representation Learning for NLP
• 120 citations
Rajeswar et al.
-
Regularization Techniques For Fine-tuning In Neural Machine Translation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 98 citations
Barone et al.
-
Neural Semantic Encoders
(2017)
• Arxiv
• 92 citations
Tsendsuren Munkhdalai, Hong Yu
-
Unsupervised Pretraining For Sequence To Sequence Learning
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 275 citations
Prajit Ramachandran, Peter J. Liu, Quoc V. Le
-
Assessing State-of-the-art Sentiment Models On State-of-the-art Sentiment Datasets
(2017)
• Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
• 66 citations
Jeremy Barnes, Roman Klinger, Sabine Schulte Im Walde
-
Are Emojis Predictable?
(2017)
• Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
• 130 citations
Francesco Barbieri, Miguel Ballesteros, Horacio Saggion
-
Advances In Joint Ctc-attention Based End-to-end Speech Recognition With A Deep CNN Encoder And RNN-LM
(2017)
• Interspeech 2017
• 318 citations
Hori et al.
-
Revisiting Recurrent Networks For Paraphrastic Sentence Embeddings
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 75 citations
John Wieting, Kevin Gimpel
-
Compressing Recurrent Neural Network With Tensor Train
(2017)
• 2017 International Joint Conference on Neural Networks (IJCNN)
• 87 citations
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
-
Inferring And Executing Programs For Visual Reasoning
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 530 citations
Johnson et al.
-
FOIL It! Find One Mismatch Between Image And Language Caption
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 106 citations
Shekhar et al.
-
End-to-end Task-completion Neural Dialogue Systems
(2017)
• Arxiv
• 251 citations
Li et al.
-
Get To The Point: Summarization With Pointer-generator Networks
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 3798 citations
Abigail See, Peter J. Liu, Christopher D. Manning
-
Learning Paraphrastic Sentence Embeddings From Back-translated Bitext
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 93 citations
John Wieting, Jonathan Mallinson, Kevin Gimpel
-
Emotional End-to-end Neural Speech Synthesizer
(2017)
• Arxiv
• 61 citations
Younggun Lee, Azam Rabiee, Soo-Young Lee
-
Unsupervised Neural Machine Translation
(2017)
• Arxiv
• 632 citations
Artetxe et al.
-
Sample-efficient Actor-critic Reinforcement Learning With Supervised Data For Dialogue Management
(2017)
• Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue
• 123 citations
Su et al.
-
Diversity Driven Attention Model For Query-based Abstractive Summarization
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 171 citations
Nema et al.
-
Learning Simpler Language Models With The Differential State Framework
(2017)
• Neural Computation
• 67 citations
Alexander G. Ororbia, Tomas Mikolov, David Reitter
-
Outrageously Large Neural Networks: The Sparsely-gated Mixture-of-experts Layer
(2017)
• Arxiv
• 575 citations
Shazeer et al.
-
Learned In Translation: Contextualized Word Vectors
(2017)
• Arxiv
• 538 citations
McCann et al.
-
Toward Controlled Generation Of Text
(2017)
• Arxiv
• 781 citations
Hu et al.
-
Frustratingly Short Attention Spans In Neural Language Modeling
(2017)
• Arxiv
• 80 citations
Daniluk et al.
-
Learning To Reason: End-to-end Module Networks For Visual Question Answering
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 509 citations
Hu et al.
-
Frames: A Corpus For Adding Memory To Goal-oriented Dialogue Systems
(2017)
• Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue
• 230 citations
Asri et al.
-
Conll-sigmorphon 2017 Shared Task: Universal Morphological Reinflection In 52 Languages
(2017)
• Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection
• 202 citations
Cotterell et al.
-
Deepstory: Video Story QA By Deep Embedded Memory Networks
(2017)
• Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
• 136 citations
Kim et al.
-
Exploiting Cross-sentence Context For Neural Machine Translation
(2017)
• Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
• 203 citations
Wang et al.
-
Regularizing And Optimizing LSTM Language Models
(2017)
• Arxiv
• 430 citations
Stephen Merity, Nitish Shirish Keskar, Richard Socher
-
Dissent: Sentence Representation Learning From Explicit Discourse Relations
(2017)
• Arxiv
• 63 citations
Allen Nie, Erin D. Bennett, Noah D. Goodman
-
VQS: Linking Segmentations To Questions And Answers For Supervised Attention In VQA And Question-focused Semantic Segmentation
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 106 citations
Gan et al.
-
Scalable Multi-domain Dialogue State Tracking
(2017)
• 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
• 125 citations
Abhinav Rastogi, Dilek Hakkani-Tur, Larry Heck
-
Hybrid Code Networks: Practical And Efficient End-to-end Dialog Control With Supervised And Reinforcement Learning
(2017)
• Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 357 citations
Jason D. Williams, Kavosh Asadi, Geoffrey Zweig
-
Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model
(2017)
• Arxiv
• 93 citations
Lu et al.
-
Learning To Compute Word Embeddings On The Fly
(2017)
• Arxiv
• 77 citations
Bahdanau et al.
-
Robustfill: Neural Program Learning Under Noisy I/O
(2017)
• Arxiv
• 112 citations
Devlin et al.
-
Learning To Generate One-sentence Biographies From Wikidata
(2017)
• Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
• 97 citations
Andrew Chisholm, Will Radford, Ben Hachey
-
A Read-write Memory Network For Movie Story Understanding
(2017)
• 2017 IEEE International Conference on Computer Vision (ICCV)
• 95 citations
Na et al.
-
A Study Of Matchpyramid Models On Ad-hoc Retrieval
(2016)
• Arxiv
• 90 citations
Pang et al.
-
Multimodal Pivots For Image Caption Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 92 citations
Julian Hitschler, Shigehiko Schamoni, Stefan Riezler
-
Multimodal Residual Learning For Visual QA
(2016)
• Arxiv
• 219 citations
Kim et al.
-
Conditional Generation And Snapshot Learning In Neural Dialogue Systems
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 74 citations
Wen et al.
-
Interactive Attention For Neural Machine Translation
(2016)
• Arxiv
• 65 citations
Meng et al.
-
Lattice-based Recurrent Neural Network Encoders For Neural Machine Translation
(2016)
• Arxiv
• 65 citations
Su et al.
-
Multi-source Neural Translation
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 292 citations
Barret Zoph, Kevin Knight
-
Learning Language Games Through Interaction
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 143 citations
Sida I. Wang, Percy Liang, Christopher D. Manning
-
A Decomposable Attention Model For Natural Language Inference
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 1416 citations
Parikh et al.
-
MS MARCO: A Human Generated Machine Reading Comprehension Dataset
(2016)
• Arxiv
• 1354 citations
Bajaj et al.
-
Embracing Data Abundance: Booktest Dataset For Reading Comprehension
(2016)
• Arxiv
• 63 citations
Ondrej Bajgar, Rudolf Kadlec, Jan Kleindienst
-
Systran's Pure Neural Machine Translation Systems
(2016)
• Arxiv
• 89 citations
Crego et al.
-
Learning Python Code Suggestion With A Sparse Pointer Network
(2016)
• Arxiv
• 71 citations
Bhoopchand et al.
-
Attentive Pooling Networks
(2016)
• Arxiv
• 341 citations
Santos et al.
-
Training With Exploration Improves A Greedy Stack-lstm Parser
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 66 citations
Ballesteros et al.
-
Fast Domain Adaptation For Neural Machine Translation
(2016)
• Arxiv
• 181 citations
Markus Freitag, Yaser Al-Onaizan
-
A Character-level Decoder Without Explicit Segmentation For Neural Machine Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 238 citations
Junyoung Chung, Kyunghyun Cho, Yoshua Bengio
-
Exploring The Limits Of Language Modeling
(2016)
• Arxiv
• 989 citations
Jozefowicz et al.
-
An Actor-critic Algorithm For Sequence Prediction
(2016)
• Arxiv
• 326 citations
Bahdanau et al.
-
Who Did What: A Large-scale Person-centered Cloze Dataset
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 121 citations
Onishi et al.
-
Latent Predictor Networks For Code Generation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 301 citations
Ling et al.
-
Bidirectional Long-short Term Memory For Video Description
(2016)
• Proceedings of the 24th ACM international conference on Multimedia
• 65 citations
Bin et al.
-
Multilingual Part-of-speech Tagging With Bidirectional Long Short-term Memory Models And Auxiliary Loss
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 474 citations
Barbara Plank, Anders Søgaard, Yoav Goldberg
-
Simverb-3500: A Large-scale Evaluation Set Of Verb Similarity
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 219 citations
Gerz et al.
-
Multi-domain Neural Network Language Generation For Spoken Dialogue Systems
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 194 citations
Wen et al.
-
Dynamic Memory Networks For Visual And Textual Question Answering
(2016)
• Arxiv
• 600 citations
Caiming Xiong, Stephen Merity, Richard Socher
-
Compositional Sequence Labeling Models For Error Detection In Learner Writing
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 91 citations
Marek Rei, Helen Yannakoudakis
-
A Persona-based Neural Conversation Model
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 934 citations
Li et al.
-
Sequence-level Knowledge Distillation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 761 citations
Yoon Kim, Alexander M. Rush
-
Character-based Neural Machine Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 233 citations
Marta R. Costa-Jussà, José A. R. Fonollosa
-
Image-to-markup Generation With Coarse-to-fine Attention
(2016)
• Arxiv
• 87 citations
Deng et al.
-
Towards End-to-end Learning For Dialog State Tracking And Management Using Deep Reinforcement Learning
(2016)
• Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 188 citations
Tiancheng Zhao, Maxine Eskenazi
-
Hierarchical Question-image Co-attention For Visual Question Answering
(2016)
• Arxiv
• 1235 citations
Lu et al.
-
Zero-resource Translation With Multi-lingual Neural Machine Translation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 274 citations
Firat et al.
-
Incorporating Structural Alignment Biases Into An Attentional Neural Translation Model
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 181 citations
Cohn et al.
-
End-to-end Lstm-based Dialog Control Optimized With Supervised And Reinforcement Learning
(2016)
• Arxiv
• 127 citations
Jason D. Williams, Geoffrey Zweig
-
Edinburgh Neural Machine Translation Systems For WMT 16
(2016)
• Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
• 471 citations
Rico Sennrich, Barry Haddow, Alexandra Birch
-
Memory-enhanced Decoder For Neural Machine Translation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 63 citations
Wang et al.
-
Learning To Generalize To New Compositions In Image Understanding
(2016)
• Arxiv
• 61 citations
Atzmon et al.
-
Simpleds: A Simple Deep Reinforcement Learning Dialogue System
(2016)
• Lecture Notes in Electrical Engineering
• 70 citations
Heriberto Cuayáhuitl
-
Pre-translation For Neural Machine Translation
(2016)
• Arxiv
• 74 citations
Niehues et al.
-
A Sequence-to-sequence Model For User Simulation In Spoken Dialogue Systems
(2016)
• Interspeech 2016
• 91 citations
Layla El Asri, Jing He, Kaheer Suleman
-
Sequence-to-sequence Learning As Beam-search Optimization
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 463 citations
Sam Wiseman, Alexander M. Rush
-
How NOT To Evaluate Your Dialogue System: An Empirical Study Of Unsupervised Evaluation Metrics For Dialogue Response Generation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 955 citations
Liu et al.
-
Zero-shot Visual Question Answering
(2016)
• Arxiv
• 67 citations
Damien Teney, Anton van Den Hengel
-
Machine Comprehension Using Match-lstm And Answer Pointer
(2016)
• Arxiv
• 418 citations
Shuohang Wang, Jing Jiang
-
Sequence To Backward And Forward Sequences: A Content-introducing Approach To Generative Short-text Conversation
(2016)
• Arxiv
• 200 citations
Mou et al.
-
An Empirical Evaluation Of Doc2vec With Practical Insights Into Document Embedding Generation
(2016)
• Proceedings of the 1st Workshop on Representation Learning for NLP
• 560 citations
Jey Han Lau, Timothy Baldwin
-
Dual Learning For Machine Translation
(2016)
• NIPS 2016
• 639 citations
Xia et al.
-
Learning A Natural Language Interface With Neural Programmer
(2016)
• Arxiv
• 78 citations
Neelakantan et al.
-
Deep Recurrent Models With Fast-forward Connections For Neural Machine Translation
(2016)
• Transactions of the Association for Computational Linguistics
• 228 citations
Zhou et al.
-
Can Neural Machine Translation Do Simultaneous Translation?
(2016)
• Arxiv
• 145 citations
Kyunghyun Cho, Masha Esipova
-
Neural Generation Of Regular Expressions From Natural Language With Minimal Domain Knowledge
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 71 citations
Locascio et al.
-
RIGA At Semeval-2016 Task 8: Impact Of Smatch Extensions And Character-level Neural Translation On AMR Parsing Accuracy
(2016)
• Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)
• 56 citations
Guntis Barzdins, Didzis Gosko
-
Neural Versus Phrase-based Machine Translation Quality: A Case Study
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 293 citations
Bentivogli et al.
-
Building An Evaluation Scale Using Item Response Theory
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 63 citations
John P. Lalor, Hao Wu, Hong Yu
-
Multimodal Compact Bilinear Pooling For Visual Question Answering And Visual Grounding
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 1353 citations
Fukui et al.
-
Wikireading: A Novel Large-scale Language Understanding Task Over Wikipedia
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 149 citations
Hewlett et al.
-
Text Understanding With The Attention Sum Reader Network
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 313 citations
Kadlec et al.
-
Explaining Predictions Of Non-linear Classifiers In NLP
(2016)
• Proceedings of the 1st Workshop on Representation Learning for NLP
• 100 citations
Arras et al.
-
Compression Of Neural Machine Translation Models Via Pruning
(2016)
• Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning
• 166 citations
Abigail See, Minh-Thang Luong, Christopher D. Manning
-
Modeling Context In Referring Expressions
(2016)
• Lecture Notes in Computer Science
• 822 citations
Yu et al.
-
Using Sentence-level LSTM Language Models For Script Inference
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 65 citations
Karl Pichotta, Raymond J. Mooney
-
Topic Aware Neural Response Generation
(2016)
• Arxiv
• 331 citations
Xing et al.
-
Learning End-to-end Goal-oriented Dialog
(2016)
• Arxiv
• 481 citations
Antoine Bordes, Y-Lan Boureau, Jason Weston
-
Image Captioning With Deep Bidirectional Lstms
(2016)
• Proceedings of the 24th ACM international conference on Multimedia
• 253 citations
Wang et al.
-
Reward Augmented Maximum Likelihood For Neural Structured Prediction
(2016)
• Arxiv
• 93 citations
Norouzi et al.
-
Rationale-augmented Convolutional Neural Networks For Text Classification
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 259 citations
Ye Zhang, Iain Marshall, Byron C. Wallace
-
Sk_p: A Neural Program Corrector For Moocs
(2016)
• Companion Proceedings of the 2016 ACM SIGPLAN International Conference on Systems, Programming, Languages and Applications: Software for Humanity
• 121 citations
Pu et al.
-
Improving Lstm-based Video Description With Linguistic Knowledge Mined From Text
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 116 citations
Venugopalan et al.
-
The LAMBADA Dataset: Word Prediction Requiring A Broad Discourse Context
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 190 citations
Paperno et al.
-
Character-level Question Answering With Attention
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 110 citations
David Golub, Xiaodong He
-
Natural Language Generation Enhances Human Decision-making With Uncertain Information
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 58 citations
Dimitra Gkatzia, Oliver Lemon, Verena Rieser
-
Deep API Learning
(2016)
• Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering
• 490 citations
Gu et al.
-
A Deep Language Model For Software Code
(2016)
• Arxiv
• 74 citations
Hoa Khanh Dam, Truyen Tran, Trang Pham
-
Attention-based Recurrent Neural Network Models For Joint Intent Detection And Slot Filling
(2016)
• Interspeech 2016
• 722 citations
Bing Liu, Ian Lane
-
A User Simulator For Task-completion Dialogues
(2016)
• Arxiv
• 146 citations
Li et al.
-
Neural Machine Translation With External Phrase Memory
(2016)
• Arxiv
• 56 citations
Tang et al.
-
An Attentive Neural Architecture For Fine-grained Entity Type Classification
(2016)
• Proceedings of the 5th Workshop on Automated Knowledge Base Construction
• 95 citations
Shimaoka et al.
-
Google's Neural Machine Translation System: Bridging The Gap Between Human And Machine Translation
(2016)
• Arxiv
• 5827 citations
Wu et al.
-
Reasoning About Pragmatics With Neural Listeners And Speakers
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 122 citations
Jacob Andreas, Dan Klein
-
Learning To Compose Neural Networks For Question Answering
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 490 citations
Andreas et al.
-
Language To Logical Form With Neural Attention
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 667 citations
Li Dong, Mirella Lapata
-
Guided Alignment Training For Topic-aware Neural Machine Translation
(2016)
• Arxiv
• 85 citations
Chen et al.
-
Mutual Information And Diverse Decoding Improve Neural Machine Translation
(2016)
• Arxiv
• 99 citations
Jiwei Li, Dan Jurafsky
-
Learning Language-visual Embedding For Movie Understanding With Natural-language
(2016)
• Arxiv
• 79 citations
Atousa Torabi, Niket Tandon, Leonid Sigal
-
SPICE: Semantic Propositional Image Caption Evaluation
(2016)
• Lecture Notes in Computer Science
• 1736 citations
Anderson et al.
-
LSTM Based Conversation Models
(2016)
• Arxiv
• 60 citations
Yi Luan, Yangfeng Ji, Mari Ostendorf
-
Policy Networks With Two-stage Training For Dialogue Systems
(2016)
• Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 70 citations
Fatemi et al.
-
A Thorough Examination Of The Cnn/daily Mail Reading Comprehension Task
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 465 citations
Danqi Chen, Jason Bolton, Christopher D. Manning
-
Recurrent Memory Networks For Language Modeling
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 78 citations
Ke Tran, Arianna Bisazza, Christof Monz
-
Supervised Attentions For Neural Machine Translation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 129 citations
Haitao Mi, Zhiguo Wang, Abe Ittycheriah
-
Quasi-recurrent Neural Networks
(2016)
• Arxiv
• 350 citations
Bradbury et al.
-
Multi-perspective Context Matching For Machine Comprehension
(2016)
• Arxiv
• 119 citations
Wang et al.
-
A Fast Unified Model For Parsing And Sentence Understanding
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 247 citations
Bowman et al.
-
Neural Text Generation From Structured Data With Application To The Biography Domain
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 459 citations
Remi Lebret, David Grangier, Michael Auli
-
A Focused Dynamic Attention Model For Visual Question Answering
(2016)
• Arxiv
• 137 citations
Ilija Ilievski, Shuicheng Yan, Jiashi Feng
-
Learning Natural Language Inference Using Bidirectional LSTM Model And Inner-attention
(2016)
• Arxiv
• 238 citations
Liu et al.
-
Vocabulary Manipulation For Neural Machine Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 68 citations
Haitao Mi, Zhiguo Wang, Abe Ittycheriah
-
Dataset And Neural Recurrent Sequence Labeling Model For Open-domain Factoid Question Answering
(2016)
• Arxiv
• 78 citations
Li et al.
-
Two Are Better Than One: An Ensemble Of Retrieval- And Generation-based Dialog Systems
(2016)
• Arxiv
• 89 citations
Song et al.
-
Cached Long Short-term Memory Neural Networks For Document-level Sentiment Classification
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 184 citations
Xu et al.
-
Syntactically Guided Neural Machine Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 80 citations
Stahlberg et al.
-
CFO: Conditional Focused Neural Question Answering With Large-scale Knowledge Bases
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 125 citations
Zihang Dai, Lei Li, Wei Xu
-
Learning Through Dialogue Interactions By Asking Questions
(2016)
• Arxiv
• 78 citations
Li et al.
-
A Convolutional Attention Network For Extreme Summarization Of Source Code
(2016)
• Arxiv
• 358 citations
Miltiadis Allamanis, Hao Peng, Charles Sutton
-
Generating Visual Explanations
(2016)
• Lecture Notes in Computer Science
• 499 citations
Hendricks et al.
-
Context-aware Natural Language Generation With Recurrent Neural Networks
(2016)
• Arxiv
• 73 citations
Tang et al.
-
Neural Paraphrase Generation With Stacked Residual LSTM Networks
(2016)
• Arxiv
• 229 citations
Prakash et al.
-
Deep Reinforcement Learning For Dialogue Generation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 1066 citations
Li et al.
-
Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation
(2016)
• Arxiv
• 121 citations
Johnson et al.
-
Strategic Attentive Writer For Learning Macro-actions
(2016)
• Arxiv
• 88 citations
Alexander et al.
-
Tree-to-sequence Attentional Neural Machine Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 256 citations
Akiko Eriguchi, Kazuma Hashimoto, Yoshimasa Tsuruoka
-
Improving Sentence Compression By Learning To Predict Gaze
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 101 citations
Sigrid Klerke, Yoav Goldberg, Anders Søgaard
-
Adversarial Evaluation Of Dialogue Models
(2016)
• Arxiv
• 72 citations
Anjuli Kannan, Oriol Vinyals
-
Diverse Beam Search: Decoding Diverse Solutions From Neural Sequence Models
(2016)
• Arxiv
• 381 citations
Vijayakumar et al.
-
Smart Reply: Automated Response Suggestion For Email
(2016)
• Arxiv
• 152 citations
Kannan et al.
-
Generative Deep Neural Networks For Dialogue: A Short Review
(2016)
• Arxiv
• 64 citations
Serban et al.
-
Neural Machine Translation With Supervised Attention
(2016)
• Arxiv
• 134 citations
Liu et al.
-
Query-reduction Networks For Question Answering
(2016)
• Arxiv
• 66 citations
Seo et al.
-
Long Short-term Memory-networks For Machine Reading
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 1041 citations
Jianpeng Cheng, Li Dong, Mirella Lapata
-
Semi-supervised Learning For Neural Machine Translation
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 91 citations
Cheng et al.
-
CUNI System For WMT16 Automatic Post-editing And Multimodal Translation Tasks
(2016)
• Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
• 60 citations
Libovický et al.
-
Conversational Contextual Cues: The Case Of Personalization And History For Response Ranking
(2016)
• Arxiv
• 59 citations
Al-Rfou et al.
-
Data Recombination For Neural Semantic Parsing
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 494 citations
Robin Jia, Percy Liang
-
Language Modeling With Gated Convolutional Networks
(2016)
• Arxiv
• 935 citations
Dauphin et al.
-
Scaling Memory-augmented Neural Networks With Sparse Reads And Writes
(2016)
• Arxiv
• 88 citations
Rae et al.
-
Attention-based Convolutional Neural Network For Machine Comprehension
(2016)
• Proceedings of the Workshop on Human-Computer Question Answering
• 92 citations
Wenpeng Yin, Sebastian Ebert, Hinrich Schütze
-
Revisiting Visual Question Answering Baselines
(2016)
• Lecture Notes in Computer Science
• 247 citations
Allan Jabri, Armand Joulin, Laurens van Der Maaten
-
The AMU-UEDIN Submission To The WMT16 News Translation Task: Attention-based NMT Models As Feature Functions In Phrase-based SMT
(2016)
• Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
• 66 citations
Marcin Junczys-Dowmunt, Tomasz Dwojak, Rico Sennrich
-
Rationalizing Neural Predictions
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 613 citations
Tao Lei, Regina Barzilay, Tommi Jaakkola
-
Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-answer Corpus
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 286 citations
Serban et al.
-
Emoji2vec: Learning Emoji Representations From Their Description
(2016)
• Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media
• 250 citations
Eisner et al.
-
A Neural Knowledge Language Model
(2016)
• Arxiv
• 126 citations
Ahn et al.
-
Language As A Latent Variable: Discrete Generative Models For Sentence Compression
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 195 citations
Yishu Miao, Phil Blunsom
-
The Role Of Context Types And Dimensionality In Learning Word Embeddings
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 105 citations
Melamud et al.
-
Log-linear Combinations Of Monolingual And Bilingual Neural Machine Translation Models For Automatic Post-editing
(2016)
• Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
• 110 citations
Marcin Junczys-Dowmunt, Roman Grundkiewicz
-
Automos: Learning A Non-intrusive Assessor Of Naturalness-of-speech
(2016)
• Arxiv
• 57 citations
Patton et al.
-
Show And Tell: Lessons Learned From The 2015 MSCOCO Image Captioning Challenge
(2016)
• IEEE Transactions on Pattern Analysis and Machine Intelligence
• 905 citations
Vinyals et al.
-
Online Segment To Segment Neural Transduction
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 84 citations
Lei Yu, Jan Buys, Phil Blunsom
-
Pointer Sentinel Mixture Models
(2016)
• Arxiv
• 687 citations
Merity et al.
-
Does Multimodality Help Human And Machine For Translation And Image Captioning?
(2016)
• Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers
• 77 citations
Caglayan et al.
-
Joint Copying And Restricted Generation For Paraphrase
(2016)
• Arxiv
• 65 citations
Cao et al.
-
Generating Natural Questions About An Image
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 279 citations
Mostafazadeh et al.
-
Title Generation For User Generated Videos
(2016)
• Lecture Notes in Computer Science
• 73 citations
Zeng et al.
-
Tracking The World State With Recurrent Entity Networks
(2016)
• ICLR 2017
• 168 citations
Henaff et al.
-
Dialog-based Language Learning
(2016)
• Arxiv
• 81 citations
Jason Weston
-
Chinese Song Iambics Generation With Neural Attention-based Model
(2016)
• Arxiv
• 59 citations
Wang et al.
-
Training Recurrent Answering Units With Joint Loss Minimization For VQA
(2016)
• Arxiv
• 77 citations
Hyeonwoo Noh, Bohyung Han
-
Improving Neural Language Models With A Continuous Cache
(2016)
• Arxiv
• 206 citations
Edouard Grave, Armand Joulin, Nicolas Usunier
-
Analyzing The Behavior Of Visual Question Answering Models
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 295 citations
Aishwarya Agrawal, Dhruv Batra, Devi Parikh
-
Linguistic Input Features Improve Neural Machine Translation
(2016)
• Proceedings of the First Conference on Machine Translation: Volume 1, Research Papers
• 352 citations
Rico Sennrich, Barry Haddow
-
Variational Neural Machine Translation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 198 citations
Zhang et al.
-
Reasonet: Learning To Stop Reading In Machine Comprehension
(2016)
• Arxiv
• 83 citations
Shen et al.
-
Assessing The Ability Of Lstms To Learn Syntax-sensitive Dependencies
(2016)
• Transactions of the Association for Computational Linguistics
• 847 citations
Tal Linzen, Emmanuel Dupoux, Yoav Goldberg
-
Multiplicative LSTM For Sequence Modelling
(2016)
• Arxiv
• 97 citations
Krause et al.
-
Contextual LSTM (CLSTM) Models For Large Scale NLP Tasks
(2016)
• Arxiv
• 191 citations
Ghosh et al.
-
Incorporating Copying Mechanism In Sequence-to-sequence Learning
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 1459 citations
Gu et al.
-
Visual Storytelling
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 198 citations
Ting-Hao et al.
-
Neural Autoregressive Collaborative Filtering For Implicit Feedback
(2016)
• Proceedings of the 1st Workshop on Deep Learning for Recommender Systems
• 79 citations
Zheng et al.
-
Neural Machine Translation In Linear Time
(2016)
• Arxiv
• 341 citations
Kalchbrenner et al.
-
Embedding Projector: Interactive Visualization And Interpretation Of Embeddings
(2016)
• Arxiv
• 152 citations
Smilkov et al.
-
Bidirectional Attention Flow For Machine Comprehension
(2016)
• Arxiv
• 1348 citations
Seo et al.
-
Transfer Learning For Low-resource Neural Machine Translation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 742 citations
Zoph et al.
-
Sequential Short-text Classification With Recurrent And Convolutional Neural Networks
(2016)
• Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
• 446 citations
Ji Young Lee, Franck Dernoncourt
-
Improved Recurrent Neural Networks For Session-based Recommendations
(2016)
• Proceedings of the 1st Workshop on Deep Learning for Recommender Systems
• 653 citations
Yong Kiam Tan, Xinxing Xu, Yong Liu
-
Recursive Recurrent Nets With Attention Modeling For OCR In The Wild
(2016)
• 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
• 455 citations
Chen-Yu Lee, Simon Osindero
-
On-line Active Reward Learning For Policy Optimisation In Spoken Dialogue Systems
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
• 102 citations
Su et al.
-
Learning Recurrent Span Representations For Extractive Question Answering
(2016)
• Arxiv
• 147 citations
Lee et al.
-
Multi30k: Multilingual English-german Image Descriptions
(2016)
• Proceedings of the 5th Workshop on Vision and Language
• 406 citations
Elliott et al.
-
Neural Language Correction With Character-based Attention
(2016)
• Arxiv
• 132 citations
Xie et al.
-
Detecting Text In Natural Image With Connectionist Text Proposal Network
(2016)
• Lecture Notes in Computer Science
• 1029 citations
Tian et al.
-
Sequence-to-sequence Generation For Spoken Dialogue Via Deep Syntax Trees And Strings
(2016)
• Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
• 68 citations
Ondřej Dušek, Filip Jurčíček
-
Coverage Embedding Models For Neural Machine Translation
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 137 citations
Mi et al.
-
Natural Language Comprehension With The Epireader
(2016)
• Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
• 68 citations
Trischler et al.
-
Dialogue Learning With Human-in-the-loop
(2016)
• Arxiv
• 71 citations
Li et al.
-
Crowd-sourcing NLG Data: Pictures Elicit Better Data
(2016)
• Proceedings of the 9th International Natural Language Generation conference
• 58 citations
Jekaterina Novikova, Oliver Lemon, Verena Rieser
-
Is Neural Machine Translation Ready For Deployment? A Case Study On 30 Translation Directions
(2016)
• Arxiv
• 155 citations
Marcin Junczys-Dowmunt, Tomasz Dwojak, Hieu Hoang
-
A Context-aware Natural Language Generator For Dialogue Systems
(2016)
• Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue
• 66 citations
Ondřej Dušek, Filip Jurčíček
-
E-commerce In Your Inbox: Product Recommendations At Scale
(2015)
• Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2015) Sydney Australia
• 201 citations
Grbovic et al.
-
Hierarchical Neural Language Models For Joint Representation Of Streaming Documents And Their Content
(2015)
• Proceedings of the 24th International Conference on World Wide Web
• 78 citations
Djuric et al.
-
Generating Different Story Tellings From Semantic Representations Of Narrative
(2013)
• Lecture Notes in Computer Science
• 55 citations
Rishes et al.
-
Natural Language Processing (almost) From Scratch
(2011)
• Arxiv
• 5334 citations
Collobert et al.
Showing first 12 while collapsed. Click to expand and reveal all 3897.