Accepted Short Papers

Congratulations! Please follow these instructions to prepare the final, camera-ready versions of your papers.

What can we learn from Semantic Tagging?
Mostafa Abdou, Artur Kulmizev, Vinit Ravishankar, Lasha Abzianidze and Johan Bos

Prediction Improves Simultaneous Neural Machine Translation
Ashkan Alinejad, Maryam Siahbani and Anoop Sarkar

Generating Natural Language Adversarial Examples
Moustafa Alzantot, Yash Sharma, Ahmed Elgohary, Bo-Jhang Ho, Mani Srivastava and Kai-Wei Chang

Word Sense Induction with Neural biLM and Symmetric Patterns
Asaf Amrami and Yoav Goldberg

Sanskrit Sandhi Splitting using seq2(seq)2
Rahul Aralikatte, Neelamadhav Gantayat, Naveen Panwar, Anush Sankaran and Senthil Mani

Towards Two-Dimensional Sequence to Sequence Model in Neural Machine Translation
Parnia Bahar, Christopher Brix and Hermann Ney

Conversational Decision Making Model for Predicting King’s Decision in the Annals of the Joseon Dynasty
JinYeong Bak and Alice Oh

Part-of-Speech Tagging for Code-Switched, Transliterated Texts without Explicit Language Identification
Kelsey Ball and Dan Garrette

Training Deeper Neural Machine Translation Models with Transparent Attention
Ankur Bapna, Mia Chen, Orhan Firat, Yuan Cao and Yonghui Wu

Interpretable Emoji Prediction via Multi-Attention LSTMs
Francesco Barbieri, Luis Espinosa Anke, Jose Camacho-Collados, Steven Schockaert and Horacio Saggion

Adversarial training for multi-context joint entity and relation extraction
Giannis Bekoulis, Johannes Deleu, Thomas Demeester and Chris Develder

Topic Intrusion for Automatic Topic Model Evaluation
Shraey Bhatia, Jey Han Lau and Timothy Baldwin

The Lazy Encoder: A Fine-Grained Analysis of the Role of Morphology in Neural Machine Translation
Arianna Bisazza and Clara Tump

Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation
Nikolay Bogoychev, Kenneth Heafield, Alham Fikri Aji and Marcin Junczys-Dowmunt

Learning To Split and Rephrase From Wikipedia Edit History
Jan A. Botha, Manaal Faruqui, John Alex, Jason Baldridge and Dipanjan Das

How agents see things: On visual representations in an emergent language game
Diane Bouchacourt and Marco Baroni

Modeling Empathy and Distress in Reaction to News Stories
Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar and Joao Sedoc

Encoding Gated Translation Memory into Neural Machine Translation
Qian Cao and Deyi Xiong

Exploring Optimism and Pessimism in Twitter Using Deep Learning
Cornelia Caragea, Liviu P. Dinu and Bogdan Dumitru

Do explanation modalities make VQA models more predictable to a human?
Arjun Chandrasekaran, Deshraj Yadav, Prithvijit Chattopadhyay, Viraj Prabhu and Devi Parikh

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates
Di Chen, Jiachen Du, Lidong Bing and Ruifeng Xu

Joint Learning for Emotion Classification and Emotion Cause Detection
Ying Chen, Wenjun Hou and Xiyao Cheng

The BQ Corpus: A Large-scale Domain-specific Chinese Corpus For Sentence Semantic Equivalence Identification
Jing Chen, Qingcai Chen, Xin Liu, Haijun Yang, Daohe Lu and Buzhou Tang

Word Relation Autoencoder for Unseen Hypernym Extraction Using Word Embeddings
Hong-You Chen, Cheng-Syuan Lee, Keng-Te Liao and Shou-de Lin

Bayesian Compression for Natural Language Processing
Nadezhda Chirkova, Ekaterina Lobacheva and Dmitry Vetrov

Conditional Word Embedding and Hypothesis Testing via Bayes-by-Backprop
Kyunghyun Cho, Michael Gill, Rujun Han and Arthur Spirling

Generating Syntactic Paraphrases
Emilie Colin and Claire Gardent

Multi-Source Syntactic Neural Machine Translation
Anna Currey and Kenneth Heafield

A Framework for Understanding the Role of Morphology in Universal Dependency Parsing
Mathieu Dehouck and Pascal Denis

Coherence-Aware Neural Topic Modeling
Ran Ding, Ramesh Nallapati and Bing Xiang

Unsupervised Bilingual Lexicon Induction via Latent Variable Models
Zi-Yi Dou and Zhi-Hao Zhou

Adversarial Evaluation of Multimodal Machine Translation
Desmond Elliott

Identifying Well-formed Natural Language Questions
Manaal Faruqui and Dipanjan Das

Identifying Domain Adjacent Instances for Semantic Parsers
James Ferguson, Janara Christensen, Edward Li and Edgar Gonzàlez Pellicer

The Importance of Generation Order in Language Modeling
Nicolas Ford, Daniel Duckworth, Mohammad Norouzi and George Dahl

Neural Metaphor Detection in Context
Ge Gao, Eunsol Choi, Yejin Choi and Luke Zettlemoyer

Learning Sequence Encoders for Temporal Knowledge Graph Completion
Alberto Garcia-Duran, Sebastijan Dumančić and Mathias Niepert

Code-switched Language Models Using Dual RNNs and Same-Source Pretraining
Saurabh Garg, Tanmay Parekh and Preethi Jyothi

A Dataset for Telling the Stories of Social Media Videos
Spandana Gella, Mike Lewis and Marcus Rohrbach

Toward Understanding and Explaining Complex Deep Models in NLP
Reza Ghaeini, Xiaoli Fern and Prasad Tadepalli

The Remarkable Benefit of User-Level Aggregation for Lexical-based Population-Level Predictions
Salvatore Giorgi, Daniel Preoţiuc-Pietro, Anneke Buffone, Daniel Rieman, Lyle Ungar and H. Andrew Schwartz

A strong baseline for question relevancy ranking
Ana Valeria Gonzalez-Garduño, Isabelle Augenstein and Anders Søgaard

Modeling Input Uncertainty in Neural Network Dependency Parsing
Rob van der Goot and Gertjan van Noord

Supervised and Unsupervised Methods for Robust Separation of Section Titles and Prose Text in Web Documents
Abhijith Athreya Mysore Gopinath, Shomir Wilson and Norman Sadeh

Marginal Likelihood Training of BiLSTM-CRF for Biomedical Named Entity Recognition from Disjoint Label Sets
Nathan Greenberg, Trapit Bansal, Patrick Verga and Andrew McCallum

Automatic Reference-Based Evaluation of Pronoun Translation Misses the Point
Liane Guillou and Christian Hardmeier

How to represent a word and predict it, too: improving tied architectures for language modelling
Kristina Gulordava, Laura Aina and Gemma Boleda

Improving Reinforcement Learning Based Image Captioning with Natural Language Prior
Tszhang Guo, Shiyu Chang, Mo Yu and Kun Bai

Semantic Parsing for Task Oriented Dialog using Hierarchical Representations
Sonal Gupta, Rushin Shah, Mrinal Mohit, Anuj Kumar and Mike Lewis

FewRel: A Large-Scale Supervised Few-shot Relation Classification Dataset with State-of-the-Art Evaluation
Xu Han, Hao Zhu, Pengfei Yu, Ziyun Wang, Yuan Yao, Zhiyuan Liu and Maosong Sun

Guided Neural Language Generation for Abstractive Summarization using Abstract Meaning Representation
Hardy Hardy and Andreas Vlachos

Why is unsupervised alignment of English embeddings from different algorithms so hard?
Mareike Hartmann, Yova Kementchedjhieva and Anders Søgaard

Retrieval-Based Neural Code Generation
Shirley Anugrah Hayati, Raphael Olivier, Pravalika Avvaru, Pengcheng Yin, Anthony Tomasic and Graham Neubig

Entity Tracking Improves Cloze-style Reading Comprehension
Luong Hoang, Sam Wiseman and Alexander Rush

Improving Author Attribute Prediction by Retrofitting Linguistic Representations with Homophily
Dirk Hovy and Tommaso Fornaciari

Grammar Induction with Neural Language Models: Flawed Experiments, Important Results
Phu Mon Htut, Kyunghyun Cho and Samuel Bowman

Somm: Into the Model
Shengli Hu

WikiConv: A Corpus of the Complete Conversational History of a Large Online Collaborative Community
Yiqing Hua, Cristian Danescu-Niculescu-Mizil, Dario Taraborelli, Nithum Thain, Jeffery Sorensen and Lucas Dixon

Chinese Pinyin Aided IME, Input What You Have Not Keystroked Yet
Yafang Huang and Hai Zhao

Modeling Temporality of Human Intentions by Domain Adaptation
Xiaolei Huang, Lixing Liu, Stefan Scherer, Brian Borsari and Joshua Woolley

Parameterized Convolutional Neural Networks for Aspect Level Sentiment Classification
Binxuan Huang and Kathleen Carley

Cut to the Chase: A Context Zoom-in Network for Reading Comprehension
Sathish Reddy Indurthi, Seunghak Yu, Seohyun Back and Heriberto Cuayahuitl

Improving the results of string kernels by adapting them to your test set
Radu Tudor Ionescu and Andrei M. Butnaru

Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion
Armand Joulin, Piotr Bojanowski, Tomas Mikolov, Edouard Grave and Hervé Jégou

Decipherment of Substitution Ciphers with Neural Language Models
Nishant Kambhatla, Anahita Mansouri Bigvand and Anoop Sarkar

Bridging Knowledge Gaps in Neural Entailment via Symbolic Models
Dongyeop Kang, Tushar Khot, Ashish Sabharwal and Peter Clark

Harnessing Popularity in Social Media for Extractive Summarization of Online Conversations
Ryuji Kano, Yasuhide Miura, Motoki Taniguchi, Yan-Ying Chen, Francine Chen and Tomoko Ohkuma

SafeCity: Understanding Diverse Forms of Sexual Harassment Personal Stories
Sweta Karlekar and Mohit Bansal

Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection
Sudhanshu Kasewa, Pontus Stenetorp and Sebastian Riedel

How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks
Divyansh Kaushik and Zachary C. Lipton

Fine-Grained Emotion Detection in Health-Related Online Posts
Hamed Khanpour and Cornelia Caragea

Improving Unsupervised Word-by-Word Translation with Language Model and Denoising Autoencoder
Yunsu Kim, Jiahui Geng and Hermann Ney

Supervised Domain Enablement Attention for Personalized Domain Classification
Joo-Kyung Kim and Young-Bum Kim

Strong Baselines for Learning Generic Sentence Embeddings
Jamie Kiros and William Chan

Context and Copying in Neural Machine Translation
Rebecca Knowles and Philipp Koehn

Representing Social Media Users for Sarcasm Detection
Y. Alex Kolchinski and Christopher Potts

LemmaTag: Jointly Tagging and Lemmatizing for Morphologically Rich Languages with BRNNs
Daniel Kondratyuk, Tomáš Gavenčiak and Milan Straka

Similarity-Based Reconstruction Loss for Meaning Representation
Olga Kovaleva, Anna Rumshisky and Alexey Romanov

Adaptive Document Retrieval for Deep Question Answering
Bernhard Kratzwald and Stefan Feuerriegel

Revisiting the Importance of Encoding Logic Rules in Sentiment Classification
Kalpesh Krishna, Preethi Jyothi and Mohit Iyyer

Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study
John Lalor, Hao Wu, Tsendsuren Munkhdalai and Hong Yu

Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning
Sotiris Lamprinidis, Daniel Hardt and Dirk Hovy

Improving Large-Scale Fact-Checking using Decomposable Attention Models and Lexical Tagging
Nayeon Lee, Chien-Sheng Wu and Pascale Fung

Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering
Jinhyuk Lee, Seongjun Yun, Hyunjae Kim, Miyoung Ko and Jaewoo Kang

Parameter sharing between dependency parsers for related languages
Miryam de Lhoneux, Johannes Bjerva, Isabelle Augenstein and Anders Søgaard

A Co-attention Neural Network Model for Emotion Cause Analysis with Emotional Context Awareness
Xiangju Li, Kaisong Song, Shi Feng and Daling Wang

A Syntactic Constraint Based Bidirectional-Asynchronous Approach for Emotional Conversation Generation
Jingyuan Li and Xiao Sun

Multi-Head Attention with Disagreement Regularization
Jian Li, Zhaopeng Tu, Baosong Yang, Michael R. Lyu and Tong Zhang

End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification
Jindřich Libovický and Jindřich Helcl

Learning When to Concentrate or Divert Attention: Automatic Control of Attention Temperature for Neural Machine Translation
Junyang Lin, Xuancheng Ren, Qi Su, Muyu Li and Xu SUN

Exploiting Contextual Information via Dynamic Memory Network for Event Detection
Shaobo Liu, Rui Cheng, Xiaoming Yu and Xueqi Cheng

Toward Less Generic Responses in Neural Conversation Models: A Statistical Re-weighting Method
Yahui Liu, Wei Bi, Jun Gao, Xiaojiang Liu, Jian Yao and Shuming Shi

Similar but not the Same - Word Sense Disambiguation Improves Event Detection via Neural Representation Matching
Weiyi Lu and Thien Huu Nguyen

Labeled Anchors and a Scalable, Transparent, and Interactive Classifier
Jeffrey Lund, Stephen Cowley, Wilson Fearn, Emily Hales and Kevin Seppi

Learning Word Representations with Cross-Sentence Dependencyfor End-to-End Co-reference Resolution
Hongyin Luo and Jim Glass

Has Neural Machine Translation Achieved Human Parity? A Case for Document-level Evaluation
Samuel Läubli, Rico Sennrich and Martin Volk

Bi-LSTMs Are State-of-the-art for Chinese Word Segmentation
Ji Ma, Kuzman Ganchev and David Weiss

Joint Learning for Targeted Sentiment Analysis
Dehong Ma, Sujian Li and Houfeng WANG

Imitation Learning for Neural Morphological String Transduction
Peter Makarov and Simon Clematide

Adversarial Training for Multi-task and Multi-lingual Joint Modeling of Utterance Intent Classification
Ryo Masumura, Yusuke Shinohara, Ryuichiro Higashinaka and Yushi Aono

Training Millions of Personalized Dialogue Agents
Pierre-Emmanuel Mazare, Samuel Humeau, Martin Raison and Antoine Bordes

Towards Semi-Supervised Learning for Deep Semantic Role Labeling
Sanket Vaibhav Mehta, Jay Yoon Lee and Jaime Carbonell

Training for Diversity in Image Paragraph Captioning
Luke Melas-Kyriazi, Alexander Rush and George Han

Towards Document-Level Neural Machine Translation with Hierarchical Attention Networks
Lesly Miculicich, Dhananjay Ram, James Henderson and Nikolaos Pappas

Listening Comprehension over Argumentative Content
Shachar Mirkin, Guy Moshkowich, Matan Orbach, Lili Kotlerman, Yoav Kantor, Tamar Lavee, Michal Jacovi, Yonatan Bilu, Ranit Aharonov and Noam Slonim

Is Nike female? Predicting brand name gender across product categories
Sridhar Moorthy, Ruth Pogacar, Samin Khan and Yang Xu

Effective Use of Context in Noisy Entity Linking
David Mueller and Greg Durrett

Learning Unsupervised Word Translations Without Adversaries
Tanmoy Mukherjee, Makoto Yamada and Timothy Hospedales

Rapid Adaptation of Neural Machine Translation to New Languages
Graham Neubig and Junjie Hu

Multimodal neural pronunciation modeling for spoken languages with logographic origin
Minh Nguyen, Gia H Ngo and Nancy Chen

Semantic Linking in Convolutional Neural Networks for Answer Sentence Selection
Massimo Nicosia and Alessandro Moschitti

Towards Dynamic Computation Graphs via Sparse Latent Structure
Vlad Niculae, André F. T. Martins and Claire Cardie

Estimating Marginal Probabilities of n-grams for Recurrent Neural Language Models
Thanapon Noraset, Doug Downey and Lidong Bing

Event Detection with Neural Networks: A Rigorous Empirical Evaluation
Walker Orr, Prasad Tadepalli and Xiaoli Fern

Reducing Gender Bias in Abusive Language Detection
Ji Ho Park, Jamin Shin and Pascale Fung

Extending Neural Generative Conversational Model using External Knowledge Sources
Prasanna Parthasarathi and Joelle Pineau

Mapping natural language commands to web elements
Panupong Pasupat, Tian-Shun Jiang, Evan Liu, Kelvin Guu and Percy Liang

S2SPMN:A Simple and Effective Framework for Response Generation with Relevant Information
Jiaxin Pei and Chenliang Li

SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach
Michael Petrochuk and Luke Zettlemoyer

Fixing Translation Divergences in Parallel Corpora for Neural MT
Minh Quang Pham, Josep Crego, Jean Senellart and François Yvon

SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation
Hieu Pham, Xinyi Wang, Zihang Dai and Graham Neubig

Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging
Barbara Plank and Željko Agić

Facts That Matter
Marco Ponza, Luciano Del Corro and Gerhard Weikum

Learning multiview embeddings for assessing dementia
Chloé Pou-Prom and Frank Rudzicz

Word Embeddings for Code-Mixed Language Processing
Adithya Pratapa, Monojit Choudhury and Sunayana Sitaram

Learning Concept Abstractness Using Weak Supervision
Ella Rabinovich, Benjamin Sznajder, Artem Spector, Ilya Shnayderman, Ranit Aharonov, David Konopnicki and Noam Slonim

Self-Governing Neural Networks for On-Device Short Text Classification
Sujith Ravi and Zornitsa Kozareva

Towards Universal Dialogue State Tracking
Liliang Ren, Kaige Xie, Lu Chen and Kai Yu

Identifying Control in Social Media from Crowd Annotations
Masoud Rouhizadeh, Kokil Jaidka, Laura Smith, H. Andrew Schwartz, Anneke Buffone and Lyle Ungar

Neural Davidsonian Semantic Proto-role Labeling
Rachel Rudinger, Adam Teichert, Ryan Culkin, Sheng Zhang and Benjamin Van Durme

Out-of-domain Detection based on Generative Adversarial Network
Seonghan Ryu, Sangjun Koo, Hwanjo Yu and Gary Geunbae Lee

Data Augmentation via Dependency Tree Morphing for Low-Resource Languages
Gozde Gul Sahin and Mark Steedman

A Neural Model of Adaptation in Reading
Marten van Schijndel and Tal Linzen

The glass ceiling in NLP
Natalie Schluter

When data permutations are pathological: the case of neural natural language inference
Natalie Schluter and Daniel Varab

Joint Aspect and Polarity Classification for Aspect-based Sentiment Analysis with End-to-End Neural Networks
Martin Schmitt, Simon Steinheber, Konrad Schreiber and Benjamin Roth

Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension
Minjoon Seo, Tom Kwiatkowski, Ankur Parikh, Ali Farhadi and Hannaneh Hajishirzi

A Graph-Theoretic Summary Evaluation for Rouge
Elaheh ShafieiBavani, Mohammad Ebrahimi, Raymond Wong and Fang Chen

Adversarial Domain Adaptation for Duplicate Question Detection
Darsh Shah, Tao Lei, Alessandro Moschitti, Salvatore Romeo and Preslav Nakov

Surprisingly Easy Hard-Attention for Sequence to Sequence Learning
Shiv Shankar, Siddhant Garg and Sunita Sarawagi

Greedy Search with Probabilistic N-gram Matching for Neural Machine Translation
Chenze Shao, Xilin Chen and Yang Feng

Evaluating Multiple System Summary Lengths: A Case Study
Ori Shapira, David Gabay, Hadar Ronen, Judit Bar-Ilan, Yael Amsterdamer, Ani Nenkova and Ido Dagan

Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning
Chen Shi, Qi Chen, Lei Sha, Sujian Li, Xu Sun, Houfeng WANG and Lintao Zhang

Genre Separation Network with Adversarial Training for Cross-genre Relation Extraction
Ge Shi, Chong Feng, Lifu Huang, Boliang Zhang, Heng Ji, Lejian Liao and Heyan Huang

Recovering Missing Characters in Old Hawaiian Writing
Brendan Shillingford and Oiwi Parker Jones

HFT-CNN: Learning Hierarchical Category Structure for Multi-label Short Text Categorization
Kazuya Shimura, Jiyi Li and Fumiyo Fukumoto

Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study
Aditya Siddhant and Zachary C. Lipton

An Encoder-Decoder Approach to the Paradigm Cell Filling Problem
Miikka Silfverberg and Mans Hulden

Structured Multi-Label Biomedical Text Tagging via Attentive Neural Tree Decoding
Gaurav Singh, James Thomas, Iain Marshall, John Shawe-Taylor and Byron C. Wallace

A Hierarchical Neural Attention-based Text Classifier
Koustuv Sinha, Yue Dong, Jackie Chi Kit Cheung and Derek Ruths

Deep Exhaustive Model for Nested Named Entity Recognition
Mohammad Golam Sohrab and Makoto Miwa

A Deep Neural Network Sentence Level Classification Method with Context Information
Xingyi Song, Johann Petrak and Angus Roberts

BLEU is Not Suitable for the Evaluation of Text Simplification
Elior Sulem, Omri Abend and Ari Rappoport

Unsupervised Neural Word Segmentation for Chinese via Segmental Language Modeling
Zhiqing Sun and Zhi-Hong Deng

The importance of Being Recurrent for Modeling Hierarchical Structure
Ke Tran, Arianna Bisazza and Christof Monz

Refining Pretrained Word Embeddings Using Layer-wise Relevance Propagation
Akira Utsumi

Getting Gender Right in Neural MT
Eva Vanmassenhove, Christian Hardmeier and Andy Way

Syntactical Analysis of the Weaknesses of Sentiment Analyzers
Rohil Verma, Samuel Kim and David Walter

Automatic Post-Editing of Machine Translation: A Neural Programmer-Interpreter Approach
Thuy-Trang Vu and Gholamreza Haffari

A Reinforcement Learning Framework for Automatic Essay Scoring Incorporating Rating Schema
Yucheng Wang, Zhongyu Wei, Yaqian Zhou and Xuanjing Huang

A Tree-based Decoder for Neural Machine Translation
Xinyi Wang, Hieu Pham, Pengcheng Yin and Graham Neubig

Improved Dependency Parsing using Implicit Word Connections Learned from Unlabeled Data
Wenhui Wang, Baobao Chang and Mairgup Mansur

Learning to Jointly Translate and Predict Dropped Pronouns with a Shared Reconstruction Mechanism
Longyue Wang, Zhaopeng Tu, Andy Way and Qun Liu

Neural Transition-based Model for Nested Mention Recognition
Bailin Wang, Wei Lu, Yu Wang and Hongxia Jin

Three Strategies to Improve One-to-Many Multilingual Translation
Yining Wang, Jiajun Zhang, Feifei Zhai, Jingfang Xu and Chengqing Zong

Toward Fast and Accurate Neural Discourse Segmentation
Yizhong Wang and Sujian Li

Translating Math Word Problem to Expression Tree
Lei Wang, Yan Wang, Deng Cai, Dongxiang Zhang and Xiaojiang Liu

Neural Latent Relational Analysis to Capture Lexical Semantic Relation
Koki Washio and Tsuneaki Kato

Dual Fixed-Size Ordinally Forgetting Encoding (FOFE) for Competitive Neural Language Models
Sedtawut Watcharawittayakul, Mingbin Xu and Hui Jiang

Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models
Daniel Watson, Nasser Zalmout and Nizar Habash

Lexicosyntactic inference in neural models
Aaron Steven White, Rachel Rudinger, Kyle Rawlins and Benjamin Van Durme

Natural Language Processing Not-At-All from Scratch: Evaluating The Utility of Hand-crafted Features in Deep Learning
Minghao Wu, Fei Liu and Trevor Cohn

Compact Personalized Models for Neural Machine Translation
Joern Wuebker, Patrick Simianer and John DeNero

Put It Back: Entity Typing with Language Model Enhancement
Ji Xin, Hao Zhu, Xu Han, Zhiyuan Liu and Maosong Sun

Session-level Language Modeling for Conversational Speech
Wayne Xiong, Lingfeng Wu, Jun Zhang and Andreas Stolcke

An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation
Jingjing Xu, Liangchen Luo, Xu SUN, Junyang Lin and Qi Zeng

Exploiting Rich Syntactic Information for Semantic Parsing with Graph-to-Sequence Model
Kun Xu, Lingfei Wu, Zhiguo Wang, Mo Yu, Liwei Chen and Vadim Sheinin

SQL-to-Text Generation with Graph-to-Sequence Model
Kun Xu, Lingfei Wu, Zhiguo Wang, Yansong Feng and Vadim Sheinin

Using Active Learning to Expand Training Data for Implicit Discourse Relation Recognition
Yang Xu, Yu Hong, Huibin Ruan, Jianmin Yao, Min Zhang and Guodong Zhou

Multi-View Learning: Multilingual and Multi-Representation Entity Typing
Yadollah Yaghoobzadeh and Hinrich Schütze

Classifying Referential and Non-referential It Using Gaze
Victoria Yaneva, Le An Ha, Richard Evans and Ruslan Mitkov

Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and Stopping Criteria for Neural Machine Translation
Yilin Yang, Liang Huang and Mingbo Ma

Convolutional Neural Networks with Recurrent Neural Filters
Yi Yang

Cascaded Mutual Modulation for Visual Reasoning
Yiqun Yao, Jiaming Xu and Bo Xu

Improving Multi-label Emotion Classification via Sentiment Classification with Dual Attention Transfer Network
Jianfei Yu, Luis Marujo, Jing Jiang, Pradeep Karuturi and William Brendel

The Internal Structure of Name Tokens: A Multilingual Study
Xiaodong Yu, Stephen Mayhew, Mark Sammons and Dan Roth

Attention-Based Capsule Network with Dynamic Routing for Relation Extraction
Ningyu Zhang, Shumin Deng, Huajun Chen, Zhanling Sun, Yiyi Zhang and Xiaoqian Li

Exploring Recombination for Efficient Decoding of Neural Machine Translation
Zhisong Zhang, Rui Wang, Masao Utiyama, Eiichiro Sumita and Hai Zhao

Learning Sentiment Memories for Sentiment Modification without Parallel Data
Yi Zhang, Xu SUN, Jingjing Xu, Pengcheng Yang and Xuancheng Ren

Neural Latent Extractive Document Summarization
Xingxing Zhang, Mirella Lapata, Furu Wei and Ming Zhou

On the Abstractiveness of Neural Document Summarization
Fangfang Zhang, Jin-ge Yao and Rui Yan

PubSE: A Hierarchical Model for Publication Extraction from Academic Homepages
Yiqing Zhang, Jianzhong Qi, Rui Zhang and Chuandong Yin

A dataset and baselines for sequential open-domain question answering
Chen Zhao, Ahmed Elgohary and Jordan Boyd-Graber

Generalizing Word Embeddings using Bag of Subwords
Jinman Zhao, Sidharth Mudgal and Yingyu Liang

Learning Gender-Neutral Word Embeddings
Jieyu Zhao, YICHAO ZHOU, Zeyu Li, Wei Wang and Kai-Wei Chang

A Dataset for Document Grounded Conversations
Kangyan Zhou, Shrimai Prabhumoye and Alan W Black

Quantifying Context Overlap for Training Word Embeddings
Yimeng Zhuang, Jinghui Xie, Yinhe Zheng and Xuan Zhu