7 results on '"Xiaodan Liang"'
Search Results
2. Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning
- Author
-
Song-Chun Zhu, Ran Gong, Liang Qiu, Pan Lu, Xiaodan Liang, Shibiao Jiang, and Siyuan Huang
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Parsing ,Formal Languages and Automata Theory (cs.FL) ,Computer Science - Artificial Intelligence ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Formal Languages and Automata Theory ,Geometry ,Construct (python library) ,Solver ,Object (computer science) ,computer.software_genre ,Artificial Intelligence (cs.AI) ,Path (graph theory) ,Formal language ,Benchmark (computing) ,Computation and Language (cs.CL) ,computer ,Axiom - Abstract
Geometry problem solving has attracted much attention in the NLP community recently. The task is challenging as it requires abstract problem understanding and symbolic reasoning with axiomatic knowledge. However, current datasets are either small in scale or not publicly available. Thus, we construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language. We further propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem Solver (Inter-GPS). Inter-GPS first parses the problem text and diagram into formal language automatically via rule-based text parsing and neural object detecting, respectively. Unlike implicit learning in existing methods, Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step. Also, a theorem predictor is designed to infer the theorem application sequence fed to the symbolic solver for the more efficient and reasonable searching path. Extensive experiments on the Geometry3K and GEOS datasets demonstrate that Inter-GPS achieves significant improvements over existing methods. The project with code and data is available at https://lupantech.github.io/inter-gps., Accepted to ACL 2021, 13 pages, 6 figures
- Published
- 2021
3. Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks
- Author
-
Liang Lin, Yining Hong, Jinghui Qin, Jianheng Tang, and Xiaodan Liang
- Subjects
FOS: Computer and information sciences ,Word problem (mathematics education) ,Set (abstract data type) ,Consistency (database systems) ,Computer Science - Computation and Language ,Theoretical computer science ,Computer science ,Scalability ,Duality (mathematics) ,Benchmark (computing) ,Solver ,Computation and Language (cs.CL) ,Task (project management) - Abstract
Previous math word problem solvers following the encoder-decoder paradigm fail to explicitly incorporate essential math symbolic constraints, leading to unexplainable and unreasonable predictions. Herein, we propose Neural-Symbolic Solver (NS-Solver) to explicitly and seamlessly incorporate different levels of symbolic constraints by auxiliary tasks. Our NS-Solver consists of a problem reader to encode problems, a programmer to generate symbolic equations, and a symbolic executor to obtain answers. Along with target expression supervision, our solver is also optimized via 4 new auxiliary objectives to enforce different symbolic reasoning: a) self-supervised number prediction task predicting both number quantity and number locations; b) commonsense constant prediction task predicting what prior knowledge (e.g. how many legs a chicken has) is required; c) program consistency checker computing the semantic loss between predicted equation and target equation to ensure reasonable equation mapping; d) duality exploiting task exploiting the quasi duality between symbolic equation generation and problem's part-of-speech generation to enhance the understanding ability of a solver. Besides, to provide a more realistic and challenging benchmark for developing a universal and scalable solver, we also construct a new large-scale MWP benchmark CM17K consisting of 4 kinds of MWPs (arithmetic, one-unknown linear, one-unknown non-linear, equation set) with more than 17K samples. Extensive experiments on Math23K and our CM17k demonstrate the superiority of our NS-Solver compared to state-of-the-art methods., ACL 2021
- Published
- 2021
4. A Data-Centric Framework for Composable NLP Workflows
- Author
-
Swapnil Singhavi, Zecong Hu, Zhiting Hu, Shikun Zhang, Avinash Bukkittu, Haoran Shi, Pengzhi Gao, Xin Gao, Zhengzhong Liu, Guanxiong Ding, Eric P. Xing, Atif Ahmed, Linwei Li, Mansi Gupta, Xiaodan Liang, Wei Wei, and Teruko Mitamura
- Subjects
0303 health sciences ,business.industry ,Computer science ,Deep learning ,Interoperability ,02 engineering and technology ,External Data Representation ,computer.software_genre ,Database-centric architecture ,Visualization ,03 medical and health sciences ,Interoperation ,Annotation ,Workflow ,Health care ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Natural language processing ,030304 developmental biology - Abstract
Empirical natural language processing (NLP) systems in application domains (e.g., healthcare, finance, education) involve interoperation among multiple components, ranging from data ingestion, human annotation, to text retrieval, analysis, generation, and visualization. We establish a unified open-source framework to support fast development of such sophisticated NLP workflows in a composable manner. The framework introduces a uniform data representation to encode heterogeneous results by a wide range of NLP tasks. It offers a large repository of processors for NLP tasks, visualization, and annotation, which can be easily assembled with full interoperability under the unified representation. The highly extensible framework allows plugging in custom processors from external off-the-shelf NLP and deep learning libraries. The whole framework is delivered through two modularized yet integratable open-source projects, namely Forte (for workflow infrastructure and NLP function processors) and Stave (for user interaction, visualization, and annotation).
- Published
- 2020
5. Data-to-Text Generation with Style Imitation
- Author
-
Eric P. Xing, Shuai Lin, Xiaodan Liang, Zhiting Hu, Frank F. Xu, Wentao Wang, and Zichao Yang
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer science ,media_common.quotation_subject ,Realization (linguistics) ,Fidelity ,02 engineering and technology ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Machine Learning (cs.LG) ,Writing style ,0202 electrical engineering, electronic engineering, information engineering ,Control (linguistics) ,0105 earth and related environmental sciences ,media_common ,Computer Science - Computation and Language ,business.industry ,Constraint (information theory) ,Artificial Intelligence (cs.AI) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Imitation ,Computation and Language (cs.CL) ,computer ,Word (computer architecture) ,Sentence ,Natural language processing - Abstract
Recent neural approaches to data-to-text generation have mostly focused on improving content fidelity while lacking explicit control over writing styles (e.g., word choices, sentence structures). More traditional systems use templates to determine the realization of text. Yet manual or automatic construction of high-quality templates is difficult, and a template acting as hard constraints could harm content fidelity when it does not match the record perfectly. We study a new way of stylistic control by using existing sentences as soft templates. That is, the model learns to imitate the writing style of any given exemplar sentence, with automatic adaptions to faithfully describe the content record. The problem is challenging due to the lack of parallel data. We develop a neural approach that includes a hybrid attention-copy mechanism, learns with weak supervisions, and is enhanced with a new content coverage constraint. We conduct experiments in restaurants and sports domains. Results show our approach achieves stronger performance than a range of comparison methods. Our approach balances well between content fidelity and style control given exemplars that match the records to varying degrees., Comment: Accepted by EMNLP 2020 Findings. Significant updates over the previous version. Code & data are available at https://github.com/ha-lins/DTG-SI
- Published
- 2020
6. Target-Guided Open-Domain Conversation
- Author
-
Zhiting Hu, Xiaodan Liang, Eric P. Xing, Chenyan Xiong, Jianheng Tang, and Tiancheng Zhao
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer science ,Transition (fiction) ,media_common.quotation_subject ,Control (management) ,Supervised learning ,Subject (documents) ,Conversational system ,02 engineering and technology ,010501 environmental sciences ,01 natural sciences ,Machine Learning (cs.LG) ,Artificial Intelligence (cs.AI) ,Human–computer interaction ,0202 electrical engineering, electronic engineering, information engineering ,Open domain ,020201 artificial intelligence & image processing ,Conversation ,Computation and Language (cs.CL) ,0105 earth and related environmental sciences ,media_common - Abstract
Many real-world open-domain conversation applications have specific goals to achieve during open-ended chats, such as recommendation, psychotherapy, education, etc. We study the problem of imposing conversational goals on open-domain chat agents. In particular, we want a conversational system to chat naturally with human and proactively guide the conversation to a designated target subject. The problem is challenging as no public data is available for learning such a target-guided strategy. We propose a structured approach that introduces coarse-grained keywords to control the intended content of system responses. We then attain smooth conversation transition through turn-level supervised learning, and drive the conversation towards the target with discourse-level constraints. We further derive a keyword-augmented conversation dataset for the study. Quantitative and human evaluations show our system can produce meaningful and effective conversations, significantly improving over other approaches., ACL 2019. Data and code available at https://github.com/squareRoot3/Target-Guided-Conversation. fixed typos
- Published
- 2019
7. Texar: A Modularized, Versatile, and Extensible Toolbox for Text Generation
- Author
-
Zhengzhong Liu, Zichao Yang, Xiaodan Liang, Devendra Singh Chaplot, Tiancheng Zhao, Junxian He, Lianhui Qin, Bowen Tan, Zhiting Hu, Di Wang, Xuezhe Ma, Haoran Shi, Eric P. Xing, and Xingjiang Yu
- Subjects
Set (abstract data type) ,Machine translation ,Programming language ,Generalization ,Computer science ,Text generation ,Systems design ,computer.software_genre ,computer ,Extensibility ,Toolbox - Abstract
We introduce Texar, an open-source toolkit aiming to support the broad set of text generation tasks. Different from many existing toolkits that are specialized for specific applications (e.g., neural machine translation), Texar is designed to be highly flexible and versatile. This is achieved by abstracting the common patterns underlying the diverse tasks and methodologies, creating a library of highly reusable modules and functionalities, and enabling arbitrary model architectures and various algorithmic paradigms. The features make Texar particularly suitable for technique sharing and generalization across different text generation applications. The toolkit emphasizes heavily on extensibility and modularized system design, so that components can be freely plugged in or swapped out. We conduct extensive experiments and case studies to demonstrate the use and advantage of the toolkit.
- Published
- 2018
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.