Back to Search
Start Over
Improvements in Transition Based Systems for Dependency Parsing
- Publication Year :
- 2015
-
Abstract
- This thesis investigates transition based systems for parsing of natural language using dependency grammars. Dependency parsing provides a good and simple syntactic representation of the grammatical relations in a sentence. In the last years, this basic task has become a fundamental step in many applications that deal with natural language processing. Specifically, transition based systems have strong practical and psycholinguistic motivations. From a practical point of view, these systems are the only parsing systems that are fast enough to be used in web-scale applications. From a psycholinguistic point of view, they very closely resemble how humans incrementally process the language. However, these systems fall back in accuracy when compared with graph-based parsing, a family of parsing techniques that are based on a more traditional graph theoretic / dynamic programming approach, and that are more demanding on a computational perspective. Recently, some techniques have been developed in order to improve the accuracy of transition based systems. Most successful techniques are based on beam search or on the combination of the output of different parsing algorithms. However, all these techniques have a negative impact on parsing time. In this thesis, I will explore an alternative approach for transition based parsing, one that improves the accuracy without sacrificing computational efficiency. I will focus on greedy transition based systems and I will show how it is possible to improve the accuracy by using a dynamic oracle and a flexible parsing strategy. Dynamic oracles allow to reduce the error propagation at parsing time. Dynamic oracles may have some impact on training time, but there is no efficiency loss at parsing time. A flexible parsing strategy allows to reduce constraints over the parsing process and the time impact in both training and parsing time is almost negligible. Finally, these two techniques work really well when combined together, and they are orthogonal to previously explored proposals such as beam search or system combinations. As far as I know, the obtained experimental results are still state-of-the-art for greedy transition based parsing based on dependency grammars.
- Subjects :
- TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES
Settore INF/01 - Informatica
ING-INF/05 Sistemi di elaborazione delle informazioni
Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
INF/01 Informatica
parsing natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Accession number :
- edsair.dedup.wf.001..a5f223c5c620613815368f58af95820b