Back to Search
Start Over
Constrained Decoding for Computationally Efficient Named Entity Recognition Taggers
- Source :
- EMNLP (Findings)
- Publication Year :
- 2020
- Publisher :
- arXiv, 2020.
-
Abstract
- Current state-of-the-art models for named entity recognition (NER) are neural models with a conditional random field (CRF) as the final layer. Entities are represented as per-token labels with a special structure in order to decode them into spans. Current work eschews prior knowledge of how the span encoding scheme works and relies on the CRF learning which transitions are illegal and which are not to facilitate global coherence. We find that by constraining the output to suppress illegal transitions we can train a tagger with a cross-entropy loss twice as fast as a CRF with differences in F1 that are statistically insignificant, effectively eliminating the need for a CRF. We analyze the dynamics of tag co-occurrence to explain when these constraints are most effective and provide open source implementations of our tagger in both PyTorch and TensorFlow.<br />Comment: Findings of EMNLP 2020
- Subjects :
- Scheme (programming language)
Conditional random field
Structure (mathematical logic)
FOS: Computer and information sciences
Computer Science - Computation and Language
Computer science
Speech recognition
02 engineering and technology
computer.software_genre
03 medical and health sciences
0302 clinical medicine
Named-entity recognition
030221 ophthalmology & optometry
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Layer (object-oriented design)
computer
Computation and Language (cs.CL)
Decoding methods
computer.programming_language
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- EMNLP (Findings)
- Accession number :
- edsair.doi.dedup.....f55d44d2274ce2f08ab05a696c26090a
- Full Text :
- https://doi.org/10.48550/arxiv.2010.04362