1. UNH at SemEval-2019 Task 12: Toponym Resolution in Scientific Papers
- Author
-
Laura Dietz and Matthew Magnusson
- Subjects
0301 basic medicine ,Conditional random field ,Computer science ,business.industry ,02 engineering and technology ,Toponymy ,computer.software_genre ,Perceptron ,Convolutional neural network ,SemEval ,Task (project management) ,03 medical and health sciences ,Identification (information) ,030104 developmental biology ,Named-entity recognition ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
The SemEval-2019 Task 12 is toponym resolution in scientific papers. We focus on Subtask 1: Toponym Detection which is the identification of spans of text for place names mentioned in a document. We propose two methods: 1) sliding window convolutional neural network using ELMo embeddings (cnn-elmo), and 2) sliding window multi-Layer perceptron using ELMo embeddings (mlp-elmo). We also submit Bi-lateral LSTM with Conditional Random Fields (bi-LSTM) as a strong baseline given its state-of-art performance in Named Entity Recognition (NER) task. Our best performing model is cnn-elmo with a F1 of 0.844 which was below bi-LSTM F1 of 0.862 when evaluated on overlap macro detection. Eight teams participated in this subtask with a total of 21 submissions.
- Published
- 2019