1. Information Extraction for Additive Manufacturing Using News Data
- Author
-
Neha Sehgal and Andrew Crampton
- Subjects
Information extraction ,Open data ,Matching (statistics) ,Information retrieval ,Web mining ,Named-entity recognition ,Machine translation ,Computer science ,Question answering ,Key (cryptography) ,computer.software_genre ,computer - Abstract
Recognizing named entities like Person, Organization, Locations and Date are very useful for web mining. Named Entity Recognition (NER) is an emerging research area which aims to address problems such as Machine Translation, Question Answering Systems and Semantic Web Search. The study focuses on proposing a methodology based on the integration of an NER system and Text Analytics to provide information necessary for business in Additive Manufacturing. The study proposes a foundation of utilizing the Stanford NER system for tagging news data related to the keywords “Additive Manufacturing”. The objective is to first derive the organization names from news data. This information is useful to define the digital footprints of an organization in the Additive Manufacturing sector. The existence of an organization derived using the NER approach is validated by matching their names with companies listed on the Companies House portal. The organization names will be matched using a Fuzzy-based text matching algorithm. Further information on company profile, officers and key financial data is extracted to provide information about companies interested and working within the Additive Manufacturing sector. This data gives an insight into which companies have digital footprints in the Additive Manufacturing sector within the UK.
- Published
- 2019
- Full Text
- View/download PDF