Back to Search
Start Over
Research of the Web Information Extraction Technology on Tourism Theme
- Source :
- Applied Mechanics and Materials. 614:503-506
- Publication Year :
- 2014
- Publisher :
- Trans Tech Publications, Ltd., 2014.
-
Abstract
- With the development of web technology, the use of dynamic web pages and the personalization of page contents become more and more popular. Currently, the information of page is protean and the structures of different pages are vastly different, the traditional thinking of web information extraction technology has been difficult to adapt to the situation. In this paper, proposes a web information extraction method based on extended XPath policy through the analysis of structural features of web pages on tourist theme. This algorithm avoids the defects of traditional web information extraction technology; it is simple, practical, high cleaning efficiency, accuracy, and saving the overhead of the system.
- Subjects :
- Web standards
Web analytics
Web server
medicine.medical_specialty
Web development
Web 2.0
computer.internet_protocol
Computer science
Dynamic web page
computer.software_genre
Social Semantic Web
Personalization
World Wide Web
Web design
Web page
Website Parse Template
medicine
Web navigation
Semantic Web Stack
XPath
business.industry
General Medicine
Web application security
Information extraction
Web mapping
Web service
business
Web intelligence
computer
Site map
Web modeling
Tourism
Subjects
Details
- ISSN :
- 16627482
- Volume :
- 614
- Database :
- OpenAIRE
- Journal :
- Applied Mechanics and Materials
- Accession number :
- edsair.doi...........d53e159de10265d58caf29d918deee75