Back to Search Start Over

Combination of feature engineering and ranking models for paper-author identification in KDD Cup 2013

Authors :
Shou-De Lin
Cheng-Kuang Wei
Cheng-Xia Chang
Chun-Pai Yang
Hsiao-Yu Tung
Kuan-Hao Huang
Yong Zhuang
Yu-Chen Lu
Tzu-Ming Kuo
Shan-Wei Lin
Jui-Pin Wang
Chih-Jen Lin
Wei-Cheng Chang
Tu-Chun Yin
Felix Wu
Yu-Chin Juan
Tong Yu
Young-San Lin
Ting-Wei Lin
Yu-Chuan Su
Wei-Sheng Chin
Chun-Liang Li
Cheng-Hao Tsai
Hsuan-Tien Lin
Source :
Scopus-Elsevier, KDD Cup

Abstract

The track 1 problem in KDD Cup 2013 is to discriminate between papers confirmed by the given authors from the other deleted papers. This paper describes the winning solution of team National Taiwan University for track 1 of KDD Cup 2013. First, we conduct the feature engineering to transform the various provided text information into 97 features. Second, we train classification and ranking models using these features. Last, we combine our individual models to boost the performance by using results on the internal validation set and the official Valid set. Some effective post-processing techniques have also been proposed. Our solution achieves 0.98259 MAP score and ranks the first place on the private leaderboard of Test set.

Details

Database :
OpenAIRE
Journal :
Scopus-Elsevier, KDD Cup
Accession number :
edsair.doi.dedup.....690e0a38bf5e1699ab75d5c170cf77d8