Journal of East China Normal University(Natural Sc ›› 2018, Vol. 2018 ›› Issue (1): 91-102,145.doi: 10.3969/j.issn.1000-5641.2018.01.009

Previous Articles     Next Articles

Forward stagewise additive modeling for entity ranking in documents

WANG Yan-hua   

  1. School of Data Science and Engineering, East China Normal University, EDWEI Shanghai 200062, China
  • Received:2016-12-01 Online:2018-01-25 Published:2018-01-11

Abstract: Key entities of a document can help to summarize the subjects of the events or the topics that the document describes, which can contribute to applications such as entity-oriented information retrieval and question-answering. However, entities in free text are unordered and hence it is important to rank entities of a document. In this paper, firstly, we make full use of features of entities that extracted from the document and draw support from Wikipedia and Word Embedding to generate external features. Then, we propose a novel ranking model named LA-FSAM(FSAM based on AUC Metric and Logistic Function) which is based on forward stagewise algorithm additive modeling. In LA-FSAM, we employ the AUC(Area Under the Curve) metric to construct the loss function and the logistic function to integrate features of entities. Finally, the stochastic gradient descent is utilized to optimize parameters of LA-FSAM model. After experiments, our evaluation shows the efficiency of the model we proposed.

Key words: entity ranking, forward stagewise additive modeling, area under the curve, logistic function, stochastic gradient descent

CLC Number: