华东师范大学学报(自然科学版) ›› 2015, Vol. 2015 ›› Issue (3): 80-90.doi: 10.3969/j.issn.1000-5641.2015.03.010

• 计算机科学 • 上一篇    下一篇

基于评论分析的评分预测与推荐

高祎璠,余文喆,晁平复,郑芷凌,张蓉   

  1. 华东师范大学 数据科学与工程研究院上海高可信计算重点实验室,上海200062
  • 收稿日期:2014-12-15 出版日期:2015-05-25 发布日期:2015-05-28
  • 通讯作者: 张蓉,女,博士,副教授,主要研究方向为数据挖掘、信息检索 E-mail:rzhang@sei.ecnu.edu.cn
  • 作者简介:高祎璠,女,硕士研究生.E-mail: yfgao@ecnu.edu.cn.
  • 基金资助:

    国家自然科学基金(61103039,61402177);国家自然科学基金重点项目(61232002)

Analyzing reviews for rating prediction and item recommendation

GAO Yi-fan,YU Wen-zhe,CHAO Ping-fu,ZHENG Zhi-ling,ZHANG Rong   

  • Received:2014-12-15 Online:2015-05-25 Published:2015-05-28

摘要: 推荐系统广泛地应用在网络平台中,推荐模型需要预测用户的喜好,帮助用户找到适合的电影、书籍、音乐等商品.通过对用户评分和评论信息的分析,可以发现用户关注的商品特征,并根据商品的特征,推测用户对该商品的喜好程度.本文提出将评论中隐含的语义内容与评分相结合,设计并实现了一种新颖的商品推荐模型.首先利用主题模型挖掘评论文本中隐含的主题分布,用主题分布刻画用户偏好和商品画像,在逻辑回归模型上训练主题与打分的关系,最终评分可以被视为是对用户偏好和商品画像的相似程度的量化表示.最后,本文在真实数据上进行了大量对比实验,结果证明该模型比对比系统性能优越且稳定.

关键词: 推荐, 潜在主题, LDA, 回归模型, 评论分析

Abstract: Recommender systems are widely deployed in Web applications that need to predict the preferences of users to items. They are popular in helping users find movies, books, music, and products in general. In this work, we design a method for item recommendation based on a novel model that captures correlations between hidden aspects in reviews and numeric ratings. It is motivated by the observation that a user’s preference against an item is affected by different aspects discussed in reviews. Our method first explores topic modeling to discover hidden aspects from review text. Profiles are then created for users and items separately based on aspects discovered in their reviews. Finally, we utilize logistic regression to model the user item relationship and the rating is modeled as the similarity between user and item profiles. Experiments over real world reviews demonstrate the advantage of our proposal over state of the art solution.

Key words: recommendation;hidden aspect;LDA;regression model, review analysis

中图分类号: