基于协同过滤和word2vec算法的邮箱平台推荐系统
首发时间:2014-11-24
摘要:网上信息量的大幅增长,用户在面对大量信息时无法迅速获得对自己真正有用的那部分,出现信息过载的问题。推荐系统在电子商务、互联网企业中变得越来越重要,这些推荐系统帮助公司更好的了解用户,为用户推荐符合其喜好的商品及服务。电子邮件一直是人们工作、生活中不可或缺的工具,而邮箱中也包含着用户非常多的个人兴趣、爱好特征。本文提出了一种基于电子邮件平台的推荐系统,通过分析用户邮件信息通过中文分词算法生成用户兴趣矩阵,抓取图书数据作为推荐数据库,根据用户兴趣矩阵与图书特征相似矩阵进行推荐,为了消除用户兴趣漂移并增强系统的健壮性,本文加入了反馈系统,根据用户的行为,对推荐结果进行权值调整,使用基于协同过滤的方式进行推荐;当用户未初始用户,只有极少量邮件时,本文创新性地引入word2vec算法,通过进行词的相似性拓展,来解决推荐系统中的冷启动问题;
关键词: 人工智能 数据挖掘 推荐系统 word2vec 邮箱平台 协同过滤
For information in English, please click here
Recommender System Based on Collaborative Filtering and word2vec for Email Platform
Abstract:With the development of Web2.0 technology , the information the user really want submerges in the large amounts of information, the so-called information overload problem. So recommendation systems are becoming more important in e-commerce and Internet companies, which helps companies better understand and analyze users, then recommends for users of goods and services according to their tastes and interests. E-mail has been a indispensable tool with the appearance of the Internet, it contains a lot of the user's personal interests and hobbies features. This paper presents a recommendation system based on e-mail platforms, and use word2vec algorithm and Collaborative Filtering in order to better address the cold start problems which is a common problem for recommendation system. We analyze user mail messages and user behavior logs for user modeling, then generating user interest matrix by the Chinese word segmentation algorithm. We use books as our database to recommend. We recommend books based on the user interest similarity matrix and feature matrix of books. Besides, in order to eliminate drift and enhance user interest robustness of the system, we added a feedback system according to the user's behavior. When a user clicks one thing, the system will get similar things and then recommend to the user. The similarity between books is stored in thing similarity matrix, which was calculated by content-based and collaborative filtering techniques. what's more, if the user has only a few mails, we use word2vec algorithm to solve the cold start problem by performing word similarity expansion.
Keywords: Artificial Intelligence;Data Mining Recommender System word2vec;Email Platform Interest Matrix
基金:
论文图表:
引用
No.4618267101762914****
同行评议
共计0人参与
勘误表
基于协同过滤和word2vec算法的邮箱平台推荐系统
评论
全部评论0/1000