基于深度强化学习的商品推荐系统

黄阳明; 邝坚

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

基于深度强化学习的商品推荐系统

首发时间：2021-01-04

黄阳明 ¹
黄阳明，男，全日制在读学硕，研究方向：深度强化学习+推荐系统
邝坚 ¹
邝坚(1966),男，硕导，研究方向：物联网与智能系统、星载软件

1、北京邮电大学计算机学院，北京，100086

摘要：现在的商品推荐系统常建立在用户数据已充分获取，且其行为特征在长期时间内都不会发生改变的基础上。但用户和推荐系统往往会发生持续且密切的交互行为，从而更好的揭示当前用户的行为特征，为推荐系统进行精准推荐提供更多的依据。针对这一问题，本文主要做了一下两方面工作。第一，本文设计并实现了多元素逼近状态机制和动作分组机制。多元素逼近状态机制使得获取相近状态奖励值时有更多的凭证，也能获得更相近的状态元素。动作分组机制是收集同一动作为一组，减少对每一个状态的计算量。第二，本文对商品推荐系统进行了扩展和优化的研究，系统能支持多次的深度强化学习推荐算法模块的更新和落实，设计并实现了支持用户登录，商品推荐，用户商品操作，用户操作记录的商品推荐系统。?????

关键词：深度强化学习多元素逼近状态动作分组

For information in English, please click here

Commodity recommendation system based on deep reinforcement learning

HuangYangMing ¹
黄阳明，男，全日制在读学硕，研究方向：深度强化学习+推荐系统
KuangJian ¹
邝坚(1966),男，硕导，研究方向：物联网与智能系统、星载软件

1、School of Computer, Beijing University of Posts and Telecommunications, Beijing,10086

Abstract：Current commodity recommendation systems are often based on the fact that user data has been fully obtained, and their behavior characteristics will not change in a long time. However, users and recommendation systems often have continuous and close interactions, which can better reveal current user behavior characteristics and provide more basis for recommendation systems to make accurate recommendations. In response to this problem, this article has mainly done two aspects of work. First, this paper designs and implements a multi-element approaching state mechanism and an action grouping mechanism. The multi-element approaching state mechanism makes it possible to obtain more credentials when obtaining the reward value of the similar state, and to obtain more similar state elements. The action grouping mechanism is to collect the same action as a group, reducing the amount of calculation for each state. Second, this article studies the expansion and optimization of the product recommendation system. The system can support multiple updates and implementations of deep reinforcement learning recommendation algorithm modules. It is designed and implemented to support user login, product recommendation, user product operations, and user operations. Recorded commodity recommendation system.

Keywords： Deep reinforcement learning, multi-element approaching state, Action grouping

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

黄阳明，邝坚. 基于深度强化学习的商品推荐系统[EB/OL]. 北京：中国科技论文在线 [2021-01-04]. https://www.paper.edu.cn/releasepaper/content/202101-6.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	202101-6
论文题目	基于深度强化学习的商品推荐系统
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.