基于深度强化学习的商品推荐系统
首发时间:2021-01-04
摘要:现在的商品推荐系统常建立在用户数据已充分获取,且其行为特征在长期时间内都不会发生改变的基础上。但用户和推荐系统往往会发生持续且密切的交互行为,从而更好的揭示当前用户的行为特征,为推荐系统进行精准推荐提供更多的依据。针对这一问题,本文主要做了一下两方面工作。第一,本文设计并实现了多元素逼近状态机制和动作分组机制。多元素逼近状态机制使得获取相近状态奖励值时有更多的凭证,也能获得更相近的状态元素。动作分组机制是收集同一动作为一组,减少对每一个状态的计算量。第二,本文对商品推荐系统进行了扩展和优化的研究,系统能支持多次的深度强化学习推荐算法模块的更新和落实,设计并实现了支持用户登录,商品推荐,用户商品操作,用户操作记录的商品推荐系统。?????
For information in English, please click here
Commodity recommendation system based on deep reinforcement learning
Abstract:Current commodity recommendation systems are often based on the fact that user data has been fully obtained, and their behavior characteristics will not change in a long time. However, users and recommendation systems often have continuous and close interactions, which can better reveal current user behavior characteristics and provide more basis for recommendation systems to make accurate recommendations. In response to this problem, this article has mainly done two aspects of work. First, this paper designs and implements a multi-element approaching state mechanism and an action grouping mechanism. The multi-element approaching state mechanism makes it possible to obtain more credentials when obtaining the reward value of the similar state, and to obtain more similar state elements. The action grouping mechanism is to collect the same action as a group, reducing the amount of calculation for each state. Second, this article studies the expansion and optimization of the product recommendation system. The system can support multiple updates and implementations of deep reinforcement learning recommendation algorithm modules. It is designed and implemented to support user login, product recommendation, user product operations, and user operations. Recorded commodity recommendation system.
Keywords: Deep reinforcement learning, multi-element approaching state, Action grouping
基金:
引用
No.****
动态公开评议
共计0人参与
勘误表
基于深度强化学习的商品推荐系统
评论
全部评论0/1000