一种处理大数据的分布式规则引擎
首发时间:2015-12-22
摘要:规则引擎可以接受数据输入,解释业务规则,并根据业务规则做出业务决策,但是由于其只能使用在单机系统上的局限性,当其处理大量数据时,会显著的影响系统性能。为解决这个问题,本文在传统的MapReduce框架下做出了改进,提出了一种分布式的规则引擎的实现方法。此方法通过构建一个并行的计算集群来处理大量的数据,集群中的每一台节点都有一个分支Rete网络。在规则分解和部署的过程中,利用了Apriori提高系统的性能。本文不仅在理论上描述了设计方法,而且也通过实验数据体现了系统的高性能。。
关键词: 计算机应用技术 规则引擎 大数据 rete算法 apriori算法
For information in English, please click here
A Distributed Architecture for Rule Engine to Deal with Big Data
Abstract:Rule engine, which acknowledges facts and draws conclusions by repeatedly matching facts with rules, is a good way of knowledge representation and inference. However, because of its low computational efficiency and the limitation of single machine's capacity, it cannot deal well with big data. As traditional MapReduce architecture can only address this problem in certain conditions, we have made some improvements and therefore proposed a distributed implementation of the rule engine using MapReduce-based architecture. It is designed to deal with a large amount of data in a parallel and distributed way by using a computing cluster that consists of multiple machines, on which certain part of the Rete algorithm would be operated. In the phase of splitting rules and the Rete-net, Apriori algorithm is also improved and adopted so as to gain a better system performance. This paper not only describes details of the design and its implementation, but also shows its high performance through several experiments.
Keywords: Technology of Computer Application rule engine big data rete algorithm apriori algorithm
基金:
论文图表:
引用
No.4670162112088714****
同行评议
共计0人参与
勘误表
一种处理大数据的分布式规则引擎
评论
全部评论0/1000