数据均衡算法在时空动态模型应用中的比较分析 -以三峡库区为例

雷宇航; 刘明皓; 陈思男; 李俊仪

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
动态公开评议
相关论文
评论

数据均衡算法在时空动态模型应用中的比较分析 -以三峡库区为例

首发时间：2019-12-25

雷宇航 ^{1
2}
雷宇航(1994-）,男，硕士研究生，主要研究方向为城乡用地动态变化模拟和计算机科学与技术应用。
刘明皓 ^{1
2}
刘明皓(1970-）,男，博士，教授，硕士导师，主要研究方向为地理计算智能、城乡用地动态变化模拟和土地资源管理等相关研究
陈思男 ^{1
2} 李俊仪 ^{1
2}

1、重庆邮电大学计算机科学与技术学院，重庆 400065
2、重庆邮电大学空间信息研究中心，重庆 400065

摘要：在机器学习与元胞自动机(Cellular Automata，CA）相结合的时空模型中，如何解决由于数据分布不均衡导致的关键少数地类的模拟精度过低的问题具有十分重要应用价值。本文在Markov-MLP-CA时空动态模型基础上，以三峡库区为例，设计了不同的数据均衡度策略和抽样算法方案，并对不同方案下的Markov-MLP-CA模拟结果进行了对比分析。结果显示：（1）当训练数据集的均衡度从0.64%，依次提升到7.65%、18.38%、23.06%和100%，其少数地类湿地的KAPPA从26.19%、依次提升到33.69%、36.57%、36.86%、42.05%，灌木地KAPPA也相应有所提高。（2）对训练数据进行均衡处理之后，少数地类的精度都得到了不同程度的提升。（3）采用Markov-MLP-CA和SMOTE-Tomek抽样算法耦合的模型，其总体KAPPA为0.8404，各地类KAPPA的波动度最小（49.08%），macro-F1值最高（0.7219）。研究认为：（1）通过改善训练数据的均衡度和改善抽样算法，可达到提高少数地类的模拟精度，降低各KAPPA指数波动度，从而提高模型的总体性能的目的；（2）模型性能评价应综合考虑KAPPA、KAPPA指数波动度和macro-F1值。（3）比较而言，Markov-MLP-CA与SMOTE-Tomek抽样算法耦合的模型具有较好的模拟性能。

关键词：用地变化模拟数据不均衡 SMOTE算法多层感知机元胞自动机

For information in English, please click here

Comparative Analysis Of Data Equalization Algorithms In Spatio-temporal Dynamic Model-A case Study Of The Three Gorges Reservoir Area

LEI Yuhang ^{1
2}
雷宇航(1994-）,男，硕士研究生，主要研究方向为城乡用地动态变化模拟和计算机科学与技术应用。
LIU Minghao ^{1
2}
刘明皓(1970-）,男，博士，教授，硕士导师，主要研究方向为地理计算智能、城乡用地动态变化模拟和土地资源管理等相关研究
CHEN SiNan ^{1
2} LI Junyi ^{1
2}

1、College of Computer Science and Technology, Chongqing University of Posts and Telecommunications ,Chongqing 400065
2、Spatial Information Research Center,Chongqing University of Posts and Telecommunications ,Chongqing 400065

Abstract：In the spatio-temporal model combining machine learning and cellular automata (CA), it is very important to solve the problem that the simulation accuracy of the key minority land classes is too low due to the imbalance of data distribution. Different data balance strategies and sampling algorithm schemes are deMarkov-MLP-CA Spatio-Temporal Dynamic Modeling And Comparison Analysis Of Equilibrium Strategies -Taking The Three Gorges Reservoir Area As An Examplesigned which is based on the Markov-MLP-CA spatio-temporal dynamic model and taking the Three Gorges Reservoir area as an example. And the Markov-MLP-CA simulation results under different schemes are compared and analyzed. The results show: (1) When the equilibrium degree of the training data set increased from 0.64% to 7.65%, 18.38%, 23.06% and 100% respectively, the KAPPA of the wetland which belongs to the minority land classes increased from 26.19%, 33.69%, 36.57%, 36.86% and 42.05% respectively. And the KAPPA of the shrub land also increased correspondingly. (2) After balancing the training data, the accuracy of the minority land classes has been improved in varying degrees.(3) The model which is coupled with Markov-MLP-CA and SMOTE-Tomek sampling algorithms has the following advantages: the total kappa is 0.8404, the volatility of kappa in different regions is the lowest (49.08%), and the value of Macro-F1 is the highest (0.7219). This study considers: (1) By improving the equilibrium degree of training data and sampling algorithm, the simulation accuracy of the minority land classes can be improved, the volatility of Kappa index can be reduced, and the overall performance of the model can be improved.(2) KAPPA, KAPPAindex volatility and Macro-F1 value should be considered in the model performance evaluation.(3) In comparison, the model coupled with Markov-MLP-CA and SMOTE-Tomek sampling algorithm has better simulation performance.

Keywords： Land use change simulation Imbalance data SMOTE algorithm Multilayer perceptron Cellular Automaton

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

雷宇航，刘明皓，陈思男，等. 数据均衡算法在时空动态模型应用中的比较分析 -以三峡库区为例[EB/OL]. 北京：中国科技论文在线 [2019-12-25]. https://www.paper.edu.cn/releasepaper/content/201912-98.

No.****

动态公开评议

共计0人参与

动态评论进行中

全部评论

0/1000

论文编号	201912-98
论文题目	数据均衡算法在时空动态模型应用中的比较分析 -以三峡库区为例
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.