基于文本的地理空间数据挖掘与可视化
首发时间:2009-01-24
摘要:Web文本挖掘在现今各行业中越来越受欢迎,而基于文本的地理空间数据的挖掘与可视化还是一较新颖的研究领域。本文在综述文本数据挖掘研究现状的基础上,提出了基于传统文本的地理空间数据挖掘应用模型,利用Arcview作为构建模型的基础平台,实现了基于文本的地理信息的可视化,和历史景点相关属性信息的查询。 本文选取庐山“山北第二路”作为文本数据挖掘模型的应用实例研究。首先将文本格式的《庐山志》中与“山北第二路”相关的内容转换成电子文档,并进行必要的预处理,在此基础上,建立包括试验区主要景点及其空间关系在内的数据字典,同时按照点、线地物的分类进行编码,绘制草图,设计空间数据库框架;在准备好数据库表格后,利用Arcview3.2软件,绘制主要景点的点、线类图形,并载入空间属性库。试验模型系统具有历史地物景点的分类、统计与显示查询等功能,并可以与现代景点进行对比分析。论文最后对此模型的不足作相关分析,进而提出改进建议。
For information in English, please click here
Geo-Spatial Data Mining and Visualization Based on Texts
Abstract:Recently, Web Text Mining is more and more popular to different kinds of industries, while geographical Data Mining and Visualization based on text belong to an emerging field. After reviewing the status quo of text mining research, the thesis brings forward an application model of geographical Data Mining based on traditional texts, and implements visualization of geographical information extracted from text data mining and inquiry of related attributes of historic scene places, by using Arcview3.2 as modeling platform. This thesis selects the second road in the north of Lushan as an application example research of Text Data Mining Model. First of all, traditional textual contents related to the second road in the north of Lushan are converted to electrical documents and necessary preprocessing is performed. Then, a data dictionary is set up covering the primary scene places and their spatial connection and a code system is established in terms of the geometry classification, namely as points or lines, of the scenes in the case study. Then sketches are drawn and architecture of spatial database is designed. After filling out database tables, both the spatial and attributes data are stored as different themes in Arcview3.2. The experimental model system possesses a series functions, like classification, statistics, and presentation of historic scene places. Moreover, it can be used to make comparative analysis between historic and current scene places. Finally the thesis analyses the weakness of this model and that proposes suggestions for improvement.
Keywords: The second road in the north of Lushan Text Mining Data dictionary Query
基金:
论文图表:
引用
No.2830634973712327****
同行评议
共计0人参与
勘误表
基于文本的地理空间数据挖掘与可视化
评论
全部评论0/1000