华东师范大学学报(自然科学版) ›› 2017, Vol. ›› Issue (3): 145-152.doi: 10.3969/j.issn.1000-5641.2017.03.016

• 地理学 • 上一篇    

基于聚类关联规则的公交扒窃犯罪时空分析

闫密巧, 过仲阳, 任浙豪   

  1. 华东师范大学 地理科学学院, 上海 200241
  • 收稿日期:2016-06-17 出版日期:2017-05-25 发布日期:2017-05-18
  • 通讯作者: 过仲阳,男,教授,博士生导师,研究方向为数据挖掘、数据可视化.E-mail:zyguo@geo.ecnu.edu.cn E-mail:zyguo@geo.ecnu.edu.cn
  • 作者简介:闫密巧,女,硕士研究生,研究方向为数据挖掘
  • 基金资助:
    国家理科基地科研训练及科研能力提高项目(J1310028)

Spatio-temporal analysis of bus pickpocketing using association rules based on clustering

YAN Mi-qiao, GUO Zhong-yang, REN Zhe-hao   

  1. School of Geography Sciences, East China Normal University, Shanghai 200241, China
  • Received:2016-06-17 Online:2017-05-25 Published:2017-05-18

摘要: 提出了一种基于聚类的时空关联规则的公交犯罪挖掘算法.针对某市一个区的110报警数据库中的大量业务信息进行分析.首先,通过文本挖掘技术从案情信息中提取时间、地点等信息,并利用高德地图API的地理编码服务和POI搜索功能对提取的地址信息进行地址匹配,提取受害人上下车站点、乘坐公交线路等信息.其次,对提取得到的时空数据进行归并处理.最后,根据案发时段、季节以及是否节假日进行聚类分析,然后在簇内进行时空关联规则分析.这种挖掘方法具有以下特点:①在聚类基础上进行关联规则分析,减少扫描数据库次数,大大缩小数据扫描范围,提高算法效率,更加适合海量犯罪数据的挖掘.②聚类后簇内数据具有相似性,特征更加明显,在此基础上进行关联规则分析产生较小的频繁项集,并且提取出置信度较高的规则.③考虑犯罪行为的时空特性,挖掘过程中同时考虑了案发季节、是否节假日等因素.

关键词: 公交扒窃, 聚类分析, 关联规则, 犯罪模式识别

Abstract: This paper introduced the spatio-temporal association rules based on clustering minging to find out the spatio-temporal crime patterns of bus pickpocketing. It can be carried out through three steps. Firstly, extract time, places and other information from the case information by text extraction. Then, confirm the boarding stations and getting off stations of victims using the geocoding service and POI search capability of Amap API. Divide the bus routes into sections according to the bus stops and merge the crime time into time interval. Thirdly, the analysis of association rules based on clustering is carried out to discover the patterns of bus pickpocketing. The results prove that the proposed mining model has the following characteristics: ①This method can reduce the database scanning times, the candidate item sets amount and improve time efficiency of the searching. ②After clustering, the data in a cluster is similar and the characteristics are more obvious. On this basis, the association rules of high confidence are extracted. ③When the analysis was carried out, the temporal and spatial characteristics of the bus pickpocketing crime were also considered.

Key words: bus pickpocketing, clustering, spatio-temporal association rules, crime pattern recognition

中图分类号: