应毅, 任凯, 刘正涛. 基于云计算技术的数据挖掘[J]. 微电子学与计算机, 2013, 30(2): 161-164.
引用本文: 应毅, 任凯, 刘正涛. 基于云计算技术的数据挖掘[J]. 微电子学与计算机, 2013, 30(2): 161-164.
YING Yi, REN Kai, LIU Zheng-tao. Data Mining Based on Cloud-Computing Technology[J]. Microelectronics & Computer, 2013, 30(2): 161-164.
Citation: YING Yi, REN Kai, LIU Zheng-tao. Data Mining Based on Cloud-Computing Technology[J]. Microelectronics & Computer, 2013, 30(2): 161-164.

基于云计算技术的数据挖掘

Data Mining Based on Cloud-Computing Technology

  • 摘要: 基于单一节点的数据挖掘系统在处理海量数据集时存在计算瓶颈,针对该问题,提出了一种基于云计算技术的数据挖掘方法:将大数据集和挖掘任务分解到多台计算机上并行处理.在对经典Apriori算法MapReduce化后,建立了一个基于Hadoop开源框架的并行数据挖掘平台,并通过对餐饮系统中点菜单的数据挖掘工作验证了该系统的有效性.实验表明,在集群中使用云计算技术处理大数据集,可以明显提高数据挖掘的效率.

     

    Abstract: When process the massive data,there exists a calculation bottleneck in current data mining system based on single node. To solve these problems, proposed a cloud-computing technology-based data mining method.That is, the large data and mining tasks will be decomposed on multiple computers and be processed by parallel.We use open source project-Hadoop to establish a parallel data mining platform based on Apriori with MapReduce technology. It has been verified the effectiveness of system by data mining job of carte menu in catering industry.Experimental results show that, using cloud-computing technology to process large data in the cluster can significantly improve the efficiency of data mining.

     

/

返回文章
返回