郑斌. 基于改进遗传算法的不完整大数据填充挖掘算法[J]. 微电子学与计算机, 2016, 33(2): 98-101.
引用本文: 郑斌. 基于改进遗传算法的不完整大数据填充挖掘算法[J]. 微电子学与计算机, 2016, 33(2): 98-101.
ZHENG Bin. Incomplete Data Filling Mining Algorithm Based on the Improved Genetic Algorithm[J]. Microelectronics & Computer, 2016, 33(2): 98-101.
Citation: ZHENG Bin. Incomplete Data Filling Mining Algorithm Based on the Improved Genetic Algorithm[J]. Microelectronics & Computer, 2016, 33(2): 98-101.

基于改进遗传算法的不完整大数据填充挖掘算法

Incomplete Data Filling Mining Algorithm Based on the Improved Genetic Algorithm

  • 摘要: 传统的大数据填充挖掘过程存在耗时高、效率低的缺陷, 对此提出基于改进遗传算法的不完整大数据填充挖掘算法.该算法先融合于BP人工网络神经算法随机产生不完整大数据初始网络权值, 并对该组权值进行训练, 设计一个不完整大数据基因矩阵, 利用遗传算法以该基因矩阵为依据计算出不完整大数据适应值函数, 并同时进行编码、杂交、变异等遗传操作, 在此基础上进行不完整大数据信息的全局搜索, 以其搜索的结果为核心将大数据划分为完整与不完整数据, 采用信息论中熵值的概念利用同一类完整数据信息的属性值对缺失值进行填充.实验仿真证明, 基于改进遗传算法的不完整大数据填充挖掘方法能实现对不完整大数据的挖掘, 提高缺失数据的填充精度.

     

    Abstract: Traditional big data high filling mining process is time-consuming, the defects of low efficiency. Based on improved genetic algorithm of incomplete data filling mining algorithm. This algorithm fusion in the BP artificial neural network algorithm first randomly generated incomplete data the initial network weights, and to train the weights of the group to design a incomplete data gene matrix, by using the genetic algorithm based on the gene matrix to calculate the incomplete data adaptive value function, and at the same time coding, hybridization and mutation genetic operation, on the basis of incomplete big data information of global search, with its search results as the core will be divided into complete and incomplete data, using the concept of entropy in information theory, and the same kind of complete data information of attribute values to fill the missing value. The experimental simulation proves that the incomplete big data based on improved genetic algorithm of incomplete filling mining method can realize data mining, improve the filling precision of the missing data.

     

/

返回文章
返回