朱琪, 林果园. 基于改进随机森林算法的钓鱼网站检测方法研究[J]. 微电子学与计算机, 2019, 36(4): 43-46, 51.
引用本文: 朱琪, 林果园. 基于改进随机森林算法的钓鱼网站检测方法研究[J]. 微电子学与计算机, 2019, 36(4): 43-46, 51.
ZHU Qi, LIN Guo-yuan. Research on Detection Methods of Phishing Websites Based on Improved Random Forest Algorithm[J]. Microelectronics & Computer, 2019, 36(4): 43-46, 51.
Citation: ZHU Qi, LIN Guo-yuan. Research on Detection Methods of Phishing Websites Based on Improved Random Forest Algorithm[J]. Microelectronics & Computer, 2019, 36(4): 43-46, 51.

基于改进随机森林算法的钓鱼网站检测方法研究

Research on Detection Methods of Phishing Websites Based on Improved Random Forest Algorithm

  • 摘要: 为了更准确快捷的对钓鱼网站进行识别, 提出了一种基于改进随机森林算法的钓鱼网站检测方法.该方法挖掘钓鱼网页特征之间潜在的关联规则, 并对数据集进行分区, 以此区分特征数据的重要程度并计算权重以及数据选取的比例, 选取数据后对数据空间进行相应的集合化与剪辑以此优化森林的建立, 并根据建立的森林达到对钓鱼网站检测识别的目的.最终实验说明, 该方法对钓鱼网站的检测识别具有很好的效果和效率.

     

    Abstract: In order to improve the efficiency of phishing detection, a new algorithm was proposed to improve the traditional random forest algorithm. Potential association rules between web features are mined and used to partition the data set, in order to distinguish the features of different structures and calculate the weight of different data space to determine the scale of the selection. After selection of data, training data sets need to be aggregated and clipped to optimize the establishment of forests. Websites are trained and predicted using voting in decision forest. Experiments result shows that the new algorithm has obvious advantages in efficiency and effectiveness compared with the other two algorithm.

     

/

返回文章
返回