Abstract:
To solve the problem that the existing privacy preserving association rule mining algorithm cannot meet better trade-off between efficiency and accuracy,the paper proposes average information distributed clustering hybrid algorithm.The algorithm creates a vector of association rules,which uses content of information theory.Accumulation of calculation information source the number of times and extraction obvious features of a potential,the potential characteristics of the neighborhood as clustering object clustering,and the introduction of data mining association ontology concept,digging under the conditions of the non-monotonicity constraint,to overcome the weakening drawbacks associated space data by the Privacy.The experiments show that the algorithm can obtain a good tradeoff between accuracy and efficiency in the case of the protection of privacy.