基于轻量图卷积增强嵌入学习的点击率预测模型

张琳钰; 牛存良; 姚政

doi:10.19304/J.ISSN1000-7180.2021.1069

基于轻量图卷积增强嵌入学习的点击率预测模型

Enhanced embedding learning for CTR prediction based on LightGCN

摘要

摘要: 基于特征交互建模方法的点击率预测问题经广泛探索已经取得较大进展，它能缓解有效信息损失，但在一定程度上依赖于不同特征的共同出现，存在特征稀疏问题.因此，针对交互过程特征出现次数少不能高效学习特征表示的问题，提出了一个基于轻量图卷积增强嵌入层学习的点击率预测模型LGCDFM(LightGCN with DeepFM).在初始嵌入层采用分而治之的学习策略，提出图结构中区分不同类型节点，首先由同类型节点信息传播确保特征出现频率，再由高阶连通的不同类型节点间交互捕捉多跳邻居信息.轻量图卷积神经结构强大的特征提取和表示学习能力，且摒弃无益于交互的特征变换和非线性激活函数，成为处理简单用户-项目交互数据的协同过滤任务的优势，有效减轻特征稀疏性问题.最后，表示学习层应用点击率预测经典模型DeepFM端到端学习高阶和低阶特征组合，由隐向量从稀疏数据中学习，提升点击率预测任务性能.通过在Criteo、Avazu两个公开数据集上的实验表明，该模型在点击率预测和特征稀疏问题上的性能表现均优于现有方法.

Abstract: The Click-through rate problem based on feature interaction modeling method has been widely explored and has made great progress. It can alleviate the loss of effective information, but to a certain extent, it depends on the co-occurrence of different features, and there is a problem of feature sparseness. Therefore, in order to solve the problem that the feature representation cannot be learned efficiently because of the few occurrences of the interactive process features, a click-through rate prediction model LGCDFM (LightGCN with DeepFM) based on LightGCN enhanced embedding layer learning is proposed. In the initial embedding layer, a divide-and-conquer learning strategy is adopted. Different types of nodes are distinguished in the graph structure. The information of the same types of nodes is first transmitted to ensure the frequency of features, and then the information of multi-hop neighbors is captured by the interaction between different types of high-order connected nodes. LightGCN structure has powerful feature extraction and representation learning capabilities, and it discards feature transformations and nonlinear activation functions that are not conducive to interaction. It becomes the advantage of collaborative filtering tasks for processing simple user-item interaction data, effectively reducing feature sparsity problem. Finally, it means that the learning layer applies the classic model of click-through rate prediction DeepFM to end-to-end learning high-order and low-order feature combinations, and learns from sparse data by latent vectors to improve the performance of click-through rate prediction tasks. Experiments on two public datasets of Criteo and Avazu show that the performance of this model is better than existing methods in terms of click-through rate prediction and feature sparseness problems.

HTML全文

参考文献(20)

施引文献

资源附件(0)