基于门控双层异构图注意力网络的半监督短文本分类

doi:10.16451/j.cnki.issn1003-6059.202307003

摘要
图/表
参考文献(0)
相关文章 (15)

全文: PDF (892 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要

针对现有的基于异构图神经网络的短文本分类方法未充分利用节点之间的有效信息,以及存在的过拟合问题,文中提出基于门控双层异构图注意力网络的半监督短文本分类方法(Semi-Supervised Short Text Classification with Gated Double-Layer Heterogeneous Graph Attention Network, GDHG).GDHG包含节点注意力机制和门控异构图注意力网络两层.首先,使用节点注意力机制,训练不同类型的节点注意力系数,再将系数输入门控异构图注意力网络,训练得到门控双层注意力.然后,将门控双层注意力与节点的不同状态相乘,得到聚合的节点特征.最后,使用softmax函数对文本进行分类.GDHG利用节点注意力机制和门控异构图注意力网络的信息遗忘机制对节点信息进行聚集,得到有效的相邻节点信息,进而挖掘不同邻居节点的隐藏信息,提高聚合远程节点信息的能力.在Twitter、MR、Snippets、AGNews四个短文本数据集上的实验验证GDHG性能较优.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	蒋云良
	王青朋
	张雄涛
	黄旭
	申情
	饶佳峰

关键词 ：门控异构图注意力, 半监督学习, 异构图神经网络, 短文本分类

Abstract：

To address the issues of insufficient utilization of information between nodes and overfitting in short text classification based on heterogeneous graph neural network, a method for semi-supervised short text classification based on gated double-layer heterogeneous graph attention network(GDHG) is proposed. GDHG consists of two layers: node attention and gated heterogeneous graph attention network. Firstly, different types of node attention coefficients are trained by node attention, and then the node attention coefficient is input into the gated heterogeneous graph attention network to obtain the gated double-layer attention. Secondly, the gated double-layer attention is multiplied by different states of the nodes to acquire the aggregated node features. Finally, the short texts are classified with the softmax function. In the proposed GDHG, the information forgetting mechanism of node attention and gated heterogeneous graph attention network is utilized to aggregate node information. Consequently, the information of neighboring nodes is effectively obtained. And then the hidden information of different neighboring nodes is mined to improve the ability to aggregate information from remote nodes. Experiment on four short text datasets , Twitter, MR, Snippets and AGNews, illustrate the superiority of GDHG.

Key words： Gated Heterogeneous Graph Attention Semi-Supervised Learning Heterogeneous Graph Neural Network Short Text Classification

收稿日期: 2022-12-06

ZTFLH:	TP181
	TP391

基金资助:

国家自然科学基金区域创新发展联合基金项目(No.U22A20102)、浙江省“尖兵”“领雁”研发攻关计划项目(No.2023C01150)资助

通讯作者: 张雄涛,博士,副教授,主要研究方向为人工智能、模式识别、机器学习等.E-mail:1047897965@qq.com.

作者简介: 蒋云良,博士,教授,主要研究方向为智能信息处理、地理信息系统等.E-mail:jyl@zjhu.edu.cn. 王青朋,硕士,主要研究方向于自然语言处理、机器学习等.E-mail:18751976668@163.com. 黄旭,博士,副教授,主要研究方向为人工智能、模式识别、机器学习等.E-mail:hx@zjhu.edu.cn.申情,硕士,教授,主要研究方向为智能信息处理等.E-mail:sq@zjhu.edu.cn. 饶佳峰,硕士,主要研究方向为智能信息处理、机器学习等.E-mail:rjf2345@163.com.

引用本文:

蒋云良, 王青朋, 张雄涛, 黄旭, 申情, 饶佳峰. 基于门控双层异构图注意力网络的半监督短文本分类[J]. 模式识别与人工智能, 2023, 36(7): 602-612. JIANG Yunliang, WANG Qingpeng, ZHANG Xiongtao, HUANG Xu, SHEN Qing, RAO Jiafeng. Semi-Supervised Short Text Classification Based on Gated Double-Layer Heterogeneous Graph Attention Network. Pattern Recognition and Artificial Intelligence, 2023, 36(7): 602-612.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202307003 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2023/V36/I7/602