融合标签关系的法律文本多标签分类方法

doi:10.16451/j.cnki.issn1003-6059.202202009

摘要
图/表
参考文献(0)
相关文章 (9)

全文: PDF (727 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要

随着大数据技术的快速发展,多标签文本分类在司法领域也催生出诸多应用.在法律文本中通常存在多个要素标签,标签之间往往具有相互依赖性或相关性,准确识别这些标签需要多标签分类方法的支持.因此,文中提出融合标签关系的法律文本多标签分类方法.方法构建标签的共现矩阵,利用图卷积网络捕捉标签之间的依赖关系,并结合标签注意力机制,计算法律文本和标签每个词的相关程度,得到特定标签的法律文本语义表示.最后,融合标签图构建的依赖关系和特定标签的法律文本语义表示,对文本进行综合表示,实现文本的多标签分类.在法律数据集上的实验表明,文中方法获得较好的分类精度和稳定性.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	宋泽宇
	李旸
	李德玉
	王素格

关键词 ：多标签分类, 文本表示, 图卷积神经网络, 标签注意力机制, 标签关系

Abstract：

With the rapid development of big data technology, multi-label text classification spawns many applications in the judicial field. There are multiple element labels in legal texts, and the labels are interdependent or correlated. Accurate identification of these labels requires the support of multi-label classification method. In this paper, a multi-label classification method of legal texts with fusion of label relations(MLC-FLR) is proposed. A graph convolution network model is utilized to capture the dependency relationship between labels by constructing the co-occurrence matrix of labels. The label attention mechanism is employed to calculate the degrees of correlation between a legal text and each label word, and the legal text semantic representation of a specific label can be obtained. Finally, the comprehensive representation of a text for multi-label classification is carried out by combining the dependency relationship and the legal text semantic representation of a specific label. Experimental results on the legal text datasets show that MLC-FLR achieves better classification accuracy and stability.

Key words： Multi-label Classification Document Representation Graph Convolutional Neural Network Label Attention Mechanism Label Relation

收稿日期: 2021-05-07

ZTFLH:

TP 391

基金资助:

国家自然科学基金项目(No.62072294,62076158,62106130,61906112)、山西省重点研发计划项目(No.201803D421024)、山西省研究生创新项目(No.2021Y149)资助

通讯作者: 李德玉,博士,教授,主要研究方向为粒计算、机器学习.E-mail:lidy@sxu.edu.cn.

作者简介: 宋泽宇,硕士研究生,主要研究方向为文本挖掘、自然语言处理.E-mail:szy5403@163.com.李旸,博士,讲师,主要研究方向为文本情感分析.E-mail:liyangprimrose@163.com.王素格,博士,教授,主要研究方向为自然语言处理、情感分析.E-mail:wsg@sxu.edu.cn.

引用本文:

宋泽宇, 李旸, 李德玉, 王素格. 融合标签关系的法律文本多标签分类方法[J]. 模式识别与人工智能, 2022, 35(2): 185-192. SONG Zeyu, LI Yang, LI Deyu, WANG Suge. Multi-label Classification of Legal Text with Fusion of Label Relations. Pattern Recognition and Artificial Intelligence, 2022, 35(2): 185-192.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202202009 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2022/V35/I2/185