基于思维链和语义解耦的层次化主题模型

doi:10.16451/j.cnki.issn1003-6059.202507003

摘要
图/表
参考文献
相关文章 (5)

全文: PDF (890 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要层次主题模型可以挖掘文档中的隐含主题,建模主题间的层次结构关系,为数据治理、信息检索、内容分类和知识管理等应用提供技术支持.文中提出基于思维链和语义解耦的层次化主题模型.首先,建立基于思维链的层次主题生成模块,设计层次化主题生成思维链,指导大语言模型(Large Language Model, LLM)生成初步的主题层次结构.然后,引入基于LLM的主题相似判别机制,生成精炼的主题,并利用样例指导LLM实现主题合并,提升生成主题的质量.最后,建立基于传输规划和语义解耦的主题层次优化模块,将初始层次主题结构作为下游建模的主题先验,构建主题关键词、文档主题分布和主题距离,并将主题层次关系建模为最优运输问题,结合上下层主题关键词进行父子主题解耦,优化主题层次结构.在NeurIPS、ACL、20 Newsgroups等涵盖新闻与学术论文的多个标准公开数据集上的实验表明,文中模型在主题质量指标和层次化指标上均取得较优值.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	王志华
	李旸
	李德玉
	王素格

关键词 ：层次主题模型, 标签自动生成, 思维链, 大语言模型(LLM), 提示学习, 语义解耦

Abstract：Hierarchical topic models can uncover latent topics in documents and model the hierarchical relationships between topics, providing technical support for applications such as data governance, information retrieval, content classification, and knowledge management. A hierarchical topic model based on chain of thought and semantic decoupling(CoT-SDHT-M) is proposed in this paper. First, a hierarchical topic generation module based on a chain of thought is established. An initial hierarchical topic structure is generated by a large language model(LLM) under the guidance of hierarchical topic generation chain of thought. Then, a topic similarity discrimination mechanism based on LLM is introduced to generate refined topics and to guide the LLM in merging topics through examples, thereby improving the quality of the generated topics. Finally, a hierarchical topic optimization module based on transport planning and semantic decoupling is designed. It incorporates the initial hierarchical structure as a topic prior for downstream modeling. The relationships between topics are modeled as an optimal transport problem, and parent-child topic decoupling is performed based on the keywords of upper-layer and lower-layer topics to optimize the hierarchical topic structure. The experiments on various standard public datasets, including NeurIPS, ACL and 20 Newsgroups, demonstrate that CoT-SDHT-M significantly outperforms existing baseline models in terms of topic quality metrics and hierarchical metrics.

Key words： Hierarchical Topic Model Automatic Label Generation Chain of Thought Large Language Model(LLM) Prompt Learning Semantic Decoupling

收稿日期: 2025-06-30

ZTFLH:	TP311
	TP391

基金资助:国家自然科学基金项目(No.62376143,62473241,U24A20335)资助

通讯作者: 李旸 ,博士,副教授,主要研究方向为文本情感分析.E-mail:liyang@sxufe.edu.cn.

作者简介: 王志华,硕士研究生,主要研究方向为数据挖掘.E-mail:2113648144@qq.com.李德玉,博士,教授,主要研究方向为粒计算、机器学习.E-mail:lidy@sxu.edu.cn.王素格,博士,教授,主要研究方向为自然语言处理、情感分析.E-mail:wsg@sxu.edu.cn.

引用本文:

王志华, 李旸, 李德玉, 王素格. 基于思维链和语义解耦的层次化主题模型[J]. 模式识别与人工智能, 2025, 38(7): 613-626. WANG Zhihua, LI Yang, LI Deyu, WANG Suge. Hierarchical Topic Model Based on Chain of Thought and Semantic Decoupling. Pattern Recognition and Artificial Intelligence, 2025, 38(7): 613-626.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.202507003 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2025/V38/I7/613

[1] BLEI D M, NG A Y, JORDAN M I. Latent Dirichlet Allocation. Journal of the Machine Learning Research, 2003, 3(1): 993-1022.
[2] BLEI D M, JORDAN M I, GRIFFITHS T L, et al. Hierarchical Topic Models and the Nested Chinese Restaurant Process // Proc of the 17th International Conference on Neural Information Processing Systems. Cambridge, USA: MIT Press, 2003: 17-24.
[3] 万常选,张奕韬,刘德喜,等.主题方面共享的领域主题层次模型.软件学报, 2024, 35(4): 1790-1818.
(WAN C X, ZHANG Y T, LIU D X, et al. Domain Topic Hierar-chy Model for Topic Aspect Sharing. Journal of Software, 2024, 35(4): 1790-1818.)
[4] KINGMA D P, WELLING M. Auto-Encoding Variational Bayes[C/OL]. [2025-05-17].https://arxiv.org/pdf/1312.6114v3.
[5] ISONUMA M, MORI J, BOLLEGALA D, et al. Tree-Structured Neural Topic Model // Proc of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: ACL, 2020: 800-806.
[6] CHEN Z Y, DING C, ZHANG Z S, et al. Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference // Proc of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing(Long Papers). Stroudsburg, USA: ACL, 2021: 2343-2353.
[7] CHEN Z Y, DING C, RAO Y H, et al. Hierarchical Neural Topic Modeling with Manifold Regularization. World Wide Web, 2021, 24(6): 2139-2160.
[8] XU Y S, WANG D S, CHEN B, et al. HyperMiner: Topic Taxonomy Mining with Hyperbolic Embedding // Proc of the 36th International Conference on Neural Information Processing Systems. Cambridge, USA: MIT Press, 2022: 31557-31570.
[9] LIN Z C, CHEN H G, LU Y Y, et al. Hierarchical Topic Modeling via Contrastive Learning and Hyperbolic Embedding // Proc of the Joint International Conference on Computational Linguistics, Language Resources and Evaluation. Stroudsburg, USA: ACL, 2024: 8133-8143.
[10] CHEN H G, MAO P G, LU Y Y, et al. Nonlinear Structural Equa-tion Model Guided Gaussian Mixture Hierarchical Topic Modeling // Proc of the 61st Annual Meeting of the Association for Computational Linguistics(Long Papers). Stroudsburg, USA: ACL, 2023: 10377-10390.
[11] LIU J Y, CHEN H G, ZHU C J, et al. Unsupervised Hierarchical Topic Modeling via Anchor Word Clustering and Path Guidance // Findings of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: ACL, 2024: 7505-7517.
[12] WU X B, PAN F J, NGUYEN T, et al. On the Affinity, Rationa-lity, and Diversity of Hierarchical Topic Modeling. Proceedings of the AAAI Conference on Artificial Intelligence, 2024, 38(17): 19261-19269.
[13] STAMMBACH D, ZOUHAR V, HOYLE A, et al. Revisiting Automated Topic Model Evaluation with Large Language Models // Proc of the Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: ACL, 2023: 9348-9357.
[14] RIJCKEN E, SCHEEPERS F, ZERVANOU K, et al. Towards Interpreting Topic Models with ChatGPT[C/OL].[2025-05-17]. https://pure.tue.nl/ws/portalfiles/portal/300364784/IFSA_InterpretingTopicModelsWithChatGPT.pdf.
[15] MU Y D, DONG C, BONTCHEVA K, et al. Large Language Mo-dels Offer an Alternative to the Traditional Approach of Topic Mo-delling // Proc of the Joint International Conference on Computational Linguistics, Language Resources and Evaluation. Stroudsburg, USA: ACL, 2024: 10160-10171.
[16] PHAM C M, HOYLE A, SUN S M, et al. TopicGPT: A Prompt-Based Topic Modeling Framework // Proc of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies(Long Papers). Stroudsburg, USA: ACL, 2024: 2956-2984.
[17] ZHANG Y W, WANG Z H, SHANG J B. ClusterLLM: Large Language Models as a Guide for Text Clustering // Proc of the Confe-rence on Empirical Methods in Natural Language Processing. Stroudsburg, USA: ACL, 2023: 13903-13920.
[18] PEYRÉ G, CUTURI M. Computational Optimal Transport. Foun-dations and Trends^® in Machine Learning, 2019, 11(5/6): 355-607.
[19] SINKHORN R. A Relationship between Arbitrary Positive Matrices and Doubly Stochastic Matrices. The Annals of Mathematical Statistics, 1964, 35(2): 876-879.
[20] TEAM GLM. ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools[C/OL]. [2025-05-17].https://arxiv.org/pdf/2406.12793.
[21] DUAN Z B, WANG D S, CHEN B, et al. Sawtooth Factorial To-pic Embeddings Guided Gamma Belief Network. Proceedings of the Machine Learning Research, 2021, 139: 2903-2913.
[22] LI Y W, WANG C J, DUAN Z B, et al. Alleviating "Posterior Collapse″ in Deep Topic Models via Policy Gradient // Proc of the 36th International Conference on Neural Information Processing Systems. Cambridge, USA: MIT Press, 2022: 22562-22575.
[23] DUAN Z B, LIU X Y, SU Y D, et al. Bayesian Progressive Deep Topic Model with Knowledge Informed Textual Data Coarsening Process. Proceedings of the Machine Learning Research, 2023, 202: 8731-8746.