多段落中文阅读理解模型

doi:10.16451/j.cnki.issn1003-6059.201902008

摘要
图/表
参考文献(0)
相关文章 (15)

全文: PDF (981 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要

解决多段落中文阅读理解任务需要考虑证据段落的稀疏性、中文语义的多样性和答案片段的有效性.基于此种情况,文中设计多段落中文阅读理解模型,利用数据增强的方式学习不包含答案的段落,利用字级别编码和中文词性标注丰富中文的语义表示,通过答案片段的特征训练答案有效性验证模型.将文中模型应用到CIPS-SOGOU事实类问答数据中,实验表明,完全匹配率和F1分数的平均分均有所提高.

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	赵峻瑶
	庞亮
	苏立新
	兰艳艳
	郭嘉丰
	程学旗

关键词 ：阅读理解, 智能问答, 数据增强

Abstract：

In the Chinese multi-paragraph reading comprehension task, three properties should be taken into account: the sparsity of evidence paragraph, the diversity of Chinese semantic and the validity of answer snippet. To solve these problems, a Chinese multi-paragraph reading comprehension model, CMPReader, is proposed. In CMReader, data augmentation is exploited to learn the paragraphs with no answer. Word level encoding and Chinese word tag are added to enrich the Chinese semantic representation, and the features of answer snippet are employed by the answer verifier model to choose the right answer. CMPReader is applied to the CIPS-SOGOU factoid question answer dataset, and the results show that the average of exact match score and F1 score are increased.

Key words： Reading Comprehension Question Answer Data Augmentation

收稿日期: 2018-10-21

ZTFLH:

TP 391

基金资助:

国家重点研发计划(2016QY02D0405)、国家自然科学基金项目(No.61425016,61472401,61722211,61872338,61773362,20180290)、中国青年创新协会CAS项目(No.20144310,20160280)资助

作者简介: 赵峻瑶,硕士研究生,主要研究方向为自然语言处理、问答系统.E-mail:zhaojunyao17s@ict.ac.cn. 庞亮(通讯作者),博士,助理研究员,主要研究方向为自然语言处理、机器学习.E-mail:pangliang@ict.ac.cn. 苏立新,博士研究生,主要研究方向为信息检索、问答系统.E-mail:sulixinict@gmail.com. 兰艳艳,博士,副研究员,主要研究方向为机器学习、数据挖掘.E-mail:lanyanyan@ict.ac.cn. 郭嘉丰,博士,研究员,主要研究方向为数据挖掘、信息检索.E-mail:guojiafeng@ict.ac.cn. 程学旗,博士,研究员,主要研究方向为网络科学与社会计算、互联网搜索与挖掘.E-mail:cxq@ict.ac.cn.

引用本文:

赵峻瑶, 庞亮, 苏立新, 兰艳艳, 郭嘉丰, 程学旗. 多段落中文阅读理解模型[J]. 模式识别与人工智能, 2019, 32(2): 161-168. ZHAO Junyao, PANG Liang, SU Lixin, LAN Yanyan, GUO Jiafeng, CHENG Xueqi. Chinese Multi-paragraph Reading Comprehension Model. , 2019, 32(2): 161-168.

链接本文:

http://manu46.magtech.com.cn/Jweb_prai/CN/10.16451/j.cnki.issn1003-6059.201902008 或 http://manu46.magtech.com.cn/Jweb_prai/CN/Y2019/V32/I2/161

[1] BERANT J, SRIKUMAR V, CHEN P C, et al. Modeling Biological Processes for Reading Comprehension // Proc of the Conference on Empirical Methods in Natural Language Processing. Berlin, Germany: Springer, 2014: 1499-1510.
[2] DUNN M, SAGUN L, HIGGINS M, et al. SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine[J/OL].[2018-09-15]. https://arxiv.org/pdf/1704.05179.pdf.
[3] WU F, LAO N, BLITZER J, et al. Fast Reading Comprehension with ConvNets[J/OL].[2018-09-15]. https://arxiv.org/pdf/1711.04352.pdf.
[4] YU A W, DOHAN D, LUONG M T, et al. QANet: Combining Local Convolution with Global Self-attention for Reading Comprehension[J/OL].[2018-09-15]. https://arxiv.org/pdf/1804.09541.pdf.
[5] HERMANN K M, KŎCISKÝ T, GREFENSTETTE E, et al. Teaching Machines to Read and Comprehend // CORTES C, LAWRENCE N D, LEE D D, et al., eds. Advances in Neural Information Processing Systems 28. Cambridge, USA: The MIT Press, 2015: 1693-1701.
[6] HILL F, BORDES A, CHOPRA S, et al. The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations[J/OL].[2018-09-15]. https://arxiv.org/pdf/1511.02301.pdf.
[7] CHEN D Q, FISCH A, WESTON J, et al. Reading Wikipedia to Answer Open-Domain Questions[J/OL].[2018-09-15]. https://arxiv.org/pdf/1704.00051.pdf.
[8] WANG S H, JIANG J.Machine Comprehension Using Match-LSTM and Answer Pointer[J/OL]. [2018-09-15].https://arxiv.org/pdf/1608.07905.pdf.
[9] VINYALS O, FORTUNATO M, JAITLY N.Pointer Networks // CORTES C, LAWRENCE N D, LEE D D, et al., eds. Advances in Neural Information Processing Systems 28. Cambridge, USA: The MIT Press, 2015: 2692-2700.
[10] SEO M J, KEMBHAVI A, HAJISHIRZI H, et al. Bidirectional Attention Flow for Machine Comprehension[J/OL].[2018-09-15]. https://arxiv.org/pdf/1611.01603.pdf.
[11] SRIVASTAVA R K, GREFF K, SCHMIDHUBER J.Highway Networks[J/OL]. [2018-09-15].https://arxiv.org/pdf/1505.00387.pdf.
[12] HILL F, CHO K, KORHONEN A.Learning Distributed Representations of Sentences from Unlabelled Data[C/OL]. [2018-09-15].https://arxiv.org/pdf/1602.03483v1.pdf.
[13] FRIEDMAN J H. Stochastic Gradient Boosting. Computational Statistics and Data Analysis, 2002, 38(4): 367-378.
[14] KINGMA D P, BA J L. Adam: A Method for Stochastic Optimization[J/OL]. [2018-09-15]. https://arxiv.org/pdf/1412.6980.pdf.
[15] RAIPURKAR P, ZHANG J, LOPYREV K, et al. Squad: 100,000+ Questions for Machine Comprehension of Text[J/OL].[2018-09-15]. https://arxiv.org/pdf/1606.05250.pdf.