• 中文核心期刊要目总览
  • 中国科技核心期刊
  • 中国科学引文数据库(CSCD)
  • 中国科技论文与引文数据库(CSTPCD)
  • 中国学术期刊文摘数据库(CSAD)
  • 中国学术期刊(网络版)(CNKI)
  • 中文科技期刊数据库
  • 万方数据知识服务平台
  • 中国超星期刊域出版平台
  • 国家科技学术期刊开放平台
  • 荷兰文摘与引文数据库(SCOPUS)
  • 日本科学技术振兴机构数据库(JST)

QGAE:用于生成问答对的端到端无答案问题生成模型

QGAE: an end-to-end answer-agnostic question generation model for generating question-answer pairs

  • 摘要: 问题生成的目标是生成有意义且流畅的问题,以增加可用数据来解决问答类型标注语料库的缺乏问题。以带有可选答案的未注释文本作为输入内容,问题生成可以根据是否提供答案分为两种类型:有答案型和无答案型。即使在提供答案的情况下,生成问题也是具有挑战性的,更不用说在没有提供答案的情况下,对于人类和机器来说生成高质量的问题更加困难。为了解决这个问题,我们提出了一种名为QGAE的新型端到端模型,它能够通过直接提取候选答案,将无答案的问题生成转化为有答案的问题生成。这种方法有效地利用未标记的数据来生成高质量的问答对,其端到端的设计使其比多阶段方法更加方便,后者需要至少两个预训练模型。此外,我们的模型获得了更好的平均分数和更大的多样性。我们的实验结果表明,QGAE在生成问答对方面取得了显著的进展,成为了一种充满潜力的问题生成方法。

     

    Abstract: Question generation aims to generate meaningful and fluent questions, which can address the lack of a question-answer type annotated corpus by augmenting the available data. Using unannotated text with optional answers as input contents, question generation can be divided into two types based on whether answers are provided: answer-aware and answer-agnostic. While generating questions by providing answers is challenging, generating high-quality questions without providing answers is even more difficult for both humans and machines. To address this issue, we proposed a novel end-to-end model called question generation with answer extractor (QGAE), which is able to transform answer-agnostic question generation into answer-aware question generation by directly extracting candidate answers. This approach effectively utilizes unlabeled data for generating high-quality question-answer pairs, and its end-to-end design makes it more convenient than a multi-stage method that requires at least two pre-trained models. Moreover, our model achieves better average scores and greater diversity. Our experiments show that QGAE achieves significant improvements in generating question-answer pairs, making it a promising approach for question generation.

     

/

返回文章
返回