基于自注意力机制的多维跨度信息联合抽取实体与关系及在电力应急系统的应用
MSAM: Self-Attention-Based Joint Extraction of Entities and Relations from Multi-Dimensional Span Information and Its Application in Power Emergency Systems
-
摘要: 随着电力系统的复杂性日益增加,其稳定性和可靠性对于现代社会的经济发展和社会运行至关重要,提高电力故障应急响应的效率,从海量文本数据中准确提取有用信息,并对这些信息进行有效组织,是当前研究的重点。现有研究方法在处理长文本长距离依赖关系和上下文理解方面仍存在许多局限,导致错误传播和语义理解不足。针对这些问题,本文提出了一种基于自注意力机制的多维跨度信息联合抽取实体与关系方法(MSAM)。MSAM通过构建一个联合模型,同时处理实体识别和关系抽取任务,有效捕捉文本中的长、短距离依赖关系,增强上下文理解,减少错误传播。与现有研究相比,MSAM通过在两个公开数据集上的实验相较于现有先进模型有显著提高,特别是在关系抽取任务上。此外,MSAM模型在自建的电力应急数据集上的应用也验证了其在实际应用场景中的有效性。Abstract: With the increasing complexity of power systems, their stability and reliability are crucial to the economic development and social operation of modern society. Improving the efficiency of emergency response to power failures, accurately extracting useful information from massive text data, and effectively organizing such information have become key research focuses. Existing methods still face limitations in handling long-distance dependencies in long texts and context understanding, leading to error propagation and insufficient semantic comprehension. To address these issues, this paper proposes a Multi-dimensional Span-based Attention Mechanism (MSAM) for the joint extraction of entities and relations. By constructing a unified model, MSAM simultaneously performs named entity recognition and relation extraction tasks, effectively capturing both long and short-distance dependencies in text, enhancing context understanding, and reducing error propagation. Compared with state-of-the-art models, experimental results on two public datasets demonstrate that MSAM achieves significant improvements, particularly in relation extraction. Furthermore, the application of MSAM on a self-constructed power emergency dataset verifies its effectiveness in practical scenarios.
下载: