基于图元文本融合和局部敏感自注意力的电力设计CAD图全景符号定位

王文杰; 曹灿; 宋承明; 陈海原; 陈长征; 曹学刚; 钟杰; 杨甫

doi:10.19768/j.cnki.dgjs.2025.14.042

基于图元文本融合和局部敏感自注意力的电力设计CAD图全景符号定位

Panoramic Symbol Spotting of Power Design CAD Drawings Based on Text Fusion of Graphic Elements and Locally Sensitive Self-Attention

摘要

摘要: 电力设计CAD图全景符号定位(PSS)对于电力系统智能化具有重要意义。现有PSS方法应用于电力场景时，难以处理图元数量多、尺寸不均衡的电力设计CAD图，且无法利用图中的文本标注信息。为此，提出PFL-Net模型，该模型基于局部敏感重排和互协方差自注意力机制构建编码器，降低注意力复杂度以适应长图元序列；应用分块的多头双线性池化方法以融合图元文本标注信息。实验结果表明，PFL-Net在电力设计场景的PSS质量领先现有方法9.97%,预测速度从秒级提升至亚秒级，证实其具有更强的适用性。

Abstract: Panoramic symbol spotting(PSS) of power design CAD drawings is of great significance for power system intelligence. Existing PSS methods, when applied to power scenarios, encounter difficulties in handling power design CAD drawings with primitives of a large amount and uneven sizes, and cannot thoroughly utilize the text annotation information in the drawings. To address this, PFL-Net is proposed to construct an encoder based on locally sensitive reordering and a cross-covariance self-attention mechanism, which reduces the attention complexity to adapt to a long sequence of primitives. The model also applies a factorized multi-head bilinear pooling method to fuse the text information into vector graphic primitives. PFL-Net surpasses existing PSS methods by 9.97 percent in panoramic quality under the experiment using power design datasets, and improves prediction speed from second level to sub-second level, indicating its greater applicability.

HTML全文

参考文献(12)

施引文献