Semantic Extraction Method of Multi-scale Nuclear Power Quality Text Fault Information

doi:10.3969/j.issn.1004-132X.2023.08.012

China Mechanical Engineering ›› 2023, Vol. 34 ›› Issue (08): 976-981,992.DOI: 10.3969/j.issn.1004-132X.2023.08.012

Previous Articles Next Articles

Semantic Extraction Method of Multi-scale Nuclear Power Quality Text Fault Information

WU Tingwei1;WANG Mengling1;YI Shuping2;GUO Jingren3

1.Key Laboratory of Smart Manufacturing in Energy Chemical Process，Ministry of Education，East China University of Science and Technology，Shanghai，200237
2.College of Mechanical Engineering，Chongqing University，Chongqing，400044
3.China Nuclear Power Engineering Co.，Ltd.，Shenzhen，Guangdong，518000

Online:2023-04-25 Published:2023-05-17

多尺度核电质量文本故障信息语义抽取方法

吴庭伟1;王梦灵1;易树平2;郭景任3

1.华东理工大学能源化工过程智能制造教育部重点实验室，上海，200237
2.重庆大学机械工程学院，重庆，4000443.中广核工程有限公司，深圳，518000

通讯作者: 王梦灵（通信作者），女，1980年生，副教授。研究方向为数据挖掘、人工智能算法。发表论文30余篇。E-mail：wml_ling@ecust.edu.cn。
作者简介:吴庭伟，男，1998年生，硕士研究生。研究方向为文本分类、信息抽取。E-mail：y30200997@mail.ecust.edu.cn。
基金资助:
国家重点研发计划（2020YFB1711700）

Abstract

Abstract: A semantic extraction method of multi-scale nuclear power quality text fault information was proposed to obtain the information of fault equipment and their stages from nuclear power quality text. The quality text included the faulty equipment and normal equipment， while the whole value chain stages of design， procurement， construction, and commissioning were not described. Firstly， based on Transformer bidirectional encoding， the pre-trained language model were used to convert nuclear equipment quality text into text vectors. The bidirectional gated recurrent unit network with attention mechanism was introduced to mine the key semantic features of quality text defects. On the basis of those above， the conditional random field was used to predict the key semantic features and output the fault equipment. Fine-tuning the extracted key semantic features by multi-layer perceptron， the stages of fault equipment was interpreted. Finally， the experimental verification was conducted based on real nuclear power quality text datasets， and the F1 value reached 94.3%. The results show that the proposed method has good feasibility and effectiveness.

Key words: multi-scale, nuclear power quality text, semantic extraction, pre-trained language model, conditional random field

摘要： 提出了多尺度核电质量文本故障信息语义抽取方法，从核电质量文本描述中获取了存在质量缺陷的故障设备与所属阶段的信息。针对故障设备与正常设备并存，以及所属设计、采购、施工和调试的全价值链阶段未描述的问题，提出了多尺度故障信息抽取策略。基于Transformer双向编码的预训练语言模型将核电质量文本转化为文本向量；采用注意力机制的双向门控循环神经网络挖掘出质量缺陷的关键语义特征；采用条件随机场对关键语义特征进行实体预测，输出故障设备；通过多层感知机对提取的关键语义特征进行微调及推理，解译出故障设备所属阶段。最后，在真实的核电质量文本数据集上进行验证，F1值达到94.3%，表明提出的方法具有较好可行性和有效性。

关键词: 多尺度, 核电质量文本, 语义抽取, 预训练语言模型, 条件随机场

CLC Number:

TP391.1

WU Tingwei, WANG Mengling, YI Shuping, GUO Jingren. Semantic Extraction Method of Multi-scale Nuclear Power Quality Text Fault Information[J]. China Mechanical Engineering, 2023, 34(08): 976-981,992.

吴庭伟, 王梦灵, 易树平, 郭景任. 多尺度核电质量文本故障信息语义抽取方法[J]. 中国机械工程, 2023, 34(08): 976-981,992.

References

［1］ZHAO Y， DIAO X， HUANG J， et al. Automated Identification of Causal Relationships in Nuclear Power Plant Event Reports［J］. Nuclear Technology， 2019， 205（8）：1021-1034.
［2］CHOI Y S， NGUYEN M D， THOMAS N K. Syntactic and Semantic Information Extraction from NPP Procedures Utilizing Natural Language Processing Integrated with Rules［J］. Nuclear Engineering and Technology， 2021， 53（3）：866-878.
［3］WU P， LI X， LI C， et al. Sentiment Classification Using Attention Mechanism and Bidirectional Long Short-term Memory Network［J］. Applied Soft Computing， 2021， 112：107792.
［4］JURADO F. Journalistic Transparency Using CRFs to Identify the Reporter of Newspaper Articles in Spanish［J］. Applied Soft Computing， 2020， 95：106496.
［5］卢淑祺，窦志成，文继荣. 手术病例中结构化数据抽取研究［J］. 计算机学报， 2019， 42（12）：2754-2768.
LU Shuqi， DOU Zhicheng， WEN Jirong. Research on Structural Data Extraction in Surgical Cases［J］. Chinese Journal of Computers， 2019， 42（12）：2754-2768.
［6］NGUYEN M， LE D， LE L. Transformers-based Information Extraction with Limited Data for Domain-specific Business Documents［J］. Engineering Applications of Artificial Intelligence， 2021， 97：104100.
［7］WANG J， XU W， FU X， et al. ASTRAL：Adversarial Trained LSTM-CNN for Named Entity Recognition［J］. Knowledge-based Systems， 2020， 197：105842.
［8］CHO M， HA J， PARK C， et al. Combinatorial Feature Embedding Based on CNN and LSTM for Biomedical Named Entity Recognition［J］. Journal of Biomedical Informatics， 2020， 103：103381.
［9］DU C， HUANG L. Text Classification Research with Attention-based Recurrent Neural Networks［J］. International Journal of Computers Communications & Control， 2018， 13（1）：50-61.
［10］张靖宜，贺光辉，代洲，等. 融入BERT的企业年报命名实体识别方法［J］. 上海交通大学学报， 2021， 55（2）：117-123.
ZHANG Jingyi， HE Guanghui， DAI Zhou， et al. Named Entity Recognition of Enterprise Annual Report Integrated with BERT［J］. Journal of Shanghai Jiaotong University， 2021， 55（2）：117-123.
［11］JIA C， SHI Y， YANG Q， et al. Entity Enhanced BERT Pre-training for Chinese NER［C］∥Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing（EMNLP）. 2020：6384-6396.
［12］VASWANI A， SHAZZER N， PARMER N， et al. Attention Is All You Need［C］∥Proceedings of the 31st Conference on Neural Information Processing Systems. Long Beach， 2017：6000-6010.

[1]	ZHENG Jinde, CHEN Yan, TONG Jinyu, PAN Haiyang. RGCMvMRDE and Its Applications in Rolling Bearing Fault Diagnosis [J]. China Mechanical Engineering, 2023, 34(11): 1315-1325.
[2]	ZHAO Qiaoli, HOU Yuliang, LIU Zeyi, LI Cheng. Multi-scale Analysis of LVI and CAI Behaviors of Plain Woven Carbon-fiber-reinforced Composites [J]. China Mechanical Engineering, 2021, 32(14): 1732-1742.
[3]	LIU Zhiqiang, ZHAO Jie, WANG Kehuan, WU Yong, LYU Liangxing, LIU Gang, YUAN Shijian, . Research Progresses on Coupling Multi-scale Simulation of Deformation and Microstructure Evolution of Titanium Alloy in Hot Forming Processes [J]. China Mechanical Engineering, 2020, 31(22): 2678-2690,2698.
[4]	PENG Fangjin. A Robust Rail Surface Defect Detection Algorithm [J]. China Mechanical Engineering, 2019, 30(03): 266-270.
[5]	CHEN Tianyu, FENG Xuning, OUYANG Minggao, LU Languang. Model-based Multi-scale Thermal Safety Design of Traction Battery Systems [J]. China Mechanical Engineering, 2018, 29(15): 1840-1846,1874.
[6]	WANG Shenghuai1, 2;XU Fenghua1;CHEN Yurong1;XIE Tiebang2. A Kind of Multi-scale Integration Measurement System for Surface Textures [J]. China Mechanical Engineering, 2018, 29(06): 705-711,719.
[7]	LI Hongru1;YU He1;TIAN Zaike1;LI Baochen2. Degradation Trend Prediction of Rolling Bearings Based on Two-element Multiscale Entropy [J]. China Mechanical Engineering, 2017, 28(20): 2420-2425,2433.
[8]	Wang Guangbin, Du Xiaoyang, Luo Jun, . Multi-scale Laplace Feature Mapping for Rotor Fault Feature Extraction [J]. China Mechanical Engineering, 2016, 27(20): 2791-2797.
[9]	Zheng Jinde, Pan Haiyang, Qi Xiaoli, Pan Ziwei. Composite Hierarchical Fuzzy Entropy and Its Applications to Rolling Bearing Fault Diagnosis [J]. China Mechanical Engineering, 2016, 27(15): 2048-2055.
[10]	Meng Zong, Hu Meng, Gu Weiming, Zhao Dongfang. Rolling Bearing Fault Diagnosis Method Based on LMD Multi-scale Entropy and Probabilistic Neural Network [J]. China Mechanical Engineering, 2016, 27(04): 433-437.
[11]	Wang Yukui, Li Hongru, Ye Peng. Fault Identification of Hydraulic Pump Based on Multi-scale Permutation Entropy [J]. China Mechanical Engineering, 2015, 26(4): 518-523.
[12]	Yang Bin1;Liu Jibiao2;Cheng Junsheng1. Damage Identification Based on Multi-scale Transmissibility Function and Grey Moment Relative Entropy [J]. China Mechanical Engineering, 2015, 26(12): 1639-1644.
[13]	Zhang Weiwei, Song Xiaolin, Zhang Sanlin, Wu Xuncheng. Real-time Lane Recognition Method Based on Hardware-software Co-design [J]. China Mechanical Engineering, 2015, 26(10): 1337-1344.
[14]	Zhang Ying, Liu Zhansheng, Su Xianzhang. Multi-scale and Multi-structure Element Edge Detection of Parameter Images for Rotating Machinery [J]. China Mechanical Engineering, 2013, 24(23): 3176-3180.
[15]	Zheng Jinde;Cheng Junsheng;Yang Yu. Multi-scale Permutation Entropy and Its Applications to Rolling Bearing Fault Diagnosis [J]. China Mechanical Engineering, 2013, 24(19): 2641-2646.

Semantic Extraction Method of Multi-scale Nuclear Power Quality Text Fault Information

多尺度核电质量文本故障信息语义抽取方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles 0

Metrics