Volume 25 Issue 1
Feb.  2025
Turn off MathJax
Article Contents
ZHU Yan-jie, WANG Yu-chen, XIONG Wen, CAI Chun-sheng. Few-shot model for extracting inspection report information based on bridge inspection domain-task transfer[J]. Journal of Traffic and Transportation Engineering, 2025, 25(1): 248-262. doi: 10.19818/j.cnki.1671-1637.2025.01.018
Citation: ZHU Yan-jie, WANG Yu-chen, XIONG Wen, CAI Chun-sheng. Few-shot model for extracting inspection report information based on bridge inspection domain-task transfer[J]. Journal of Traffic and Transportation Engineering, 2025, 25(1): 248-262. doi: 10.19818/j.cnki.1671-1637.2025.01.018

Few-shot model for extracting inspection report information based on bridge inspection domain-task transfer

doi: 10.19818/j.cnki.1671-1637.2025.01.018
Funds:

National Natural Science Foundation of China 52478147

National Natural Science Foundation of China 52378135

More Information
  • Corresponding author: XIONG Wen(1982-), male, professor, PhD, wxiong@seu.edu.cn
  • Received Date: 2024-01-30
  • Publish Date: 2025-02-25
  • To reduce the reliance on large-scale manually annotated samples for extracting key information of bridge inspection, a bridge inspection key information extraction model applicable to few-shot scenarios was proposed. The method comprised a bridge inspection-specific pre-trained language model, a bi-directional long short-term memory (BiLSTM) network, and a conditional random field (CRF). Through the utilization of bridge-related corpora and inspection task data for the domain pre-training and task fine-tuning, a two-stage transfer from domain knowledge to task-specific features was implemented, resulting in a pre-trained language model with high generalization for bridge-specific terminology and inspection report formats. Subsequently, the BiLSTM was employed to capture the contextual dependencies within the bridge inspection reports, while the CRF was combined to enforce structured constraints on the final extraction results. According to industry specifications and relevant research, eight key information categories commonly found in bridge inspection reports were redefined. To validate the effectiveness of the proposed approach, few-shot datasets containing only 50 and 100 sentences were utilized for training, respectively, with performance evaluated on a test set containing 1 491 sentences. Experimental results show that, when trained with 50 and 100 samples respectively, the F1 scores of the proposed model reach 0.860 7 and 0.820 2, respectively, significantly outperforming four mainstream models. This confirms the model's superior capability in accurately extracting key information from bridge inspection reports under few-shot conditions. Moreover, ablation experiments further reveal that the two-stage transfer learning strategy effectively facilitates the rapid extraction of domain-relevant and task-discriminative features from few-shot data, thereby substantially enhancing the model's overall performance in few-shot scenarios. The proposed bridge inspection information extraction method under few-shot scenarios can be used to construct a knowledge graph for evaluating bridge structural conditions and predicting future service life.

     

  • loading
  • [1]
    XIA Ye, LEI Xiao-ming, WANG Peng, et al. A data-driven approach for regional bridge condition assessment using inspection reports[J]. Structural Control and Health Monitoring, 2022, 29(4): e2915.
    [2]
    YANG Jian-xi, YANG Xiao-xia, LI Ren, et al. BERT and hierarchical cross attention-based question answering over bridge inspection knowledge graph[J]. Expert Systems with Applications, 2023, 233: 120896.
    [3]
    JIANG Ya-li, YANG Gang, LI Hai-jiang, et al. Knowledge driven approach for smart bridge maintenance using big data mining[J]. Automation in Construction, 2023, 146: 104673. doi: 10.1016/j.autcon.2022.104673
    [4]
    YANG Jian-xi, XIANG Fang-yue, LI Ren, et al. Intelligent bridge management via big data knowledge engineering[J]. Automation in Construction, 2022, 135: 104118.
    [5]
    KALE A, RICKS B, GANDHI R. New measure to understand and compare bridge conditions based on inspections time-series data[J]. Journal of Infrastructure Systems, 2021, 27(4): 04021037. doi: 10.1061/(ASCE)IS.1943-555X.0000633
    [6]
    WANG Y C, CAI C S, HAN B, et al. A deep learning-based approach for assessment of bridge condition through fusion of multi-type inspection data[J]. Engineering Applications of Artificial Intelligence, 2024, 128: 107468.
    [7]
    YANG Xiao-xia, YANG Jian-xi, LI Ren, et al. Complex knowledge base question answering for intelligent bridge management based on multi-task learning and cross-task constraints[J]. ENTROPY, 2022, 24(12): 1805.
    [8]
    LIU Heng, ZHANG Yun-feng. Bridge condition rating data modeling using deep learning algorithm[J]. Structure and Infrastructure Engineering, 2020, 16(10): 1447-1460.
    [9]
    LIU Kai-jian, EL-GOHARY N. Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports[J]. Automation in Construction, 2017, 81: 313-327. doi: 10.1016/j.autcon.2017.02.003
    [10]
    LI T S, ALIPOUR M, HARRIS D K. Context-aware sequence labeling for condition information extraction from historical bridge inspection reports[J]. Advanced Engineering Informatics, 2021, 49: 101333. doi: 10.1016/j.aei.2021.101333
    [11]
    LI Ren, MO Tian-jin, YANG Jian-xi, et al. Bridge inspection named entity recognition via bert and lexicon augmented machine reading comprehension neural model[J]. Advanced Engineering Informatics, 2021, 50: 101416. doi: 10.1016/j.aei.2021.101416
    [12]
    MOON S, CHUNG S, CHI S. Bridge damage recognition from inspection reports using NER based on recurrent neural network with active learning[J]. Journal of Performance of Constructed Facilities, 2020, 34(6): 04020119. doi: 10.1061/(ASCE)CF.1943-5509.0001530
    [13]
    WANG X Y, EL-GOHARY N. Deep learning-based named entity recognition and resolution of referential ambiguities for enhanced information extraction from construction safety regulations[J]. Journal of Computing in Civil Engineering, 2023, 37(5): 04023023. doi: 10.1061/(ASCE)CP.1943-5487.0001064
    [14]
    ZHANG R C, EL-GOHARY N. A deep neural network-based method for deep information extraction using transfer learning strategies to support automated compliance checking[J]. Automation in Construction, 2021, 132: 103834. doi: 10.1016/j.autcon.2021.103834
    [15]
    HE Shuan-hai, WANG An-hua, ZHU Zhao, et al. Research progress on intelligent detection technologies of highway bridges[J]. China Journal of Highway and Transport, 2021, 34 (12): 12-24.
    [16]
    MOON S, LEE G, CHI S. Automated system for construction specification review using natural language processing[J]. Advanced Engineering Informatics, 2022, 51: 101495. doi: 10.1016/j.aei.2021.101495
    [17]
    LI Chuan-jiang, LI Shao-bo, WANG Huan, et al. Attention-based deep meta-transfer learning for few-shot fine-grained fault diagnosis[J]. Knowledge-Based Systems, 2023, 264: 110345. doi: 10.1016/j.knosys.2023.110345
    [18]
    ZHANG Hu, GUO Jia-yu, WANG Yu-jie, et al. Judicial nested named entity recognition method with mrc framework[J]. International Journal of Cognitive Computing in Engineering, 2023, 4: 118-126. doi: 10.1016/j.ijcce.2023.03.002
    [19]
    MOIRANGTHEM D S, LEE M. Hierarchical and lateral multiple timescales gated recurrent units with pre-trained encoder for long text classification[J]. Expert Systems with Applications, 2021, 165: 113898. doi: 10.1016/j.eswa.2020.113898
    [20]
    HU Zhong-jian, YANG Peng, LI Bing, et al. Biomedical extractive question answering based on dynamic routing and answer voting[J]. Information Processing and Management, 2023, 60(4): 103367. doi: 10.1016/j.ipm.2023.103367
    [21]
    DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional transformers for language understanding[J]. arXiv, 2017, DOI: 10.48550/arXiv.1810.04805.
    [22]
    LIU K J, EL-GOHARY N. Semantic neural network ensemble for automated dependency relation extraction from bridge inspection reports[J]. Journal of Computing in Civil Engineering, 2021, 35(4): 04021007. doi: 10.1061/(ASCE)CP.1943-5487.0000961
    [23]
    LIU K J, EL-GOHARY N. Improved similarity assessment and spectral clustering for unsupervised linking of data extracted from bridge inspection reports[J]. Advanced Engineering Informatics, 2022, 51: 101496. doi: 10.1016/j.aei.2021.101496
    [24]
    LAO Wu-lüe, CUI Chuang, ZHANG Deng-ke, et al. Crack identification method of orthotropic steel deck based on computer vision[J]. China Journal of Highway and Transport, 2023, 36 (3): 188-201. doi: 10.3969/j.issn.1001-7372.2023.03.016
    [25]
    JING Qiang, ZHENG Shun-chao, LIANG Peng, et al. Technologies and engineering practices of intelligent operation and maintenance of Hong Kong-Zhuhai-Macao Bridge[J]. China Journal of Highway and Transport, 2023, 36(6): 143-156.
    [26]
    VO S N, VO T T, LE B. Interpretable extractive text summarization with meta-learning and bi-lstm: a study of meta learning and explainability techniques[J]. Expert Systems with Applications, 2024, 245: 123045.
    [27]
    LI Dai-yi, YAN Li, YANG Jian-zhong, et al. Dependency syntax guided BERT-BiLSTM-GAM-CRF for chinese NER[J]. Expert Systems with Applications, 2022, 196: 116682.
    [28]
    ZHOU Shang-lian, SONG Wei. Deep learning-based roadway crack classification using laser-scanned range images: a comparative study on hyperparameter selection[J]. Automation in Construction, 2020, 114: 103171.
    [29]
    LAMBRECHTS G, DE GEETER F, VECOVEN N, et al. Warming up recurrent neural networks to maximise reachable multistability greatly improves learning[J]. Neural Networks, 2023, 166: 645-669.
    [30]
    LIU Ya-fei, WEI Si-qi, HUANG Hai-jun, et al. Naming entity recognition of citrus pests and diseases based on the BERT-BiLSTM-CRF model[J]. Expert Systems with Applications, 2023, 234: 121103.

Catalog

    Article Metrics

    Article views (17) PDF downloads(5) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return