[1]DI Jian,LIU Junhua,CAO Jingang.An improved HiNT text retrieval model using BERT and coverage mechanism[J].CAAI Transactions on Intelligent Systems,2024,19(3):719-727.[doi:10.11992/tis.202201020]
                                
                                Copy
                                
                                
                             
                            
                                
                                    CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
                                    19
                                    Number of periods:
                                    2024 3
                                    Page number:
                                    719-727
                                    Column:
                                    学术论文—自然语言处理与理解
                                    Public date:
                                    2024-05-05
                                
                                
                                    - Title:
- 
                                        An improved HiNT text retrieval model using BERT and coverage mechanism
                                    - Author(s):
- 
                                        DI Jian1; 2;  LIU Junhua1; 2;  CAO Jingang1; 2
- 
                                        1. School of Control and Computer Engineering, North China Electric Power University, Baoding 071003, China;
 2. Engineering Research Center of Intelligent Computing for Complex Energy Systems, Ministry of Education, Baoding 071003, China
- 
                                
                                    - Keywords:
- 
                                        bidirectional encoder representations from transformers; hierarchical neural matching model; coverage mechanism; text retrieval; semantic representation; feature extraction; natural language processing; similarity; multigranularity
                                    - CLC:
- 
                                        TP311
                                    - DOI:
- 
                                        10.11992/tis.202201020
                                    - Abstract:
- 
                                        To effectively improve the accuracy of text semantic retrieval, an improved hierarchical neural matching model is proposed, which can solve the problems of text ambiguity and polysemy when using text retrieval models to measure the relevance of queries and documents. The model first extracts key subject words from each segment of the document and then encodes them into multiple dense semantic vectors using the BERT model. Afterward, the local matching layer introduced with the coverage mechanism is used for processing so that the model can calculate the correlation according to the local segment-level granularity and the global document-level granularity of the document and improve the retrieval accuracy. The proposed model is compared with multiple retrieval models on the MS MARCO and webtext2019zh datasets, and the optimal results obtained verify the effectiveness of the proposed model.