[1]WU Nannan,GUO Zehao,ZHAO Yiming,et al.Name disambiguation method based on hyperbolic space feature fusion[J].CAAI Transactions on Intelligent Systems,2024,19(1):79-88.[doi:10.11992/tis.202209029]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 1
Page number:
79-88
Column:
学术论文—机器感知与模式识别
Public date:
2024-01-05
- Title:
-
Name disambiguation method based on hyperbolic space feature fusion
- Author(s):
-
WU Nannan1; GUO Zehao1; ZHAO Yiming1; ZHEN Zixu1; WANG Wenjun1; LIU Yan2
-
1. College of Intelligence and Computing, Tianjin University, Tianjin 300354, China;
2. School of Computer Science and Technology, Anhui University, Hefei 230039, China
-
- Keywords:
-
name disambiguation; Euclidean space; hyperbolic space; network alignment; network representation learning; graph embedding; feature fusion; anchor link prediction
- CLC:
-
TP39
- DOI:
-
10.11992/tis.202209029
- Abstract:
-
In view of the challenge of name duplication and the increasingly serious influence of name ambiguity in traditional user influence analysis and other research, the impact of name ambiguity is becoming increasingly serious. This paper proposes a network alignment model – geometry interaction network alignment (GINA) based on the fusion of hyperbolic space and Euclidean space features, fusing multiple spatial features. It effectively establishes a model to show the main function of a network structure for name disambiguation. The fundamental idea of this paper is to simultaneously utilize both Euclidean space and hyperbolic space for network representation learning, aiming to capture network structural information with distinct spatial characteristics. It employs cross-space network mapping and cross-space feature fusion to realize information exchange among different spaces and final network representation under the situations of reducing loss of spatial mapping as far as possible, implements network alignment and further name disambiguation. By performing network alignment based on the obtained representations, the paper accomplishes name disambiguation. On real datasets, the Chinese paper co-authorship network, English paper co-authorship network, and the Chinese patent co-authorship network are aligned in pair to construct the "Paper-Patent" empirical data network alignment dataset and the "Chinese-English" empirical data network alignment dataset to carry out the test demonstration of GINA model in two empirical scenarios for the identity recognition of the individuals with the same name and Chinese & foreign papers. The results show that the precision in the hyperbolic space combined with the Euclidean space is at least 24.9% higher than that in a single space, confirming effectiveness of the GINA method.