[1]HUANG Heyan,LIU Xiao.A survey on event extraction in new domains[J].CAAI Transactions on Intelligent Systems,2022,17(1):201-212.[doi:10.11992/tis.202109045]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
17
Number of periods:
2022 1
Page number:
201-212
Column:
人工智能院长论坛
Public date:
2022-01-05
- Title:
-
A survey on event extraction in new domains
- Author(s):
-
HUANG Heyan1; 2; 3; LIU Xiao1; 2; 3
-
1. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China;
2. Beijing Engineering Research Center of High-Volume Language Information Processing and Cloud Computing Applications, Beijing 100081, China;
3. Southeast Academy of Information Technology, Beijing Institute of Technology, Putian 351100, China
-
- Keywords:
-
event extraction; new domains; information extraction; event schema induction; collective extraction; event factuality prediction; natural language processing; knowledge base
- CLC:
-
TP391.4
- DOI:
-
10.11992/tis.202109045
- Abstract:
-
In the current Internet era, numerous unstructured text data in new domains often contain high-volume information. Studies on event extraction in new domains can accelerate building of domain knowledge bases, supporting downstream knowledge-based applications. However, the existing event extraction methods have substantial limitations of the domain. Building event extraction systems from scratch in new domains will heavily depend on the quality and scale of event schemas and annotated data, requiring a lot of human efforts and expertise. Moreover, it is common in the datasets that multiple associated event instances often appear in the same context, heavily hindering event extraction and factuality prediction. This paper summarizes the emerging research field of event extraction in new domains and investigates current research status from three directions: event schema induction, collective event extraction, and event factuality prediction. In addition, this paper discusses the existing difficulties and challengings and indicates the potential research work to be carried out in the future.