[1]周博文,熊伟丽.采用双层优选策略的主动学习算法及其应用[J].智能系统学报,2022,17(4):688-697.[doi:10.11992/tis.202106041]
ZHOU Bowen,XIONG Weili.Active learning algorithm and its application based on a two-tier optimization strategy[J].CAAI Transactions on Intelligent Systems,2022,17(4):688-697.[doi:10.11992/tis.202106041]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
17
期数:
2022年第4期
页码:
688-697
栏目:
学术论文—机器学习
出版日期:
2022-07-05
- Title:
-
Active learning algorithm and its application based on a two-tier optimization strategy
- 作者:
-
周博文1, 熊伟丽1,2
-
1. 江南大学 物联网工程学院,江苏 无锡 214122;
2. 江南大学 轻工过程先进控制教育部重点实验室,江苏 无锡 214122
- Author(s):
-
ZHOU Bowen1, XIONG Weili1,2
-
1. School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China;
2. Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Jiangnan University, Wuxi 214122, China
-
- 关键词:
-
主动学习; 双层优选; 不确定性; 分布信息; 评价指标; 冗余信息; 建模应用; 脱丁烷塔
- Keywords:
-
active learning; two-tier optimization; sample uncertainty; distribution information; evaluation indicator; redundant information; modeling application; debutanizer
- 分类号:
-
TP274
- DOI:
-
10.11992/tis.202106041
- 摘要:
-
针对工业生产过程中有标签样本少而人工标记代价高的问题,提出一种基于双层优选策略的主动学习算法。首先,建立不同预测模型对无标签样本的信息量进行评估;其次,充分考虑样本的分布信息,从样本的不确定性、差异性和代表性3个角度出发,提出新的评价指标,优选无标签样本,并去除冗余信息;最后,对双层优选的样本进行人工标记,重构有标签样本集后进行建模应用。通过脱丁烷塔的工业过程数据进行算法的应用仿真,验证了所提算法的有效性与性能。
- Abstract:
-
Aiming at the problem that the number of label samples is small and the cost of manual labeling is high in the industrial production process, an active learning algorithm based on a two-tier optimization strategy is proposed. First, establish different prediction models to evaluate the amount of information contained in unlabeled samples; secondly, fully consider the distribution information of the samples and, from the three perspectives of sample uncertainty, difference, and representativeness, propose new evaluation indicators, preferably unlabeled samples, and remove redundant information; finally, the double-layered preferred samples are manually labeled, and the labeled sample set is reconstructed for modeling application. The application simulation of the algorithm through the industrial process data of the debutanizer verifies the effectiveness and performance of the proposed algorithm.
更新日期/Last Update:
1900-01-01