[1]胡康,何思宇,左敏,等.基于CNN-BLSTM的化妆品违法违规行为分类模型[J].智能系统学报,2021,16(6):1151-1157.[doi:10.11992/tis.202104001]
HU Kang,HE Siyu,ZUO Min,et al.Classification model for judging illegal and irregular behavior for cosmetics based on CNN-BLSTM[J].CAAI Transactions on Intelligent Systems,2021,16(6):1151-1157.[doi:10.11992/tis.202104001]
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
16
期数:
2021年第6期
页码:
1151-1157
栏目:
吴文俊人工智能科学技术奖论坛
出版日期:
2021-11-05
- Title:
-
Classification model for judging illegal and irregular behavior for cosmetics based on CNN-BLSTM
- 作者:
-
胡康1, 何思宇2, 左敏2, 葛伟2
-
1. 中国食品药品检定研究院 信息中心, 北京 102629;
2. 北京工商大学 农产品质量安全追溯技术及应用国家工程实验室, 北京 100048
- Author(s):
-
HU Kang1, HE Siyu2, ZUO Min2, GE Wei2
-
1. Information Center, China National Institute for Food and Drug Control, Beijing 102629, China;
2. National Engineering Laboratory for Agri-product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
-
- 关键词:
-
化妆品; 双维度模型; 自然语言处理; 位置感知; 注意力机制; 卷积神经网络; 双向长短时记忆网络
- Keywords:
-
cosmetics; double-dimensional model; natural language processing; location awareness; attention mechanism; CNN; BLSTM
- 分类号:
-
TP391
- DOI:
-
10.11992/tis.202104001
- 摘要:
-
针对化妆品安全监管部门抽样检测所含违法违规行为自动识别且分类困难的问题,建立语义分类自动识别模型,辅助有关部门构建智能化管理体系,依靠数据实现科学决策及有效监管。本文分别使用中文词向量及字向量作为双路模型输入,采用CNN(convolutional neural network)网络模型训练字向量, BLSTM(bidirectional long short-term memory)网络模型训练词向量,并在BLSTM中引入位置注意力机制,构建基于CNN-BLSTM的字词双维度化妆品违法违规行为分类模型。在染发类化妆品抽样检测数据集上进行的对比实验结果表明,CNN-BLSTM 模型准确率比常用的几种深度神经网络模型均有明显提高,验证了其合理性和有效性。
- Abstract:
-
Aiming at the difficulties in the automatic recognition and classification of illegal and irregular behaviors of cosmetic supervision departments in sampling inspection, an automatic semantic classification model is established to assist relevant departments in building an intelligent management system so as to realize scientific decision-making and effective supervision with data. In this study, a Chinese word vector and character vector are used as two-way model input. The convolutional neural network (CNN) model is used to train the character vector, and the bidirectional long short-term memory (BLSTM) network model is used to train the word vector. Then, the positional attention mechanism is introduced to BLSTM to construct a character–word double-dimensional classification model for illegal and irregular behaviors in cosmetic sampling inspection based on CNN-BLSTM. The results of the comparative experiments on the sampling inspection dataset of hair dye cosmetics show that the accuracy of the CNN-BLSTM model is significantly higher than that of several commonly used deep neural network models, which verifies this model’s rationality and effectiveness.
备注/Memo
收稿日期:2021-04-01。
基金项目:国家自然科学基金项目(61873027);教育部人文社会科学研究青年基金项目(20YJCZH229);北京市自然科学基金项目(4202014);北京市教委科研计划项目(KM202010011011)
作者简介:胡康,高级工程师,主要研究方向为食品、药品、化妆品、医疗器械监管检验方面的人工智能和大数据研究。参与国家、省重点研发计划等项目;2017年获得吴文俊人工智能三等奖等。发表学术论文13篇;何思宇,硕士研究生,主要研究方向为自然语言处理、大数据;左敏,教授,博士生导师,主要研究方向为化妆品安全大数据、化妆品安全追溯。主持国家重点研发计划课题、承担中国工程院战略研究课题多项。2009年被评为“北京市中青年骨干教师”,2017年获得吴文俊人工智能三等奖。发表学术论文40余篇.
通讯作者:左敏.E-mail:zuomin12345@126.com
更新日期/Last Update:
2021-12-25