<-上一篇/Previous Article 下一篇/Next Article->

[1]熊铁妞,邱吉芳,胡建.基于深度学习技术的古彝文字图像搜集与整理方法[J].智能系统学报,2025,20(4):928-935.[doi:10.11992/tis.202406036]
　XIONG Tieniu,QIU Jifang,HU Jian.Collection and sorting method of ancient Yi character images based on deep learning technology[J].CAAI Transactions on Intelligent Systems,2025,20(4):928-935.[doi:10.11992/tis.202406036]

点击复制

基于深度学习技术的古彝文字图像搜集与整理方法

PDF下载 HTML

《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷: 20 期数: 2025年第4期页码: 928-935 栏目: 学术论文—机器学习出版日期: 2025-08-05

Title:: Collection and sorting method of ancient Yi character images based on deep learning technology

作者:: 熊铁妞^1,2, 邱吉芳³, 胡建^1,2; 1. 西南民族大学计算机系统国家民委重点实验室, 四川成都 610225;
2. 西南民族大学计算机与人工智能学院, 四川成都 610225;
3. 西南民族大学中国语言文学学院, 四川成都 610225

Author(s):: XIONG Tieniu^1,2, QIU Jifang³, HU Jian^1,2; 1. The Key Laboratory for Computer Systems of State Ethnic Affairs Commission, Southwest Minzu University, Chengdu 610225, China;
2. College of Computer Science and Artificial Intelligence, Southwest Minzu University, Chengdu 610225, China;
3. School of Chinese Language and Literature, Southwest Minzu University, Chengdu 610225, China

关键词:: 深度学习; 古彝文字; 古籍; 图像处理; 相似度匹配; 特征提取; 目标检测; 数字化

Keywords:: deep learning; ancient Yi characters; ancient literatures; image processing; similarity matching; feature extraction; object detection; digitalization

分类号:: TP391.4; TP391.1

DOI:: 10.11992/tis.202406036

文献标志码:: 2025-2-21

摘要:: 古彝文字是中华文化的重要载体之一，但人工搜集、整理大量古彝文字耗时耗力，而且能辨识古彝文字的人已非常稀缺且越来越少，这使得整理工作变得更为困难。对此，本文提出一种基于深度学习技术的古彝文字图像搜集与整理的新思路。在古彝文字图像搜集方面，通过目标检测模型得到每个古彝文字在彝文古籍图像中的位置，据此在彝文古籍图像中截取出古彝文字图像，实现古彝文字搜集。在古彝文图像整理方面，首先根据规范彝文来源于古彝文的事实，采用规范彝文字体文件自动生成彝文字图像用于构建数据集，并将数据集应用于训练古彝文字图像特征算法，这有效回避了目前因古彝文字数量庞大、异体字众多、整理尚未完成，而尚无古彝文字图像数据集的问题；然后，通过匹配所搜集的古彝文字图像的特征与现已收录的古彝文字图像的特征的相似性，判断所搜集的古彝文字图像是否已被收录，从而整理出未收录的古彝文字图像。实验在多种典型的特征提取算法和相似性计算方式下进行，实验结果验证了方法的有效性。

Abstract:: The ancient Yi script is one of the important carriers of Chinese culture. However, manually collecting and organizing a large amount of ancient Yi script is time-consuming and labor-intensive. Additionally, very few people can recognize ancient Yi script, and their numbers are dwindling, which makes the task even more difficult. In response to this, this paper proposes a new approach to collecting and organizing images of the ancient Yi script based on deep learning technology. For image collection, the object detection model is used to locate each ancient Yi character in the images of ancient Yi manuscripts, and the characters are extracted from these images accordingly. For image organization, because modern standardized Yi characters are derived from ancient Yi characters, standardized Yi character font files are used to generate images of the Yi characters automatically to construct a dataset. This dataset is then used to train an algorithm for extracting features of ancient Yi script images, which effectively addresses the current lack of an ancient Yi script image dataset due to the large number of characters, many variants, and incomplete organization. Subsequently, matching the features of the collected ancient Yi script images with those of already cataloged images enables determining whether the collected images have been previously recorded and thereby organizing uncatalogued ancient Yi script images. Experiments conducted with various typical feature extraction algorithms and similarity computation methods validate the effectiveness of this approach.

参考文献/References:: [1] 孔祥卿. 彝文的源流[M]. 北京: 民族出版社, 2005.
[2] 韩旭. 彝文古籍字符检测和识别的研究与实现[D]. 重庆: 西南大学, 2020.
HAN Xu. Research and implementation of character detection and recognition in Yi ancient books[D]. Chongqing: Southwest University, 2020.
[3] 贾晓栋. 基于深度学习的手写彝文识别技术应用研究[D]. 北京: 中央民族大学, 2017.
JIA Xiaodong. Research on the application of handwritten Yi recognition technology based on deep learning[D]. Beijing: Central University for Nationalities, 2017.
[4] 胡峰, 李路正, 代劲, 等. 结合聚类边界采样的主动学习[J]. 智能系统学报, 2024, 19(2): 482-492.
HU Feng, LI Luzheng, DAI Jin, et al. Active learning combined with clustering boundary sampling[J]. CAAI transactions on intelligent systems, 2024, 19(2): 482-492.
[5] 王定旺. 彝文联机手写体识别的研究与应用[D]. 重庆: 西南大学, 2021.
WANG Dingwang. Research and application of online handwriting recognition in Yi language[D]. Chongqing: Southwest University, 2021.
[6] WANG Chongguang, EVANS K, HARTLEY D, et al. A systematic review of artificial neural network techniques for analysis of foot plantar pressure[J]. Biocybernetics and biomedical engineering, 2024, 44(1): 197-208.
[7] 陈善雄, 韩旭, 林小渝, 等. 基于MSER和CNN的彝文古籍文献的字符检测方法[J]. 华南理工大学学报(自然科学版), 2020, 48(6): 123-133.
CHEN Shanxiong, HAN Xu, LIN Xiaoyu, et al. MSER and CNN-based method for character detection in ancient Yi books[J]. Journal of South China University of Technology (natural science edition), 2020, 48(6): 123-133.
[8] CHEN Huizhong, TSAI S S, SCHROTH G, et al. Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions[C]//2011 18th IEEE International Conference on Image Processing. Brussels: IEEE, 2011: 2609-2612.
[9] 陈善雄, 王小龙, 韩旭, 等. 一种基于深度学习的古彝文识别方法[J]. 浙江大学学报(理学版), 2019, 46(3): 261-269.
CHEN Shanxiong, WANG Xiaolong, HAN Xu, et al. A recognition method of ancient Yi character based on deep learning[J]. Journal of Zhejiang University (science edition), 2019, 46(3): 261-269.
[10] CICHOCKI A, CRUCES S, AMARI S I. Generalized alpha-beta divergences and their application to robust nonnegative matrix factorization[J]. Entropy, 2011, 13(1): 134-170.
[11] 贵州省彝学研究会. 西南彝志[M]. 贵阳: 贵州民族出版社, 2015.
[12] 《彝文典籍集成》编委会. 彝文典籍集成·四川卷·教育[M]. 成都: 四川民族出版社, 2014.
[13] 滇川黔桂彝文协作组编. 滇川黔桂彝文字集[M]. 昆明: 云南民族出版社, 2004.
[14] JIANG Peiyuan, ERGU Daji, LIU Fangyao, et al. A review of yolo algorithm developments[J]. Procedia computer science, 2022, 199: 1066-1073.
[15] 沙马拉毅. 《规范彝文方案》推行30年实践效果述评[J]. 西南民族大学学报(人文社科版), 2010, 31(8): 28-31.
SHA M. A review of the practice effect of standardizing Yi language program for 30 years[J]. Journal of Southwest University for Nationalities (humanities and social science edition), 2010, 31(8): 28-31.
[16] REN Shaoqing, HE Kaiming, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(6): 1137-1149.
[17] FARHADI A, REDMON J. Yolov3: An incremental improvement[C]//Computer vision and pattern recognition. Berlin: Springer, 2018, 1804: 1-6.
[18] JANI M, FAYYAD J, AL-YOUNES Y, et al. Model compression methods for YOLOv5: a review[EB/OL]. (2023-07-21)[2024-06-21]. https://arxiv.org/abs/2307.11904v1.
[19] WANG Ao, CHEN Hui, LIU Lihao, et al. YOLOv10: real-time end-to-end object detection[EB/OL]. (2024-05-23)[2024-06-21]. https://arxiv.org/abs/2405.14458v2.
[20] TIAN Zhi, SHEN Chunhua, CHEN Hao, et al. FCOS: fully convolutional one-stage object detection[EB/OL]. (2019-04-02)[2024-06-21]. https://arxiv.org/abs/1904.01355v5.
[21] ZHANG T Y, SUEN C Y. A fast parallel algorithm for thinning digital patterns[J]. Communications of the acm, 1984, 27(3): 236-239.
[22] YADAV S, SAWALE M D. A review on image classification using deep learning[J]. World journal of advanced research and reviews, 2023, 17(1): 480-482.
[23] ZHAO Xia, WANG Limin, ZHANG Yufei, et al. A review of convolutional neural networks in computer vision[J]. Artificial intelligence review, 2024, 57(4): 99.
[24] SRIVASTAVA D, WADHVANI R, GYANCHANDANI M, et al. A review: color feature extraction methods for content based image retrieval[J]. International journal of computational engineering & management, 2015, 18(3): 9-13.
[25] DENG Jia, DONG Wei, SOCHER R, et al. ImageNet: a large-scale hierarchical image database[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami: IEEE, 2009: 248-255.
[26] 曾婷, 唐孝, 谭阳, 等. 相似度三支决策模糊粗糙集模型的决策代价研究[J]. 智能系统学报, 2020, 15(6): 1068-1078.
ZENG Ting, TANG Xiao, TAN Yang, et al. Decision costs of the similarity three-way decision-theoretic fuzzy rough set model[J]. CAAI transactions on intelligent systems, 2020, 15(6): 1068-1078.
[27] 卞则康, 王士同. 基于混合距离学习的鲁棒的模糊C均值聚类算法[J]. 智能系统学报, 2017, 12(4): 450-458.
BIAN Zekang, WANG Shitong. Robust FCM clustering algorithm based on hybrid-distance learning[J]. CAAI transactions on intelligent systems, 2017, 12(4): 450-458.
[28] LEI Shiye, TAO Dacheng. A comprehensive survey of dataset distillation[J]. IEEE transactions on pattern analysis and machine intelligence, 2024, 46(1): 17-32.

相似文献/References:: [1]张媛媛,霍静,杨婉琪,等.深度信念网络的二代身份证异构人脸核实算法[J].智能系统学报,2015,10(2):193.[doi:10.3969/j.issn.1673-4785.201405060]
　ZHANG Yuanyuan,HUO Jing,YANG Wanqi,et al.A deep belief network-based heterogeneous face verification method for the second-generation identity card[J].CAAI Transactions on Intelligent Systems,2015,10():193.[doi:10.3969/j.issn.1673-4785.201405060]
[2]丁科,谭营.GPU通用计算及其在计算智能领域的应用[J].智能系统学报,2015,10(1):1.[doi:10.3969/j.issn.1673-4785.201403072]
　DING Ke,TAN Ying.A review on general purpose computing on GPUs and its applications in computational intelligence[J].CAAI Transactions on Intelligent Systems,2015,10():1.[doi:10.3969/j.issn.1673-4785.201403072]
[3]马晓,张番栋,封举富.基于深度学习特征的稀疏表示的人脸识别方法[J].智能系统学报,2016,11(3):279.[doi:10.11992/tis.201603026]
　MA Xiao,ZHANG Fandong,FENG Jufu.Sparse representation via deep learning features based face recognition method[J].CAAI Transactions on Intelligent Systems,2016,11():279.[doi:10.11992/tis.201603026]
[4]刘帅师,程曦,郭文燕,等.深度学习方法研究新进展[J].智能系统学报,2016,11(5):567.[doi:10.11992/tis.201511028]
　LIU Shuaishi,CHENG Xi,GUO Wenyan,et al.Progress report on new research in deep learning[J].CAAI Transactions on Intelligent Systems,2016,11():567.[doi:10.11992/tis.201511028]
[5]马世龙,乌尼日其其格,李小平.大数据与深度学习综述[J].智能系统学报,2016,11(6):728.[doi:10.11992/tis.201611021]
　MA Shilong,WUNIRI Qiqige,LI Xiaoping.Deep learning with big data: state of the art and development[J].CAAI Transactions on Intelligent Systems,2016,11():728.[doi:10.11992/tis.201611021]
[6]王亚杰,邱虹坤,吴燕燕,等.计算机博弈的研究与发展[J].智能系统学报,2016,11(6):788.[doi:10.11992/tis.201609006]
　WANG Yajie,QIU Hongkun,WU Yanyan,et al.Research and development of computer games[J].CAAI Transactions on Intelligent Systems,2016,11():788.[doi:10.11992/tis.201609006]
[7]黄心汉.A3I:21世纪科技之光[J].智能系统学报,2016,11(6):835.[doi:10.11992/tis.201605022]
　HUANG Xinhan.A3I: the star of science and technology for the 21st century[J].CAAI Transactions on Intelligent Systems,2016,11():835.[doi:10.11992/tis.201605022]
[8]宋婉茹,赵晴晴,陈昌红,等.行人重识别研究综述[J].智能系统学报,2017,12(6):770.[doi:10.11992/tis.201706084]
　SONG Wanru,ZHAO Qingqing,CHEN Changhong,et al.Survey on pedestrian re-identification research[J].CAAI Transactions on Intelligent Systems,2017,12():770.[doi:10.11992/tis.201706084]
[9]杨梦铎,栾咏红,刘文军,等.基于自编码器的特征迁移算法[J].智能系统学报,2017,12(6):894.[doi:10.11992/tis.201706037]
　YANG Mengduo,LUAN Yonghong,LIU Wenjun,et al.Feature transfer algorithm based on an auto-encoder[J].CAAI Transactions on Intelligent Systems,2017,12():894.[doi:10.11992/tis.201706037]
[10]王科俊,赵彦东,邢向磊.深度学习在无人驾驶汽车领域应用的研究进展[J].智能系统学报,2018,13(1):55.[doi:10.11992/tis.201609029]
　WANG Kejun,ZHAO Yandong,XING Xianglei.Deep learning in driverless vehicles[J].CAAI Transactions on Intelligent Systems,2018,13():55.[doi:10.11992/tis.201609029]

备注/Memo

收稿日期:2024-6-21。
基金项目:国家社会科学基金重大招标项目(19ZDA284)；西南民族大学中华民族共同体研究院团队项目(2024GTT-TD17)；西南民族大学中央高校基本科研业务费专项基金项目(ZYN2023009).
作者简介:熊铁妞，硕士研究生，主要研究方向为深度学习、图像处理、古彝文字数字化。E-mail：xiongtieniu@stu.swun.edu.cn。;邱吉芳，本科生，主要学习方向彝语语言学、彝语方言学。E-mail：18384496920@163.com。;胡建，教授，博士，主要研究方向为计算机视觉、群体智能、文献数字化。E-mail：hujian@swun.edu.cn。
通讯作者:胡建. E-mail：hujian@swun.edu.cn

更新日期/Last Update: 1900-01-01

基于深度学习技术的古彝文字图像搜集与整理方法 PDF下载HTML

备注/Memo

基于深度学习技术的古彝文字图像搜集与整理方法

PDF下载 HTML