[1]ZHAO Wenqing,XU Lijiao,CHEN Haoyang,et al.Blind image quality assessment based on multi-level feature fusion and semantic enhancement[J].CAAI Transactions on Intelligent Systems,2024,19(1):132-141.[doi:10.11992/tis.202301007]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 1
Page number:
132-141
Column:
学术论文—智能系统
Public date:
2024-01-05
- Title:
-
Blind image quality assessment based on multi-level feature fusion and semantic enhancement
- Author(s):
-
ZHAO Wenqing1; 2; XU Lijiao1; CHEN Haoyang1; LI Mengwei1
-
1. School of Control and Computer Engineering, North China Electric Power University, Baoding 071003, China;
2. Engineering Research Center of the Ministry of Education for Intelligent Computing of Complex Energy System Department, Baoding 071003, China
-
- Keywords:
-
deep learning; image quality; convolution neural network; feature extraction; channel attention structure; multi-level feature fusion; dilated convolution; triplet loss function
- CLC:
-
TP391.41
- DOI:
-
10.11992/tis.202301007
- Abstract:
-
Aiming at the low performance of the existing blind image quality assessment algorithm when facing the real distorted images, the paper proposes a new no-reference image quality assessment algorithm, namely multi-level feature fusion and semantic enhancement for NR (MFFSE-NR), which combines multi-level feature fusion and semantic information enhancement. The local and global distortion features of an image are extracted, then a feature fusion module is used to fuse the features in layers. The multi-layer dilated convolution is employed to enhance semantic information and further direct the mapping process from distorted image to quality fraction. Finally, a novel loss function called Lmix is created by combining the triplet ranking loss function and the L1 loss function, taking account of the relative ranking relationship between the predicted score and the subjective score. Validation and comparison experiments carried out on LIVEC dataset show that both the SROCC and PLCC index are improved respectively by 2.3% than the original algorithm; cross-dataset validation on the KonIQ-10k dataset and LIVEC dataset confirm that the proposed algorithm has good generalization ability when dealing with the real distorted images.