[1]YIN Zezhong,GUO Maozu,TIAN Le.Neural radiance field optimization based on Fourier frequency domain truncation[J].CAAI Transactions on Intelligent Systems,2024,19(5):1319-1330.[doi:10.11992/tis.202401036]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 5
Page number:
1319-1330
Column:
吴文俊人工智能科学技术奖论坛
Public date:
2024-09-05
- Title:
-
Neural radiance field optimization based on Fourier frequency domain truncation
- Author(s):
-
YIN Zezhong1; 2; GUO Maozu1; 2; TIAN Le1; 2
-
1. School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 100044, China;
2. Research on Intelligent Processing Method of Building Big Data Beijing Key Laboratory, Beijing University of Civil Engineering and Architecture, Beijing 100044, China
-
- Keywords:
-
neural radiance field; frequency domain truncation; Fourier transform; 3D reconstruction; sparse view; fine rendering; positional encoding; scene representation
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202401036
- Abstract:
-
Neural radiance field (NeRF), as a general method of scene expression, can better understand the three-dimensional (3D) world and create a more realistic sensory experience. However, in practical application, it is a common problem that less input images lead to poor reconstruction effect. Thus, a sparse-view neural radiance field (Sv-NeRF) is proposed in this study based on Fourier frequency domain truncation. The position coding mechanism of NeRF is optimized by truncating the input frequency in the frequency domain space and applying a regularization strategy to control the input of high-frequency signals. This method effectively reduces high-frequency noise and retains key details to improve the quality and stability of rendering. Compared with other methods, the proposed method improves the ability of the model to understand the scene significantly, and improves the rendering quality and detail preservation ability. It is especially suitable for scene reconstruction from the sparse input perspective.