The overall display of PFSNet
Recently, we have witnessed the great progress of salient object detection (SOD), which benefits from the effectiveness of various feature aggregation strategies. However, existing methods usually aggregate the low-level features containing details and the high-level features containing semantics over a large span, which introduces noise into the aggregated features and generates inaccurate saliency maps. In this paper, we propose a pyramidal feature shrinking network (PFSNet), which aims to aggregate adjacent feature nodes in pairs with layer-by-layer shrinkage, so that the aggregated features fuse effective details and semantics and discard interference information. Specifically, a pyramidal shrinking decoder (PSD) is proposed to aggregate adjacent features hierarchically in an asymptotic manner. Unlike other methods that aggregate features with significantly different information, this method only focuses on adjacent feature nodes in each layer and shrinks them to a final unique feature node. Besides, we propose an adjacent fusion module (AFM) to perform mutual spatial enhancement between the adjacent features to dynamically weight the features and adaptively fuse the appropriate information. Besides, a scale-aware enrichment module (SEM) based on the features extracted from the backbone is utilized to obtain rich scale information and generate diverse initial features with dilated convolutions. Extensive quantitative and qualitative experiments demonstrate that the proposed intuitive framework outperforms 14 state-of-the-art approaches on 5 public datasets.
BibTex Citation
@article{Ma_Xia_Li_2021,
title={Pyramidal Feature Shrinking for Salient Object Detection},
volume={35},
url={https://ojs.aaai.org/index.php/AAAI/article/view/16331},
number={3},
journal={Proceedings of the AAAI Conference on Artificial Intelligence},
author={Ma, Mingcan and Xia, Changqun and Li, Jia},
year={2021},
month={May},
pages={2311-2318}
}