详细信息
文献类型:期刊文献
英文题名:Reconsidering Multi-Branch Aggregation for Semantic Segmentation
作者:Cai, Pengjie[1];Yang, Derong[1];Zou, Yonglin[1];Chen, Ruihan[1];Dai, Ming[1]
机构:[1]Guangdong Ocean Univ, Sch Math & Comp, Zhanjiang 524088, Peoples R China
年份:2023
卷号:12
期号:15
外文期刊名:ELECTRONICS
收录:SCI-EXPANDED(收录号:WOS:001045330300001)、、Scopus(收录号:2-s2.0-85167723350)、WOS
基金:This research was supported by a special grant from the Guangdong Provincial Science and Technology Innovation Strategy under Grant No. pdjh2022a0231, Guangdong Basic and Applied Basic Research Foundation under Grant No. 2023A1515011326, and program for scientific research start-up funds of Guangdong Ocean University under Grant No. 060302102101.
语种:英文
外文关键词:semantic segmentation; multi-branching; adaptive parameters; branch aggregation
外文摘要:For semantic segmentation tasks, there are problems in using multi-branch structures to enrich feature maps and aggregate different branches of feature maps in a certain network depth, such as the insufficient richness of feature maps and the incomplete aggregation of feature maps. Given multi-branch feature maps and branch aggregation, this study proposes a lightweight method, called multi-branch aggregation atrous spatial pyramidal pooling, by introducing an attention mechanism and CARAFE to enrich the features of the feature maps, giving the feature maps adaptive parameters, periodically adjusting the adaptive parameters, and aggregating the feature maps in both the vertical and horizontal directions. First, the atrous pyramid is retained and the attention mechanism and CARAFE are used to handle the pooling features to obtain 10 different feature maps. Secondly, giving each feature map a cascaded adaptive parameter and periodically adjusting the adaptive parameter to promote or suppress certain feature maps prevents the model from being at a local minimum or saddle point for long periods due to aleatory uncertainty. Finally, the feature maps are vertically aggregated, horizontally aggregated, and weighted for summation. This work demonstrates a competitive performance on the benchmark dataset, with an improvement of 1.88% in MPA, 0.66% in FWIoU, and 1.29% in MIoU compared to atrous spatial pyramid pooling. The benchmark datasets include PASCAL VOC 2012 and CIFAR-100.
参考文献:
正在载入数据...