TY - JOUR
T1 - Disentangled Capsule Routing for Fast Part-Object Relational Saliency
AU - Liu, Yi
AU - Zhang, Dingwen
AU - Liu, Nian
AU - Xu, Shoukun
AU - Han, Jungong
N1 - Publisher Copyright:
© 1992-2012 IEEE.
PY - 2022/10/25
Y1 - 2022/10/25
N2 - Recently, the Part-Object Relational (POR) saliency underpinned by the Capsule Network (CapsNet) has been demonstrated to be an effective modeling mechanism to improve the saliency detection accuracy. However, it is widely known that the current capsule routing operations have huge computational complexity, which seriously limited the usability of the POR saliency models in real-time applications. To this end, this paper takes an early step towards a fast POR saliency inference by proposing a novel disentangled part-object relational network. Concretely, we disentangle horizontal routing and vertical routing from the original omnidirectional capsule routing, thus generating Disentangled Capsule Routing (DCR). This mechanism enjoys two advantages. On one hand, DCR that disentangles orthogonal 1D (i.e., vertical and horizontal) routing greatly reduces parameters and routing complexity, resulting in much faster inference than omnidirectional 2D routing adopted by existing CapsNets. On the other hand, thanks to the light POR cues explored by DCR, we could conveniently integrate the part-object routing process to different feature layers in CNN, rather than just applying it to the small-scaled one as in previous works. This helps to increase saliency inference accuracy. Compared to previous POR saliency detectors, DPORTNet infers visual saliency (5 ∼ 9) × faster, and is more accurate. DPORTNet is available under the open-source license at https://github.com/liuyi1989/DCR.
AB - Recently, the Part-Object Relational (POR) saliency underpinned by the Capsule Network (CapsNet) has been demonstrated to be an effective modeling mechanism to improve the saliency detection accuracy. However, it is widely known that the current capsule routing operations have huge computational complexity, which seriously limited the usability of the POR saliency models in real-time applications. To this end, this paper takes an early step towards a fast POR saliency inference by proposing a novel disentangled part-object relational network. Concretely, we disentangle horizontal routing and vertical routing from the original omnidirectional capsule routing, thus generating Disentangled Capsule Routing (DCR). This mechanism enjoys two advantages. On one hand, DCR that disentangles orthogonal 1D (i.e., vertical and horizontal) routing greatly reduces parameters and routing complexity, resulting in much faster inference than omnidirectional 2D routing adopted by existing CapsNets. On the other hand, thanks to the light POR cues explored by DCR, we could conveniently integrate the part-object routing process to different feature layers in CNN, rather than just applying it to the small-scaled one as in previous works. This helps to increase saliency inference accuracy. Compared to previous POR saliency detectors, DPORTNet infers visual saliency (5 ∼ 9) × faster, and is more accurate. DPORTNet is available under the open-source license at https://github.com/liuyi1989/DCR.
KW - Salient object detection
KW - capsule network
KW - disentangled capsule routing
KW - multi-level information integration
KW - part-object relationship
UR - http://www.scopus.com/inward/record.url?scp=85141004172&partnerID=8YFLogxK
U2 - 10.1109/TIP.2022.3215887
DO - 10.1109/TIP.2022.3215887
M3 - Article
C2 - 36282823
SN - 1057-7149
VL - 31
SP - 6719
EP - 6732
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
ER -