RPCANet++

Problem Modeling, Iterative Solving, and Unfolding to the Network

In the context of segmentation orientated RPCA tasks, our objective is to estimate low rank background \(\mathbf{B}\in \mathbb{R}^{m \times n}\) and extract the sparse object matrix \(\mathbf{O}\in \mathbb{R}^{m \times n}\). For an image \(\mathbf{D}\in \mathbb{R}^{m \times n}\), we transform the segmentation model into the following optimization framework: \begin{equation} \min \limits_{\mathbf{B},\mathbf{O}} rank(\mathbf{B}) + \lambda \left\| \mathbf{O} \right\|_0 \quad s.t.~\mathbf{D} = \mathbf{B} + \mathbf{O} \enspace, \label{eq_RPCA} \end{equation} where we signify \(\lambda\) as a trade-off coefficient, and the term \({\left\| \cdot \right\|_0}\) denotes the \(l_0\)-norm, which is defined as the count of non-zero elements within a matrix. However, when facing complex scenarios, the background can exhibit varying degrees of complexity, rendering a solitary nuclear norm or rank function insufficient for encapsulating the practical constraints. Similarly, the sparsity of object elements can vary, making the exclusive use of the \(l_0\) or \(l_1\)-norm potentially inadequate. Consequently, we propose a more generalized formulation of the problem. Here, we employ \(\mathcal{R}(\mathbf{B})\) and \(\mathcal{S}(\mathbf{O})\) as constraints that incorporate prior knowledge of the background and object images, individually: \begin{equation} \min \limits_{\mathbf{B},\mathbf{O}} \mathcal{R}(\mathbf{B}) + \lambda \mathcal{S}(\mathbf{O}) \quad s.t.~\mathbf{D} = \mathbf{B} + \mathbf{O} \enspace. \label{eq_relaxPCP} \end{equation}

This motivate us to solve the above optimization problem in an iterative manner and unfolds the above optimization problem into a deep network as follows:

\(\textbf{RPCANet}^{++}\) framework unfolds iterative model-driven closed-form equations in deep network design and comprises corresponding \(K\) stages. Transmissive elements are presented in different colors: \(\mathbf{D}\) for the restoration image, \(\mathbf{B}\) for the low rank background, \(\mathbf{O}\) for the sparse object matrix, \(\rho\) for the learnable parameter, and \([\mathcal{B}_h,\mathcal{B}_c]\) for the latent background features.

Model Verifications

In the context of deep unfolding, the network is architectured to iteratively yield guided results congruent with an algorithm’s unrolled stages. Demonstrating outcomes at each stage is vital for model validation.

(a) Typcial sparse object segmentation tasks solved by RPCA methods with overall low-rank background. (b) Datasets utilized in this paper with objects' average area.

\(\textbf{Heatmaps}\) of different stages' \(\textbf{B}^{k}\) and \(\textbf{O}^{k}\) visualization results (\(K=6\)) of our RPCANet\(^{++}\) on various scenarios from six different datasets (\(\textbf{IRSTD}\), \(\textbf{VS}\), and \(\textbf{DD}\) tasks). We can observe its gradual shaping process via iterative unfolding.

The impact of different stage index \(K\) on detection efficacy

\(\textbf{Low-rankness verification}\) of different stage features (1st to 6th) in \(\textbf{(a)}\) RPCANet\(^{++}\), compared to original images. As well as its variants \(\textbf{(b)}\) without MAM or \(\textbf{(c)}\) without DCPM, and the baseline \(\textbf{(d)}\) RPCANet. Verification is conducted on the IRSTD-1K test set. Our RPCANet\(^{++}\) progressively estimates background features satisfying low-rankness, step-by-step, without overestimation. [Zoom in for a better view]

\(\textbf{Sparsity verification}\) of different stages our RPCANet\(^{++}\) and its variants(without MAM or DCPM) vs RPCANet on IRSTD-1K. \(\textbf{Left:}\) numerical verification. \(\textbf{Right:}\) heatmaps among different stages.

Experimental Comparisons

Performance metrics, including IoU (%), F1 (%), Pd (%), Fa (10^-5 ), and runtime are evaluated for various methods on datasets NUDT-SIRST, IRSTD-1K, SIRST, and SIRST-AUG. Parameter statistics for data-driven approach are encapsulated within the second column (Find more details in the main manuscript).