F2D-Net

Factorized Stochastic Transport for Composite Degradation Image Restoration

Xin Su1,2,3,4, Jianshu Chao1,2,3*, Huifang Shen1,2,3, Anqi Chen1,2,3,5, Yuting Gao1,2,3,6, Jianya Yuan1,2,3

1Quanzhou Institute of Equipment Manufacturing, Haixi Institutes, Chinese Academy of Sciences
2Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences
3Fujian College, University of Chinese Academy of Sciences    4Fuzhou University
5College of Mechanical and Electrical Engineering, Fujian Agriculture and Forestry University
6College of Computer and Cyber Security, Fujian Normal University
*Corresponding author

Paper Code

Abstract

Composite image degradations, where rain, haze, low-light, and snow overlap spatially within a single frame, pose a fundamental challenge for unified restoration models. We identify two failure modes on such inputs. First, deterministic flow transport averages over feasible solutions and collapses to over-smoothed predictions. Second, image-level expert routing yields a selectivity ratio of only 1.2×, with near-uniform weights that leave experts under-specialized on localized distortions. Motivated by these observations, we propose F2D-Net, a factorized stochastic transport framework. The model factorizes the restoration velocity field into a shared backbone and degradation-specific expert increments, and a learned spatial gating assembles them at every pixel. The core mechanism is a state-dependent multiplicative noise whose magnitude scales with the local residual. The noise remains large far from the clean target to encourage diverse reconstructions, and contracts to zero at convergence to preserve fine detail. The closed-form log-normal transition collapses sampling to one network call, matching feed-forward speed. On CDD-11, F2D-Net attains the best structural and perceptual quality across all complexity tiers with substantially higher expert selectivity than image-level routing. It also remains competitive on standard three-task and five-task all-in-one benchmarks, and generalizes to real-world weather degradations on WeatherBench and RTTS.

Method

F2D-Net addresses two failure modes of all-in-one restorers on composite inputs: deterministic flow transport collapses to the conditional mean, and image-level expert routing leaves experts under-specialized.

F2D-Net Framework

Figure 1.  Overview of F2D-Net. The restoration velocity field is factorized into a shared backbone and four degradation-specific expert increments, assembled at every pixel via learned spatial gating maps and time-conditioned coefficients.

Stochastic transport with state-dependent noise. We formulate restoration as a forward-only stochastic transport process from the degraded observation to the clean target. The diffusion coefficient is scaled by the residual to the target, so the noise is large far from the solution and contracts to zero upon convergence:

dxt = θt(μxt) dt + σt diag(xtμ) dwt

This SDE admits a closed-form log-normal transition, which enables direct sampling of intermediate states during training and reduces inference to a single network evaluation through analytic transitions.

Factorized velocity field. Vector additivity of flow matching motivates decomposing the restoration velocity into a shared backbone and degradation-specific expert increments, assembled at every pixel through spatial intensity maps mi and time-conditioned gating weights αi:

fφ(xt, t, y) = fshare(xt, t, y) + Σi αi(t, w) · mi ⊙ Δfi(xt, t)

A lightweight CNN parser produces both the spatial maps mi and the global degradation weights w in one forward pass; the time-conditioned gate then schedules each expert across the transport trajectory. The shared backbone absorbs cross-degradation interactions while each Δfi specializes on a single atomic flow.

Visual Results

Drag the slider to compare degraded inputs with F2D-Net restored outputs on CDD-11.

Snow (S)

Restored Degraded
Degraded Restored

Low-light + Haze (L+H)

Restored Degraded
Degraded Restored

Haze + Snow (H+S)

Restored Degraded
Degraded Restored

Low-light + Haze + Rain (L+H+R)

Restored Degraded
Degraded Restored

Method Comparisons on CDD-11

Snow comparison: Input / AirNet / PromptIR / MoCE-IR / Ours / GT
Low-light+Haze comparison
Haze+Snow comparison
Low-light+Haze+Rain comparison

Real-World Generalization

Trained on synthetic CDD-11, F2D-Net transfers to real captures from WeatherBench (paired rain, snow, haze) without retraining for any single task.

Dehaze Per-pixel error map

WeatherBench dehaze qualitative comparison

F2D-Net retains tighter error distribution and preserves edge structure under real haze.

Derain Per-pixel error map

WeatherBench derain qualitative comparison

Largest single-task gain on WeatherBench; rain streaks are removed without smearing background texture.

Desnow Per-pixel error map

WeatherBench desnow qualitative comparison

Spatially adaptive expert gating localizes snow particles while leaving clean regions untouched.

Per-pixel error uses the inferno colormap; brighter pixels denote larger error against the paired ground truth. F2D-Net is the same single checkpoint across all three tasks.