F2D-Net: Factorized Stochastic Transport for Composite Degradation Image Restoration

Abstract

Composite image degradations, where rain, haze, low-light, and snow overlap spatially within a single frame, pose a fundamental challenge for unified restoration models. We identify two failure modes on such inputs. First, deterministic flow transport averages over feasible solutions and collapses to over-smoothed predictions. Second, image-level expert routing yields a selectivity ratio of only 1.2×, with near-uniform weights that leave experts under-specialized on localized distortions. Motivated by these observations, we propose F²D-Net, a factorized stochastic transport framework. The model factorizes the restoration velocity field into a shared backbone and degradation-specific expert increments, and a learned spatial gating assembles them at every pixel. The core mechanism is a state-dependent multiplicative noise whose magnitude scales with the local residual. The noise remains large far from the clean target to encourage diverse reconstructions, and contracts to zero at convergence to preserve fine detail. The closed-form log-normal transition collapses sampling to one network call, matching feed-forward speed. On CDD-11, F²D-Net attains the best structural and perceptual quality across all complexity tiers with substantially higher expert selectivity than image-level routing. It also remains competitive on standard three-task and five-task all-in-one benchmarks, and generalizes to real-world weather degradations on WeatherBench and RTTS.

Method

F²D-Net addresses two failure modes of all-in-one restorers on composite inputs: deterministic flow transport collapses to the conditional mean, and image-level expert routing leaves experts under-specialized.

Figure 1. Overview of F²D-Net. The restoration velocity field is factorized into a shared backbone and four degradation-specific expert increments, assembled at every pixel via learned spatial gating maps and time-conditioned coefficients.

Stochastic transport with state-dependent noise. We formulate restoration as a forward-only stochastic transport process from the degraded observation to the clean target. The diffusion coefficient is scaled by the residual to the target, so the noise is large far from the solution and contracts to zero upon convergence:

dx_t = θ_t(μ − x_t) dt + σ_t diag(x_t − μ) dw_t

This SDE admits a closed-form log-normal transition, which enables direct sampling of intermediate states during training and reduces inference to a single network evaluation through analytic transitions.

Factorized velocity field. Vector additivity of flow matching motivates decomposing the restoration velocity into a shared backbone and degradation-specific expert increments, assembled at every pixel through spatial intensity maps m_i and time-conditioned gating weights α_i:

f_φ(x_t, t, y) = f_share(x_t, t, y) + Σ_i α_i(t, w) · m_i ⊙ Δf_i(x_t, t)

A lightweight CNN parser produces both the spatial maps m_i and the global degradation weights w in one forward pass; the time-conditioned gate then schedules each expert across the transport trajectory. The shared backbone absorbs cross-degradation interactions while each Δf_i specializes on a single atomic flow.

Real-World Generalization

Trained on synthetic CDD-11, F²D-Net transfers to real captures from WeatherBench (paired rain, snow, haze) without retraining for any single task.

Dehaze Per-pixel error map

WeatherBench dehaze qualitative comparison

F²D-Net retains tighter error distribution and preserves edge structure under real haze.

Derain Per-pixel error map

WeatherBench derain qualitative comparison

Largest single-task gain on WeatherBench; rain streaks are removed without smearing background texture.

Desnow Per-pixel error map

WeatherBench desnow qualitative comparison

Spatially adaptive expert gating localizes snow particles while leaving clean regions untouched.

Per-pixel error uses the inferno colormap; brighter pixels denote larger error against the paired ground truth. F²D-Net is the same single checkpoint across all three tasks.

F²D-Net

Abstract

Method

Visual Results

Snow (S)

Low-light + Haze (L+H)

Haze + Snow (H+S)

Low-light + Haze + Rain (L+H+R)

Method Comparisons on CDD-11

Real-World Generalization

Dehaze Per-pixel error map

Derain Per-pixel error map

Desnow Per-pixel error map