Distribution-Aware Data Expansion with Diffusion Models

Author: NewsCrawler
Published: 3/12/2024, 1:04:14 AM
Category: Resource

Data expansion framework which creates diverse and distribution-consistent samples, leading to significant accuracy improvements across image datasets.

Paper

https://arxiv.org/pdf/2403.06741

Code

https://github.com/haoweiz23/DistDiff

DistDiff is a new framework engineered to tackle the persistent challenge of data scarcity. It orchestrates a sophisticated interplay of hierarchical clustering and multi-step energy guidance to expand training data with precision and scalability. Unlike its predecessors, which rely on predefined perturbations to augment datasets, DistDiff zeroes in on refining denoising processes within the sampling mechanism. This distinctive approach yields remarkable enhancements in optimization outcomes.

Central to DistDiff's architecture is the construction of a novel energy function, engineered to approximate data distributions based on predicted clean data points. The method harnesses hierarchical prototypes as strategic waypoints to shape energy guidance, optimizing predictions across multiple stages for nuanced effects. Crucially, DistDiff navigates the task of determining the requisite number of group-level prototypes, a pivotal step in mirroring real-world data distributions.

Distribution-Aware Data Expansion with Diffusion Models

Distribution-Aware Data Expansion with Diffusion Models

Comments

Log in to leave a comment