Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization
BiDPO jointly optimizes image and text preferences with region-guided alignment to boost compositional fidelity in T2I generation.
12 days ago
BiDPO jointly optimizes image and text preferences with region-guided alignment to boost compositional fidelity in T2I generation.