| --- |
| base_model: |
| - stabilityai/stable-diffusion-3.5-medium |
| library_name: diffusers |
| pipeline_tag: text-to-image |
| --- |
| |
| # Model Card |
|
|
| ## Model Details |
|
|
| ### Model Description |
| This is a reproduced LoRA of SD3.5-Medium, post-trained with DiffusionNFT on multiple reward models, as presented in the paper [Diffusion Negative-aware FineTuning (DiffusionNFT)](https://huggingface.co/papers/2509.16117). |
|
|
| ### Paper Abstract |
| Online reinforcement learning (RL) has been central to post-training language |
| models, but its extension to diffusion models remains challenging due to |
| intractable likelihoods. Recent works discretize the reverse sampling process |
| to enable GRPO-style training, yet they inherit fundamental drawbacks, |
| including solver restrictions, forward-reverse inconsistency, and complicated |
| integration with classifier-free guidance (CFG). We introduce Diffusion |
| Negative-aware FineTuning (DiffusionNFT), a new online RL paradigm that |
| optimizes diffusion models directly on the forward process via flow matching. |
| DiffusionNFT contrasts positive and negative generations to define an implicit |
| policy improvement direction, naturally incorporating reinforcement signals |
| into the supervised learning objective. This formulation enables training with |
| arbitrary black-box solvers, eliminates the need for likelihood estimation, and |
| requires only clean images rather than sampling trajectories for policy |
| optimization. DiffusionNFT is up to 25times more efficient than FlowGRPO in |
| head-to-head comparisons, while being CFG-free. For instance, DiffusionNFT |
| improves the GenEval score from 0.24 to 0.98 within 1k steps, while FlowGRPO |
| achieves 0.95 with over 5k steps and additional CFG employment. By leveraging |
| multiple reward models, DiffusionNFT significantly boosts the performance of |
| SD3.5-Medium in every benchmark tested. |
|
|
| ### Model Sources |
|
|
| <!-- Provide the basic links for the model. --> |
|
|
| - **Repository:** https://github.com/NVlabs/DiffusionNFT |
| - **Paper:** https://huggingface.co/papers/2509.16117 |
| - **Project Page:** https://research.nvidia.com/labs/dir/DiffusionNFT |
|
|
| ## Uses |
|
|
| Please refer to the evaluation script in GitHub. |