cv-dbm-scheduler
Computer Visiondbim-codebaserigorous codebase
Description
Task: Time Scheduler for Diffusion Bridge Models (NFE=5)
Background
In diffusion bridge sampling, the time schedule controls the discretization step sizes. Currently, the uniform (linear) and karras schedules are the most widely used baselines in the field.
Objective
Design a novel time schedule optimized specifically for extremely low-step sampling (NFE = 5). Your goal is to achieve a better generation quality (lower FID) than the standard baselines.
⚠️ CRITICAL CONSTRAINT:
To maintain compatibility with our evaluation interface, your code MUST be written inside the function named get_sigmas_uniform. Please ignore the function name—do NOT implement a basic linear/uniform schedule. Use this exact function slot to implement your new, advanced mathematical curve.
Implementation
import torch
def get_sigmas_uniform(n, t_min, t_max, device="cpu"):
"""
Requirements:
1. Length: Must return a 1D PyTorch tensor of exactly length `n + 1`.
2. Monotonic: The sequence must strictly decrease from `t_max` to `t_min`.
3. Terminal Value: The final element (index `n`) must exactly equal `t_min`.
4. Device: Move the tensor to the requested `device`.
"""
# For this task, n will typically be 5 (NFE=5).
# Implement your novel schedule formulation here...
# Example return:
# return sigmas.to(device)Code
Results
| Model | Type | best fid edges2handbags ↓ | best fid Imagenet ↓ | best fid ↓ | fid ↓ | fid edges2handbags ↓ | fid Imagenet ↓ |
|---|---|---|---|---|---|---|---|
| cosine | baseline | 4.905 | 5.438 | - | - | - | - |
| karras | baseline | 4.416 | 12.025 | - | - | - | - |
| loglinear | baseline | 5.634 | 13.748 | - | - | - | - |
| uniform | baseline | 5.180 | 6.070 | - | - | - | - |
| claude-opus-4.6 | vanilla | 4.943 | 13.043 | 4.943 | 4.943 | 4.943 | 13.043 |
| deepseek-reasoner | vanilla | 5.634 | 13.748 | 5.634 | 5.634 | 5.634 | 13.748 |
| gemini-3.1-pro-preview | vanilla | 4.860 | 5.553 | 4.860 | 4.860 | 4.860 | 5.553 |
| qwen3.6-plus | vanilla | 4.339 | 8.159 | 4.339 | 4.339 | 4.339 | 8.159 |
| claude-opus-4.6 | agent | 4.937 | 13.043 | - | - | - | - |
| deepseek-reasoner | agent | 4.846 | 7.226 | - | - | - | - |
| gemini-3.1-pro-preview | agent | 4.370 | 7.313 | - | - | - | - |
| qwen3.6-plus | agent | 4.339 | 8.159 | - | - | - | - |