| --- |
| license: apache-2.0 |
| library_name: diffusers |
| pipeline_tag: unconditional-image-generation |
| base_model: shallowdream204/BitDance-ImageNet |
| language: |
| - en |
| tags: |
| - bitdance |
| - imagenet |
| - class-conditional |
| - custom-pipeline |
| - diffusers |
| --- |
| |
| # BitDance-ImageNet (Diffusers) |
|
|
| Diffusers-compatible BitDance ImageNet checkpoints for class-conditional generation at `256x256`. |
|
|
| ## Available Subfolders |
|
|
| - `BitDance_B_1x` (`parallel_num=1`) |
| - `BitDance_B_4x` (`parallel_num=4`) |
| - `BitDance_B_16x` (`parallel_num=16`) |
| - `BitDance_L_1x` (`parallel_num=1`) |
| - `BitDance_H_1x` (`parallel_num=1`) |
|
|
| All variants include a custom `BitDanceImageNetPipeline` and support ImageNet class IDs (`0-999`). |
|
|
| ## Requirements |
|
|
| - `flash-attn` is required for model execution and sampling. |
| - Install it in your environment before loading the pipeline. |
|
|
| ## Quickstart (native diffusers) |
|
|
| ```python |
| import torch |
| from diffusers import DiffusionPipeline |
| |
| repo_id = "BiliSakura/BitDance-ImageNet-diffusers" |
| subfolder = "BitDance_B_1x" # or BitDance_B_4x, BitDance_B_16x, BitDance_L_1x, BitDance_H_1x |
| |
| pipe = DiffusionPipeline.from_pretrained( |
| repo_id, |
| subfolder=subfolder, |
| trust_remote_code=True, |
| torch_dtype=torch.float16, |
| ).to("cuda") |
| |
| # ImageNet class 207 = golden retriever |
| out = pipe( |
| class_labels=207, |
| num_images_per_label=1, |
| sample_steps=100, |
| cfg_scale=4.6, |
| ) |
| out.images[0].save("bitdance_imagenet.png") |
| ``` |
|
|
| ## Local Path Note |
|
|
| When loading from a local clone, do not point `from_pretrained` to the repo root unless you also provide `subfolder=...`. |
| Each variant folder contains its own `model_index.json`, so the most reliable local usage is to load the variant directory directly: |
|
|
| ```python |
| from diffusers import DiffusionPipeline |
| |
| pipe = DiffusionPipeline.from_pretrained( |
| "/path/to/BitDance-ImageNet-diffusers/BitDance_B_1x", |
| trust_remote_code=True, |
| ) |
| ``` |
|
|
| ## Model Metadata |
|
|
| - Pipeline class: `BitDanceImageNetPipeline` |
| - Diffusers version in configs: `0.36.0` |
| - Resolution: `256x256` |
| - Number of classes: `1000` |
| - Autoencoder class: `BitDanceImageNetAutoencoder` |
|
|
| ## Citation |
|
|
| If you use this model, please cite BitDance and Diffusers: |
|
|
| ```bibtex |
| @article{ai2026bitdance, |
| title = {BitDance: Scaling Autoregressive Generative Models with Binary Tokens}, |
| author = {Ai, Yuang and Han, Jiaming and Zhuang, Shaobin and Hu, Xuefeng and Yang, Ziyan and Yang, Zhenheng and Huang, Huaibo and Yue, Xiangyu and Chen, Hao}, |
| journal = {arXiv preprint arXiv:2602.14041}, |
| year = {2026} |
| } |
| |
| @inproceedings{von-platen-etal-2022-diffusers, |
| title = {Diffusers: State-of-the-art diffusion models}, |
| author = {Patrick von Platen and Suraj Patil and Anton Lozhkov and Damar Jablonski and Hernan Bischof and Thomas Wolf}, |
| booktitle = {GitHub repository}, |
| year = {2022}, |
| url = {https://github.com/huggingface/diffusers} |
| } |
| ``` |
|
|
| ## License |
|
|
| This repository is distributed under the Apache-2.0 license, consistent with the upstream BitDance release. |
|
|