StreamAvatar AROD Student Checkpoint

This repository hosts the public StreamAvatar AROD real-anchor student checkpoint.

AROD stands for Autoregressive One-step Denoising. It is a blockwise student model distilled from a DyStream teacher for faster audio-to-motion inference in the StreamAvatar project.

Files

  • blockwise_latest.pt: AROD real-anchor student checkpoint.
  • config.yaml: sanitized inference/training configuration for the checkpoint.

Download

pip install huggingface-hub

mkdir -p outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k
huggingface-cli download pancx/StreamAvatar-AROD blockwise_latest.pt \
  --local-dir outputs/blockwise_stream_distill_cross_fm_teacher_cache_anchor_pretrain_60k

Expected SHA256:

01893fabb842fcc8e9817a8e2530108d75932aad4f6ac4136e5c22b94702e860

Project

Code and full setup instructions are available at:

https://github.com/CXP-2024/StreamAvatar

The checkpoint requires the StreamAvatar/DyStream codebase, the original DyStream teacher checkpoint, Wav2Vec2 assets, and the renderer checkpoint described in the project README.

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support