openWakeWord featurization graphs (mirror)

This repo mirrors the two ONNX feature-extractor models that openWakeWord uses as shared frontends for every wake-word DNN it trains:

File Size Purpose
melspectrogram.onnx 1.1 MB Converts 16 kHz int16 audio โ†’ 32-bin mel-spectrogram frames
embedding_model.onnx 1.3 MB Google speech_embedding/1 โ€” mel frames โ†’ 96-dim embeddings

These are NOT trained by Little Bear Labs โ€” they are upstream Apache-2.0 assets from the openWakeWord project, mirrored here so HuggingFace hf_hub_download callers (e.g. witness-wake) can pull them without pip / GitHub-release-API round trips.

When to use this

Use this mirror if you're:

  • Building a Rust wake-word runtime that uses ort + the openwakeword feature pipeline directly (no Python).
  • In an environment where the openwakeword pip package isn't available.
  • Wanting a fixed HuggingFace URL with hf_hub_download semantics.

Otherwise, prefer:

  • pip install openwakeword then python -c "import openwakeword; openwakeword.utils.download_models([])" โ€” populates <site-packages>/openwakeword/resources/models/.
  • The original openWakeWord GitHub release assets.

Pairing with a wake-word DNN

These two graphs are the shared frontend. Pair them with any openwakeword DNN, e.g.:

License

Apache-2.0 (inherited from openWakeWord + Google speech_embedding).

Citation

If you use this in research, cite openWakeWord:

@software{openwakeword,
  author = {David Scripka},
  title = {openWakeWord: A library for training open-source wake word models},
  year = {2024},
  url = {https://github.com/dscripka/openWakeWord}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support