add README
Browse files
README.md
ADDED
|
@@ -0,0 +1,55 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
tags:
|
| 4 |
+
- openwakeword
|
| 5 |
+
- wake-word
|
| 6 |
+
- keyword-spotting
|
| 7 |
+
- feature-extraction
|
| 8 |
+
- onnx
|
| 9 |
+
library_name: openwakeword
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# openWakeWord featurization graphs (mirror)
|
| 13 |
+
|
| 14 |
+
This repo mirrors the two ONNX feature-extractor models that [openWakeWord](https://github.com/dscripka/openWakeWord) uses as shared frontends for every wake-word DNN it trains:
|
| 15 |
+
|
| 16 |
+
| File | Size | Purpose |
|
| 17 |
+
|---|---|---|
|
| 18 |
+
| `melspectrogram.onnx` | 1.1 MB | Converts 16 kHz int16 audio → 32-bin mel-spectrogram frames |
|
| 19 |
+
| `embedding_model.onnx` | 1.3 MB | Google [`speech_embedding/1`](https://tfhub.dev/google/speech_embedding/1) — mel frames → 96-dim embeddings |
|
| 20 |
+
|
| 21 |
+
These are NOT trained by Little Bear Labs — they are upstream Apache-2.0 assets from the openWakeWord project, mirrored here so HuggingFace `hf_hub_download` callers (e.g. [`witness-wake`](https://github.com/gpu-cli/witness)) can pull them without pip / GitHub-release-API round trips.
|
| 22 |
+
|
| 23 |
+
## When to use this
|
| 24 |
+
|
| 25 |
+
Use this mirror if you're:
|
| 26 |
+
- Building a Rust wake-word runtime that uses `ort` + the openwakeword feature pipeline directly (no Python).
|
| 27 |
+
- In an environment where the openwakeword pip package isn't available.
|
| 28 |
+
- Wanting a fixed HuggingFace URL with `hf_hub_download` semantics.
|
| 29 |
+
|
| 30 |
+
Otherwise, prefer:
|
| 31 |
+
- `pip install openwakeword` then `python -c "import openwakeword; openwakeword.utils.download_models([])"` — populates `<site-packages>/openwakeword/resources/models/`.
|
| 32 |
+
- The original [openWakeWord GitHub release](https://github.com/dscripka/openWakeWord/releases) assets.
|
| 33 |
+
|
| 34 |
+
## Pairing with a wake-word DNN
|
| 35 |
+
|
| 36 |
+
These two graphs are the shared frontend. Pair them with any openwakeword DNN, e.g.:
|
| 37 |
+
- [`littlebearlabs/hey-virgil-wake-word`](https://huggingface.co/littlebearlabs/hey-virgil-wake-word) — Little Bear Labs's "hey virgil" detector.
|
| 38 |
+
- Any community-trained wake-word `.onnx` from the openwakeword ecosystem.
|
| 39 |
+
|
| 40 |
+
## License
|
| 41 |
+
|
| 42 |
+
Apache-2.0 (inherited from openWakeWord + Google speech_embedding).
|
| 43 |
+
|
| 44 |
+
## Citation
|
| 45 |
+
|
| 46 |
+
If you use this in research, cite openWakeWord:
|
| 47 |
+
|
| 48 |
+
```bibtex
|
| 49 |
+
@software{openwakeword,
|
| 50 |
+
author = {David Scripka},
|
| 51 |
+
title = {openWakeWord: A library for training open-source wake word models},
|
| 52 |
+
year = {2024},
|
| 53 |
+
url = {https://github.com/dscripka/openWakeWord}
|
| 54 |
+
}
|
| 55 |
+
```
|