lightsofapollo commited on
Commit
bc5387a
·
verified ·
1 Parent(s): 991452f

add README

Browse files
Files changed (1) hide show
  1. README.md +55 -0
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - openwakeword
5
+ - wake-word
6
+ - keyword-spotting
7
+ - feature-extraction
8
+ - onnx
9
+ library_name: openwakeword
10
+ ---
11
+
12
+ # openWakeWord featurization graphs (mirror)
13
+
14
+ This repo mirrors the two ONNX feature-extractor models that [openWakeWord](https://github.com/dscripka/openWakeWord) uses as shared frontends for every wake-word DNN it trains:
15
+
16
+ | File | Size | Purpose |
17
+ |---|---|---|
18
+ | `melspectrogram.onnx` | 1.1 MB | Converts 16 kHz int16 audio → 32-bin mel-spectrogram frames |
19
+ | `embedding_model.onnx` | 1.3 MB | Google [`speech_embedding/1`](https://tfhub.dev/google/speech_embedding/1) — mel frames → 96-dim embeddings |
20
+
21
+ These are NOT trained by Little Bear Labs — they are upstream Apache-2.0 assets from the openWakeWord project, mirrored here so HuggingFace `hf_hub_download` callers (e.g. [`witness-wake`](https://github.com/gpu-cli/witness)) can pull them without pip / GitHub-release-API round trips.
22
+
23
+ ## When to use this
24
+
25
+ Use this mirror if you're:
26
+ - Building a Rust wake-word runtime that uses `ort` + the openwakeword feature pipeline directly (no Python).
27
+ - In an environment where the openwakeword pip package isn't available.
28
+ - Wanting a fixed HuggingFace URL with `hf_hub_download` semantics.
29
+
30
+ Otherwise, prefer:
31
+ - `pip install openwakeword` then `python -c "import openwakeword; openwakeword.utils.download_models([])"` — populates `<site-packages>/openwakeword/resources/models/`.
32
+ - The original [openWakeWord GitHub release](https://github.com/dscripka/openWakeWord/releases) assets.
33
+
34
+ ## Pairing with a wake-word DNN
35
+
36
+ These two graphs are the shared frontend. Pair them with any openwakeword DNN, e.g.:
37
+ - [`littlebearlabs/hey-virgil-wake-word`](https://huggingface.co/littlebearlabs/hey-virgil-wake-word) — Little Bear Labs's "hey virgil" detector.
38
+ - Any community-trained wake-word `.onnx` from the openwakeword ecosystem.
39
+
40
+ ## License
41
+
42
+ Apache-2.0 (inherited from openWakeWord + Google speech_embedding).
43
+
44
+ ## Citation
45
+
46
+ If you use this in research, cite openWakeWord:
47
+
48
+ ```bibtex
49
+ @software{openwakeword,
50
+ author = {David Scripka},
51
+ title = {openWakeWord: A library for training open-source wake word models},
52
+ year = {2024},
53
+ url = {https://github.com/dscripka/openWakeWord}
54
+ }
55
+ ```