Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
antflydb
/
clipclap
like
2
Follow
Antfly, Inc.
6
Feature Extraction
ONNX
GGUF
OpenSound/AudioCaps
onnxruntime
multimodal
clip
clap
audio
image
text
embeddings
antfly
antfly-inference
License:
mit
Model card
Files
Files and versions
xet
Community
3
Copy to bucket
new
main
clipclap
1.03 GB
Ctrl+K
Ctrl+K
3 contributors
History:
13 commits
timkaye
Rename Termite references to Antfly Inference
6d70d3c
verified
5 days ago
.gitattributes
2 kB
Add ClipClap Q4_K GGUF variant (#1)
about 1 month ago
README.md
5.52 kB
Rename Termite references to Antfly Inference
5 days ago
antfly_inference_variants.json
565 Bytes
Canonicalize Antfly inference variants manifest
14 days ago
audio_model.onnx
3.32 MB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
5 months ago
audio_model.onnx.data
277 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago
audio_projection.onnx
12.7 kB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
5 months ago
audio_projection.onnx.data
4.26 MB
xet
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
5 months ago
clip_config.json
411 Bytes
Update CLIPCLAP model with trained audio projection
5 months ago
clipclap-clap.Q4_K.gguf
41.3 MB
xet
Add ClipClap Q4_K GGUF variant (#1)
about 1 month ago
clipclap-clip.Q4_K.gguf
95.2 MB
xet
Add ClipClap Q4_K GGUF variant (#1)
about 1 month ago
model_manifest.json
72 Bytes
Mark ClipClap image and audio inputs (#3)
19 days ago
processor_config.json
581 Bytes
Update CLIPCLAP model with trained audio projection
5 months ago
projection_training_metadata.json
294 Bytes
Update CLIPCLAP model: contrastive loss training on AudioCaps audio embeddings
5 months ago
text_model.onnx
1.24 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago
text_model.onnx.data
253 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago
text_projection.onnx
339 Bytes
xet
Update CLIPCLAP model with trained audio projection
5 months ago
text_projection.onnx.data
1.05 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago
tokenizer.json
3.64 MB
Update CLIPCLAP model with trained audio projection
5 months ago
tokenizer_config.json
322 Bytes
Update CLIPCLAP model with trained audio projection
5 months ago
visual_model.onnx
1.14 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago
visual_model.onnx.data
350 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago
visual_projection.onnx
341 Bytes
xet
Update CLIPCLAP model with trained audio projection
5 months ago
visual_projection.onnx.data
1.57 MB
xet
Update CLIPCLAP model with trained audio projection
5 months ago