altndrr
/

cased

Image Classification

feature-extraction

Model card Files Files and versions

altndrr commited on Nov 21, 2023

Commit

e8c0181

·

1 Parent(s): f3261e5

Update README

Files changed (1) hide show

README.md +9 -10

README.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
 pipeline_tag: image-classification
 tags:
-- vision
 inference: false
 widget:
-- src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/cat-dog-music.png
-  example_title: Cat & Dog
 ---
 # Category Search from External Databases (CaSED)
 Disclaimer: The model card is taken and modified from the official repository, which can be found [here](https://github.com/altndrr/vic). The paper can be found [here](https://arxiv.org/abs/2306.00917).
@@ -34,7 +35,7 @@ processor = CLIPProcessor.from_pretrained("openai/clip-vit-large-patch14")
 # get the model outputs
 images = processor(images=[image], return_tensors="pt", padding=True)
-outputs = model(images, alpha=0.5)
 labels, scores = outputs["vocabularies"][0], outputs["scores"][0]
 # print the top 5 most likely labels for the image
@@ -47,18 +48,16 @@ for value, index in zip(values, indices):
 The model depends on some libraries you have to install manually before execution:
 ```bash
-pip install torch faiss-cpu flair inflect nltk transformers
 ```
 ## Citation
 ```latex
-@misc{conti2023vocabularyfree,
       title={Vocabulary-free Image Classification},
       author={Alessandro Conti and Enrico Fini and Massimiliano Mancini and Paolo Rota and Yiming Wang and Elisa Ricci},
       year={2023},
-      eprint={2306.00917},
-      archivePrefix={arXiv},
-      primaryClass={cs.CV}
 }
-```

 ---
 pipeline_tag: image-classification
 tags:
+  - vision
 inference: false
 widget:
+  - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/cat-dog-music.png
+    example_title: Cat & Dog
 ---
 # Category Search from External Databases (CaSED)
 Disclaimer: The model card is taken and modified from the official repository, which can be found [here](https://github.com/altndrr/vic). The paper can be found [here](https://arxiv.org/abs/2306.00917).
 # get the model outputs
 images = processor(images=[image], return_tensors="pt", padding=True)
+outputs = model(images, alpha=0.7)
 labels, scores = outputs["vocabularies"][0], outputs["scores"][0]
 # print the top 5 most likely labels for the image
 The model depends on some libraries you have to install manually before execution:
 ```bash
+pip install torch faiss-cpu flair inflect nltk pyarrow transformers
 ```
 ## Citation
 ```latex
+@article{conti2023vocabularyfree,
       title={Vocabulary-free Image Classification},
       author={Alessandro Conti and Enrico Fini and Massimiliano Mancini and Paolo Rota and Yiming Wang and Elisa Ricci},
       year={2023},
+      journal={NeurIPS},
 }
+```