bharat-raghunathan/indian-foods-dataset
Viewer β’ Updated β’ 4.77k β’ 167 β’ 7
How to use therealcyberlord/vit-indian-food with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("image-classification", model="therealcyberlord/vit-indian-food")
pipe("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png") # Load model directly
from transformers import AutoImageProcessor, AutoModelForImageClassification
processor = AutoImageProcessor.from_pretrained("therealcyberlord/vit-indian-food")
model = AutoModelForImageClassification.from_pretrained("therealcyberlord/vit-indian-food")This model is a fine-tuned Vision Transformer (ViT) for the task of classifying images of Indian foods. The model was trained on the Indian Foods Dataset from Hugging Face Datasets.
The Indian Foods Dataset contains 4,770 images across 15 different classes of popular Indian dishes. The dataset is split into:
The base model used is the vision transformer (google/vit-base-patch16-224-in21k). The model was fine-tuned on the Indian Foods Dataset for 10 epochs using the AdamW optimizer with a learning rate of 2e-4.
The model was evaluated on the test set and achieved the following metrics:
You can use this pre-trained model directly from Hugging Face