sartifyllc
/

AViLaMa

Zero-Shot Image Classification

vision-text-dual-encoder

image generation

text-image embedding

image-text embedding

visual conversional ai

image semantic retrival

african raw resourced languages

Model card Files Files and versions

innocent-charles commited on Apr 26, 2024

Commit

cac8554

·

verified ·

1 Parent(s): d4e503c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ library_name: transformers
 ---
 # AViLaMa : African Vision-Languages Aligment Pre-Training Model.
-Learning Visual Concepts Directly From African Languages Supervision. [To be coming]()
 ## Model Details
 AViLaMa is the large open-source text-vision alignment pre-training model in African languages. It brings a way to learn visual concepts directly from African languages supervision. Inspired from OpenAI CLIP, but with more modalities like video, audio, etc.. and other techniques like agnostic languages encoding, data filtering network. All for more than 12 African languages, trained on the #AViLaDa-2B datasets of filtered image, video, audio-text pairs. We are also working to make it usable in directly vision-vision tasks.

 ---
 # AViLaMa : African Vision-Languages Aligment Pre-Training Model.
+Learning Visual Concepts Directly From African Languages Supervision. [Paper is coming]()
 ## Model Details
 AViLaMa is the large open-source text-vision alignment pre-training model in African languages. It brings a way to learn visual concepts directly from African languages supervision. Inspired from OpenAI CLIP, but with more modalities like video, audio, etc.. and other techniques like agnostic languages encoding, data filtering network. All for more than 12 African languages, trained on the #AViLaDa-2B datasets of filtered image, video, audio-text pairs. We are also working to make it usable in directly vision-vision tasks.