Sentence Similarity
Safetensors
sentence-transformers
English
PyLate
modernbert
ColBERT
embeddings
retrieval
feature-extraction
Generated from Trainer
dataset_size:640000
loss:Distillation
Eval Results (legacy)
text-embeddings-inference
🇪🇺 Region: EU
Instructions to use lightonai/ModernColBERT-embed-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use lightonai/ModernColBERT-embed-base with sentence-transformers:
from pylate import models queries = [ "Which planet is known as the Red Planet?", "What is the largest planet in our solar system?", ] documents = [ ["Mars is the Red Planet.", "Venus is Earth's twin."], ["Jupiter is the largest planet.", "Saturn has rings."], ] model = models.ColBERT(model_name_or_path="lightonai/ModernColBERT-embed-base") queries_emb = model.encode(queries, is_query=True) docs_emb = model.encode(documents, is_query=False) - Notebooks
- Google Colab
- Kaggle
| { | |
| "model_type": "ColBERT", | |
| "__version__": { | |
| "sentence_transformers": "5.1.1", | |
| "transformers": "4.48.3", | |
| "pytorch": "2.6.0" | |
| }, | |
| "prompts": { | |
| "query": "search_query: ", | |
| "document": "search_document: " | |
| }, | |
| "default_prompt_name": null, | |
| "similarity_fn_name": "MaxSim", | |
| "query_prefix": "[Q] ", | |
| "document_prefix": "[D] ", | |
| "query_length": 39, | |
| "document_length": 519, | |
| "attend_to_expansion_tokens": false, | |
| "skiplist_words": [ | |
| "!", | |
| "\"", | |
| "#", | |
| "$", | |
| "%", | |
| "&", | |
| "'", | |
| "(", | |
| ")", | |
| "*", | |
| "+", | |
| ",", | |
| "-", | |
| ".", | |
| "/", | |
| ":", | |
| ";", | |
| "<", | |
| "=", | |
| ">", | |
| "?", | |
| "@", | |
| "[", | |
| "\\", | |
| "]", | |
| "^", | |
| "_", | |
| "`", | |
| "{", | |
| "|", | |
| "}", | |
| "~" | |
| ], | |
| "do_query_expansion": false | |
| } |