Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

joe32140
/
gte-en-mlm-base-msmarco

Sentence Similarity
sentence-transformers
Safetensors
English
new
feature-extraction
Generated from Trainer
dataset_size:11662655
loss:CachedMultipleNegativesRankingLoss
custom_code
Eval Results (legacy)
text-embeddings-inference
Model card Files Files and versions
xet
Community

Instructions to use joe32140/gte-en-mlm-base-msmarco with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use joe32140/gte-en-mlm-base-msmarco with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("joe32140/gte-en-mlm-base-msmarco", trust_remote_code=True)
    
    sentences = [
        "what county is lyndhurst, ohio in",
        "This article is about the song written by Kenneth Gamble, Leon Huff and Cary Gilbert. For the Tina Turner song, see Don't Leave Me This Way (Tina Turner song). Don't Leave Me This Way is a song written by Kenneth Gamble, Leon Huff and Cary Gilbert. First charting as a hit for Harold Melvin & the Blue Notes featuring Teddy Pendergrass, an act on Gamble & Huff's Philadelphia International label in 1975, Don't Leave Me This Way was later a huge disco hit for Motown artist Thelma Houston in 1977.",
        "Lyndhurst is a city in Cuyahoga County, Ohio, United States. The population was 14,001 at the 2010 census. Lyndhurst is located in northeastern Ohio, and is a suburb of Cleveland. A small part of Lyndhurst was originally part of Mayfield Township. It used to be called Euclidville before Lyndhurst was chosen. Lyndhurst is located at 41°31′17″N 81°29′25″W / 41.52139°N 81.49028°W / 41.52139; -81.49028 (41.521352, -81.490141).",
        "Welcome to Trumbull County... Trumbull County, the county seat, located in Warren, Ohio, consists of a combination of both urban and rural communities situated in the northeast corner of Ohio. It is situated roughly between the Youngstown, Cleveland and Akron corridors."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle
gte-en-mlm-base-msmarco
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
joe32140's picture
joe32140
Add new SentenceTransformer model
608f025 verified over 1 year ago
  • 1_Pooling
    Add new SentenceTransformer model over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    48.9 kB
    Add new SentenceTransformer model over 1 year ago
  • config.json
    1.46 kB
    Add new SentenceTransformer model over 1 year ago
  • config_sentence_transformers.json
    204 Bytes
    Add new SentenceTransformer model over 1 year ago
  • model.safetensors
    547 MB
    xet
    Add new SentenceTransformer model over 1 year ago
  • modules.json
    229 Bytes
    Add new SentenceTransformer model over 1 year ago
  • sentence_bert_config.json
    54 Bytes
    Add new SentenceTransformer model over 1 year ago
  • special_tokens_map.json
    695 Bytes
    Add new SentenceTransformer model over 1 year ago
  • tokenizer.json
    712 kB
    Add new SentenceTransformer model over 1 year ago
  • tokenizer_config.json
    1.22 kB
    Add new SentenceTransformer model over 1 year ago
  • vocab.txt
    232 kB
    Add new SentenceTransformer model over 1 year ago