SentenceTransformer based on CALDISS-AAU/DA-BERT_Old_News_V1

This is a sentence-transformers model finetuned from CALDISS-AAU/DA-BERT_Old_News_V1. It is an early version of a model meant to assist in segmenting ENO.

Model Details

Model Description

Training Details

Training Dataset

Unnamed Dataset

  • Size: 27,869 training samples
  • Columns: sentence_0, sentence_1, and label
  • Approximate statistics based on the first 1000 samples:
    sentence_0 sentence_1 label
    type string string float
    details
    • min: 4 tokens
    • mean: 41.42 tokens
    • max: 401 tokens
    • min: 4 tokens
    • mean: 41.21 tokens
    • max: 286 tokens
    • min: 0.0
    • mean: 0.46
    • max: 1.0
  • Samples:
    sentence_0 sentence_1 label
    6) Paa Hjørnet af Integaden og lilie Kongensgade ligge to store Trækasser, sog afspærre Passagen, og truer dem, der i Mørke kunde løbe paa dem. 7) Paa Østergade er der intet Bræt over den Rendesteen der kommer ud fra Per Madsens Gang og med en dyb Kisterende overskærer Fortoget. 1.0
    Ganske Fattige undervises for intet, og Seminaristerne bibringer han sin Methode. Ved et af ham selv opfundet Tegnsprog giver han sin Undervisning Indgang. 1.0
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for JohanHeinsen/Old_News_Segmentation_SBERT_V0

Finetuned
(5)
this model