Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
proxectonos
's Collections
Domain Specific Corpora
CorpusNÓS: A massive Galician corpus for training LLM
Text Datasets for Fine-tuning and Instruction tuning
Text Datasets for Evaluation
MT
Text Models
TTS Models
ASR Models
Instruction Pretrained Experiments
MT Models (former)
ASR Datasets
TTS Datasets
Domain Specific Corpora
updated
5 days ago
Collection of corpora prepared from specific domains mainly in Galician language.
Upvote
-
proxectonos/corpus_dominio_legal_administrativo
Preview
•
Updated
5 days ago
•
36
proxectonos/corpus_dominio_periodistico
Viewer
•
Updated
4 days ago
•
280k
•
34
proxectonos/corpus_dominio_cientifico
Preview
•
Updated
4 days ago
•
42
proxectonos/corpus_dominio_museistico_patrimonio
Viewer
•
Updated
2 days ago
•
14.5k
•
48
Upvote
-
Share collection
View history
Collection guide
Browse collections