Automatic Speech Recognition
Transformers
PyTorch
TensorBoard
Safetensors
Norwegian
Norwegian Bokmål
wav2vec2
Eval Results (legacy)
Instructions to use NbAiLab/nb-wav2vec2-300m-bokmaal with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use NbAiLab/nb-wav2vec2-300m-bokmaal with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="NbAiLab/nb-wav2vec2-300m-bokmaal")# Load model directly from transformers import AutoProcessor, AutoModelForCTC processor = AutoProcessor.from_pretrained("NbAiLab/nb-wav2vec2-300m-bokmaal") model = AutoModelForCTC.from_pretrained("NbAiLab/nb-wav2vec2-300m-bokmaal") - Notebooks
- Google Colab
- Kaggle
Update eval.py
Browse files
eval.py
CHANGED
|
@@ -130,9 +130,9 @@ def normalize_text(original_text: str, dataset: str) -> str:
|
|
| 130 |
text = re.sub('[úùüû]', 'u', text)
|
| 131 |
text = re.sub('[«»]', '', text)
|
| 132 |
text = re.sub('\s+', ' ', text)
|
| 133 |
-
text = re.sub('<
|
| 134 |
-
text = re.sub('<
|
| 135 |
-
text = re.sub('<
|
| 136 |
text = re.sub('<inaudible>', 'xxx', text)
|
| 137 |
text = re.sub('[<>]', '', text)
|
| 138 |
|
|
|
|
| 130 |
text = re.sub('[úùüû]', 'u', text)
|
| 131 |
text = re.sub('[«»]', '', text)
|
| 132 |
text = re.sub('\s+', ' ', text)
|
| 133 |
+
text = re.sub('<e+h?>', 'eee', text)
|
| 134 |
+
text = re.sub('<m+>', 'mmm', text)
|
| 135 |
+
text = re.sub('<q+>', 'qqq', text)
|
| 136 |
text = re.sub('<inaudible>', 'xxx', text)
|
| 137 |
text = re.sub('[<>]', '', text)
|
| 138 |
|