Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -273,6 +273,45 @@ to highlight words with high or low confidence:
|
|
| 273 |
|
| 274 |
<img width="965" alt="image" src="https://user-images.githubusercontent.com/1991296/197356445-311c8643-9397-4e5e-b46e-0b4b4daa2530.png">
|
| 275 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 276 |
## Implementation details
|
| 277 |
|
| 278 |
- The core tensor operations are implemented in C ([ggml.h](ggml.h) / [ggml.c](ggml.c))
|
|
|
|
| 273 |
|
| 274 |
<img width="965" alt="image" src="https://user-images.githubusercontent.com/1991296/197356445-311c8643-9397-4e5e-b46e-0b4b4daa2530.png">
|
| 275 |
|
| 276 |
+
## Word-level timestamps (experimental)
|
| 277 |
+
|
| 278 |
+
The [main](examples/main) example has experimental support for word-level timestamp generation. The accuracy
|
| 279 |
+
is not great, but might be improved in the future.
|
| 280 |
+
|
| 281 |
+
To use it, simply add the `-owts` command-line argument. There is a free parameter `-wt` that should be around `0.01`.
|
| 282 |
+
|
| 283 |
+
Here are a few *"typical"* examples:
|
| 284 |
+
|
| 285 |
+
```java
|
| 286 |
+
./main -m ./models/ggml-base.en.bin -f ./samples/jfk.wav -owts -wt 0.01
|
| 287 |
+
source ./samples/jfk.wav.wts
|
| 288 |
+
ffplay ./samples/jfk.wav.mp4
|
| 289 |
+
```
|
| 290 |
+
|
| 291 |
+
https://user-images.githubusercontent.com/1991296/198885665-b34b6845-11b8-4449-a255-d9ec2eab1344.mp4
|
| 292 |
+
|
| 293 |
+
---
|
| 294 |
+
|
| 295 |
+
```java
|
| 296 |
+
./main -m ./models/ggml-base.en.bin -f ./samples/mm0.wav -owts -wt 0.1
|
| 297 |
+
source ./samples/mm0.wav.wts
|
| 298 |
+
ffplay ./samples/mm0.wav.mp4
|
| 299 |
+
```
|
| 300 |
+
|
| 301 |
+
https://user-images.githubusercontent.com/1991296/198885703-0547ba17-c288-4827-8361-84cc440f2901.mp4
|
| 302 |
+
|
| 303 |
+
---
|
| 304 |
+
|
| 305 |
+
```java
|
| 306 |
+
./main -m ./models/ggml-base.en.bin -f ./samples/gb0.wav -owts -wt 0.01
|
| 307 |
+
source ./samples/gb0.wav.wts
|
| 308 |
+
ffplay ./samples/gb0.wav.mp4
|
| 309 |
+
```
|
| 310 |
+
|
| 311 |
+
https://user-images.githubusercontent.com/1991296/198885729-3fc9028c-a50c-4549-a11f-3306ef97e0c4.mp4
|
| 312 |
+
|
| 313 |
+
---
|
| 314 |
+
|
| 315 |
## Implementation details
|
| 316 |
|
| 317 |
- The core tensor operations are implemented in C ([ggml.h](ggml.h) / [ggml.c](ggml.c))
|