ggerganov commited on
Commit
e3ee348
·
unverified ·
1 Parent(s): b777dde

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md CHANGED
@@ -273,6 +273,45 @@ to highlight words with high or low confidence:
273
 
274
  <img width="965" alt="image" src="https://user-images.githubusercontent.com/1991296/197356445-311c8643-9397-4e5e-b46e-0b4b4daa2530.png">
275
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
276
  ## Implementation details
277
 
278
  - The core tensor operations are implemented in C ([ggml.h](ggml.h) / [ggml.c](ggml.c))
 
273
 
274
  <img width="965" alt="image" src="https://user-images.githubusercontent.com/1991296/197356445-311c8643-9397-4e5e-b46e-0b4b4daa2530.png">
275
 
276
+ ## Word-level timestamps (experimental)
277
+
278
+ The [main](examples/main) example has experimental support for word-level timestamp generation. The accuracy
279
+ is not great, but might be improved in the future.
280
+
281
+ To use it, simply add the `-owts` command-line argument. There is a free parameter `-wt` that should be around `0.01`.
282
+
283
+ Here are a few *"typical"* examples:
284
+
285
+ ```java
286
+ ./main -m ./models/ggml-base.en.bin -f ./samples/jfk.wav -owts -wt 0.01
287
+ source ./samples/jfk.wav.wts
288
+ ffplay ./samples/jfk.wav.mp4
289
+ ```
290
+
291
+ https://user-images.githubusercontent.com/1991296/198885665-b34b6845-11b8-4449-a255-d9ec2eab1344.mp4
292
+
293
+ ---
294
+
295
+ ```java
296
+ ./main -m ./models/ggml-base.en.bin -f ./samples/mm0.wav -owts -wt 0.1
297
+ source ./samples/mm0.wav.wts
298
+ ffplay ./samples/mm0.wav.mp4
299
+ ```
300
+
301
+ https://user-images.githubusercontent.com/1991296/198885703-0547ba17-c288-4827-8361-84cc440f2901.mp4
302
+
303
+ ---
304
+
305
+ ```java
306
+ ./main -m ./models/ggml-base.en.bin -f ./samples/gb0.wav -owts -wt 0.01
307
+ source ./samples/gb0.wav.wts
308
+ ffplay ./samples/gb0.wav.mp4
309
+ ```
310
+
311
+ https://user-images.githubusercontent.com/1991296/198885729-3fc9028c-a50c-4549-a11f-3306ef97e0c4.mp4
312
+
313
+ ---
314
+
315
  ## Implementation details
316
 
317
  - The core tensor operations are implemented in C ([ggml.h](ggml.h) / [ggml.c](ggml.c))