ggerganov commited on
Commit
a8c0955
·
unverified ·
1 Parent(s): 9a513f5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -8,6 +8,7 @@ High-performance inference of [OpenAI's Whisper](https://github.com/openai/whisp
8
  - Low memory usage (Flash Attention + Flash Forward)
9
  - Zero memory allocations at runtime
10
  - Runs on the CPU (Mac and Linux)
 
11
 
12
  Incoming features:
13
  - [Realtime audio input transcription](https://github.com/ggerganov/whisper.cpp/issues/10#issuecomment-1264665959)
@@ -208,6 +209,17 @@ main: decode time = 14907.22 ms
208
  main: total time = 57416.63 ms
209
  ```
210
 
 
 
 
 
 
 
 
 
 
 
 
211
  ## Limitations
212
 
213
  - Very basic greedy sampling scheme - always pick up the top token
 
8
  - Low memory usage (Flash Attention + Flash Forward)
9
  - Zero memory allocations at runtime
10
  - Runs on the CPU (Mac and Linux)
11
+ - [C-style API](https://github.com/ggerganov/whisper.cpp/blob/master/whisper.h)
12
 
13
  Incoming features:
14
  - [Realtime audio input transcription](https://github.com/ggerganov/whisper.cpp/issues/10#issuecomment-1264665959)
 
209
  main: total time = 57416.63 ms
210
  ```
211
 
212
+ ## Real-time audio input example
213
+
214
+ This is a naive example of performing real-time inference on audio from your microphone.
215
+ The `stream` tool samples the audio every 3 seconds and runs the transcription continously. More info is available in [issue #10](https://github.com/ggerganov/whisper.cpp/issues/10).
216
+
217
+ ```java
218
+ $ ./stream -m models/ggml-small.en.bin -t 8
219
+ ```
220
+
221
+ https://user-images.githubusercontent.com/1991296/193465125-c163d304-64f6-4f5d-83e5-72239c9a203e.mp4
222
+
223
  ## Limitations
224
 
225
  - Very basic greedy sampling scheme - always pick up the top token