Live Demo - how does it process real-time audio.
Created by: vucuong12
I am wondering how the live demo works. If I say "Today I am going to the supermarket to buy some beer", then does the live demo record the whole sentence then do the inference step (speech to text), or does it do that for smaller chunks (For example, "Today I am", then "going to the supermarket", and so on) ?