I'm looking to perform online realtime transcription (i.e. with something like whisper-streaming)
for this we need FULL DUPLEX capabilities i.e. the client needs to be able to continually stream data to the server.
I see the https://github.com/replicate/replicate-python?tab=readme-ov-file#run-a-model-and-stream-its-output example which shows the server can stream results, but the input has to be sent initially and I dont see how the client could send more and more input data.
is full duplex streaming supported in replicate or can you add support for it?