mirror of
https://github.com/ggerganov/whisper.cpp.git
synced 2025-05-28 21:57:43 +02:00
RPC_CMD_SET_TENSOR always returns an empty response and we send this 4 times per token. We can improve TG speed if we don't wait for this empty response. The performance impact of this change depends on the network latency.