I have a video call with my guests here is how the feeds look like:
Feed 1 (recorded locally by guest, records ONLY his/her audio/video)
Feed 2 (recorded locally by me, records ONLY my audio/video)
I can determine the point when both started (i.e. 10 seconds in etc).
I want to create a combined feed where:
a guest is shown/heard oly when guest speaks
I am shown/heard when I speak
I.e. the script should decide based on audio waves which video is to be shown
It should also sync the audio volume so both audio volumes sound the same for listener.
Can ffmpeg do that?