Wait for a longer time (5 seconds) before establishing the first bandwidth estimate.

This reduces the risk of getting a small initial estimate when doing combined a/v BWE, and the audio stream is received earlier than the video stream.

In addition a check is added to make sure a probe can't reduce the BWE.


