Yours is playing a LOT less than mine. Mine actually overlaps occasionally, and I can't count 5 seconds without a voiceover, often less than 2.
Anyway, here's my version
Assuming we don't have different definitions of "a LOT less", mine is actually very much the same as yours.
I counted 6 NPC audio calls in mine. There is 27 seconds of "dead air" and 33 seconds of audio chatter from 0-60 seconds.
In yours, I counted 6 NPC audio calls as well. There is 26 seconds of "dead air" and 34 seconds of audio chatter from 0-60 seconds.
From 61-120 seconds, there are 8 audio calls, but they are shorter statements. The result is pretty similar to the rest of the samples: 25 seconds of "dead air" and 35 seconds of audio chatter.
That you can listen to my video and believe it is "a LOT less" tells me a lot of this is just in people's mind. Assuming you were speaking about the first video, I didn't hear a single overlapping comment (even though you said "..mine actually overlaps occasionally.."), which implies more than once (even though it never even did it once).
Now, your second video (the demon combat), yeah, the amount of overlap happening there is annoying. But Cyseal is not like that (and neither are most encounters). If anything that particular encounter's audio could use an adjustment, but that doesn't mean the whole game needs to have its audio nerfed (which is what's being called for in this thread).