Different languages, similar encoding efficiency: Comparable information rates across the human communicative niche | Science Advances
In terms of encoded data, humans generally operate at 39.15 bits/sec regardless of language. Which means that languages that have low information per syllable must speak faster to compensate. Japanese seems to encode at 5 bits/syllable so must talk fast. On the other end, it looks like US english is capped at 9 syllables/sec so audio processing into information (mental objects) is a possible limiter on the high end.
