It does sound to me like ingesting all these different formats into a normalized database (aka data warehousing) and then building your tools to report from that centralized warehouse is the way to go. Your warehouse could also track ingestion dates, original format converted from, etc. and then your tools only need to know that one source of truth.
Is there any reason not to build this as a two-step process of 1) ingestion to a central database and 2) reporting from said database?
https://youtu.be/bFc-tlXT56I?si=xjOb6xqKxmWJG8Uw
Most of the ones I can find are react content re-uploads. This one has corny captions on it but at least it’s the raw video.