I would be happy, if DT would check for upcoming updates all technical possibilities, that are possible for DT. Of course, that are only my few cents on this topic, but it really makes sense to look for this very detailed.
I am now for over 30 years journalist and editor. So I work my whole day with words and paragraphs. By history such text based files are the strength of DT, not image or video etc. But as more research goes from text to all media types (multimedia) in the last decade(s) even in traditional media markets (newspapers etc), as more I found myself in the position of searching solutions to get text content out of non-text media (video, images etc).
By the years it was getting more and more important for my research to have text from of non-text media, because text has still the most strong search possibilities. And of course all ways of extracting text, that are automatic as most as possible, are the most helpful. And automation with text based files is a key strength of DT.
In the last month there were also some other posts in this forum, what addresses this topic/problem. Sometimes there were answers, that multimedia, video and images aren’t the key strengths of DT and it may be better to optimize DTs strengths than to go too far in other (new) fields what DT isn’t really for. I agree completely with such opinions and I like the way DT was optimized and updated in the last 10 years (I use DT for 10 years now).
But the automatic extraction of text from images or video, automatic transcription from video or text annotations for video timestamps are nowadays important key strength for text based research.
Therefore I would appreciate it very much, if all new possibilities of such automatic implementation, would be discussed here very deep and open-minded. Because they belong to the presence and to the future of text based research.
I am Journalist, not technician, but maybe Apples „Live Text“ has new technical futures, that could be helpful for text based research in images or videos. Or maybe not. Of course this could be answered better by app programmers…
Just my opinion from my daily work…