Some questions about audio transcription in DEVONthink 4

I downlaoded these two videos:

https://youtu.be/jaI_eOBNWhQ?si=FLp3jCz_d4VNysU3
https://youtu.be/FEVJoIcWj7A?si=9Uz8uYkADdOUzZ99

They wre downloaded as MPEG-4 Videos, 1280x720 resolution.

I still have the files, but didn’t want to upload them as they total almost 50 MB (which I realize is not a lot these days).

Thank you for the links, we’ll check this.

I did just run another transcription on a different video that was 71 minutes long and the timestamps were pretty good (used Apple Local).

I’ve set up DT4 beta2 correctly, including the API, but the speech-to-text feature still isn’t working. The PDF Markdown summary function works fine, so the API shouldn’t be the problem.

Which one? There are 3 options.

「Remote GPT-4o transcription」,Transcription Language: English

And what did you try to transcribe and how, e.g. via Data > Recognition > Transcribe Speech?

Transcribe Speech. After I restarted the computer, it started working normally.
Thanks!