I’m currently searching for a tool that can handle indexing and searching through very large text files, often containing over 1 million lines within individual documents. I recently downloaded DEVONthink and attempted to index several files, some of which are around 20MB in size and include texts that have more than 1 million lines.
While I successfully set up the index, it didn’t seem like DEVONthink searched through the entirety of the files, especially when dealing with such large texts. I’m unsure if this is a limitation of DEVONthink or if I have configured something incorrectly.
Has anyone encountered similar issues or knows if DEVONthink has specific optimizations for handling and searching long text files like these?
Files sizes are usually no useful indicator. A PDF might contain lots of images and no text at all. Converting the PDF to plain text is the easiest way to figure out the numbers (in case of the manual less than 700k)