I’ve been trying out DEVONthink (Dt) for awhile now, and while I’m continually impressed by its abilities, I also am quickly getting the impression that my initial needs (or rather “wants”) is beyond Dt’s capabilities. I was hoping that if I outline my wants, someone could help me out with how Dt could be used to achieve them. I’ve now spent a lot of time trying to determine an optimal usage scenario for Dt, and either I’m not understanding it 100%, or my needs are beyond the scope of the personal edition of Dt (But the Pro info. on the website hasn’t answered these questions for me).
I’m a pack rat when it comes to data and information, and I have been into PC’s for awhile now, so I’ve amassed a rather sizable collection of files to deal with. I’m also a developer, and want to be able to quickly glean what file would be best for me to peruse when I need to find a solution to a particular problem. This goes for my other interests as well, whether it’s indexing fictional texts, wallpaper images, or my personal and financial documents.
As such, I have about 6.5 gb’s of ebooks I want to index in Dt. The ebooks consist primarily of pdf’s, but also contain a lot of “indexed” html (meaning that it’s a local copy of a website, or a series of html-based manuals, similar to the O’Reilly developer series). These ebooks are arranged, on the hard drive, in manners similar to the following example, and are often mixed in with web clippings, and articles I’ve archived from online sources:
/Manuals and Instructional/Software/Development/PHP/PHP5
I was able to just drag a large series of these over, and into Dt, but somewhere along the way, they stopped copying accurately, meaning that if I dragged a group of files over from my hard drive, to link to Dt, not all of the files would end up linked.
At this point, it appears that I likely have hit the 10000 limit to the personal edition of Dt, but I can’t find anything that’s telling me I’ve hit this limit. Dt simply “beeps” at me (sometimes… Other times it just takes the focus, and does nothing) when I try adding more to the system. I can see that I have “x” number of images in the database, along with a myriad of other information, but I’ve no idea on the PDF’s. Perhaps this is a limitation of the trial version, but again, I can’t find anything specifically telling me what the problem is.
Another problem I’m running into is with the speed of Dt. My database of development related ebooks is not over 41000 objects (a mixture of PDF’s, web pages/sites, images, and related text notes and sych), and Dt really seems to slow down at times. I don’t see the CPU utilization go over 50% on one of my 2 processors, so I don’t see it as a bottleneck with the system… Can I expect similar slowing as my collection continues to grow? I can deal with some slowness… It’s very impressive that Dt is able to do what it does already, and I expect that larger amounts of files will slow it down somewhat, but the increase seems to be really bad at times.
This leads me to thinking about other ways to break down my documents… Ideally, I’d like to keep all my documents with Dt, or a similar program (Dt at this point), from personal documents, to ebooks and reference material (PDFs, saved web pages/sites, etc), to resource material (images, videos & audio files, etc), all arranged under some rather broad root folders (again, “Development”, “Fiction”, “Reference”, “Personal”, etc), which would allow me to quickly narrow my search down by topic.
Anyway… My hope is that Pro will let me do this, as currently the file limitations of the personal edition are simply lower than the number of files I intended on indexing. What will be the hard limits of the Pro version? Any, file size, quantity, etc limitations?
I’ve considered breaking my subjects up into seperate databases. But I really have a problem when it comes to Development, for example. Ideally, I’d like to have my code snippets, notes, refrence material, development related ebooks, images, and so on, all housed in on database, along with information for specific projects, housed in the same database, but clearly segregated from the reference material, resources, etc.
This is certainly possible now, but considering the reference material, what if I’d like to reference the same ebook/reference material in a seperate ebook-only database? There doesn’t appear to be a way for me to share document information between databases, and it seems a big waste of space to duplicate this data.
Again, I’d prefer to do this all within one Dt database, rather than split them up, but is Dt’s speed, and limitations going to allow me to do this? Another problem with this scenario is that there’s no quick and easy way to switch databases. I an redirect the focus of Dt, and then close and re-open the program, but this rather time consuming, especially when considering how slow my start time is on my large database. I’ve thought about trying to achieve this also by pointing Dt’s focus to a linked folder, rather than the folder itself. I can then overwrite this link with differnt links, and then relaunch Dt also, but again, this seems slow and problematic.
I do like Dt a lot, but am leary of spending the time that it will take to correctly arrange, import, and sort my wealth of documents, on a product which isn’t designed for someone with my particular needs.
One more item I was wondering about: I am considering the purchase of an Powerbook, and would like to share “some” data between a Powerbook -based database, and a home-based Dt install, while keeping additional items, both on the powerbook, and on the home PC, seperate from one another. Basically, I’d like to be able to doc my Powerbook up to my home network, and have these shared sections sync up to each other, while keeping their location-specific sections untouched, and intact. I don’t see an obvious way to do something like this, nor any indication that the Pro version will. Is this a really odd request, or am I missing something obvious?
Again, I really do like Dt, and think that there must be some way to tailor it around my needs. Sorry for the long post, but I wanted to be as clear as possible.