Brin (
brin_bellway) wrote2020-09-23 11:18 pm
![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
(no subject)
I learned about Recoll today while looking for a way to search ~50 chat-log ODT documents for a particular exchange, and oh my god it's amazing.
It totally did handle that situation, but it also handles *so* many other file formats. EPUB, HTML, Thunderbird, *things inside zipped folders* (7-zip too, if you install the right plugin!), just to name a few. I tried a search for "backpack" and got hits from chat logs, fanfics, ebooks, blogs, wikis, emails...
And there's search syntax, to let you exclude stuff with certain words or restrict it to a particular directory and all that.
I have acquired my own private search engine! That is a thing, that exists, right now, that you can just download off Synaptic like it's no big deal and not something beyond Vannevar Bush's wildest dreams!
It totally did handle that situation, but it also handles *so* many other file formats. EPUB, HTML, Thunderbird, *things inside zipped folders* (7-zip too, if you install the right plugin!), just to name a few. I tried a search for "backpack" and got hits from chat logs, fanfics, ebooks, blogs, wikis, emails...
And there's search syntax, to let you exclude stuff with certain words or restrict it to a particular directory and all that.
I have acquired my own private search engine! That is a thing, that exists, right now, that you can just download off Synaptic like it's no big deal and not something beyond Vannevar Bush's wildest dreams!
no subject
(I can run it on my Linux dual-boot to figure out, on a pure UI-and-technical-capabilities level, whether it's likely to work for me; but being usable at all is a much lower bar to meet than being sufficiently useful during normal day-to-day activities to warrant spending money on. I'm not sure how to figure out that second thing, given the relative infrequency with which I do anything on my Linux dual-boot.)
no subject
no subject
(From a first pass, it seems like DocFetcher is the other one most well-optimized for my use case; but plausibly the list will have changed by the time it becomes relevant to me down the line.)