What open source tools are you referring to? Do you just mean the search compone...

brudgers · on Aug 25, 2024

A script running curl on my browsing history would collect the html. I’d solve the 200 result problem if and when it was an actual problem in a way that addressed the actual problem. There’s a lot of success before too many results is a problem.

The idea that it might be too much friction than it was worth is why I didnt build it. Probably why nobody has built it and perhaps why you just listed a bunch of imagined problems as reasons not to build it.

I mean it would probably be shit if I built it and I liked my idea better than the idea of the work. That’s most things.

—

For what it is worth, I would default to google for the things google does better and use my personalized historic search when I wanted to see what I had seen before. Its both-and not either-or.

dotcoma · on Aug 25, 2024

I’d start from one’s bookmarked websites on Pinboard.

RulerOf · on Aug 25, 2024

> gathering the data

I've been of the opinion that website content monitoring should be implemented with a browser extension (plus possibly a local agent app)[1]. An extension-based approach would work well and be easy to use IMO.

I've been extremely disappointed by how Chrome in particular likes to forget everything about my browsing history (except for tracking cookies) after three months. I don't see why a link I clicked on a year ago on any given page should turn blue just because computers from 2004 might have performance problems with it.

[1]: Enterprises seem to prefer MITM here instead, but I'd argue it's not truly required, given the overwhelming popularity of agent-based EDR solutions.