(#nlouyba) Yeah it would be! I _think_ we'dย have a lot to complement each other. Problem is it actually is a lot of work to create a generalised search engine. It's much easier to create a search engine for a small domain like Yarn.social / Twtxt. But even then there's still work to be done on the crawling side (_I think_) -- Right now it just re-crawls the space once a day.
matched #23afyqa score:12.26
Search by:
Search by 1 tags:
(#nlouyba)
> The author rightly blames search engines. A similar revelation hit me like a truck after I used Marginalia Search a few times. Give it a try.
Bookmarked! Is this a search engine that's done it's own crawling and indexing like what I've tried to do with spyda.dev? ๐ค
matched #c7jbuvq score:12.26
Search by:
Search by 1 tags:
(#nlouyba) @prologic Yes, it does its own crawling. You can check if a particular website is indexed by searching for a domain like this: `site:mckinley.cc`
matched #net5xta score:12.26
Search by:
Search by 1 mentions:
(#nlouyba) So I had a play with this search engine tonight and read everything about what this guy has done, amazing work! ๐ I've reached out to him via email to see if perhaps he'd be interested in teaming up with me in some way. Anyway I also wanted to point out something rather sad:
> The crawler gets captchad by CDNs like Fastly and CloudFlare. I've prostrated myself before them and pleaded to get listed as a good bot, but they have yet to call back so until then they are blocked on a subnet level.
๐ข ๐ก ๐คฌ #Fastly and #Cloudflare sucks ๐ก
matched #pvbnzka score:7.08
Search by:
Search by 3 tags:
(#nlouyba) @prologic Yes, it does its own crawling. You can check if a particular website is indexed by searching for a domain like this: `site:mckinley.cc`
matched #xa27pna score:12.26
Search by:
Search by 1 mentions:
@mckinley (#nlouyba) In that case it's very similar in spirit to what I've been building at -- What's holding me back at the moment is I need to understand how to better index "web" documents and figure out a crawling strategy so it continues to grow it's index.
matched #xqij7fa score:12.26
Search by:
Search by 1 mentions: