tags:nlouyba Twtxt Search

Tue, 21 Sep 2021 13:08 Z (2 years ago)

(#nlouyba) Yeah it would be! I _think_ we'd have a lot to complement each other. Problem is it actually is a lot of work to create a generalised search engine. It's much easier to create a search engine for a small domain like Yarn.social / Twtxt. But even then there's still work to be done on the crawling side (_I think_) -- Right now it just re-crawls the space once a day.

matched #23afyqa score:12.26 Search by:

Search by 1 tags:

tags:nlouyba

prologic @twtxt.net

Tue, 21 Sep 2021 04:18 Z (2 years ago)

(#nlouyba) > The author rightly blames search engines. A similar revelation hit me like a truck after I used Marginalia Search a few times. Give it a try. Bookmarked! Is this a search engine that's done it's own crawling and indexing like what I've tried to do with spyda.dev? 🤔

matched #c7jbuvq score:12.26 Search by:

Search by 1 tags:

tags:nlouyba

mckinley @twtxt.net

Tue, 21 Sep 2021 04:36 Z (2 years ago)

(#nlouyba) @prologic Yes, it does its own crawling. You can check if a particular website is indexed by searching for a domain like this: `site:mckinley.cc`

matched #net5xta score:12.26 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.net

Search by 1 tags:

tags:nlouyba

prologic @twtxt.net

Tue, 21 Sep 2021 13:03 Z (2 years ago)

(#nlouyba) So I had a play with this search engine tonight and read everything about what this guy has done, amazing work! 👌 I've reached out to him via email to see if perhaps he'd be interested in teaming up with me in some way. Anyway I also wanted to point out something rather sad: > The crawler gets captchad by CDNs like Fastly and CloudFlare. I've prostrated myself before them and pleaded to get listed as a good bot, but they have yet to call back so until then they are blocked on a subnet level. 😢 😡 🤬 #Fastly and #Cloudflare sucks 😡

matched #pvbnzka score:7.08 Search by:

Search by 3 tags:

adi @twtxt.net

Tue, 21 Sep 2021 13:05 Z (2 years ago)

@prologic (#nlouyba) Hehe, would be nice for you to team up! 😎

matched #ww5s6xq score:12.26 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.net

Search by 1 tags:

tags:nlouyba

mckinley @twtxt.net

Tue, 21 Sep 2021 04:36 Z (2 years ago)

(#nlouyba) @prologic Yes, it does its own crawling. You can check if a particular website is indexed by searching for a domain like this: `site:mckinley.cc`

matched #xa27pna score:12.26 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.net

Search by 1 tags:

tags:nlouyba

prologic @twtxt.net

Tue, 21 Sep 2021 04:45 Z (2 years ago)

@mckinley (#nlouyba) In that case it's very similar in spirit to what I've been building at -- What's holding me back at the moment is I need to understand how to better index "web" documents and figure out a crawling strategy so it continues to grow it's index.

matched #xqij7fa score:12.26 Search by:

Search by 1 mentions:

mentions:mckinley@twtxt.net

Search by 1 tags:

tags:nlouyba