tags:37xr3ra Twtxt Search

prologic @twtxt.net

Tue, 12 Jan 2021 06:40 Z (3 years ago)

@xuu (#37xr3ra) This is true!

matched #24gv72q score:11.2 Search by:

Search by 1 mentions:

mentions:xuu@txt.sour.is

Search by 1 tags:

tags:37xr3ra

lyse @lyse.isobeef.org

Mon, 11 Jan 2021 16:45 Z (3 years ago)

@prologic (#37xr3ra) Cool!

matched #2brasia score:11.2 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.net

Search by 1 tags:

tags:37xr3ra

xuu @dev.txt.sour.is

Tue, 12 Jan 2021 06:06 Z (3 years ago)

@prologic (#37xr3ra) in theory shouldn't need to let users add feeds.. if they get mentioned by a tracked feed they will get added automagically. on a pod it would just need to scan the twtxt feed to know about everyone.

matched #3antibq score:11.2 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.net

Search by 1 tags:

tags:37xr3ra

xuu @txt.sour.is

Mon, 11 Jan 2021 22:53 Z (3 years ago)

(#37xr3ra) @prologic It is pretty basic, and depends on some local changes i am still working out on my branch..

matched #4b4qykq score:11.2 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.netd

Search by 1 tags:

tags:37xr3ra

prologic @twtxt.net

Tue, 12 Jan 2021 08:40 Z (3 years ago)

(#37xr3ra) As a quick experiment, I modified my code to remove the domain restrictions and low and behold: ``` All done! Crawled 516 feeds Found 52464 twts Found 736 feeds ``` The Twtxt network is larger than I thought. A significant no. of feeds no longer work obviously, but that's okay, we can prune dead feeds out.

matched #4vuxgtq score:11.2 Search by:

Search by 1 tags:

tags:37xr3ra

xuu @dev.txt.sour.is

Mon, 11 Jan 2021 18:48 Z (3 years ago)

(#37xr3ra) @lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any `follows = ` comments and mentions from other users and came up with: ``` twters: 744 total: 52073 ```

matched #6ltxv6q score:11.2 Search by:

Search by 2 mentions:

Search by 1 tags:

tags:37xr3ra

prologic @twtxt.net

Mon, 11 Jan 2021 23:12 Z (3 years ago)

(#37xr3ra) It _might_ be worthwhile combining the two approaches and _actually_ building a goodness to gracious search engine and crawler for twtxt? 🤔 🤣

matched #74yugnq score:11.2 Search by:

Search by 1 tags:

tags:37xr3ra

prologic @twtxt.net

Tue, 12 Jan 2021 00:37 Z (3 years ago)

(#37xr3ra) Wait... So you actually wrote a more elaborate crawler without taking a shortcut like I did using colly (_not that it really helps much_) Hmmm? 🤔 Can we take it a bit further, make a daemon/server out of it, a web interface to search what it crawls using bleve and some tools (_API, Web UI_) to let people add more "feeds" to crawl? 🤔

matched #acie3ca score:11.2 Search by:

Search by 1 tags:

tags:37xr3ra

xuu @txt.sour.is

Tue, 12 Jan 2021 00:28 Z (3 years ago)

(#37xr3ra) @prologic yeah it reads a seed file. I'm using mine. it scans for any mention links and then scans them recursively. it reads from http/s or gopher. i don't have much of a db yet.. it just writes to disk the feed and checks modified dates.. but I will add a db that has hashs/mentions/subjects and such.

matched #e5kep3q score:11.2 Search by:

Search by 1 mentions:

mentions:prologic@twtxt.net

Search by 1 tags:

tags:37xr3ra

prologic @twtxt.net

Mon, 11 Jan 2021 23:11 Z (3 years ago)

(#37xr3ra) Ahh I don't think your code actually _crawls_ the Twtxt space right? Just parses urls given to it and adds it to a database file?

matched #eahcpkq score:11.2 Search by:

Search by 1 tags:

tags:37xr3ra