Those of you who have your own sites, might want to give this a quick look:
It's just a text file, similar to robots.txt, but for AI crawlers, rather than search engine ones. Probably not very effective, as of now, but at least it's a way to make it clear you don't conset to your site being used for AI training, without making it suck for human users, in the process.
…#fucv4ya) @mckinley @prologic Yes, I agree the website itself sucks and the company behind it is incompetent at best - even more so, with their other websites.
Their first site (haveibeentrained.com)…
matched #i2odrya score:1.9
Search by:
Search by 2 mentions:
…#fucv4ya) @thecanine @prologic @eldersnake @mckinley This page is just a terrible joke. Great writeup, mckinley! Exactly my thoughts, but you forgot to mention that you see zero contents unless you sc…
matched #w4wugfq score:2.28
Search by:
Search by 4 mentions:
(#fucv4ya) @lyse their crawler does not read the robots txt and to my knowledge, neither do any other AI crawlers. As always, they considered themselves exempt, from everything they find inconvenient.
matched #2pd56ba score:3.06
Search by:
Search by 1 mentions:
…#fucv4ya) @mckinley Haha, right. They might have figured that everybody is just using `*` anyway. :-D Evidence from logs suggests "Spawning-AI".
Yup, @thecanine, I thought so, too. Reminds me a bit o…
matched #465hv4a score:1.81
Search by:
Search by 2 mentions: