On 2023-08-16 at 16:19+02:00, Adolfo Santiago wrote: > I write to the list in order to propose a wide-block > of AI bots scrapping anything from loang services. > > Recently I came accross the GPTBot[1] documentation, > where it says how to block the bot > if you don't want it to scrap anything. > > The idea would be to not allow the bot the posibility > of scrapping anything, regardless of the loang service(s) > you're using. > > [1]: https://platform.openai.com/docs/gptbot Thanks for reminding me of this! I saw this on fedi a few days ago and forgot about it. Personally I don't object scraping, but given OpenAI doing it to mass-launder copyright and its oligopolistic power, I am open to blocking it server-wide. Until anyone wants to opt into the OpenAI dataset, I will set up the firewall to block its IP ranges. Do note that whatever one puts out to the public interwebs will eventually be scraped. A resource could be mirrored by a archiving initiative and OpenAI could scrape from there. Other big tech corps are also building their own LLM. Think of the block more as an act of protest, if anything.