* AI scrapping bot ban
@ 2023-08-16 14:19 Adolfo Santiago
2023-08-17 8:30 ` Nguyễn Gia Phong
0 siblings, 1 reply; 2+ messages in thread
From: Adolfo Santiago @ 2023-08-16 14:19 UTC (permalink / raw)
To: ~cnx/loang
[-- Attachment #1: Type: text/plain, Size: 489 bytes --]
Hello, loang users.
I write to the list in order to propose a wide-block of AI bots scrapping
anything from loang services.
Recently I came accross the GPTBot[1] documentation, where it says how to block
the bot if you don't want it to scrap anything.
The idea would be to not allow the bot the posibility of scrapping anything,
regardless of the loang service(s) you're using.
Hope to hear back from you.
Thank you for your time,
Adolph
[1]: https://platform.openai.com/docs/gptbot
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 228 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: AI scrapping bot ban
2023-08-16 14:19 AI scrapping bot ban Adolfo Santiago
@ 2023-08-17 8:30 ` Nguyễn Gia Phong
0 siblings, 0 replies; 2+ messages in thread
From: Nguyễn Gia Phong @ 2023-08-17 8:30 UTC (permalink / raw)
To: Adolfo Santiago, ~cnx/loang
[-- Attachment #1: Type: text/plain, Size: 1150 bytes --]
On 2023-08-16 at 16:19+02:00, Adolfo Santiago wrote:
> I write to the list in order to propose a wide-block
> of AI bots scrapping anything from loang services.
>
> Recently I came accross the GPTBot[1] documentation,
> where it says how to block the bot
> if you don't want it to scrap anything.
>
> The idea would be to not allow the bot the posibility
> of scrapping anything, regardless of the loang service(s)
> you're using.
>
> [1]: https://platform.openai.com/docs/gptbot
Thanks for reminding me of this!
I saw this on fedi a few days ago and forgot about it.
Personally I don't object scraping, but given OpenAI
doing it to mass-launder copyright and its oligopolistic power,
I am open to blocking it server-wide.
Until anyone wants to opt into the OpenAI dataset,
I will set up the firewall to block its IP ranges.
Do note that whatever one puts out to the public interwebs
will eventually be scraped. A resource could be mirrored
by a archiving initiative and OpenAI could scrape from there.
Other big tech corps are also building their own LLM.
Think of the block more as an act of protest, if anything.
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 248 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2023-08-17 8:30 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-08-16 14:19 AI scrapping bot ban Adolfo Santiago
2023-08-17 8:30 ` Nguyễn Gia Phong
Code repositories for project(s) associated with this public inbox
https://trong.loang.net/nixos-conf
https://trong.loang.net/phylactery
https://trong.loang.net/site
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for IMAP folder(s).