News, requests and patches for loang.net
 help / color / mirror / code / Atom feed
* AI scrapping bot ban
@ 2023-08-16 14:19 Adolfo Santiago
  2023-08-17  8:30 ` Nguyễn Gia Phong
  0 siblings, 1 reply; 2+ messages in thread
From: Adolfo Santiago @ 2023-08-16 14:19 UTC (permalink / raw)
  To: ~cnx/loang

[-- Attachment #1: Type: text/plain, Size: 489 bytes --]

Hello, loang users.

I write to the list in order to propose a wide-block of AI bots scrapping
anything from loang services.

Recently I came accross the GPTBot[1] documentation, where it says how to block
the bot if you don't want it to scrap anything.

The idea would be to not allow the bot the posibility of scrapping anything,
regardless of the loang service(s) you're using.

Hope to hear back from you.

Thank you for your time,
Adolph

[1]: https://platform.openai.com/docs/gptbot

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 228 bytes --]

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: AI scrapping bot ban
  2023-08-16 14:19 AI scrapping bot ban Adolfo Santiago
@ 2023-08-17  8:30 ` Nguyễn Gia Phong
  0 siblings, 0 replies; 2+ messages in thread
From: Nguyễn Gia Phong @ 2023-08-17  8:30 UTC (permalink / raw)
  To: Adolfo Santiago, ~cnx/loang

[-- Attachment #1: Type: text/plain, Size: 1150 bytes --]

On 2023-08-16 at 16:19+02:00, Adolfo Santiago wrote:
> I write to the list in order to propose a wide-block
> of AI bots scrapping anything from loang services.
>
> Recently I came accross the GPTBot[1] documentation,
> where it says how to block the bot
> if you don't want it to scrap anything.
>
> The idea would be to not allow the bot the posibility
> of scrapping anything, regardless of the loang service(s)
> you're using.
>
> [1]: https://platform.openai.com/docs/gptbot

Thanks for reminding me of this!
I saw this on fedi a few days ago and forgot about it.

Personally I don't object scraping, but given OpenAI
doing it to mass-launder copyright and its oligopolistic power,
I am open to blocking it server-wide.

Until anyone wants to opt into the OpenAI dataset,
I will set up the firewall to block its IP ranges.

Do note that whatever one puts out to the public interwebs
will eventually be scraped.  A resource could be mirrored
by a archiving initiative and OpenAI could scrape from there.
Other big tech corps are also building their own LLM.
Think of the block more as an act of protest, if anything.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 248 bytes --]

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2023-08-17  8:30 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-08-16 14:19 AI scrapping bot ban Adolfo Santiago
2023-08-17  8:30 ` Nguyễn Gia Phong

Code repositories for project(s) associated with this public inbox

	https://trong.loang.net/nixos-conf
	https://trong.loang.net/phylactery
	https://trong.loang.net/site

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for IMAP folder(s).