From mboxrd@z Thu Jan 1 00:00:00 1970 Authentication-Results: mail-b.sr.ht; dkim=pass header.d=loang.net header.i=@loang.net Received: from tem.loang.net (tem.loang.net [37.205.11.127]) by mail-b.sr.ht (Postfix) with ESMTPS id 7303B11EFC9 for <~cnx/loang@lists.sr.ht>; Thu, 17 Aug 2023 08:30:21 +0000 (UTC) DKIM-Signature: a=rsa-sha256; bh=QG9fdIj7WUXluw1odz9hDxDQ+w0BfPoa10nRTFKEe2w=; c=relaxed/relaxed; d=loang.net; h=Subject:Subject:Sender:To:To:Cc:From:From:Date:Date:MIME-Version:Content-Type:Content-Type:Content-Transfer-Encoding:Reply-To:In-Reply-To:In-Reply-To:Message-Id:Message-Id:References:References:Autocrypt:Openpgp; i=@loang.net; s=default; t=1692261016; v=1; x=1692693016; b=XbbPyd4Y3u/8l8mxbITV1GYQ9hf93eAwaIJLAwfMUPm1gf6GQd0AiaHTpSGOHmA1v/cfFLw1 Y2L9becGw+ATFOP/pGTCKA9+FUey2XjFXT4ycXyOSR7ff/CC+yBE98ZURsLHXQQLP87fMAbVDp1 2kXYdRZPONm49fJxYNTxiPhhRDp7INf9Ox1LJ53h0/hLHAN8VyzHmvufMuDrwsVFIvfo+G535M/ Qt2p2KdbTXfNIj56aHEcjrieVqYG/YFamESTrHpbxO2MhsoyfaiOAu9hEqQtSabXxdzdSmcoOhT f2Jt1NZZg4DIkSfIKosa60f1yMZhrgV+zbCyQCYIQfgaA== Received: by tem.loang.net (envelope-sender ) with ESMTPS id 5b20a8a2; Thu, 17 Aug 2023 08:30:16 +0000 Content-Type: multipart/signed; boundary=4db76b4132c743b7862d594fc5c18f85fde477afa7d380e8e01533ab72f0; micalg=pgp-sha256; protocol="application/pgp-signature" Date: Thu, 17 Aug 2023 17:30:11 +0900 Subject: Re: AI scrapping bot ban To: "Adolfo Santiago" , <~cnx/loang@lists.sr.ht> From: =?utf-8?q?Nguy=E1=BB=85n_Gia_Phong?= Message-Id: References: In-Reply-To: --4db76b4132c743b7862d594fc5c18f85fde477afa7d380e8e01533ab72f0 Mime-Version: 1.0 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=UTF-8 On 2023-08-16 at 16:19+02:00, Adolfo Santiago wrote: > I write to the list in order to propose a wide-block > of AI bots scrapping anything from loang services. > > Recently I came accross the GPTBot[1] documentation, > where it says how to block the bot > if you don't want it to scrap anything. > > The idea would be to not allow the bot the posibility > of scrapping anything, regardless of the loang service(s) > you're using. > > [1]: https://platform.openai.com/docs/gptbot Thanks for reminding me of this! I saw this on fedi a few days ago and forgot about it. Personally I don't object scraping, but given OpenAI doing it to mass-launder copyright and its oligopolistic power, I am open to blocking it server-wide. Until anyone wants to opt into the OpenAI dataset, I will set up the firewall to block its IP ranges. Do note that whatever one puts out to the public interwebs will eventually be scraped. A resource could be mirrored by a archiving initiative and OpenAI could scrape from there. Other big tech corps are also building their own LLM. Think of the block more as an act of protest, if anything. --4db76b4132c743b7862d594fc5c18f85fde477afa7d380e8e01533ab72f0 Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iIQEABYIACwWIQSDiv4NVdwHTjYPlDqEtpzm8/a3ZwUCZN3alA4cY254QGxvYW5n Lm5ldAAKCRCEtpzm8/a3Z6DuAP0eP0Ct3Kpq19FE93jx2u0puoQOvfNMb/5bgBNQ 8QB4rgEA97+vDqq5iDzkiVw9grouwO42tZTB/0/BkhDWnZy9CgM= =7YW1 -----END PGP SIGNATURE----- --4db76b4132c743b7862d594fc5c18f85fde477afa7d380e8e01533ab72f0--