Implicaciones legales del web scraping en el

Abstract

El web scraping es una técnica que se usa para recopilar datos en Internet y almacenarlos en una base de datos. Ese proceso se usa, entre otras cosas, para entrenar modelos de inteligencia artificial generativa y ha generado controversia alrededor del mundo debido a sus riesgos legales. En este artículo se analizará la viabilidad legal del uso de técnicas de web scraping y se abordarán tensiones relacionadas con asuntos contractuales de los términos de servicio de las páginas web, los riesgos legales que se desprenden de estas técnicas y, en particular, del uso de obras protegidas en el entrenamiento de modelos de inteligencia artificial generativa, de protección de datos personales y de implicaciones penales; las licencias open source, open access y de Creative Commons, así como también los datos de dominio público y en cabeza del Estado colombiano. Este artículo pretende ser un marco teórico inicial para la discusión del web scraping en modelos de inteligencia artificial generativa, dado que, a la fecha de elaboración de este artículo, el desarrollo normativo y jurisprudencial sobre este tema es aún incipiente.

Alternate abstract:

Web scraping is a technique used to collect data on the Internet and record it in a database. This process is used, among other things, to train generative artificial intelligence models, and has generated worldwide controversy due to its legal risks. This article will analyze the legal viability of the use of web scraping techniques and will address tensions related to contractual issues in the terms of service of web pages; legal risks arising from web scraping, and in particular the use of protected works in the training of generative artificial intelligence models, personal data protection, and criminal implications; open source, open access, and Creative Commons licenses, as well as public domain data and data held by the Colombian State. This article aims to be an initial theoretical framework for the discussion of web scraping in generative artificial intelligence models, given that at the time of writing this article, the regulatory and jurisprudential development on this topic is still incipient.

Details

Title

Implicaciones legales del web scraping en el entrenamiento de modelos de inteligencia artificial generativa

Author

Pacheco Chaparro, Juan Manuel; Laura Barrero Ramírez

Pages

167-189

Section

Artículos

Publication year

2024

Publication date

Jul-Dec 2024

Publisher

Universidad Externado de Colombia

ISSN

16571959

e-ISSN

23462116

Source type

Scholarly Journal

Language of publication

Spanish; Castilian

DOI

https://doi.org/10.18601/16571959.n38.07

ProQuest document ID

3095698206

© 2024. This work is published under https://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.

Implicaciones legales del web scraping en el entrenamiento de modelos de inteligencia artificial generativa

Jump to:

Abstract

Details

Suggested sources