A Deeper Look into the EU Text and Data Mining Exceptions: Harmonisation, Data Ownership, and the Future of Technology

GRUR International, 71(8), 2022, 685–701

17 Pages Posted: 19 Jul 2021 Last revised: 16 Dec 2022

See all articles by Thomas Margoni

Thomas Margoni

Centre for IT & IP Law (CiTiP), Faculty of Law - KU Leuven

Martin Kretschmer

University of Glasgow - School of Law

Date Written: July 14, 2021

Abstract

This paper focuses on the two exceptions for text and data mining (TDM) introduced in the Directive on Copyright in the Digital Single Market (CDSM). While both are mandatory for Member States, Art. 3 is also imperative and finds application in cases of text and data mining for the purpose of scientific research by research and cultural institutions; Art. 4, on the other hand, permits text and data mining by anyone but with rightholders able to ‘contract-out’ (Art. 4). We trace the context of using the lever of copyright law to enable emerging technologies such as AI and the support innovation. Within the EU copyright intervention, elements that may underpin a transparent legal framework for AI are identified, such as the possibility of retention of permanent copies for further verification. On the other hand, we identify several pitfalls, including an exces- sively broad definition of TDM which makes the entire field of data-driven AI development dependent on an exception. We analyse the implications of limiting the scope of the exceptions to the right of reproduction; we argue that the limitation of Art. 3 to certain beneficiaries remains problematic; and that the requirement of lawful access is difficult to operationalize. In conclusion, we argue that there should be no need for a TDM exception for the act of extracting informational value from protected works. The EU’s CDSM provisions para- doxically may favour the development of biased AI systems due to price and accessibility conditions for train- ing data that offer the wrong incentives. To avoid licensing, it may be economically attractive for EU-based developers to train their algorithms on older, less accurate, biased data, or import AI models already trained abroad on unverifiable data.

Keywords: text mining, data mining, TDM, copyright, artificial intelligence, machine learning, EU, Directive on copyright in the digital single market

JEL Classification: D23, K4, H41, K11, L88

Suggested Citation

Margoni, Thomas and Kretschmer, Martin, A Deeper Look into the EU Text and Data Mining Exceptions: Harmonisation, Data Ownership, and the Future of Technology (July 14, 2021). GRUR International, 71(8), 2022, 685–701, Available at SSRN: https://ssrn.com/abstract=3886695 or http://dx.doi.org/10.2139/ssrn.3886695

Thomas Margoni (Contact Author)

Centre for IT & IP Law (CiTiP), Faculty of Law - KU Leuven ( email )

Brussels
Belgium

Martin Kretschmer

University of Glasgow - School of Law ( email )

CREATe, School of Law
10 The Square
Glasgow, G12 8QQ
United Kingdom

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
1,093
Abstract Views
3,125
Rank
39,643
PlumX Metrics