LISTSERV - LIBLICENSE-L Archives

From: "Peter B. Hirtle" <[log in to unmask]>
Date: Wed, 20 Jul 2016 14:00:33 +0000

Joe, on question #1, I am not aware of any court cases that have yet tested TDM.

Perhaps the closest are the Google Books/HathiTrust cases that make it
clear that when you are not using the expressive content of a work,
there is no infringement.  The arguments are well laid-out in Jockers,
Matthew L. and Sag, Matthew and Schultz, Jason, Brief of Digital
Humanities and Law Scholars as Amici Curiae in Support of
Defendant-Appellees and Affirmance, (The Authors Guild, Inc., et al.,
v. Google, Inc., et al.) (Second Circuit) (July 10, 2014).

Available at SSRN: http://ssrn.com/abstract=2465413 or
http://dx.doi.org/10.2139/ssrn.2465413.  I can also recommend Matthew
Sag, Orphan Works As Grist for the Data Mill, 27 Berkeley Tech. L.J.
(2012).

Available at: http://scholarship.law.berkeley.edu/btlj/vol27/iss3/9 or
http://dx.doi.org/doi:10.15779/Z387M5B.

Your second question is a little harder.  While some argued during the
Napster cases that "there is no legal restriction on downloading that
content," I think that is far from established.  I suspect that the
opposite is likely to be the case now - but I haven't been tracking
the cases (primarily with regards to audio and videos) to speak to
this.

Peter B. Hirtle
Affiliate Fellow, Berkman Klein Center for Internet & Society, Harvard
University
[log in to unmask]
[log in to unmask]
[log in to unmask]
http://vivo.cornell.edu/display/individual23436
Copyright and Cultural Institutions: Guidelines for Digitization for
U.S. Libraries, Archives, and Museums:
http://hdl.handle.net/1813/14142


-----Original Message-----
From: Joseph Esposito <[log in to unmask]>
Date: Sat, 16 Jul 2016 15:08:15 -0400

I was puzzling over some of the copyright issues concerning text and
data mining (TDM) this week and wonder if anyone on this list can
offer some guidance. Basically, I have two questions:

1.  My understanding is that there is a growing body of court rulings
to the effect that TDM is not protected by copyright. That is,
machines/robots/spiders can mine full-text databases without
triggering a copyright claim. Are there any summary articles/blog
posts that lay out the current view of this?

2.  Related to this is a question that came up concerning SciHub and
other sites that reproduce scholarly content. While it may be a breach
of contract or illegal to upload content to SciHub and its brethren,
and it may be illegal for SciHub to display that content, there is no
legal restriction on downloading that content. The downloader, on the
other hand, cannot redisplay that content. If this is true, could a
TDM robot download articles from SciHub (or, for that matter, from
ResearchGate or Academia.edu) with impunity?

Can anybody help me here?

Joe Esposito

--
Joseph J. Esposito
Processed Media
[log in to unmask]
@josephjesposito
+Joseph Esposito