“Somewhere at Google there is a database containing 25 million books and nobody is allowed to read them.”
In August 2010, Google put out a blog post announcing that there were 129,864,880 books in the world. The company said they were going to scan them all.
Of course, it didn’t quite turn out that way. This particular moonshot fell about a hundred-million books short of the moon. What happened was complicated but how it started was simple: Google did that thing where you ask for forgiveness rather than permission, and forgiveness was not forthcoming. Upon hearing that Google was taking millions of books out of libraries, scanning them, and returning them as if nothing had happened, authors and publishers filed suit against the company, alleging, as the authors put it simply in their initial complaint, “massive copyright infringement.”
[...]
When Google started scanning, they weren’t actually setting out to build a digital library where you could read books in their entirety; that idea would come later. Their original goal was just to let you search books. For books in copyright, all they would show you were “snippets,” just a few sentences of context around your search terms. They likened their service to a card catalog.
Google thought that creating a card catalog was protected by “fair use,” the same doctrine of copyright law that lets a scholar excerpt someone’s else’s work in order to talk about it. “A key part of the line between what’s fair use and what’s not is transformation,” Google’s lawyer, David Drummond, has said. “Yes, we’re making a copy when we digitize. But surely the ability to find something because a term appears in a book is not the same thing as reading the book. That’s why Google Books is a different product from the book itself.”
It was important for Drummond to be right. Statutory damages for “willful infringement” of a copyright can run as high as $150,000 for each work infringed. Google’s potential liability for copying tens of millions of books could have run into the trillions of dollars. “Google had some reason to fear that it was betting the firm on its fair-use defense,” Pamela Samuelson, a law professor at UC Berkeley, wrote in 2011. Copyright owners pounced.
I'm not a huge fan of the concept of IP. In my work, I've always been in favor of sharing my codes with the community. Otherwise, they would just disappear into oblivion the day I stop using them.
Also, I sometimes end up with interesting snippets from this online google catalog, but to be honest, I never went as far as trying to find the original book. So, without full access to all the books, this catalog is not useful to me. However, I would gladly zap the original author of a book if I get to read their book online.
Do you use Google Books?
HN comments turn out to be the bomb, yet again:
HathiTrust, an organization built from the big academic libraries that participated in early book digitization efforts. The HathiTrust is better about letting the public read books that have actually fallen into the public domain. I have found many books that are "snippet view" only on Google Books but freely visible on HathiTrust.
reply
I used to use Google books, it was amazing, especially for reading out of copyright (pre 1923 or so) older texts.
Last time I looked at it, though, it was very restricted and almost completely unusable.
There's some other sources for these materials that people should know about - annas-archive.org, zlib.
reply
I used to use it all the time. I tend to stay away from google these days. It was great for research, but I sometimes got frustrated because the snippet didn't quite have everything I needed. I don't believe it ever led me to buy the actual book, but I know on a few occasions I then found the book in a library.
reply
“Somewhere at Google there is a database containing 25 million books and nobody is allowed to read them.”
as many as 25 million books should not be read, what is the real problem, these books should not be read.
reply
Not sure I understand your comment. You don't want people to have access to these books, regardless of being online or in their original format?
reply