Archive for the 'copyright' Category

Google books: A great reference tool and nothing more.

As a reference tool, Google Books is pretty good. You can do a normal search, and get as results any matching books that Google has indexed. With the recent burgeoning of Google deals with large and well known libraries (for example The NYPL, Oxford University Library, and Harvard Library), Google Books looks set to include the full text of a decent chunk of published works. This means it is now possible to effectively run a Google search on the content of a very large library, and have the results returned in a relevance ranked order with little snippets of text for context. It’s also possible to add the things you read to a “personal library”, assuming that you have a Google account, meaning that when you just have to find the poem you read in a book that includes the line ‘the stars carried the helpless one ribbed moon away’, you can search specifically in the books you have read.

There are a few implications of this technology, though, that are problematic. The first is that under the current law, Google is being sued for copyright infringement because they have to make a copy of the works they make searchable to create the search index. Normally I would think this was a reasonable use (even though technically it’s legal), but there is a loophole that I discovered yesterday that does make me slightly uneasy on behalf of all poets: The context that Google provides around the search terms in the results allows you to search for the next line of the poem, and for a short poem, it is relatively easy to read the whole thing. Admittedly this is a somewhat cumbersome process, and admittedly it is not likely that any poet will lose a sale out of it, but you see these snippets without direct attribution to the poet, if your search results come from an anthology, and this is a sad loss of a moral right for the poet involved.

The second problem is that this knowledge is tied up in a commercial corporation who by law has first responsibility to their shareholders, but by popular cachet is the source of information on the internet. Libraries are nervous about a monopoly on information, and while some may view this as just one more twist in the historical antipathy between libraries and Google, I think it is in line with the freedom of information principle that it should be available from more than one source, if possible.

The third issue is one that is close to my heart, and one that Sara and some of these comments got me thinking about. Google books are great if you already know what you are looking for, but if you don’t have some search terms already, it’s hopeless. More than that, though, there is no serendipity: you go, you type in some words, you find the book and either read it online, buy it, or reserve it at your local library, and you leave. You never get to see the book on the shelf next to it might also have been useful, or just walked past a display that might have had something interesting for other reasons. Now, chances are that some people wouldn’t have bothered to go find a book if they didn’t have Google books, but some of them would have. Improving serendipitous information encounters (i.e. online browsing of information sources) is something that attracts a lot of research attention (including my own, for a year), and some novel approaches. And to me it is this that is the real user experience failing of Google books — not that I don’t want to actually read online, not the copyright issues, but that their browsing experience is boring and cumbersome and smacks of an afterthought. Until Google can provide me the same rich browsing experience that an actual library or bookstore does, it will only be a reference tool.


Subscribe

Books

Web Pages

See my collection of web resources on del.icio.us

Journals and conferences

Disclaimer on reading materials

My list of blogs, web-pages, journals, conferences and books is by no means exhaustive, If you want more pointers, or if you think your favourite resource should be included, get in touch.

License

by-nc-sa.png
Some rights reserved.

Comment moderation

If it is your first time posting, your comment will automatically be held for my moderation -- I try get to these as soon as possible. After that, your comments will appear automatically. If your comment is on-topic and isn't abusing me or anyone else who comments, chances are I'll leave it alone. That said, I reserve the right to delete (or infinitely moderate) any comments that are abusive, spammy or otherwise irelevant.