Slashdot Log In
Non-Profit Org Claims Rights In Library Catalog Data
Posted by
timothy
on Thu Nov 13, 2008 03:48 PM
from the paging-dr-gracenote dept.
from the paging-dr-gracenote dept.
lamona writes "The main source of the bibliographic records that are carried in library databases is a non-profit organization called OCLC. Over the weekend OCLC 'leaked' its new policy that claims contractual rights in the subsequent uses of the data, uses such as downloading book information into Zotero or other bibliographic software. The policy explicitly forbids any use that would compete with OCLC. This would essentially rule out the creation of free and open databases of library content, such as the Open Library and LibraryThing. The library blogosphere is up in arms . But can our right to say: "Twain, Mark. The adventures of Tom Sawyer" be saved?"
Related Stories
[+]
Your Rights Online: Thomson Reuters Sues Over Open-Source Endnote-Alike Zotero 181 comments
Noksagt writes "Thomson Reuters, the owner of the Endnote reference management software, has filed a $10 million lawsuit and a request for injunction against the Commonwealth of Virginia. Virginia's George Mason University develops Zotero, a free and open source plugin to Mozilla Firefox that researchers may use to manage citations. Thomson alleges that GMU's Center for History and New Media reverse engineered Endnote and that the beta version of Zotero can convert (in violation of the Endnote EULA) the proprietary style files that are used by Endnote to format citations into the open CSL file format."
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
The library blogosphere is up in arms! (Score:5, Funny)
Hey! (Score:4, Funny)
Parent
Re:The library blogosphere is up in arms! (Score:5, Funny)
God is dead.
-Niet~*&%a~~NO CARRIER
Parent
Re:The library blogosphere is up in arms! (Score:4, Funny)
At best we have a long meeting about it.
Lets pray it never comes to that.
Parent
Take back the data! (Score:4, Interesting)
Jeez, has everyone here gone soft? Download it, repackage it, and give it to your friends. To hell with the law! I'm not saying screw over the authors but if it's been out more than 15 years, to hell with corporate interest then. Practice an act of civil disobedience. And as Mark Twain would say, "A man who carries a cat by the tail learns something he can learn in no other way." Tell these corporate bastards we're not going to pay anymore. It's their turn to give something back, rather than just take, take, take.
Re:Take back the data! (Score:5, Interesting)
"Tell these corporate bastards we're not going to pay anymore."
Completely within your rights as they stand now. Don't buy and don't receive -- simple.
"It's their turn to give something back, rather than just take, take, take."
They already do, it's called exchange. What is it you're willing to give them for their work? Oh yeah -- "To hell with the law!".
Parent
They can claim.... (Score:5, Insightful)
They can claim anything that they want, but they can't enforce property rights on something they don't own.
Re:They can claim.... (Score:5, Interesting)
That's like saying the Encyclopedia Brittannica can't enforce copyright on its stories, because they don't own the facts.
In other words: no, OCLC doesn't own the books, or the facts about them, but they do own the database.
But that's not even the issue (although you're forgiven for having to dig around to find the real issue, since the article summary above doesn't really say it). It's the fact that OCLC wants to be the only records database out there, and is trying to use legal force to stop libraries from sharing their records with anybody else.
Parent
Re:They can claim.... (Score:4, Insightful)
That's like saying the Encyclopedia Brittannica can't enforce copyright on its stories, because they don't own the facts.
It owns the "articles," true, but it can't prevent the "facts" from being transfered.
In other words: no, OCLC doesn't own the books, or the facts about them, but they do own the database.
Sort of true, the copyright in this case *only* applies to when original work incorporated in to the collection of facts, making the "collection" a copyrighted entity.
although you're forgiven for having to dig around to find the real issue
The arrogance of Slashdot posters astounds me. Most are all too quick to assume that someone does not know the specifics. You always lead with some a-hole comment intended to ad-hominem rather than rely on your own merit of argument.
Leave *me* out of your debate, address facts and issues alone, thank you very much.
At issue is a court ruling that a database collection of publicly known facts can comprise an original work. However, there must be original work involved, not merely the simple aggregation of data, but original work that augments the data.
A database of books based on the standard library card catalog is not something whose collection would be protected by copyright. If, however, they incorporated original work such as reviews and ratings, then the database could be protected by copyright.
So, when I say "they can claim what they want, but they can't enforce property rights on that which they do not own," the statement is true and accurate, so much so that it is inarguable. The issue is what constitutes an original work worthy of copyright protection.
Parent
Don't Steal My Information! (Score:5, Funny)
Serutan's Fun Factz #22583: Columbus is the capital of Ohio.
Serutan's Fun Factz #57661: The chemical formula for water is H20.
- - - - - - -
Policy for Use and Transfer of Serutan's Fun-Factz Records:
YOU ARE FREE:
1. To use, reproduce, incorporate into works and display Serutan's Fun Factz records.
2. To transfer Serutan's Fun Factz records of your libraryâ(TM)s, archiveâ(TM)s or museumâ(TM)s own holdings.
UNDER THE FOLLOWING CONDITIONS:
1. Noncommercial Use. Use of Serutan's Fun Factz records for commercial purposes requires a separate agreement with OCLC.
2. Noncommercial Transfer. Serutan's Fun Factz records may not be sold, sublicensed, or otherwise transferred for a fee, other economic gain or commercial purposes.
3. Attribution. Serutan encourages you to identify Serutan's Fun Factz as the source of Serutan's Fun Factz-derived records.
4. Reasonable Use. Use must not discourage the contribution of bibliographic and holdings data to Serutan's Fun Factz or substantially replicate the function, purpose, and/or size of Serutan's Fun Factz.
5. Modification. Serutan encourages you not to remove the Serutan's Fun Factz number, the link to the policy, and any other means of attribution from Serutan's Fun Factz-derived records.
6. Conveyance. The policy terms and conditions remain in effect following the transfer of Serutan's Fun Factz records.
Have a Fun-Factz-Filled day!
Import Library of Congress to Evergreen or Koha (Score:5, Interesting)
Some of the problems caused by OCLC can be avoided by using better tools. Evergreen [evergreen-ils.org], Koha [koha.org] are both feature-rich, open source integrated library systems. They're not just competitive, in many cases they are just plain better.
Another danger point is Metalib. The Z39.50 profiles are about the only advantage there, aside from the sales pitch. Those are public anyway and could easily be listed centrally by pooling resources to the tune of a few cents per month per participating organization.
However, all that is about the code and the article is about claims of ownership over database content. Well fortunately enough, data can be imported, exported and shared between systems like Koha or Evergreen without ever having anything to do with OCLC. Most libraries, even many library consortia, no longer have any catalogers. In those cases, import the metadata for the catalog from the Library of Congress, that's what it's there for...
OCLC Didn't Create the Records (Score:5, Informative)
OCLC stores the bibliographic records in its database, but it did not create the vast majority of them. The records were created by catalogers at thousands of libraries. These libraries contribute their records to OCLC so that they can be shared with other libraries, but never do they grant OCLC ownership of the records.
Okay, let me see if I got this right.... (Score:4, Interesting)
...OCLC is a business (sorry, non-profit) that has orchestrated a ginormous database of bibliographic data and summaries, which it then sells to libraries both on- and off-line.
Libraries that use and display these records are expected to indicate that they were provided by OCLC and cannot be re-copied en masse.
So far, I can't blame 'em. That's a huge database to just let slip away for free. However, I imagine that this part of the policy would make a few libraries upset:
Reasonable Use. Use must not discourage the contribution of bibliographic and holdings data to WorldCat or substantially replicate the function, purpose, and/or size of WorldCat.
Which, to me, translates as "If you use our database, you're not allowed to compete with us, period."
Re:Okay, let me see if I got this right.... (Score:5, Insightful)
This guy [aaronsw.com], peeled from the Wikipedia list of comments, seems to summarize the real problem here better than I'd guessed:
At least folks could build an alternative to OCLC. So that's what I and others have been doing -- Open Library provides a free collection of over 20 million book records that anyone can browse, download, contribute to, and reuse for absolutely free. Naturally, OCLC hasn't been a fan. They've been trying to kill it from the beginning -- threatening its funders with lawsuits, insulting it in the press, and putting pressure on member libraries not to cooperate. (Again, notice the reversal: an organization libraries create to help them has now become so powerful that it is forcing libraries to help it.)
But recently, it's gone one step way too far. Not satisfied with controlling the world's largest source of book information, it wants to take over all the smaller ones as well. It's now demanding that every library that uses WorldCat give control over all its catalog records to OCLC. It literally is asking libraries to put an OCLC policy notice on every book record in their catalog. It wants to own every library.
Basically, they're feeling threatened by the Internet, they've locked Google and Yahoo out of their web-based records, and they don't want the records (which member libraries actually paid them to contribute to) being given away to anybody else.
Pooh on them. If this keeps up, it looks like they're liable to be replaced by something smaller, faster, and free-er that uses the Internet. Like the RIAA, they're being dangerously slow to embrace the new technology so widely used by their own customers. Unlike the RIAA, they stand a good chance of being completely circumvented if small libraries decide they'd rather share their records with someone like Google.
Parent
Re:Okay, let me see if I got this right.... (Score:5, Informative)
Parent
other crooked OCLC behaviors (Score:5, Interesting)
This comes as no surprise to me. I work for a small record label that provides a streaming audio service to about 150 colleges and institutions. Many of our clients like to have information about our content stored in their institutional catalog/OPAC.
The thing is, these catalog systems pretty much only accept MARC-formatted records. The MARC format is kind of obscure, and it's nothing we want to generate ourselves, so we provide CSV data to OCLC and they convert it to MARC format for us.
The amazing part of the racket they're running is that we have to *pay* OCLC to make these records for us, and then they turn around and require *another* payment from anyone who wants to use the records.
We aren't even entitled to our own copy of the data they've converted for us. Presumably, if we wanted it, we'd have to purchase it from the people we gave it to in the first place. It's needless to say, but we also don't see any kind of profit sharing from OCLC when 150 libraries each purchase thousands of these records.
Re:other crooked OCLC behaviors (Score:4, Interesting)
Assuming your metadata isn't too complicated (and since you're using CSV, I'm assuming it isn't), it should be very simple to convert it to MARC using MARC4J [tigris.org]. I'm not sure if there are similar libraries for Perl or other languages.
In fact, I'd be happy to help you with this, since it's pretty ridiculous to be charging for such a simple service. You can email me at escowles [at] ucsd.edu.
-Esme
Parent
old story, OCLC at it again (Score:5, Interesting)
I am a systems librarian (librarian who is in charge of the servers and systems) who has dealt with OCLC for thirty years. They tried to do this with libraries as well, claiming ownership of information that has, for the most part, been contributed by libraries themselves. OCLC does very little original cataloguing. It's mostly value-added stuff by little podunk, and a few large, libraries all over the world. They're going to have a hard time asserting their so-called rights here and the quite substantial 'library community' is not going to be on their side.
One note here: Several have already asserted that open source integrated library systems (ILS) projects are 'superior' to OCLC. You are comparing apples and oranges. KOHA is an ILS. It is NOT a bibliographic utility. KOHA (along with Dynix, Sirsi, Gaylord, VTLS, and a few others) provides a suite of programs to manage library collections and inventory, allow the check out and in of books and materials, provide an online public catalog, send overdue notices--that sort of thing. They are, by and large, local to and managed by a library system (which is exactly what I did for years), though there are many libraries which share such systems on a regional basis as well.
OCLC is a BIBLIOGRAPHIC utility, though they also dabble in other things such as acquisitions, collection analyses, and interlibrary loans. They are responsible for keeping records of books and materials in standard formats such as MARC (Machine Readable Cataloguing, a format originally designed to transport bibliographic records via 9-track tape, i.e.: it is a 'serially organized' database making use of tags and sub-tags to parse the data.) which are then made available to other libraries. This provides the kind of centralization that means 16,000 libraries don't have to all individually catalog the same book. Once is sufficient. Every ILS has an interface to OCLC that allows them to grab records and download them to the local system--as well as upload original cataloging to OCLC (a crucial point, I think.) Every library that owns a particular title attached their own identifier to the main record, which is what makes OCLC a good source for interlibrary loan information. In a sense, OCLC is the world's online catalog, but it DOES NOT displace the local OPAC. (Online Public Access catalog).
Now, places like librarything.com get their records from a variety of places, including Amazon, well known for crap-quality bibliographic records, and any number of universities and large library systems around the world. OCLC would be hard-pressed to 'prove' records in place at librarything originated with OCLC, much less that they are 'owned' by OCLC. In other words, OCLC can be easily circumvented.
With the demise of the smaller bibliographic utilities such as WLN (The Washington, then Western Library Network) OCLC has achieved world domination in some sense, but it is also a membership organization with library representation on its board and governing committees. Having seen OCLC try this crap before, my take on it is that it won't fly. I wouldn't worry about it.
ASIN (Score:5, Interesting)
With everything going online, there is no longer a need for a linear sequencing of all human knowledge. It's all hypertext and keyword-based. So when I say "ASIN" database, I mean not just title and author, but also keywords, summaries, and maybe even recommended similar books and customer reviews. Amazon would still retain its well-oiled shipping system, but it would be in a position to define all of human knowledge in a finer way than Google currently does.
Re:DDS (Score:5, Funny)
Parent
ObFuturama (Score:4, Funny)
"Dewey, You Fool! Your Decimal System Has Played Right Into My Hands! Ha Ha Ha Ha!"
Although I guess OCLC is saying that instead of the giant brains.
Parent
Re:ObAl (Score:4, Funny)
[Conan the Librarian lifts the man up with his bare hands]
Conan the Librarian: Don't you know the Dewey Decimal System?
Parent
Re:DDS (Score:5, Insightful)
Um, anybody who knew more than what they learned in elementary school about the DDC (it's actually called the Dewey Decimal Classification) to begin with probably knew that. Admittedly, that's not very many people, unfortunately. I understand why a lot of people question IP laws in general, but I don't understand why so many people are surprised to find out that the DDC is a piece of IP like any other.
Now, the fact that one needs to pay to get the full version of the Library of Congress Classification (LCC) confuses me a little more, since it's actually a governement-created resource. Well, actually I guess I do know. LC, and especially it's under-appreciated traditional services, like cataloging, classification and authority control are so underfunded that they actually need to charge money to libraries to keep those projects alive. Alas.
Parent
Re:DDS (Score:4, Insightful)
Yes there is. A lot of hard work performed, most likely by "member" (read "customer") libraries, goes into those records. Since OCLC is the only game in town and this behavior is clearly anti-competitive, with little benefit to consumers, they could run afoul of anti-trust laws.
Parent
Re:First of all... (Score:5, Funny)
Circuit city seems to be doing its best.
Parent