Thursday, June 23, 2005
Metadata should be free
Here's the lede from Michael Rogers' story at MSNBC:
Several years ago journalist John Lenger told a remarkable story in the Columbia Journalism Review about teaching a journalism class at Harvard's extension school. He asked his young students to write a story about a Harvard land deal that occurred in 1732, but after a week of research, most came back with almost nothing substantial to report. The problem: They had done most of their research using the Internet, walking right past Harvard's library and archives, where the actual information could be found. When Lenger questioned their research methods, one student replied that she assumed that anything that was important in the world was already on the Internet.This connects to the effort in transforming "books into bytes" in Rogers' story, but I think it's about the poor job we've done in putting metadata about books, such as their indexes, online, making the discovery of these resources more likely.
Years ago, people said protection was needed against "theft" of software programs, and it took years for the countervailing attitude to appear in a strong, coherent way. Today, free and open software includes operating systems like Linux and Solaris, and applications for word processing, image manipulation, spreadsheets, presentations, software development and a thousand different uses. IBM, Sun, Nokia and other companies have taken IP rights and software that cost millions of dollars and put them into the public domain. We don't say such software is not creative and not worth "protecting," but its free distribution is more valuable to our society than locking it up. Enough people see this that the cultural effect is so far in advance of the laws it's kind of scary.
I'll skip over the arguments for scrapping patent laws that favor one side of this argument over the other. And there are many sites for discussing the similar issues around copyright. In the book world, people are beginning to see this, such as Cory Doctorow, who has posted his third book on the web in a variety of formats, saying "Share this book! That's what it's for." [1] The battleground goes beyond the legal arena to the cultural aspects, I would say. Just what are our obligations to creators and researchers? The IP pendulum needs to swing back to the middle in our minds as much as in our laws.
Metadata, creation and copyright have a tenuous connection to each other that ought to follow the open software model, to my way of thinking. Sure, there is intellectual effort that goes into the creation of an index, say, just as there is in the writing of a software application. But our society will benefit more — from wider, deeper, more accurate searching — when such information can be readily shared.
Metadata should be free. Metadata about non-electronic resources especially.
[1] Doctorow actually goes much further than simply allowing the electronic version of the book to be read for free, saying: "What's more, if you live in the developing world -- a country not on the World Bank's list of high-income countries -- you can do much more. You can make your own editions, charge money for them, make movies, translations, plays and anything else you care to, and charge whatever you want, without sending me one cent -- you don't even need my permission. See the FAQ for more. The only restriction is that you can't export your versions to the world's high-income countries where all my paying customers are. Deal? Deal."


