Category Archives: library

the open library

A little while back I was musing on the possibility of a People’s Card Catalog, a public access clearinghouse of information on all the world’s books to rival Google’s gated preserve. Well thanks to the Internet Archive and its offshoot the Open Content Alliance, it looks like we might now have it – ?or at least the initial building blocks. On Monday they launched a demo version of the Open Library, a grand project that aims to build a universally accessible and publicly editable directory of all books: one wiki page per book, integrating publisher and library catalogs, metadata, reader reviews, links to retailers and relevant Web content, and a menu of editions in multiple formats, both digital and print.

Imagine a library that collected all the world’s information about all the world’s books and made it available for everyone to view and update. We’re building that library.

The official opening of Open Library isn’t scheduled till October, but they’ve put out the demo now to prove this is more than vaporware and to solicit feedback and rally support. If all goes well, it’s conceivable that this could become the main destination on the Web for people looking for information in and about books: a Wikipedia for libraries. On presentation of public domain texts, they already have Google beat, even with recent upgrades to the GBS system including a plain text viewing option. The Open Library provides TXT, PDF, DjVu (a high-res visual document browser), and its own custom-built Book Viewer tool, a digital page-flip interface that presents scanned public domain books in facing pages that the reader can leaf through, search and (eventually) magnify.
Page turning interfaces have been something of a fad recently, appearing first in the British Library’s Turning the Pages manuscript preservation program (specifically cited as inspiration for the OL Book Viewer) and later proliferating across all manner of digital magazines, comics and brochures (often through companies that you can pay to convert a PDF into a sexy virtual object complete with drag-able page corners that writhe when tickled with a mouse, and a paper-like rustling sound every time a page is turned).
This sort of reenactment of paper functionality is perhaps too literal, opting for imitation rather than innovation, but it does offer some advantages. Having a fixed frame for reading is a relief in the constantly scrolling space of the Web browser, and there are some decent navigation tools that gesture toward the ways we browse paper. To either side of the open area of a book are thin vertical lines denoting the edges of the surrounding pages. Dragging the mouse over the edges brings up scrolling page numbers in a small pop-up. Clicking on any of these takes you quickly and directly to that part of the book. Searching is also neat. Type a query and the book is suddenly interleaved with yellow tabs, with keywords highlighted on the page, like so:

But nice as this looks, functionality is sacrificed for the sake of fetishism. Sticky tabs are certainly a cool feature, but not when they’re at the expense of a straightforward list of search returns showing keywords in their sentence context. These sorts of references to the feel and functionality of the paper book are no doubt comforting to readers stepping tentatively into the digital library, but there’s something that feels disjointed about reading this way: that this is a representation of a book but not a book itself. It is a book avatar. I’ve never understood the appeal of those Second Life libraries where you must guide your virtual self to a virtual shelf, take hold of the virtual book, and then open it up on a virtual table. This strikes me as a failure of imagination, not to mention tedious. Each action is in a sense done twice: you operate a browser within which you operate a book; you move the hand that moves the hand that moves the page. Is this perhaps one too many layers of mediation to actually be able to process the book’s contents? Don’t get me wrong, the Book Viewer and everything the Open Library is doing is a laudable start (cause for celebration in fact), but in the long run we need interfaces that deal with texts as native digital objects while respecting the originals.
What may be more interesting than any of the technology previews is a longish development document outlining ambitious plans for building the Open Library user interface. This covers everything from metadata standards and wiki templates to tagging and OCR proofreading to search and browsing strategies, plus a well thought-out list of user scenarios. Clearly, they’re thinking very hard about every conceivable element of this project, including the sorts of things we frequently focus on here such as the networked aspects of texts. Acolytes of Ted Nelson will be excited to learn that a transclusion feature is in the works: a tool for embedding passages from texts into other texts that automatically track back to the source (hypertext copy-and-pasting). They’re also thinking about collaborative filtering tools like shared annotations, bookmarking and user-defined collections. All very very good, but it will take time.
Building an open source library catalog is a mammoth undertaking and will rely on millions of hours of volunteer labor, and like Wikipedia it has its fair share of built-in contradictions. Jessamyn West of librarian.net put it succinctly:

It’s a weird juxtaposition, the idea of authority and the idea of a collaborative project that anyone can work on and modify.

But the only realistic alternative may well be the library that Google is building, a proprietary database full of low-quality digital copies, a semi-accessible public domain prohibitively difficult to use or repurpose outside the Google reading room, a balkanized landscape of partner libraries and institutions left in its wake, each clutching their small slice of the digitized pie while the whole belongs only to Google, all of it geared ultimately not to readers, researchers and citizens but to consumers. Construed more broadly to include not just books but web pages, videos, images, maps etc., the Google library is a place built by us but not owned by us. We create and upload much of the content, we hand-make the links and run the search queries that program the Google brain. But all of this is captured and funneled into Google dollars and AdSense. If passive labor can build something so powerful, what might active, voluntary labor be able to achieve? Open Library aims to find out.

of shelves and selves

William Drenttel has a lovely post over on Design Observer about the exquisite information of bookshelves, a meditation spurred by 60 photographs of the library of renowned San Francisco designer, typographer, printer and founder of Greenwood Press Jack Stauffacher. Each image (they were taken by Dennis Letbetter) gives a detailed view of one section of Stauffacher’s shelves, a rare glimpse of one individual’s bibliographic DNA, made browseable as a slideshow (unfortunately, the images are not reassembled at the end to give a full view of the collection).

Early evidence suggests that the impulse toward personal mapping through media won’t abate as we go deeper into the digital. Delicious Library and Library Thing are more or less direct transpositions of physical shelves to the computer environment, the latter with an added social dimension (people meeting through their virtual shelves). More generally, social networking sites from Facebook to MySpace are full of self-signification through shelves, or rather lists, of favorite books, movies and music. Social bookmarking sites too bear traces of identity in the websites people save and tag (the tags themselves are a kind of personal signature). Much of the texture and spatial language of the physical may be lost, a new social terrain has opened up, one which we’re only beginning to understand.
But it’s not as though physical bookshelves haven’t always been social. We arrange books not only for our own conceptual orientation, but to give others who venture into our space a sense of our self (or what we’d like to appear as our self), our distinct intellectual algorithm. Browsing a friend’s thoughtfully arranged shelf is like looking through a lens calibrated to their view of the world, especially when those books have played a crucial role, as in Stauffacher’s, in shaping a life’s work. Drenttel savors the idiosyncrasies that inevitably are etched into such a collection:

I have seen many great rare book libraries…. But the libraries I most enjoy are working libraries, where the books have been used and cited and annotated – first editions marred with underlining, notes throughout their pages. (I will always remember the chaos of Susan Sontag’s library, where every book had been touched, read and filled with notes and ephemera.) The organization of a working library is seldom alphabetical…but rather follows some particular mental construct of its owner. Jack Stauffacher’s shelves have some order, one knows. But it is his order, his life.

Or, in Stauffacher’s own words:

Without this working library, I would have no compass, no map, to guide me through the density of our human condition.

six blind men and an elephant

Thomas Mann, author of The Oxford Guide to Library Research, has published an interesting paper (pdf available) examining the shortcomings of search engines and the continued necessity of librarians as guides for scholarly research. It revolves around the case of a graduate student investigating tribute payments and the Peloponnesian War. A Google search turns up nearly 80,000 web pages and 700 books. An overwhelming retrieval with little in the way of conceptual organization and only the crudest of tools for measuring relevance. But, with the help of the LC Catalog and an electronic reference encyclopedia database, Mann manages to guide the student toward a manageable batch of about a dozen highly germane titles.
Summing up the problem, he recalls a charming old fable from India:

Most researchers – at any level, whether undergraduate or professional – who are moving into any new subject area experience the problem of the fabled Six Blind Men of India who were asked to describe an elephant: one grasped a leg and said “the elephant is like a tree”; one felt the side and said “the elephant is like a wall”; one grasped the tail and said “the elephant is like a rope”; and so on with the tusk (“like a spear”), the trunk (“a hose”) and the ear (“a fan”). Each of them discovered something immediately, but none perceived either the existence or the extent of the other important parts – or how they fit together.
Finding “something quickly,” in each case, proved to be seriously misleading to their overall comprehension of the subject.
In a very similar way, Google searching leaves remote scholars, outside the research library, in just the situation of the Blind Men of India: it hides the existence and the extent of relevant sources on most topics (by overlooking many relevant sources to begin with, and also by burying the good sources that it does find within massive and incomprehensible retrievals). It also does nothing to show the interconnections of the important parts (assuming that the important can be distinguished, to begin with, from the unimportant).

Mann believes that books will usually yield the highest quality returns in scholarly research. A search through a well tended library catalog (controlled vocabularies, strong conceptual categorization) will necessarily produce a smaller, and therefore less overwhelming quantity of returns than a search engine (books do not proliferate at the same rate as web pages). And those returns, pound for pound, are more likely to be of relevance to the topic:

Each of these books is substantially about the tribute payments – i.e., these are not just works that happen to have the keywords “tribute” and “Peloponnesian” somewhere near each other, as in the Google retrieval. They are essentially whole books on the desired topic, because cataloging works on the assumption of “scope-match” coverage – that is, the assigned LC headings strive to indicate the contents of the book as a whole….In focusing on these books immediately, there is no need to wade through hundreds of irrelevant sources that simply mention the desired keywords in passing, or in undesired contexts. The works retrieved under the LC subject heading are thus structural parts of “the elephant” – not insignificant toenails or individual hairs.

If nothing else, this is a good illustration of how libraries, if used properly, can still be much more powerful than search engines. But it’s also interesting as a librarian’s perspective on what makes the book uniquely suited for advanced research. That is: a book is substantial enough to be a “structural part” of a body of knowledge. This idea of “whole books” as rungs on a ladder toward knowing something. Books are a kind of conceptual architecture that, until recently, has been distinctly absent on the Web (though from the beginning certain people and services have endeavored to organize the Web meaningfully). Mann’s study captures the anxiety felt at the prospect of the book’s decline (the great coming blindness), and also the librarian’s understandable dread at having to totally reorganize his/her way of organizing things.
It’s possible, however, to agree with the diagnosis and not the prescription. True, librarians have gotten very good at organizing books over time, but that’s not necessarily how scholarship will be produced in the future. David Weinberg ponders this:

As an argument for maintaining human expertise in manually assembling information into meaningful relationships, this paper is convincing. But it rests on supposing that books will continue to be the locus of worthwhile scholarly information. Suppose more and more scholars move onto the Web and do their thinking in public, in conversation with other scholars? Suppose the Web enables scholarship to outstrip the librarians? Manual assemblages of knowledge would retain their value, but they would no longer provide the authoritative guide. Then we will have either of two results: We will have to rely on “‘lowest common denominator'”and ‘one search box/one size fits all’ searching that positively undermines the requirements of scholarly research”…or we will have to innovate to address the distinct needs of scholars….My money is on the latter.

As I think is mine. Although I would not rule out the possibility of scholars actually participating in the manual assemblage of knowledge. Communities like MediaCommons could to some extent become their own libraries, vetting and tagging a wide array of electronic resources, developing their own customized search frameworks.
There’s much more in this paper than I’ve discussed, including a lengthy treatment of folksonomies (Mann sees them as a valuable supplement but not a substitute for controlled taxonomies). Generally speaking, his articulation of the big challenges facing scholarly search and librarianship in the digital age are well worth the read, although I would argue with some of the conclusions.

johannes who?

This is the oldest existing document in the world printed with metal movable type: an anthology of Zen teachings, Goryeo Dynasty, Korea… 1377. It’s a little known fact, at least in the West, that movable type was first developed in Korea circa 1230, over 200 years before that goldsmith from Mainz came on the scene. I saw this today in the National Library of Korea in Seoul (more on that soon). This book is actually a reproduction. The original resides in Paris and is the subject of a bitter dispute between the French and Korean governments.

talk at brooklyn college library

If you’re in the New York area, this Wednesday I’ll be giving a talk at an event organized by the Brooklyn College Library called “It’s All About the Book.” Also speaking will be Jason Epstein, one of the all-time great innovators in print publishing, and founder most recently of On Demand Books. Talks will be followed by a tour of “books as art” installations by a number of local artists, curators and librarians. It promises to be an interesting event, well worth the trek to Flatbush.

the people’s card catalog (a thought)

New partners and new features. Google has been busy lately building up Book Search. On the institutional end, Ghent, Lausanne and Mysore are among the most recent universities to hitch their wagons to the Google library project. On the user end, the GBS feature set continues to expand, with new discovery tools and more extensive “about” pages gathering a range of contextual resources for each individual volume.
Recently, they extended this coverage to books that haven’t yet been digitized, substantially increasing the findability, if not yet the searchability, of thousands of new titles. The about pages are similar to Amazon’s, which supply book browsers with things like concordances, “statistically improbably phrases” (tags generated automatically from distinct phrasings in a text), textual statistics, and, best of all, hot-linked lists of references to and from other titles in the catalog: a rich bibliographic network of interconnected texts (Bob wrote about this fairly recently). Google’s pages do much the same thing but add other valuable links to retailers, library catalogues, reviews, blogs, scholarly resources, Wikipedia entries, and other relevant sites around the net (an example). Again, many of these books are not yet full-text searchable, but collecting these resources in one place is highly useful.
It makes me think, though, how sorely an open source alternative to this is needed. Wikipedia already has reasonably extensive articles about various works of literature. Library Thing has built a terrific social architecture for sharing books. There are a great number of other freely accessible resources around the web, scholarly database projects, public domain e-libraries, CC-licensed collections, library catalogs.
Could this be stitched together into a public, non-proprietary book directory, a People’s Card Catalog? A web page for every book, perhaps in wiki format, wtih detailed bibliographic profiles, history, links, citation indices, social tools, visualizations, and ideally a smart graphical interface for browsing it. In a network of books, each title ought to have a stable node to which resources can be attached and from which discussions can branch. So far Google is leading the way in building this modern bibliographic system, and stands to turn the card catalogue of the future into a major advertising cash nexus. Let them do it. But couldn’t we build something better?

samizdat express

In his latest NY Times column, Edward Rothstein meditates on the vastness of the public domain and the pleasures of skimming it in simple digital editions prepared by B+R Samizdat Express. Since 1993 B+R, run by Barbara and Richard Seltzer of West Roxbury, Massachusetts, has been selling bundles of plain text (ASCII) digital literature scooped from Project Gutenberg and arranged by theme, genre or period into anthologies — first on floppy disc, and now on CD-ROM and DVD. It’s all stuff you can get for free by grazing the web’s various public domain repositories, but B+R have done the work of harvesting and sorting and they’ll ship these multi-shelf-spanning chunks to you for the price of a single print volume. Browse through nearly 200 book collections they’ve assembled so far and you’ll find packages ranging from “Anthropology and Myth” ($19), “Works of Guy de Maupassant” ($12), or “The American Revolution and Early Republic as witnessed by Mercy Warren and Others” ($19). Some works are provided in audio through text-to-voice conversion software.
As Rothstein notes, the bare-bones formatting and sheer volume of the anthologies makes these works hard to digest, but there’s no doubt B+R provides a valuable service, especially for people in places where books are scarce and net access unreliable. All in all, it’s an e-book advocate’s playground but more of a hallucinogenic head trip for the average reader — a way to sample vastness. It does make one’s wheels start to turn, though, on what other elucidating layers could be built on top of the vast murk of the digital library.

the new harpers.org

Harper’s has a new web concept designed by Paul Ford of F Train. History bears heavily on the refurbished site, almost overwhelmingly — especially compared to the stripped-down affair that preceded it. But considering that Harper’s has a more than ordinary amount of history to cart around — at 157 years, it’s the oldest general interest monthly in the United States — it makes sense that Ford and the editors had time on the brain. A journal that has published continuously since before the Civil War, on through Reconstruction, the Gilded Age, WWI, the Great Depression, WWII, civil rights, the 60s, the Cold War, right up to the present carries a hefty chunk of the national memory — and a lot of baggage, good and bad. So it’s fitting that the new design is packed with dates, inviting readers to dig into the past while also surveying the present. I can’t think of another news site in which the archives mingle so promiscuously with the front page spread. The result is a site that feels as much like a library as a periodical.

Directly beneath the title banner and above stories from the current issue is a highly compressed archive navigation, three rows tall. On the top row, Harper’s 16 decades fan out from left to right. Below them are the ten years of a given decade. Below that, the twelve months of a given year. Thus, every issue of Harper’s ever printed is just three clicks away. Of course, you need a subscription to view most of the content. (A hint, though: articles between the 1850 debut issue and 1899 are all available for free at the website of Cornell’s Making of America project, which undertook the task of scanning the first half-century’s worth of Harper’s.)
Clearly, the editors have been thinking a great deal about how to use the web to bring Harper’s‘ long, winding paper trail into the light and into use. The new design may be a little over-freighted, but shine light it does. By placing current events in such close proximity with the past, things are nested in a historical context — a refreshing expansion of scope next to the perpetual present of the 24-hour news cycle. Already there are a few features that help connect the dots. One is “topic pages” that allow readers to track particular subjects through the archive. Take a look, for example, at this trail of links for “South Africa”:

4 Images from 1983 to 2001
67 Articles from 1850 to 2007
2 Cartoons from 1985
44 Events from 2000 to 2007
10 Facts from 1999 to 2006
4 Stories from 1888 to 1983
2 Jokes from 1881 to 1912
4 Photographs from 1987 to 2001
1 Poem from 1883
6 Reviews from 1887 to 2005

A smart next step would be to let readers trace, tag and document their own research trails and share those with other readers. This could be an added incentive for a new generation of Harper’s subscribers: access not only to an invaluable historical archive but to a social architecture in which communities and individuals could interpret that archive and bring it into conversation with the contemporary.

emerging libraries at rice: day one

For the next few days, Bob and I will be at the De Lange “Emerging Libraries” conference hosted by Rice University in Houston, TX, coming to you live with occasional notes, observations and overheard nuggets of wisdom. Representatives from some of the world’s leading libraries are here: the Library of Congress, the British Library, the new Bibliotheca Alexandrina, as well as the architects of recent digital initiatives like the Internet Archive, arXiv.org and the Public Library of Science. A very exciting gathering indeed.
We’re here, at least in part, with our publisher hat on, thinking quite a lot these days about the convergence of scholarly publishing with digital research infrastructure (i.e. MediaCommons). It was fitting then that the morning kicked off with a presentation by Richard Baraniuk, founder of the open access educational publishing platform Connexions. Connexions, which last year merged with the digitally reborn Rice University Press, is an innovative repository of CC-licensed courses and modules, built on an open volunteer basis by educators and freely available to weave into curricula and custom-designed collections, or to remix and recombine into new forms.
Connexions is designed not only as a first-stop resource but as a foundational layer upon which richer and more focused forms of access can be built. Foremost among those layers of course is Rice University Press, which, apart from using the Connexions publishing framework will still operate like a traditional peer review-driven university press. But other scholarly and educational communities are also encouraged to construct portals, or “lenses” as they call them, to specific areas of the Connexions corpus, possibly filtered through post-publication peer review. It will be interesting to see whether Connexions really will end up supporting these complex external warranting processes or if it will continue to serve more as a building block repository — an educational lumber yard for educators around the world.
Constructive crit: there’s no doubt that Connexions is one of the most important and path-breaking scholarly publishing projects out there, though it still feels to me more like backend infrastructure than a fully developed networked press. It has a flat, technical-feeling design and cookie cutter templates that give off a homogenous impression in spite of the great diversity of materials. The social architecture is also quite limited, and what little is there (ways to suggest edits and discussion forums attached to modules) is not well integrated with course materials. There’s an opportunity here to build more tightly knit communities around these offerings — lively feedback loops to improve and expand entries, areas to build pedagogical tutorials and to collect best practices, and generally more ways to build relationships that could lead to further collaboration. I got to chat with some of the Connexions folks and the head of the Rice press about some of these social questions and they were very receptive.

* * * * *

Michael A. Keller of Stanford spoke of emerging “cybraries” and went through some very interesting and very detailed elements of online library search that I’m too exhausted to summarize now. He capped off his talk with a charming tour through the Stanford library’s Second Life campus and the library complex on Information Island. Keller said he ultimately doesn’t believe that purely imitative virtual worlds will become the principal interface to libraries but that they are nonetheless a worthwhile area for experimentation.
Browsing during the talk, I came across an interesting and similarly skeptical comment by Howard Rheingold on a long-running thread on Many 2 Many about Second Life and education:

I’ve lectured in Second Life, complete with slides, and remarked that I didn’t really see the advantage of doing it in SL. Members of the audience pointed out that it enabled people from all over the world to participate and to chat with each other while listening to my voice and watching my slides; again, you don’t need an immersive graphical simulation world to do that. I think the real proof of SL as an educational medium with unique affordances would come into play if an architecture class was able to hold sessions within scale models of the buildings they are studying, if a biochemistry class could manipulate realistic scale-model simulations of protein molecules, or if any kind of lesson involving 3D objects or environments could effectively simulate the behaviors of those objects or the visual-auditory experience of navigating those environments. Just as the techniques of teleoperation that emerged from the first days of VR ended up as valuable components of laparascopic surgery, we might see some surprise spinoffs in the educational arena. A problem there, of course, is that education systems suffer from a great deal more than a lack of immersive environments. I’m not ready to write off the educational potential of SL, although, as noted, the importance of that potential should be seen in context. In this regard, we’re still in the early days of the medium, similar to cinema in the days when filmmakers nailed a camera tripod to a stage and filmed a play; SL needs D.W. Griffiths to come along and invent the equivalent of close-ups, montage, etc.

Rice too has some sort of Second Life presence and apparently was beaming the conference into Linden land.

* * * * *

Next came a truly mind-blowing presentation by Noha Adly of the Bibliotheca Alexandrina in Egypt. Though only five years old, the BA casts itself quite self-consciously as the direct descendant of history’s most legendary library, the one so frequently referenced in contemporary utopian rhetoric about universal digital libraries. The new BA glories in this old-new paradigm, stressing continuity with its illustrious past and at the same time envisioning a breathtakingly modern 21st century institution unencumbered by the old thinking and constrictive legacies that have so many other institutions tripping over themselves into the digital age. Adly surveyed more fascinating-sounding initiatives, collections and research projects than I can possibly recount. I recommend investigating their website to get a sense of the breadth of activity that is going on there. I will, however, note that that they are the only library in the world to house a complete copy of the Internet Archive: 1.5 petabytes of data on nearly 900 computers.
(Speaking of the IA, Brewster Kahle is also here and is closing the conference Wednesday afternoon. He brought with him a test model of the hundred dollar laptop, which he showed off at dinner (pic to the right) in tablet mode sporting an e-book from the Open Content Alliance’s children’s literature collection (a scanned copy of The Owl and the Pussycat)).
And speaking of old thinking and constrictive legacies, following Adly was Deanna B. Marcum, an associate librarian at the Library of Congress. Marcum seemed well aware of the big picture but gave off a strong impression of having hands tied by a change-averse institution that has still not come to grips with the basic fact of the World Wide Web. It was a numbing hour and made one palpably feel the leadership vacuum left by the LOC in the past decade, which among other things has allowed Google to move in and set the agenda for library digitization.
Next came Lynne J. Brindley, Chief Executive of the British Library, which is like apples to the LOC’s oranges. Slick, publicly engaged and with pockets deep enough to really push the technological envelope, the British Library is making a very graceful and sometimes flashy (Turning the Pages) migration to the digital domain. Brindley had many keen insights to offer and described a several BL experiments that really challenge the conventional wisdom on library search and exhibitions. I was particularly impressed by these “creative research” features: short, evocative portraits of a particular expert’s idiosyncratic path through the collections; a clever way of featuring slices of the catalogue through the eyes of impassioned researchers (e.g. here). Next step would be to open this up and allow the public to build their own search profiles.

* * * * *

That more or less covers today with the exception of a final keynote talk by John Seely Brown, which was quite inspiring and included a very kind mention of our work at MediaCommons. It’s been a long day, however, and I’m fading. So I’ll pick that up tomorrow.

AAUP on open access / business as usual?

On Tuesday the Association of American University Presses issued an official statement of its position on open access (literature that is “digital, online, free of charge, and free of most copyright and licensing restrictions” – Suber). They applaud existing OA initiatives, urge more OA in the humanities and social sciences (out of the traditional focus areas of science, technology and medicine), and advocate the development of OA publishing models for monographs and other scholarly formats beyond journals. Yet while endorsing the general open access direction, they warn against “more radical approaches that abandon the market as a viable basis for the recovery of costs in scholarly publishing and instead try to implement a model that has come to be known as the ‘gift economy’ or the ‘subsidy economy.'” “Plunging straight into pure open access,” they argue, “runs the serious risk of destabilizing scholarly communications in ways that would disrupt the progress of scholarship and the advancement of knowledge.”
Peter Suber responds on OA News, showing how many of these so-called risks are overblown and founded on false assumptions about open access. OA, even “pure” OA as originally defined by the Budapest Open Access Initiative in 2001, is not incompatible with a business model. You can have free online editions coupled with priced print editions, or full open access after an embargo period directly following publication. There are many ways to go OA and still generate revenue, many of which we probably haven’t thought up yet.
But this begs the more crucial question: should scholarly presses really be trying to operate as businesses at all? There’s an interesting section toward the end of the AAUP statement that basically acknowledges the adverse effect of market pressures on university presses. It’s a tantalizing moment in which the authors seem to come close to actually denouncing the whole for-profit model of scholarly publishing. But in the end they pull their punch:

For university presses, unlike commercial and society publishers, open access does not necessarily pose a threat to their operation and their pursuit of the mission to “advance knowledge, and to diffuse it…far and wide.” Presses can exist in a gift economy for at least the most scholarly of their publishing functions if costs are internally reallocated (from library purchases to faculty grants and press subsidies). But presses have increasingly been required by their parent universities to operate in the market economy, and the concern that presses have for the erosion of copyright protection directly reflects this pressure.

According to the AAUP’s own figures: “On average, AAUP university-based members receive about 10% of their revenue as subsidies from their parent institution, 85% from sales, and 5% from other sources.” This I think is the crux of the debate. As the above statement reminds us, the purpose of scholarly publishing is to circulate discourse and the fruits of research through the academy and into the world. But today’s commercially structured system runs counter to these aims, restricting access and limiting outlets for publication. The open access movement is just one important response to a general system failure.
But let’s move beyond simply trying to reconcile OA with existing architectures of revenue and begin talking about what it would mean to reconfigure the entire scholarly publishing system away from commerce and back toward infrastructure. It’s obvious to me, given that university presses can barely stay solvent even in restricted access mode, and given how financial pressures continue to tighten the bottleneck through which scholarship must pass, making less of it available and more slowly, that running scholarly presses as profit centers doesn’t make sense. You wouldn’t dream of asking libraries to compete this way. Libraries are basic educational infrastructure and it’s obvious that they should be funded as such. Why shouldn’t scholarly presses also be treated as basic infrastructure?
Publishing libraries?
Here’s one radical young librarian who goes further, suggesting that libraries should usurp the role of publishers (keep in mind that she’s talking primarily about the biggest corporate publishing cartels like Elsevier, Wiley & Sons, and Springer Verlag):

…I consider myself the enemy of right-thinking for-profit publishers everywhere…
I am not the enemy just because I’m an academic librarian. I am not the enemy just because I run an institutional repository. I am not the enemy just because I pay attention to scholarly publishing and data curation and preservation. I am not the enemy because I’m going to stop subscribing to journals–I don’t even make those decisions!
I am the enemy because I will become a publisher. Not just “can” become, will become. And I’ll do it without letting go of librarianship, its mission and its ethics–and publishers may think they have my mission and my ethics, but they’re often wrong. Think I can’t compete? Watch me cut off your air supply over the course of my career (and I have 30-odd years to go, folks; don’t think you’re getting rid of me in any hurry). Just watch.

Rather than outright clash, however, there could be collaboration and merger. As business and distribution models rise and fall, one thing that won’t go away is the need for editorial vision and sensitive stewardship of the peer review process. So for libraries to simply replace publishers seems both unlikely and undesirable. But joining forces, publishers and librarians could work together to deliver a diverse and sustainable range of publishing options including electronic/print dual editions, multimedia networked formats, pedagogical tools, online forums for transparent peer-to-peer review, and other things not yet conceived. All of it by definition open access, and all of it funded as libraries are funded: as core infrastructure.
There are little signs here and there that this press-library convergence may have already begun. I recently came across an open access project called digitalculturebooks, which is described as “a collaborative imprint of the University of Michigan Press and the University of Michigan Library.” I’m not exactly sure how the project is funded, and it seems to have been established on a provisional basis to study whether such arrangements can actually work, but still it seems to carry a hint of things to come.

if:book

A Project of the Institute for the Future of the Book

Category Archives: library

the open library

of shelves and selves

six blind men and an elephant

johannes who?

talk at brooklyn college library

the people’s card catalog (a thought)

samizdat express

the new harpers.org

emerging libraries at rice: day one

AAUP on open access / business as usual?