I. Nelson’s criticism
Ted Nelson (introduced last week by Ben) is a lonely revolutionary marching a lonely march, and whenever he’s in the news mockery is heard. Some of this is with good reason: nobody’s willing to dismantle the Internet we have for his improved version of the Internet (which doesn’t quite work yet). You don’t have to poke around too long on his website to find things that reek of crackpottery. But the problems that Nelson has identified in the electronic world are real, even if the solutions he’s proposing prove to be untenable. I’d like to expand on on one particular aspect of Nelson’s thought prominent in his latest missive: his ideas about the inherent ideologies of document formats. While this sounds very blue sky, I think his ideas do have some repercussions for what we’re doing at the Institute, and it’s worth investigating them, if not necessarily buying off on Xanadu.
Nelson starts from the position that attempting to simulate paper with computers is a mistaken idea. (He’s not talking about e-ink & the idea of electronic paper, though a related criticism could be made of that: e-ink by itself won’t solve the problem of reading on screens.) This is correct: we could do many more things with virtual space than we can with a static page. Look at this Flash demonstration of Jef Raskin’s proposed zooming interface (previously discussed here), for example. But we don’t usually go that far because we tend to think of electronic space in terms of the technology that preceded it – paper space. This has carried over into the way in which we structure documents for online reading.
There are two major types of electronic documents online. In one, the debt to paper space is explicit: PDFs, one of the major formats currently used for electronic books, are a compressed version of Postscript, a specification designed to tell a printer exactly what should be on a printed page. While a PDF has more functionality than a printed page – you can search it, for example, and if you’re tricky you can embed hyperlinks and tables of content in them – it’s built on the same paradigm. A PDF is also like a printed page in that it’s a finalized product: while content in a PDF can be written over with annotations, it’s difficult to make substantial changes to it. A PDF is designed to be an electronic reproduction of the printed page. More functionality has been welded on to it by Adobe, who created the format, but it is, at its heart attempting to maintain fidelity to the printed page.
The other dominant paradigm is that of the markup language. A quick, not too technical introduction: a markup language is a way of encoding instructions for how a text is to be structured and formatted in the text. HTML is a markup language; so is XML. This web page is created in a markup language; if you look at it with the “View Source” option on your browser, you’ll see that it’s a text file divided up by a lot of HTML tags, which are specially designed to format web pages: putting <i> and </i> around a word, for example, makes it italic. XML is a broader concept than HTML: it’s a specification that allows people to create their own tags to do other things: some people are using their own version of XML to represent ebooks.
There’s a lot of excitement about XML – it’s a technology that can be (and is)bent to many different uses. A huge percentage of the system files on your computer, for example, probably use some flavor of XML, even if you’ve never thought of composing an XML documents. Nelson’s point, however, is that there’s a central premise to all XML: that all information can be divided up into a logical hierarchy – an outline, if you will. A lot of documents do work this way: book is divided into chapters; a chapter is divided into paragraphs; paragraphs are divided into words. A newspaper is divided into stories; each story has a headline and body copy; the body copy is divided into paragraphs; a paragraph is divided into sentences; a sentence is divided into words; and words are divided into letters, the atom of the markup universe.
II. A Victorian example
But while this is the dominant way we arrange information, this isn’t necessarily a natural way to arrange things, Nelson points out, or the only way. It’s one way of many possible ones. Consider this spread of pages (double-click to enlarge them):
This is a title page from a book printed by William Morris, another self-identified humanist. We mostly think of William Morris (when we’re not confusing him with the talent agency) as a source of wallpaper, but his work as a book designer can’t be overvalued. The book was printed in 1893; it’s entitled The Tale of King Florus and the Fair Jehane. Like all of Morris’s books, it’s sumptuous to the point of being unreadable: Morris was dead set on bringing beauty back into design’s balance of aesthetics & utility, and maybe over-corrected to offset the Victorian fixation on the latter.
I offer this spread of pages as an example because the elements that make up the page don’t break down easily into hierarchical units. Let’s imagine that we wanted to come up with an outline for what’s on these pages – let’s consider how we would structure them if we wanted to represent them in XML. I’m not interested in how we could represent this on the Web or somewhere else – it’s easy enough to do that as an image. I’m more interested in how we would make something like this if we were starting from scratch & wanted to emulate Morris’s type and woodcuts – a more theoretical proposition.
First, we can look at the elements that comprise the page. We can tell each page is individually important. Each page has a text box, with decorative grapevines around the text box; inside the text box, the title gets its own page; on the second page, there’s the title repeated, followed by two body paragraphs, separated by a fleuron. The first paragraph gets an illustrated dropcap. Each word, if you want to go down that far, is composed of letters.
But if you look closer, you’ll find that the elements on the page don’t decompose into categories quite so neatly. If you look at the left-hand page, you can see that the title’s not all there – this is the second title page in the book. The title isn’t part of the page – as would almost certainly be assumed under XML – rather, they’re overlapping units. And the page backgrounds aren’t mirror images of each other: each has been created uniquely. Look at the title at the top of the right-hand page: it’s followed by seven fleurons because it takes seven of them to nicely fill the space. Everything here’s been minutely adjusted by hand. Notice the characters in the title on the right and how they interact with the flourishes around them: the two A’s are different, as are the two F’s, the two N’s, the two R’s, the two E’s. You couldn’t replicate this lettering with a font. You can’t really build a schema to represent what’s on these two pages. A further argument: to make this spread of pages rigorous, as you’d have to to represent it in XML, would be to ruin them aesthetically. The vines are the way they are because the letters are the way they are: they’ve been created together.
The inability of XML to adequately handle what’s shown on these pages isn’t a function of the screen environment. It’s a function of the way we build electronic documents right now. Morris could build pages this way because he didn’t have to answer to the particular restraints we do now.
III. The ideologies of documents
Let’s go back to Ted:
Nearly every form of electronic document- Word, Acrobat, HTML, XML- represents some business or ideological agenda. Many believe Word and Acrobat are out to entrap users; HTML and XML enact a very limited kind of hypertext with great internal complexity. All imitate paper and (internally) hierarchy.
For years, hierarchy simulation and paper simulation have been imposed throughout the computer world and the world of electronic documents. Falsely portrayed as necessitated by “technology,” these are really just the world-view of those who build software. I believe that for representing human documents and thought, which are parallel and interpenetrating– some like to say “intertwingled”– hierarchy and paper simulation are all wrong.
It’s possible to imagine software that would let us follow our fancy and create on the screen pages that look like William Morris’s – a tool that would let a designer make an electronic woodcut with ease. Certainly there are approximations. But the sort of tool I imagine doesn’t exist right now. This is the sort of tool we should have – there’s no reason not to have it already. Ted again:
I propose a different document agenda: I believe we need new electronic documents which are transparent, public, principled, and freed from the traditions of hierarchy and paper. In that case they can be far more powerful, with deep and rich new interconnections and properties- able to quote dynamically from other documents and buckle sideways to other documents, such as comments or successive versions; able to present third-party links; and much more.
Most urgently: if we have different document structures we can build a new copyright realm, where everything can be freely and legally quoted and remixed in any amount without negotiation.
Ben does a fine job of going into the ramifications of Nelson’s ideas about transclusion, which he proposes as a solution. I think it’s an interesting idea which will probably never be implemented on a grand scale because there’s not enough of an impetus to do so. But again: just because Nelson’s work is unpragmatic doesn’t mean that his critique is baseless.
I feel there’s something similar in the grandiosity of Nelson’s ideas and Morris’s beautiful but unreadable pages. William Morris wasn’t just a designer: he saw his program of arts and crafts (of which his books were a part) as a way to emphasize the beauty of individual creation as a course correction to the increasingly mechanized & dehumanized Victorian world. Walter Benjamin declares (in “The Author as Producer”) that there is “a difference between merely supplying a production apparatus and trying to change the production apparatus”. You don’t have to make books exactly like William Morris’s or implement Ted Nelson’s particular production apparatus to have your thinking changed by them. Morris, like Nelson, was trying to change the production apparatus because he saw that another world was possible.
And a postscript: as mentioned around here occasionally, the Institute’s in the process of creating new tools for electronic book-making. I’m in the process of writing up an introduction to Sophie (which will be posted soon) which does its best to justify the need for something new in an overcrowded world: Nelson’s statement neatly dovetailed with my own thinking on the subject on why we need something new: so that we have the opportunity to make things in other ways. Sophie won’t be quite as radical as Nelson’s vision, but we will have something out next year. It would be nice if Nelson could do the same.
Dan, this is a beautifully conceived post about Nelson. Ted has always been caught between a coterie of sycophantic followers who see him as an un-appreciated genius and most everyone else who consider him a nutcase. you’ve managed to walk a middle line, presenting probably perhaps the most even-handed piece ever written about his work. the use of Morris to explain the problems with XML is inspired.
b.
What’s your ideological justification for creating documents that are “sumptuous to the point of being unreadable”?
Note how much Morris’ design pushes toward a “paper” metaphor, because you can’t lay out things that exactly when you don’t know what size screen the reader has. This is the PDF nightmare.
Well of course Morris pushes towards a “paper” metaphor, because that’s what he was designing on. Good design is suited to the medium its performed on, whether that’s wood when you’re making a chair or paper when you’re making a (print) book. But design suited for paper that’s not being used on paper is bad design.
I can’t presume to speak for Morris (nor does he need me to speak for him), but it seems a natural rejoinder to the overly industrial: it’s pointing out what’s missing. I don’t think it functions as an object in and of itself – though it does, of course – so much as it’s a sign pointing out possibilities that are being neglected. It’s a course correction.
I think markup is overrated and under understood.
I have a little trouble going with Dan’s example that, since words make up sentences, sentences make up paragraphs, and paragraphs make up chapters, words are somehow a lower level hierarchy of chapters. I prefer to think of chapters and such as ‘mileage markers’ on the road you take when you are reading, a road made of words. When you come to a new chapter, you are on a new stretch of the same road – you dont go ‘down’ a level. Getting hung-up on hierarchy seems to be an occupational hazard for the ebook crowd, or maybe its the other way ’round.
Markup is symptomatic of the data-driven mindset, otherwise known as encyclopedestrianization. It has its uses, but is not the best model for books or any other complex information. I believe what we think of as ‘meaning’ is sort of a Moire effect of data, and we see the structure we want to see.
Bob, I’m entirely with you on markup being overrated and under understood. I’m sorry if this erred on the side of being too technical: this is something that isn’t generally talked about besides among people who care deeply about markup, and they’re generally so deep inside it as not to be able to really criticize it usefully.
I completely agree that it’s counterintuitive that words are a lower lever hierarchy of chapters – but that idea is the core of the idea of markup. I think on a basic level it doesn’t make sense when you’re talking about how people read. But markup wasn’t really designed with reading in mind: it’s an easy way to structure information.
Another quick example from book design of how markup doesn’t always work to represent what we see on a page: if you look at just about any (American) book printed before the rise of desktop publishing, you’ll see that not every paragraph is indented: the first paragraph at the start of the chapter or a section is usually unindented. This is because indents were not a distinct feature of a paragraph (as they would be represented under most markup-based scheme) but served to separate paragraphs: they were a sign than one paragraph had ended and another was to begin, which you didn’t need for your first paragraph. Having a first paragraph indented is pointless unless you think that indentation of the first line is a feature of all paragraphs.
You mention chapters as “mileage markers” and I think that’s an apt analogy: I think that’s much closer to how we read. That’s the idea behind indenting between paragraphs. But I think it’s important to remember that that’s not how markup sees a text: in markup, a text is a hierarchy of smaller sections, some sensible (chapters, paragraphs) and some more nonsensical: when you get down to the level of the word or the letter.
I think something’s deeply wrong when we get too hung up on hierarchy – it’s not something we should have to think about, but it becomes something we have to work around. I don’t think we should have to hierarchicalize everything we write: we should be able to, but we should be able to do other things as well.
Dan, I think we can agree to agree!
You can find some good ideas about how visual layout influences perception of data in Tufte’s books. The trick with markup is that by seeing it as an armature for structure, it could be implied that anything not marked up is at the lowest, ‘default’ level, maybe not worth bothering with. Can we markup every idea, person, lilly in the field? Lets get this planet organized!
I think this can be a problem when you look at a wiki encyclopedia-type thing, and see all these words linked. First, you soon get to know that the link doesnt explain the term in context of what you are reading now, as you might hope a friendly indexer would, but pops you off to a whole ‘nother article with a whole new set of hot words. Second, since you are on the web, you can google or otherwise dig up info on any word on the screen. In this case, markup acts to show the boundaries of the maze, rather than bring depth of association. But I’m really sliding off in to the wiki-thread – ouch.
Ok now, my personal wrap up on Mr. Nelson. One of my first cool books was his big b/w two-sided book Computer Lib/Dream Machines – wish I’d kept it. It was a comic book for geeks and I just loved it. I used to hang out at Xerox PARC after school in ’76 (I had the smalltalk-80 book!), and started working at Apple in ’77. The future sure seemed bright and shiny through the eyes of a big color monitor in the late 70s. Here we are raking the ashes of his ideas 30 years on – what you get is not usuallly what you’ve seen.
As an early colleague (NOT sycophant) with Ted Nelson, I commend your treatment, and suggest that you examine:
magicdragon2 LiveJournal
Friday, July 22nd, 2005
10:20 am Intertwingle
I remember 1975 as clearly as if it happened yesterday. Ted Nelson, the mad genius who invented Hypertext and Hypermedia, spake his Revelation from a waterbed, whereon he had been noodling with an electric banjo.
EVERYTHING IS PROFOUNDLY INTERTWINGLED….
The comment thread is at:
Intertwingle.
I knew of Ted since the 1960s, coimplemented software for him early in Xanadu history, and at the world’s first Personal Computer Conference where I demo’d some Xanadu, I presented what is arguably the first ever formal lecture on Literature and Hypertext. I then had several hyptertext poems published in Datamation, arguably the first such published [paper semi-implementations of] hyperpoems.
See also my primary web domain:
http://www.magicdragon.com/ [which gets over 15,000,000 hits per year]
and a biographical slant on my Math professorship identity at:
MATH Pages of Jonathan Vos Post, although it’s two years out of date.