I must admit that I hadn’t thought microformats were all that interesting—cool, yes, but not much more than that—when I first heard of them. But this rather interesting and clearly written article at usabletype on the topic gave me pause to think more about what they could do. So I’ve decided to write up a couple of my thoughts on the matter, with a view to suggesting ways they could be used for academic writing on the web.
So what are Microformats?
Well, I commend the article to which I referred above, but I’ll try and give a brief summary.
Microformats are a way to embed the structure of a given text-based or text-oriented data format—such as an electronic calendar (icalendar) or a electronic business card (vcard)—into an xHTML document. In fact, a microformat should not just embed the structure, it should directly map the existing data structures and semantics of the given format onto the the closest existing semantic elements of xHTML.
What’s it all for, then?
One of the key advantages of doing this, from my perspective, is that you can then immediately take advantage of existing web development tools and skillsets to extract the data in these microformats, and then to manipulate and process them.
Here, One commentor noted a practical implementation:
I?¢‚Ǩ‚Ñ¢ve been thinking about making a Greasemonkey script to automagically detect and extract hCard info from pages. I haven?¢‚Ǩ‚Ñ¢t really researched it much yet, but I was also wondering if there might be some way to automatically add contacts to my Thunderbird addressbook.
Academic Uses
The first use that came to mind was for bibilographic and citation data. HTML already has several elements that allow writers to mark up citations and quotations. There are at least three that I can think of right away. The first is the <blockquote>
element. This element is designed to mark up paragraphs of quoted material. The second is the <q>
(uote) element, which is meant to be used to mark up inline (short phrase or sentence-level) quotations, but it is poorly supported by Internet Explorer. The third element is the <cite>
, which is generally used to mark up the names of cited works, such as the title of a book, or the name of a film. There doesn’t seem to be a way to easily differentiate between say, book titles, which should be rendered in italics, and journal article titles, which are meant to be enclosed in quote marks.
Despite these limitations, I think that these three elements give us something with which to work. The limitations of <cite>
could be worked around thus:
<cite class="journal">Name of Article</cite>
I’ll continue tomorrow with some more thoughts on practical uses of this, and perhaps a more extensive treatment of what I think could form a workable citation microformat.
Update: more here.
Update 2 (17 August, 2005): It looks like there’s already some work going on in developing a citation format. The work seems to be in an inital discovery phase.