16 April 2006

Open Access Books Are Like Buses...

...you wait for ages, and then three turn up at once.

Well, two at least: I wrote recently about Willinsky's The Access Principle, and now here, hard on its heels, comes Yochai Benkler's The Wealth of Networks.

If the name Benkler is unfamiliar, you might want to glance at the suggestively-titled Coase's Penguin (yes, that penguin), which is effectively a sketch of the present book. Both, then, are about how the network changes everything, and how all the opens and the various kinds of commons that are central concerns of this blog lie at the heart of one of the most profound economic, social and political transformations seen in recent years.

But don't take my word for it, listen to what Larry Lessig has to say, with typical generosity:

This is — by far — the most important and powerful book written in the fields that matter most to me in the last ten years.

Then buy/download the thing (CC licence, of course) and read it. I know I will. The fact that I haven't yet finished its 500+ pages is not just another reason not to listen to me: it's also a further hint of why eventually all books will be freely available as digital downloads online. Basically, reading on a screen and reading text placed on a physical object are two quite different experiences, and warrant two quite different business models.

14 April 2006

"The Access Principle" Made Accessible

One of the jibes that the anti-open access lot like to lob is that many of those writing in favour of these ideas often do so in non-open access outlets. But the fact is, we don't always have a choice if we want to reach traditional audiences who aren't yet used to reading open access titles/media. Against this background, it's good to see some traditional publishers proving amenable to releasing open access versions of works dealing with open access alongside the hard-copy versions.

A case in point is The Access Principle, The Case for Open Access to Research and Scholarship, by John Willinsky. I've just discovered (through the indispensable Open Access News) that the enlightened MIT Press has made this freely available (apart from some mild registration): kudos for that.

I've only skimmed through the first few chapters, but already it looks to be about the most important book on open access so far. This is hardly surprising given the author's work as director of the Public Knowledge Project - and the fact that he wrote an essay entitled "The unacknowledged convergence of open source, open access, and open science", which sounds strangely familiar as an idea.

So now there's no excuse for anyone not to rush out and buy it/download it and read it.

End of the World Nigh: VCs Clueful on Patents

And now for some good news about patents - no really. Judging by this article on the excellent Techdirt, a few of the brighter VCs are starting to get the message about software patents. Next thing you know, even the lawyers will join in - then we'll know that the end of the world is really nigh.

13 April 2006

Right-minded Right to Create

As you may have noticed, some of the more outrageous abuses of the patent system tend to make me go a funny colour. So I was interested to discover (via Techdirt) that there is an entire right-minded site called Right to Create offering a concentrated helping of bile directed at the manifold insanities of the patent system.

The only trouble is, having read it, I think I need to go and lie down for a while.

12 April 2006

PageRank on a Stick

Googlejuice: everybody wants it. It's measured in that mysterious coinage of the cyber-realm known as PageRank, and everybody wants to know how much they've got. Enter Webmaster Eyes, which tells you (via Digg). And if you were wondering, Google's GoogleJuice is 10 (and this site is a middling 6).

Windows Live Academic Search Goes Live

Microsoft has rolled out the first beta of its academic search engine. It has some nice Web 2.0-y features that make it look far cooler than Google Scholar (Google, are you listening?). One of the FAQs made me smile:

What about open source repositories? Do you have content from them in your index?

Academic search has implemented the Open Architecture Initiative (OAI) protocol for indexing OAI-compliant repositories. For example, we indexed the content present in ArXiv.org for the launch. We will continue to index more repositories after the launch.

I don't know of anybody except Microsoft that calls these OAI repositories "open source": you don't think that Microsoft's hung up on something?

Another FAQ talks about something new to me: the OpenURL. This turns out to be a wonderful piece of Orwellian double-speak, since it is a way of ensuring that people only get to see the content they are "entitled" to - that is, have paid for. In other words, OpenURL is all about closing off your options.

What a Tangled Bank We Weave

If you've not come across the Tangled Bank miscellanies, take a look at the latest one, hosted on the fine Discovering Biology in a Digital World blog. As the Tangled Bank home site puts it, these collections are a "weekly showcase of good weblog writing, selected by the authors themselves," in the realm of science. And yes, I'm afraid there's something by me.

But it's still worth reading.

11 April 2006

From Blogroll to Searchroll

I confess it: I'm a sucker for analogical thought. So taking the idea of a blogroll - even if I don't actually use the things - and coming up with a "searchroll" - a personalised list of sites across which you can carry out searches - is intrinsically appealing. This is what Rollyo has done: I know it's not exactly new, but the last time I looked there wasn't much to see. Now there is.

For example, I was rather taken with this Russian library searchroll, which somehow makes the idea nicely concrete. I can see that you might well want to search through a group of related sites, rather than wade through several million Google hits.

The only problem, of course, is that you either have to set up the searchrolls yourself, or try to find one that suits your needs. Fortunately Rollyo has done the obvious, and come up with a search engine for search rolls. From here you can enter keywords and try to piggy-back off someone else's collection (all in the spirit of sharing, of course).

I was impressed with the range of open source searchrolls that are already available. Open access searchrolls are thinner on the ground, while open content is waiting for its first example. Any offers?

10 April 2006

Webaroo - Yawnaroo

Convincing proof that Web 2.0 is a replay of Web 1.0 comes in the form of Webaroo. As this piece from Om Malik explains, this start-up aims to offer users a compressed "best of the Web" that they can carry around on their laptops and use even when they're offline.

Sorry, this idea was invented back in 1995, when Frontier Technologies released its SuperHighway Access CyberSearch, a CD-ROM that contained a "best of the Web" based on Lycos - at the time, one of the best search engines. As I wrote in September 1995:

Not all of the Lycos base has been included: contained in the 608 Mbytes on the disc is information on around 500,000 pages. The search engine is also simplified: whereas Lycos possesses a reasonably powerful search language, the CyberSearch tool allows you to enter just a word or phrase.

Only the scale has changed....

Open Peer Review

Open access does not aim to subvert the peer review process that lies at the heart of academic publishing: it just wants to open things up a little. But you know how it is: once you start this subversive stuff, it's really hard to stop.

So what did I come across recently, but this fascinating hint of what opening up peer review might achieve (as for the how, think blogs or wikis). Maybe an idea whose time has (almost) come.

09 April 2006

(Patently) Right

Paul Graham is a master stylist - indeed, one of the best writers on technology around. Reading his latest essay, "Are Software Patents Evil?" is like floating in linguistic cream. And that's the problem. His prose is so seductive that it is too easy to be hypnotised by his gently-rhythmic cadences, too pleasurable to be lulled into a complaisant state, until you find yourself nodding mechanically in agreement - even with ideas that are, alas, fundamentally wrong.

Take his point in this recent essay about algorithms, where he tries to argue that software patents are OK, even when they are essentially algorithms, because hardware is really only an instantiation of an algorithm.

If you allow patents on algorithms, you block anyone from using what is just a mathematical technique. If you allow patents on algorithms of any kind, then you can patent mathematics and its representations of physics (what we loosely call the Laws of Physics are in fact just algorithms for calculating reality).

But let's look at the objection he raises, that hardware is really just an algorithm made physical. Maybe they are; but the point is you have to work out how to make that algorithm physical - and that's what the patent is for, not for the algorithm itself. Note that such a patent does not block anyone else from coming up with different physical manifestations of it. They are simply stopped from copying your particular idea.

It's instructive to look at another area where patents are being hugely abused: in the field of genes. Thanks to a ruling in 1980 that DNA could be patented, there has been a flood of completely insane patent applications, some of which have been granted (mostly in the US, of course). Generally, these concern genes - DNA that codes for particular proteins. The argument is that these proteins do useful things, so the DNA that codes for them can therefore be patented.

The problem is that there is no way of coming up with an alternative to that gene: it is "the" gene for some particular biological function. So the patent on it blocks everyone using that genomic information, for whatever purpose. What should be patentable - because, let me be clear here, patents do serve a useful purpose when granted appropriately - is the particular use of the protein - not the DNA - the physical instantiation of what is effectively a genomic algorithm.

Allowing patents on a particular industrial use for a protein - not a patent on its function in nature - leaves the door open for others to find other chemicals that can do the same job for the industrial application. It also leaves the DNA as information/algorithm, outside the realm of patents.

This test of whether a patent allows alternative implementations of the underlying idea can be applied fruitfully to the equally-vexed questions of business methods. Amazon's famous "one-click" method of online making purchases is clearly total codswallop as a patent. It is a patent on an idea, and blocks everyone else from implementing that (obvious) idea.

The same can be said about an earlier patent that Oracle applied for, which apparently involved the conversion of one markup language into another. As any programmer will tell you, this is essentially trivial, in the mathematical sense that you can define a set of rules - an algorithm - and the whole drops out automatically. And if you apply the test above - does it block other implementations? - this clearly does, since if such a patent were granted, it would stop everyone else coming up with algorithms for conversions. Worse, there would be no other way to do it, since the process is simply a restatement of the problem.

I was heartened to see that a blog posting on this case by John Lambert, a lawyer specialising in intellectual property, called forth a whole series of comments that explored the ideas I've sketched out above. I urge you to read it. What's striking is that the posts - rather like this one - are lacking the polish and poise of Graham's writing, but they more than make up for it in the passion they display, and the fact that they are (patently) right.

08 April 2006

Death to the Podcast

"Podcast" is such a cool word. It manages to be familiar, made up as it is of the odd little "pod" and suffix "-cast", as in "broadcast", and yet cheekily new. Pity, then, that it's completely the wrong term for what it describes.

These are simply downloadable mp3 files. The "pod" bit is a misnomer, because the iPod is but one way to listen to them: any mp3 player will do. And the "-cast" is wrong, too, because they are not broadcast in any sense - you just download them. And if they were broadcast across the Internet, then you'd call them streams - as in "podstream", rather than "podcast".

Given my long-standing dislike of this term - and its unthinking adoption by a mainstream press terrified of looking uncool - I was pleased to come across Jack Schofield's opinion on the subject, where he writes:

[P]odcasting's main appeal at the moment is time-shifting professionally-produced programmes. It's a variant of tape recording, and should probably be called AOD (audio on demand).

AOD: that sounds good to me, Jack.

His wise suggestion comes in piece commenting on the release of a typically-expensive ($249 for six pages) piece of market research on this sector from Forrester Research.

Many people have taken its results - the fact that only 1% of online households in the US regularly download and listen to AOD - to indicate the death of the medium. I don't agree: I think people will continue to enjoy audio on demand in many situations. For example, I regularly return to the excellent Chinesepod site, a shining example of how to use AOD well.

But even if the downloads live on, I do hope that we might see the death of the term "podcast".

07 April 2006

Another Day, Another Open

Open content is an area that I follow quite closely. I've just finished the second of a series of articles for LWN.net that traces the growth of open content and its connections with open source. The first of these is on open access, while the most recent looks at Project Gutenberg and the birth of open content. The next will look at open content in education, including the various open courseware projects.

Here's a report from UNESCO on the area, which it has dubbed open educational resources, defined as

the creation of open source software and development tools, the creation and provision of open course content, and the development of standards and licensing tools.

I'm not quite sure we really needed a new umbrella term for this, but it's good to see the matter being discussed at high levels within the global education community.

A Nod's as Good as a Wink

As I mentioned, I have started playing with Google Analytics for this blog. It's early days yet, but already some fascinating results have dropped out - I'll be reporting on them once the trends become slightly more significant than those based on two days' data....

But one thing just popped up that I thought I'd pass on. Some of my traffic has come from Wink, which describes itself as a social search engine. More specifically:

Wink analyzes tags and submissions from Digg, Furl, Slashdot, Yahoo MyWeb, and other services, plus user-imported tags from del.icio.us, and favorites marked at Wink, and figure[s] out which pages are most relevant.

So basically Wink aims to filter standard Web search results through the grid of social software like Digg, del.icio.us etc. It's a clever idea, although the results at the moment are a little, shall we say, jejune. But I'm grateful for the tip that Google Analytics - and one of my readers - has given me. Duly noted.

UNDPAPDIPIOSN - A Name to Remember?

The UN is such a huge, amorphous organisation that it is no suprise that there are bits of it that rarely make it into the limelight. A case in point is the UN Development Programme (UNDP), "the UN's global development network, an organization advocating for change and connecting countries to knowledge, experience and resources to help people build a better life."

Given its task, and its doubtless limited resources, it is only natural that the UNDP has been promoting free software use around the world longer than most (I first talked to them about it in 1997), and its efforts in this sphere are becoming significant. It now has a separate arm, the United Nations Development Programme's (UNDP) Asia Pacific Development Information Programme (APDIP) International Open Source Network - UNDPAPDIPIOSN for short.

As a quick glance at the home page shows, there's lot of good stuff going on, with all the right buzzwords. For example, news on the Asian Commons, a press release about the UNDPAPDIPIOSN joining the ODF Alliance, which pushes for ODF adoption (and complements the OpenDocument Fellowship I mentioned yesterday), plus some free software primers.

What I like about these is that they take a truly global view of things, providing information about open source adoption around the world that is hard to come by elsewhere, particularly in a consolidated form. They deserve to be better known - as does the UNDPAPDIPIOSN itself - although probably not under that name....(IOSN seems to be the preferred abbreviation).

06 April 2006

Microsoft's Open Source Blog

Not something you see every day: a Web site called Port 25, with the explanatory line "Communications from the Open Source Software Lab @ Microsoft". Yup, you read that right. It will be interesting to see what they do with this apparent attempt to reach out (port 25, right?) - especially if they can get rid of the Russian spam on Bill Hilf's welcoming post....

The Commons Becomes Commoner

I've already written about how the "commons" meme is on the rise, with all that this implies in terms of co-operation, sharing and general open source-iness. Now here are two more.

The first is the Co-operation Commons, "an interdisciplinary framework for understanding cooperation" (an excellent, fuller explanation can be found here). The second is the Credibility Commons, "an experimental environment enabling individuals the opportunity to try out different approaches to improving access to credible information on the World Wide Web."

As the commons becomes, er, commoner, I find that it is all just getting more and more interesting.

ODF Petition Hits 10,000

I'm a big fan of the OpenDocument Fellowship - there's something very Tolkienian about it, and it's one of the best places to keep on top of developments in this important area.

One of its projects is a petition trying to persuade Microsoft to support the Open Document Format. This may be somewhat of a forlorn hope, but then they do say that a gentleman only supports lost causes.

If you wish to add your name to the 10,000 or so who have shown their gentlemanly/ladylike credentials in this way, you can do so here.

Why VOIP Needs Crypto

The ever-wise Bruce Schneier (whom I had the pleasure of interviewing a couple of years ago) spells out in words of one syllable why the hot Voice over IP digital 'phone systems absolutely need encryption. He also links to the perfect solution: Phil Zimmermann's latest wheeze, Zfone - an open source VOIP encryption program.

In Praise of Nakedness

The impudence of Microsoft knows no bounds. According to this report in ZDNet UK, it is now doing the nudge-nudge, wink-wink to PC dealers that selling "naked" systems - those with no operating system installed - might be inadvisable, know what I mean?

According to the original article in its Partner Update magazine, "Microsoft is recruiting two 'feet on the street' personnel whose role will be to provide proactive assistance during customer visits" in other words, threatening to send in the heavy mob to duff up customers, too. Although Microsoft hurriedly "retracted" this part of the story, if it was a slip, it was surely Freudian.

From this and other statements, it's clear that Microsoft sees every PC as its birthright, and naked PCs as, well, positively obscene. I'm an OS naturist, myself.

05 April 2006

After Wikia, Qwika: the Wiki Search Engine

The last time I wrote about Qwika, it seemed to be a solution in search of a problem. A recent press release suggests that it's managed to come up with an answer to that conundrum: Qwika has turned into a dedicated wiki search engine.

At first sight, you might think that's rather redundant. After all, wikis are essentially just Web pages, and one or two search engine companies seem to have that sector sorted out. But if you only want to look in wikis, and don't want the other million hits on ordinary Web pages that common words throw up, a dedicated wiki search engine makes sense.

Moreover, wikis do have some special characteristics, as Qwika's Luke Metcalfe explained to me:

[W]ikis are quite different to html documents - they have a good amount of metadata associated with them - edit histories, user information, and data embedded within the WikiMedia format. They conform also [to] a certain writing style, which makes things easier to parse from a computational linguistic perspective. Other search engines are only interested in them as html documents with links pointing to them, so they miss out on a lot.

It's early days yet - both for Qwika and the wikis it indexes (1,158 at the time of writing). But recent moves like Wales' Wikia relaunch, which I wrote about the other day, mean that the wiki space is starting to hot up.

So, in the "One to Watch" category, to Wikia, add Qwika.

Blender - Star of the First Open Source Film

Blender is one of the jewels in the open source crown. As its home page puts it:

Blender is the open source software for 3D modeling, animation, rendering, post-production, interactive creation and playback. Available for all major operating systems under the GNU General Public License.

It's a great example of how sophisticated free software can be - if you haven't tried it, I urge you to do so. It's also an uplifing story of how going open source can really give wings to a project.

Now Blender is entering an exciting new phase. A few days ago, the premiere of Elephant's Dream, the first animated film made using Blender, took place.

What's remarkable is not just that this was made entirely with open source software, but also that the film and all the Blender files are being released under a Creative Commons licence - making it perhaps the first open source film.

Given that most commercial animation films are already produced on massive GNU/Linux server farms, it seems likely that some companies, at least, will be tempted to dive even deeper into free software and shift from expensive proprietary systems to Blender. Whether using all this zero-cost, luvvy-duvvy GPL'd software makes them any more sympathetic to people sharing their films for free remains to be seen....

United States of Patent Absurdity

If you ever wondered how the US got into such a mess with patents on software and business methods - and wondered how the European Union can avoid making the same mistakes - take a look at this excellent exposition. As far as I can tell, it's a condensed version of the full half-hour argument in the 2004 book Innovation and Its Discontents: How Our Broken Patent System is Endangering Innovation and Progress, and What to Do About It.

Read either - or both; then weep.

Daily Me 2.0

One of the problems with blogs for advertisers is their fragmented nature: to get a reach comparable to mainstream media generally involves faffing around with dozens of sites. The obvious solution is to bundle, and that's precisely what Federated Media Publishing does. As its roster of blogs indicates, it operates mainly in the field of tech blogs, but the model can clearly be extended.

To the average blog-reader on the Clapham Ominibus (probably the 319 these days), more interesting than the business side of things is the possibility of doing blog mashups. And lo and behold, Federated has produced such a thing (note that the URL begins significantly with "tech", hinting of non-tech things to come...).

What struck me about this federated news idea is that it could be extended beyond the bundles. It would be easy - well, easy if you're a skilled programmer - to knock up a tool offering a range of newspage formats that let you drag and drop newsfeeds into predefined slots to produce the same kind of effect as the Federated Media/Tech one.

RSS aggregators already do this crudely, but lack the design element that would help to make the approach more popular. You would also need some mechanism for flagging up which stories had changed on the page, or for allowing new stories from designated key blogs to rise to the top of the dynamically-generated newspage.

The result, of course, is the Daily Me that everyone has been wittering on about for years. But it comes with an important twist. This Daily Me 2.0 is not just a cosmetic mixing of traditional medium news, but a very different kind of online newspage, based on the very different perspective offered by blogs.

One reason why Daily Me 1.0 never took off was because traditional media are simply too greedy to contemplate sharing with anyone else. Blogs have no such qualms - indeed, they have different kinds of sharing (quotations, links, comments) at their core. I think we'll be reading more about this....

Privacy Policy

As part of my exploration of Google, I've signed up for Google Analytics. This means that these pages now collect some anonymous traffic data - nothing personal. For the same reason, the site will ask whether it can set a cookie (one of Netscape's more enduring legacies). If you don't want one on your computer, just refuse: it won't make any difference to your access.