04 March 2006

Tying the Kangaroo Down

If any proof were needed that some people still don't really get the Internet, this article is surely it. Apparently Australia's copyright collection agency wants schools to pay a "browsing fee" every time a teacher tells students to browse a Web site.

Right.

So, don't tell me: the idea is to ensure that students don't use the Web, and that they grow up less skilled in the key enabling technology of the early twenty-first century, that they learn less, etc. etc. etc.?

Of course, the fact that more and more content is freely available under Creative Commons licences, or is simply in the public domain, doesn't enter into the so-called "minds" of those at the copyright collection office. Nor does the fact that by making this call they not only demonstrate their extraordinary obtuseness, but also handily underline why copyright collection agencies are actually rather irrelevant these days. And that rather than waste schools' time and money paying "browsing fees", Australia might perhaps do better to close down said irrelevant, clueless copyright office, and save some money instead?

03 March 2006

Beyond Parallel Universes

One of the themes of this blog is the commonality between the various opens. In a piece I wrote for the excellent online magazine LWN.net, I've tried to make some of the parallels between open source and open access explicit - to the point where I set up something of a mapping between key individuals and key moments (Peter Suber at Open Access News even drew a little diagram to make this clearer).

My article tries to look at the big picture, largely because I was trying to show those in the open source world why they should care about open access. At the end I talk a little about specific open source software that can be used for open access. Another piece on the Outgoing blog (subtitle: "Library metadata techniques and trends"), takes a closer look at a particular kind of such software, that for repositories (where you can stick your open access materials).

This called forth a typically spirited commentary from Stevan Harnad, which contains a link to yet more interesting words from Richard Poynder, a pioneering journalist in the open access field, with a blog - called "Open and Shut" (could there be a theme, here?) - that is always worth taking a look at. For example, he has a fascinating interview on the subject of the role of open access in the humanities.

Poynder rightly points out that there is something a contradiction in much journalistic writing about open access, in that it is often not accessible itself (even my LWN.net piece was subscribers-only for a week). And so he's bravely decided to conduct a little experiment by providing the first section of a long essay, and then asking anyone who reads it - it is freely accessible - and finds it useful to make a modest donation. I wish him well, though I fear it may not bring him quite the income he is hoping for.

01 March 2006

There's No INSTEDD without Open Access

An interesting story in eWeek.com. Larry Brilliant, newly-appointed head of the Google.org philanthropic foundation, wants to set up a dedicated search engine that will spot incipient disease outbreaks.

The planned name is INSTEDD: International Networked System for Total Early Disease Detection - a reference to the fact that it represents an alternative option to just waiting for cataclysmic infections - like pandemics - to happen. According to the article:

Brilliant wants to expand an existing web crawler run by the Canadian government. The Global Public Health Intelligence Network monitors about 20,000 Web sites in seven languages, searching for terms that could warn of an outbreak.

What's interesting about this - apart from the novel idea of spotting outbreaks around the physical world by scanning the information shadow they leave in the digital cyberworld - is that to work it depends critically on having free access to as much information and as many scientific and medical reports as possible.

Indeed, this seems a clear case where it could be claimed that not providing open access in relevant areas - and the range of subjects that are relevant is vast - is actually endangering the lives of millions of people. Something for publishers and their lawyers to think about, perhaps.

Higgins: Social Web, Social Commerce

Identity is a slippery thing at the best of times. On the Internet it's even worse (as the New Yorker cartoon famously encapsulated). But identity still matters, and sorting it out is going to be crucial if the Internet is to continue moving into the heart of our lives.

Of course, defining local solutions is easy: that's why you have to remember 33 different passwords for 33 different user accounts (you do change the password for each account, don't you?) at Amazon.com and the rest. The hard part is creating a unitary system.

The obvious way to do this is for somebody to step forward - hello Microsoft Passport - and to offer to handle everything. There are problems with this approach - including the tasty target that the central identity stores represent for ne'er-do-wells (one reason why the UK Government's proposed ID card scheme is utterly idiotic), and the concentration of power it creates (and Microsoft really needs more power, right?).

Ideally, then, you would want a completely modular, decentralised approach, based on open source software. Why open source? Well, if it's closed source, you never really know what it's doing with your identity - in the same way that you never really know what closed software in general is doing with your system (spyware, anyone?).

Enter Higgins, which not only meets those requirements, but is even an Eclipse project to boot. As the goals page explains:

The Higgins Trust Framework intends to address four challenges: the lack of common interfaces to identity/networking systems, the need for interoperability, the need to manage multiple contexts, and the need to respond to regulatory, public or customer pressure to implement solutions based on trusted infrastructure that offers security and privacy.

Perhaps the most interesting of these is the "multiple contexts" one:

The existence of common identity/networking framework also makes possible new kinds of applications. Applications that manage identities, relationships, reputation and trust across multiple contexts. Of particular interest are applications that work on behalf of a user to manage their own profiles, relationships, and reputation across their various personal and professional groups, teams, and other organizational affiliations while preserving their privacy. These applications could provide users with the ability to: discover new groups through shared affinities; find new team members based on reputation and background; sort, filter and visualize their social networks. Applications could be used by organizations to build and manage their networks of networks.

The idea here seems to be a kind of super-identity - a swirling bundle of different cuts of your identity that can operate according to the context. Although this might lead to fragmentation, it would also enable a richer kind of identity to emerge.

As well as cool ideas, Higgins also has going for it the backing of some big names: according to this press release, those involved include IBM, Novell, the startup Parity Communications (Dyson Alert: Esther's in on this one, too) and the Berkman Center for Internet & Society at Harvard Law School.

The latter is also involved in SocialPhysics.org, whose aim is

to help create a new commons, the "social web". The social web is a layer built on top of the Internet to provide a trusted way to link people, organizations, and concepts. It will provide people more control over their digital identities, the ability to more easily find other people and groups, and more control over how they are seen by others across diverse contexts.

There is also a blog, called Social Commerce, defined as "e-commerce + social networking + user-centric identity". There are lots of links here, as well as on the SocialPhysics site. Clearly there's much going on in this area, and I'm sure I'll be returning to it in the future.

28 February 2006

Open Source, Opener Source

Brian Behlendorf is an interesting individual: one of those quietly-spoken but impressive people you meet sometimes. When I talked to him about the birth of Apache - which he informed me was not "a patchy" server, as the folklore would have it, just a cool name he rather liked - he was working at CollabNet, which he had helped to found.

He's still there, even though the company has changed somewhat from those early days. But judging from a characteristically thought-provoking comment reported in one of ZDNet's blogs, he's not changed so much, and is still very much thinking and doing at the leading edge of open source.

In the blog, he was reported as saying that he saw more and more "ordinary people" being enfranchised as coders thanks to the new generation of programming models around - notably Ajax and the mashup - that made the whole process far easier and less intimidating.

If he's right, this is actually a profound shift, since ironically the open source model has been anything but open to the general public. Instead, you had to go through a kind of apprenticeship before you could stalk the hallowed corridors of geek castle.

If open really is becoming opener, it will be no bad thing, because the power of free software depends critically on having lots of programmers, and a good supply of new ones. Anything that increases the pool from which they can be drawn will have powerful knock-on effects.

Blogroll, Drumroll

This fellow Blogger blogger is well worth taking a look at if you're interested in science and technology (well, that's everybody, isn't it?). Not so much for the blog entries - which are interesting enough - but for the astonishing blogroll, which includes several hundred links to a wide variety of interesting-looking sites, all neatly categorised. I have literally never seen anything like it - but maybe I'm just provincial.

I found Al Fin - for such is its suitably gnomic moniker - through another site that is worth investigating. Called Postgenomic (well, with a name like that, I had to take a look), it "collates posts from life science blogs and then does useful and interesting things with that data" according to the site. The "interesting things" seem to amount largely to collation of data from various source; there is also a good blog list, though not quite as impressive as Al Fin's.

Wanted: a Rosetta for the MegaWikipedia

As I write, Wikipedia has 997,131 articles - close to the magic, if totally arbitrary, one million (if we had eleven fingers, we'd barely be halfway to the equally magic 1,771,561): the MegaWikipedia.

Except that it's not there, really. The English-language Wikipedia may be approaching that number, but there are just eight other languages with more than 100,000 articles (German, French, Italian, Japanese, Dutch, Polish, Portuguese and Swedish), and 28 with more than 10,000. Most have fewer than 10,000.

Viewed globally, then, Wikipedia is nowhere near a million articles on average, across the languages.

The disparity between the holdings in different languages is striking; it is also understandable, given the way Wikipedia - and the Internet - arose. But the question is not so much Where are we? as Where do we go from here? How do we bring most of the other Wikipedias - not just five or six obvious ones - up to the same level of coverage as the English one?

Because if we don't, Wikipedia will never be that grand, freely-available summation of knowledge that so many hope for: instead, it will be a grand, freely-available summation of knowledge for precisely those who already have access to much of it. And the ones who actually need that knowledge most - and who currently have no means of accessing it short of learning another language (for which they probably have neither the time nor the money) - will be excluded once more.

Clearly, this global Wikipedia cannot be achieved simply by hoping that there will be enough volunteers to write all the million articles in all the languages. In any case, this goes against everything that free software has taught us - that the trick is to build on the work of others, rather than re-invent everything each time (as proprietary software is forced to do). This means that most of the articles in non-English tongues should be based on those in English. Not because English is "better" as a language, or even because its articles are "better": but simply because they are there, and they provide the largest foundation on which to build.

A first step towards this is to use machine translations, and the new Wikipedia search engine Qwika shows the advantages and limitations of taking this approach. Qwika lets you search in several languages through the main Wikipedias and through machine-translations of the English version. In effect, it provides a pseudo-conversion of the English Wikipedia to other tongues.

But what is needed is something more thoroughgoing, something formal - a complete system for expediting the translation of all of the English-language articles into other languages. And not just a few: the system needs to be such that any translator can use it to create new content based on the English items. The company behind Ubuntu, Canonical, already has a system that does something similar for people who are translating open source software into other languages. It's called, appropriately enough, Rosetta.

Now that the MegaWikipedia is in sight - for Anglophones, at least - it would be the perfect time to move beyond the succerssful but rather ad hoc approach currently taken to creating multilingual Wikipedia content, and to put the Net's great minds to work on the creation of something better - something scalable: a Rosetta for the MegaWikipedia.

What better way to celebrate what is, for all the qualifications above, truly a milestone in the history of open content, than by extending it massively to all the peoples of the world, and not just to those who understand English?

27 February 2006

(B)looking Back

I wondered earlier whether blogified books were bloks or blooks, and the emerging view seems to be the latter, not least because there is now a Blooker Prize, analogous to the (Man)Booker Prize for dead-tree stuff.

I was delighted to find that the Blooker is run by Lulu.com, discussed recently by Vic Keegan in the Guardian. Lulu is essentially micro-publishing, or publishing on demand: you send your digital file, they send the physical book - as many or as few copies as you like. You can also create music CDs, video DVDs and music downloads in the same way; Lulu.com handles the business end of things, and takes a cut for its troubles.

Nonetheless, the prices are extremely reasonable - if you live in the US: as Vic points out, the postage costs for books, for example, tend to nullify the attractiveness of this approach for anyone elsewhere in the world, at least from a financial point of view. But I don't think that this will be a problem for long. For Lulu.com is the brainchild of Bob Young, the marketing brains behind Red Hat, still probably the best-known GNU/Linux distribution for corporates.

I emphasise the marketing side, since the technical brains behind the company was Marc Ewing, who also named the company. As he explained to me when I was writing Rebel Code:

In college I used to wear my grandfather's lacrosse hat, which was red and white striped. It was my favourite hat, and I lost it somewhere in Philadelphia in my last year. I named the company to memorialise the hat. Of course, Red and White Hat Software wasn't very catchy, so I took a little liberty.

Young, a Canadian by birth, was the perfect complement to the hacker Ewing. He is the consummate salesmen, constantly on the lookout for opportunities. His method is to get close to his customers, to let them tell him what he should be selling them. The end-result of this hands-on approach was that he found himself in the business of selling a free software product: GNU/Linux. It took him a while to understand this strange, topsy-turvy world he tumbled into, but being a shrewd chap, and a marketeer of genius, he managed it when he finally realised:

that the one unique benefit that was creating this enthusiasm [for GNU/Linux] was not that this stuff was better or faster or cheaper, although many would argue that it is all three of those things. The one unique benefit that the customer gets for the first time is control over the technology he's being asked to invest in.

Arguably it was this early understanding of what exactly he was selling - freedom - that helped him make Red Hat the first big commercial success story of the open source world: on 11 August 1999, the day of its IPO, Red Hat's share went from $14 to $52, valuing the company that sold something free at $3.5 billion.

It also made Young a billionaire or thereabouts. He later left Red Hat, but has not lost the knack for pursuing interesting ideas. Even if Lulu.com isn't much good for those of us on the wrong side of the Atlantic, it can only be a matter of time before Bob listens to us Brit users (to say nothing of everyone else in the world outside the US) and puts some infrastructure in place to handle international business too.

26 February 2006

The First Blogger - and His Chaos

Wandering around the Net (as one does) I came across this: certainly one of the least-attractive sites that I've seen in a long time. But as soon as I noticed that familiar face in the top left-hand corner, I knew where I was: back in Chaos Manor.

Now, for younger readers, those two words might not mean much, but for anyone privileged enough to have lived through the early years of the microcomputing revolution, as chronicled in Byte magazine (now a rather anonymous Web site), they call forth a strange kind of appalled awe.

For Pournelle's columns - which still seem to exist in cyber-versions if you are a subscriber - consisted of the most mind-numbingly precise descriptions of his struggles to install software or add an upgrade board to one of his computers, all of which were endowed with names like "Anastasia" and "Alex".

Along the way he'd invariably drop in references to what he was doing while conducting this epic struggle, the latest goings-on in space exploration (one of his enthusiasms) plus the science-fiction book he was writing at the time (he always seemed to be writing new ones each month - impressive).

The net effect was that his articles ran to pages and pages of utterly irrelevant - but compulsively fascinating - detail about his daily and working life. I half-dreaded and half-longed for the monthly delivery of Byte, since I knew that I would soon be swept away on this irresistible and unstoppable torrent of high-tech logorrhea.

Visiting the site, I noticed the line "The Original Blog", linked to the following text:

I can make some claim to this being The Original Blog and Daybook. I certainly started keeping a day book well before most, and long before the term "blog" or Web Log was invented. BIX, the Byte exchange, preceeded the Web by a lot, and I also had a daily journal on GE Genie.

And in a flash, I realised why I had been mesmerised by but ambivalent about Pournelle's outpourings all those years ago. They did indeed form a proto-blog, with all the blog's virtues - a captivating first-person voice weaving a story over time - and all of its vices - a level of information way beyond what any sane person should really want to know, given the subject-matter.

Pournelle is right: he probably was the first blogger, but working on pre-Internet time - one posting a month, not one a day. However, it is hard to tell whether what we now know as blogs took off all those years later because of his pioneering example - or in spite of it.

JISC for Fun

I've written before about what seems to me the huge missed opportunity for free software in education. Of course, this is a two-way thing: as well as coders making more effort to serve the education sector, it would be nice to see education deploying more open source.

And hey presto, along comes the Joint Informaton Systems Committee (JISC), a government-funded UK body that offers advice and strategic guidance to further and higher education establishements, with a neat briefing paper on the very subject. What makes this doubly interesting is that last year it published a similar paper on open access: clearly things are beginning to click here.

What makes this trebly interesting - well, to me at least - is that I've been asked to speak at the Open Source and Sustainability conference in Oxford this April, organised by the JISC-funded OSS Watch.

"JISC for fun" as Linus almost said.

24 February 2006

Google's Creeping Cultural Imperialism

Another day, another Google launch.

As the official Google blog announced, the company is launching a pilot programme to digitise national archive content "and offer it to everyone in the world for free."

And what national archives might these be? Well, not just any old common-or-garden national archives, but "the National Archives", which as Google's blog says:

was founded with the express purpose of ... serving America by documenting our government and our nation.

Right, so these documents are fundamentally "serving America". A quick look at what's on offer reveals the United Motion Newsreel Pictures, a series which, according to the accompanying text, "was produced by the Office of War Information and financed by the U. S. government", and was "[d]esigned as a counter-propaganda medium."

So there we have it: this is (literally) vintage propaganda. And nothing wrong with that: everybody did it, and it's useful to be able to view how they did it. But as with the Google Print/Books project, there is a slight problem here.

When Google first started, it did not set out to become a search engine for US Web content: it wanted it all - and went a long way to achieving that, which is part of its power. But when it comes to books, and even more where films are concerned, there is just too much to hope to encompass; of necessity, you have to choose where to start, and where to concentrate your efforts.

Google, quite sensibly, has started with those nearest home, the US National Archives. But I doubt somehow that it will be rushing to add to other nations' archives. Of course, those nations could digitise and index their own archives - but it wouldn't be part of the Google collection, which would always have primacy, even if the indexed content were submitted to them.

It's a bit like Microsoft's applications: however much governments tell the company to erect Chinese walls between the programmers working on Windows and those working on applications, there is bound to be some leakiness. As a result, Windows programs from Microsoft have always had an advantage over those from other companies. The same will happen with Google's content: anything it produces will inevitably be more tightly integrated into their search engine.

And so, wittingly or not, Google becomes an instrument of cultural imperialism, just like that nice Mr Chirac warned. The problem is that there is nothing so terribly wrong with what Google is doing, or even the way that it is doing it; but it is important to recognise that these little projects that it sporadically announces are not neutral contributions to the sum of the world's open knowledge, but come with very particular biases and knock-on effects.

Watching IP Watch

Another great site revealed by Open Access News and the indefatigable Peter Suber: IP Watch. Intellectual property - the very term is hated by people like Richard Stallman - is one of those areas that can make your toes curl just at the thought of the mind-numbing subtleties and complexities involved. But make no mistake: this is an area that matters, and more than ever as the analogue and digital worlds are becoming fuzzy at the edges, and people are tempted to apply old analogue rules to new digital creations.

This lends one story on IP Watch a particular importance, since it deals with the thorny area of balancing rights of digital access and protection. What makes the story particularly interesting is that it reports on a "side event" of a World Intellectual Property Organization (WIPO) meeting.

Now, this may not seem like a big deal, but traditionally WIPO has been all about protecting the intellectual property rights of big content "owners" - record companies, the film industry, etc. So the idea that there could be some discussion about the other side of the coin - access rights - even at a non-official meeting, was a tiny but momentous step.

The end of the journey, which promises to be both long and arduous, is a recognition that access is just as important as protection, that the open approach is as valid as the proprietary one. Although this might seem milk and water stuff to long-time proponents of free software and related movements, it would be truly revolutionary if WIPO ever reached such a point.

Indeed, I would go so far as to predict that WIPO will be one of the last bastions to fall when it comes to recognising that a fresh wind is blowing through the world of intellectual domains - and that these domains might just be commons rather than property as WIPO's own name so prominently and problematically assumes.

23 February 2006

The Blogification of the Cyber Union

I suppose it was inevitable that Google would go from being regarded as quite the dog's danglies to being written off as a real dog's breakfast, but I think that people are rather missing the point of the latest service, Google Page Creator.

Despite what many think, Google is not about ultra-cool, Ajaxic, Javascripty, XMLifluous Web 2.0 mashups: the company just wants to make it as easy as possible for people to do things online. Because the easier it is, the more people will turn to Google to do these things - and the more the advertising revenue will follow.

Google's search engine is a case in point, and Blogger is another. As Blogger's home page explains, you can:

Create a blog in 3 easy steps: (1) Create an account (2) Name your blog (3) Choose a template

and then start typing.

Google Page Creator is just the same - you don't even have to choose a name, you just start typing into the Web page template. In other words, it has brought the blog's ease of use to the creation of Web sites.

This blogification of the Internet is a by-product of the extraordinary recent rise of blogs. As we know, new blogs are popping up every second (and old ones popping their clogs only slightly more slowly). This means that for many people, the blog is the new face of the Web. There is a certain poetic justice in this, since the original WorldWideWeb created by Tim Berners-Lee was a browser-editor, not simply a read-only application.

For many Net users, then, the grammar of the blog - the way you move round it and interact with its content - is replacing the older grammar of traditional Web pages. These still exist, but they are being shadowed and complemented by a new set of Web 2.0 pages - the blogs that are being bolted on by sites everywhere. They function as a kind of gloss explaining the old, rather incomprehensible language of Web 1.0 to the inhabitants of the brave new blogosphere.

Even books are being blogified. For example, Go It Alone!, by Bruce Judson, is freely available online, and supported by Google Ads alongside the text (like a blog) that is broken up into small post-like chunks. The only thing missing is the ability to leave comments, and I'm sure that future blogified books (bloks? blooks?) will offer this and many other blog-standard features.

Update: Seems that it's "blook" - and there's even a "Blooker Prize" - about which, more anon.

22 February 2006

Wackypedia

In case you haven't seen this on digg.com yet: a Wikipedia page devoted to "unusual articles" in Wikipedia. What's impressive is not just the sheer wackiness of some of these, but how many have been brought together here.

This is of more than passing interest. As the Edit history shows, the page's length - and richness - is down to the number of people who have contributed. In a way, this page single-handedly demonstrates the real strengths of the collaborative approach when applied to appropriate domains.

10 Things to Build a Blog Readership

1: A clear idea of what you are trying to do

If you want to get and keep an audience for a blog, you need to have a clear idea about a couple of things: what you are writing about, and who for. The word “blog” may derive from “web log”, but if all you do is write a kind of haphazard online diary, only your mother will read it. Don't be fooled by the top bloggers: even though they often seem to be jotting down random thoughts about their day, there is terrific method in their madness – which is why they are top bloggers. Even if they don't articulate it, there is usually a strong underlying theme to their posts. Until you become an A-list blogger, and can do it reflexively, you need to think about what you are trying to say, and to whom, all the time.

2: Strong, interesting and consistent opinions

Having a clear idea of what you are trying to do is not enough: unless you have something interesting to contribute on the subject, people who visit your blog once will not return. The good news is that it doesn't really matter what your opinions are, provided they are strong, well-expressed – and consistent. People like to have a mental image of what a blog is doing, and where it fits in the blogosphere: once you've established a certain tone or approach, don't keeping changing it. It will only confuse your readers, who will get annoyed and move on.

3: The ability to think fast

This is related to the previous point. You not only need strong opinions, you need to be able to form them quickly. Blogs are not the right medium if you want to ruminate deeply about something and then post a 10,000 word essay some months later: time is of the essence (the “log” bit in web log). You need to be able to form snap judgements - but sensible snap judgements, and consistent with your previous posts. Although not crucial, it also helps if you can type fast: the sooner you get your opinion out there, the more likely it is to be a fresh viewpoint for your potential audience.

4: A self-critical attitude

So you've had an idea and typed it up: don't press the Publish button just yet. First of all, read through what you've written: check whether it's clear, and whether you could improve it. In particular, cut out anything unnecessary. People tend to skim through blogs, so you've got to make it as quick to read as possible. Once you've checked the whole thing (good grammar and correct spelling are not obligatory – blogs are informal, after all – but they do make it easier for visitors) ask yourself one final question: is it worth posting this? One of the hardest things to learn when blogging is the discipline of deleting sub-standard posts. A weak post leaves a bad impression, and lowers someone's opinion of the whole blog. If in any doubt, scrub the post and write another one.

5: Marketing skills

Once you've posted something you're happy with, you need to let the world know. One common misconception among fledgling bloggers is that quality will out – that the world will somehow guess you've written some deeply witty/profound/amazing post and rush to read it. It doesn't work like that. Instead, you need to get out there and sell your story. The first thing to do is to make sure that all the blog search engines know about your posting (an easy way to do this is to sign up with Ping-o-matic). Then you need to start posting on other people's blogs to drive some traffic back to yours.

There are two ways to do this. One is to post on anything about which you have an opinion: most blogs let you include your main blog address as a matter of course. The other is to include in your comment a direct link to one of your posts – only do this if your post is strictly relevant to what you are commenting on. One technique is to use a blog search engine like Technorati to find other blogs that are writing about the subject matter of your post: then visit them to see if you can make a sensible comment with a link back to your own blog. However, use this approach sparingly or your comments may simply not be posted.

6: Self-confidence

Even if you've left plenty of hints around the blogosphere that you've just put up an interesting post, you may well find that there's little activity on your blog. Relatively few people leave comments on blogs (and many of those are just publicising their own postings), so the absence of comments doesn't mean that nobody's visited (see below for a good way of tracking all visitors). But you musn't give up: around 50% of all new bloggers throw in the towel within three months of starting, so if you can last longer than this you're already ahead of the field. Moreover, the longer you keep going, the more posts there will be on your site, and the more interesting material for anyone when they do visit. Rather than viewing all the unvisited posts as waste of effort, consider them as an investment for the future.

7: Thick skin/Self-restraint

After you've waited what can seem an age to get a few comments on your posts, you may be disappointed to find that some of them are, shall we say, less than complimentary. It is a sad fact that among those most likely to take the time and trouble to write a comment on your posts are people who feel strongly that you are a complete idiot. Assuming their post is not libellous or obscene (in which case, just delete it) the best thing to do is to reply as sensibly as possible. Do not return in kind, do not retaliate, do not mock: if you descend to their level, you just look as stupid as them. If, on the other hand, you are seen to be responding in a mature and intelligent fashion, visitors will value your blog all the more highly because they will attribute those same characteristics to the rest of your site.

8: Stamina

If you are serious about blogging, you are making a major, long-term commitment. At the beginning, you will need stamina (as well as the self-confidence mentioned above) to keep churning out posts even though few seem to be reading them. But it's even worse, later on, as you gradually gain a readership. Because now there is an implicit contract between you and your visitors: they will keep reading you, and give some of their valuable time and attention if – and it's a big “if” – you continue to post. This doesn't mean every day without fail – though that would be ideal – but it does mean several posts a week, every week. Above all, you need to establish a rhythm that visitors can depend on.

9: Humility

One of the most daunting aspects of blogging is that whatever rank you achieve on Technorati or elsewhere, you are only as good as your future posts: if you begin to post less, or start skimping in your posts, you will inevitably lose the readership you have built up so laboriously. This is obviously related to the last point: you will need stamina to keep any position you have earned. But another danger is that as you rise through the blogging ranks you might start to believe in your own importance: after all, the A-list bloggers do indeed wield great power. But that power comes from the readers, and as soon as arrogance starts to creep into posts, that audience will diminish, and with it the power. It is noticeable how humble many of the top bloggers are towards their audience, constantly thanking them for their attention and loyalty.

10: A day job

Don't delude yourself: you will never make enough money from your blog to give up the day job. Just look at the A-list: almost all of them do something else as well as blog (though quite how they find the time is one of the blogosphere's great mysteries). By all means use Google's AdSense on your site – its online Reports are invaluable, because they give an up-to-the-minute figure for the number of visitors your site has had each day. But as these will show, the sad fact is you are only likely to get one click-through per thousand visitors: even with valuable keywords paying up to a dollar a time, you will need absolutely vast traffic to make a living from this.

Blogging won't ever be your main job, but it might well help you get a better one. The thing to remember about blogs is that they are a great way of marketing yourself to the world – especially the parts you never knew might be interested. This is something else that you need to keep in mind when you write your blog: the fact that at any moment a future employer may be reading it.

(These comments are based on twenty-five years of journalism, including fifteen covering the Net, colliding with a few months of active blogging. Obviously, they are very early thoughts on the matter, based on limited experience. I'd be interested in the views of other people - especially those with more blogging under their belt - on what are the things you need to build up an audience for your blog.)

21 February 2006

A Question of Value

Although not quite in the same class as Open Access News in terms of signal-to-noise ratio, Slashdot does have its uses, not least as raw entertainment (it's also quite useful for a bit of blog boosting, too....). And sometimes it throws up something that is pure gold.

A case in point tonight, with a link to a story by James Boyle in today's FT. Since that story will more likely than not sink irrevocably into the limbo of subscriber-only content, I won't even bother wasting angled brackets and Href= on it. Happily, though - and this, perhaps, is the real value of Slashdot - one of the comments linked to a much fuller version of the article's underlying arguments that is freely available on Boyle's rich if horribly-designed Website (frames, in the 21st century: can you believe it?)

The item is called, intriguingly, "A Politics of Intellectual Property: Environmentalism For the Net?" Here's a short excerpt from the opening section:

The theme of cyberpunk is that the information age means the homologisation of all forms of information -- whether genetic, electronic, or demographic. I grew up believing that genes had to do with biology, petri dishes and cells and that computers had to do with punch cards and magnetic disks. It would be hard to imagine two more disparate fields. In contrast cyberpunk sees only one issue ~ code ~ expressed in binary digits or the C's,G's, A's and T's on a gene map.

This should give a hint of just how spot-on the whole piece is.

I won't attempt to summarise the whole thing here - partly because I've not digested it fully myself, and it seems too important to vitiate with my own ham-fisted approximations, and partly because Boyle's essay is, in any case, precisely the length it needs to be for a deep analysis of a complicated domain. You might read it now, or wait until I come up with some vaguely coherent thoughts in due course: this, I most certainly will do, since the issues it touches on are central to much of what I am writing about here.

You have been warned.

20 February 2006

Open Business on Open Content

Once more, the indispensable Open Access News takes me somewhere I didn't know I wanted to go. This time it's to a site called Open Business. According to its home page:

OpenBusiness is a platform to share and develop innovative Open Business ideas - entrepreneurial ideas which are built around openness, free services and free access. The two main aims of the project are to build an online resource of innovative business models, ideas and tools, and to publish an OpenBusiness Guidebook.

At the moment it seems to be another tripod, with legs in the UK, Brazil and South Africa. Its basic form is a blog, topped off with a dash of wiki.

The link that brought me here led to an interview with Esther Dyson. I have to confess that I tend to find her a little, er, light, shall we say? But this interview was an exception, and she had some interesting background to give on Del.icio.us, in which she was an angel investor.

I'm still not entirely clear what the site is doing - either strategically or structurally - but it has pointers to stuff I wasn't aware of, so it gets brownie points for that if for nothing else. One to return to.

Freedom, in Other Words

Recently the blogosphere went slightly bonkers over a story that "the Korean government plans to select a city and a university late next month where open-source software like Linux will become the mainstream operating programs." It seems to have been the concept of a "Linux city" that really caught people's attention (though surely "Linux city" must refer to Helsinki?). But the significance of this announcement is perhaps not quite what most people think.

As the article rightly pointed out, there's nothing new in the idea of a city powered by free software: the much-ballyhooed Munich conversion to open source was along the same lines. And there are plenty of other, smaller-scale deployments that show that free software is up to the job. But there is another element in this story that is in some ways more interesting.

Yes, open source can now offer all the usual elements - operating system, browser, email, office apps - that people need for their daily computing; but even more impressively, it can do this in many languages. In other words, there is now a multi-dimensional depth to free software: not only in terms of the apps that have been created, but also with regard to the languages in which many of those apps are available natively.

For example, Firefox boasts versions in languages such as Asturian, Basque, Macedonian and Slovenian, with several others - Armenian, Gujarati and Mongolian - listed as "Not Yet Available", implying that they will be. OpenOffice.org does even better, offering versions in languages such as Albanian, Azerbaijani, Galician, Khmer and Sango (new to me). There is an even longer list of languages that includes others that are being worked on.

And if that doesn't impress you, well, you must be pretty jaded linguistically. So consider this: for many of these languages, you can download the program for three platforms: GNU/Linux, Windows and MacOS X. "Get The Facts", as Microsoft loves to say; but the ones it cares to give are partial, and they strangely omit any mention of this factual strength-in-depth that only free software can offer.

Indeed, there have been various cases where national governments of smaller nations have all but begged Microsoft to port some of its products to their language, only to be refused on the grounds that it wasn't economically "viable" to do so. Which just goes to show how complicated is the warp and weft of software and power, culture and money.

Viability has never been much of an issue for free software; if it had been, Richard Stallman would never have bothered starting it all off in the first place. As far as he is concerned, there's only one word that matters, whatever the language, and that's freedom. As the sterling work underway to produce localised versions of open source software emphasises, that includes the freedom to work in your own tongue.

17 February 2006

The Economics of Open Access Books

I've written before about open access books; but such is my sad state of excitement when I come across good examples, I feel obliged to pass on another one. The home page for the book Introductory Economic Analysis by R. Preston McAfee declares itself to be "the open source introduction to microeconomics" no less, and the online blurb explains why, and also offers some interesting thoughts on the economics of academic book publishing:

Why open source? Academics do an enormous amount of work editing journals and writing articles and now publishers have broken an implicit contract with academics, in which we gave our time and they weren't too greedy. Sometimes articles cost $20 to download, and principles books regularly sell for over $100. They issue new editions frequently to kill off the used book market, and the rapidity of new editions contributes to errors and bloat. Moreover, textbooks have gotten dumb and dumber as publishers seek to satisfy the student who prefers to learn nothing. Many have gotten so dumb ("simplified") so as to be simply incorrect. And they want $100 for this schlock? Where is the attempt to show the students what economics is actually about, and how it actually works? Why aren't we trying to teach the students more, rather than less?

(This closely mirrors Linus' own feelings about the high cost and low quality of proprietary software.)

As a consequence of this unholy alliance of greed and shoddiness, McAfee suggests:

The publishers are vulnerable to an open source project: rather than criticize the text, we will be better off picking and choosing from a free set of materials. Many of us write our own notes for the course anyway and just assign a book to give the students an alternate approach. How much nicer if in addition that "for further reading" book is free.

Introductory Economic Analysis is truly open access, as the author explains: "You are free to use any subset of this work provided you don't charge for it, and you make any additions or improvements to it available under the same terms." To be precise, it is published under the Creative Commons Attribution-NonCommercial-ShareAlike 2.0 licence. Although I've only just started working through the text, I can heartily recommend it for its pervasive clarity and gentle wit - not qualities that you normally associate with economics textbooks.

As well as for the generous gift of the book itself, I'm also grateful to McAfee for the link he gives to a site that's new to me, called The Assayer, which describes itself as "the web's largest catalog of free books." It turns out the vast majority of these are in the fields of science, maths and computing. It's not a huge collection, but since it includes titles as fine as McAfee's, things are clearly looking up in the world of open access books.

Goodbye Goobuntu, We Hardly Knew Ye

Mark Shuttleworth has finally put the ridiculous Goobuntu rumour out of its misery. The idea that Google might come out with an operating system - even one based on the very-wonderful Ubuntu - was just so crazy that I really find hard to understand why everyone - especially the blogosphere - swallowed it.

There is absolutely no fit between the commoditised operating systems market and the core competencies of Google - even if Google itself depends on GNU/Linux to keep its server farms humming. There is a world of difference between using something, and trying to sell it to everybody and their dog.

16 February 2006

There's No Such Thing as a Free...Culture

Well, Becky Hogge probably wouldn't agree. She's written a useful summary of all the IP ins and outs this year, wondering whether this will be "the year of free culture?"

I'm not holding my breath, but I was grateful to be told about the Adelphi Charter on "creativity, innovation and intellectual property", which I'd not heard of before.

Hogge concludes:

What is crucial now is that defenders of the public good vested in the democratic dissemination of information step forward to make their voices heard. Expect this columnist to return to the matter, and to amplify these voices.

I look forward to it.