10 December 2007

Towards Open Government Data

One of the premises of this blog is that openness - radiating out from open source through open content, open access, open data and the rest - is more than a technical issue. Ultimately it is something that will touch every aspect of our lives.

One manifestation of this is the move to obtain free access to government data. In the UK, the Guardian has been a leading campaigner, and here's some news of what going on the other side of the pond:

I got a sense for the importance of the task talking with Dan O’Neil, who is “people person” for Everyblock.com, a remarkable project headed by Adrian Holovaty designed to be a “one-stop shop” for information about urban neighborhoods, including building permits, crime reports, planned improvements, school information, etc. Dan’s job is to negotiate with government officials in the twenty cities Everyblock seeks to map, and gain access to vast geocoded data sets. Armed with a set of principles and best practices that government geeks can show to their bosses, his job would be a lot easier than it is right now.

Most significant, perhaps, is the definition of what constitutes open government data:

Government data shall be considered open if it is made public in a way that complies with the principles below:

1. Complete
All public data is made available. Public data is data that is not subject to valid privacy, security or privilege limitations.

2. Primary
Data is as collected at the source, with the highest possible level of granularity, not in aggregate or modified forms.

3. Timely
Data is made available as quickly as necessary to preserve the value of the data.

4. Accessible
Data is available to the widest range of users for the widest range of purposes.

5. Machine processable
Data is reasonably structured to allow automated processing.

6. Non-discriminatory
Data is available to anyone, with no requirement of registration.

7. Non-proprietary
Data is available in a format over which no entity has exclusive control.

8. License-free
Data is not subject to any copyright, patent, trademark or trade secret regulation. Reasonable privacy, security and privilege restrictions may be allowed.

Nokia: Hollywood's Lapdog, and People's Enemy

Somewhat naively I thought that Nokia was a savvy company on the side of light - maybe because it's Finnish; but I was wrong, it seems:

Nokia has filed a submission with the World Wide Web Consortium (W3C) objecting to the use of Ogg Theora as the baseline video standard for the Web. Ogg is an open encoding scheme (On2, the company that developed it, gave it and a free, perpetual unlimited license to its patents to the nonprofit Xiph foundation), but Nokia called it "proprietary" and argued for the inclusion of standards that can be used in conjunction with DRM, because "from our viewpoint, any DRM-incompatible video related mechanism is a non-starter with the content industry (Hollywood). There is in our opinion no need to make DRM support mandatory, though."

...

Nokia intervention here is nothing short of bizarre. Ogg is not proprietary, DRM is, and DRM-free may be a "non-starter" for Hollywood today, but that was true of music two years ago and today, most of the labels are lining up to release their catalogs without DRM. The Web, and Web-based video, are bigger than Hollywood. The Web is not a place for proprietary technology or systems that take over your computer. For Nokia (and Apple, who also lobbied hard for DRM inclusion) to get the Web this badly wrong, this many years into the game, is really sad: if you haven't figured out that the Web is open by 2007, you just haven't been paying attention.

Time to cross Nokia off the Christmas card list, then.

Apple the Imitator

Seems like Microsoft isn't the only company copying the innovations of the open source world:

Apple plans at Macworld to introduce a 12-inch Mac laptop with flash memory in place of a hard drive.

Wow, how original....

OpenID Becomes Enterprising

On Open Enterprise blog.

Open Source Society

One of the themes of this blog is how the principles behind open source can be applied to other domains. Here's someone with the same idea:

Just say the words quietly to yourself: open…source….society…. A society where the inner workings of the government, the economy, every aspect of everyday life, are placed under the spotlight for every citizen to see, examine, and have an impact on.

These are my goals: to present ways you can improve your own life and the lives of your friends and family through the benefits of the open source movement; to present ways you can give back to the movement through your own ideas, labor, or financial support; to present ways you can have a positive impact on other aspects of your life completely unrelated to technology through the use of an open source philosophy; to create a movement hellbent on remaking the world into a more cooperative, friendly, honest, and above all equitable place to live, work, and play.

Deutschland 2.0 Über Alles

One of the besetting faults of the online world is a certain anglocentricity in its reporting: we tend not to hear much about the goings-on in other parts of the world - even other parts of Europe. So for all those of you who were wondering, here's a list of the top 100 Web 2.0 sites in Germany, complete with quick notes explaining what they do.

09 December 2007

Source(Forge) of Strength or Weakness?

On the Open Enterprise blog.

Sell the Analogue

This is what the film industry *really* makes its money from:

new research suggests that the presence of other people may enhance our movie-watching experiences. Over the course of the film, movie-watchers influence one another and gradually synchronize their emotional responses. This mutual mimicry also affects each participant's evaluation of the overall experience -- the more in sync we are with the people around us, the more we like the movie.

Note, too, that this is not something you can download and copy....

08 December 2007

YouTube, the Government's NewTube

In the light of this:


Keelan says, “YouTube is increasingly a resource people consult for health information, including vaccination. Our study shows that a significant amount of immunization content on YouTube contradicts the best scientific evidence at large. From a public health perspective, this is very concerning.”

Clearly, we need to start seeing YouTube for what it is: a communications medium that governments should be employing routinely to get messages - about health, for example - across:

According to Wilson, “The findings also indicate that public health officials should consider how to effectively communicate their viewpoints through Internet video portals.”

With one important caveat: that governments must learn to use YouTube on its own terms - not trying to impose traditional formats, which will simply be ignored. That's going to be hard...

Sage Does the Maths of Free Software

One of the persistent myths about free software is that successes like Linux are one-offs, and that the open source methodology can't be applied easily to tackle complex software challenges. In the early days of free software, the relative paucity of end-user apps was trotted out as proof of this idea - The GIMP stood in splendid isolation back then.

Things have change, though; today, there is a wide range of high-quality open source apps, and the list keeps on growing. Here's there latest, and it's a biggie:

Until recently, a student solving a calculus problem, a physicist modeling a galaxy or a mathematician studying a complex equation had to use powerful computer programs that cost hundreds or thousands of dollars. But an open-source tool based at the University of Washington won first prize in the scientific software division of Les Trophées du Libre, an international competition for free software.

The tool, called Sage, faced initial skepticism from the mathematics and education communities.

"I've had a surprisingly large number of people tell me that something like Sage couldn't be done -- that it just wasn't possible," said William Stein, associate professor of mathematics and lead developer of the tool. "I'm hearing that less now."

Open-source software, which distributes programs and all their underlying code for free, is increasingly used in everyday applications. Firefox, Linux and Open Office are well-known examples.

But until recently, nobody had done the same for the everyday tools used in mathematics. Over the past three years, more than a hundred mathematicians from around the world have worked with Stein to build a user-friendly tool that combines powerful number-crunching with new features, such as collaborative online worksheets.

"A lot of people said: 'Wow, I've been waiting forever for something like this,'" Stein said. "People are excited about it."

Sage can take the place of commercial software commonly used in mathematics education, in large government laboratories and in math-intensive research. The program can do anything from mapping a 12-dimensional object to calculating rainfall patterns under global warming.

The benefits of using free software for maths extend far beyond the usual ones:

The frustrations weren't only financial. Commercial programs don't always reveal how the calculations are performed. This means that other mathematicians can't scrutinize the code to see how a computer-based calculation arrived at a result.

"Not being able to check the code of a computer-based calculation is like not publishing proofs for a mathematical theorem," Stein said. "It's ludicrous."

(Via A Blog Around the Clock.)

07 December 2007

A Moot of Folksonomies

Being a rigorous sort of chap, I was sceptical about folksonomies - ad-hoc tags. But over time I've come to appreciate their power - and the fact that once people start using them routinely, the combined body of folksonomic knowledge becomes quite impressive.

But the obvious question is: what lies beyond the simple tag? Myabe this kind of thing:


GroupMe! extends the idea of social tagging systems like del.icio.us, Flickr or BibSonomy by introducing the group dimension. The foundation of social tagging systems are so-called folksonomies, which describe how users (folks) tag resources (e.g. photos, videos, publications, etc.). In technical terms a folksonomy is just a collection of tag assignments:

(User, Tag, Resource) = User has tagged Resource with Tag at a particular time.

Over time it is likely that semantics emerge, e.g. tags that are often assigned to same resources may be synonyms. Hence, folksonomies are promising to improve (web) search, etc. With GroupMe!'s approach of taggable groups we extend tag assignments with a group dimension:

(User, Tag, Group, Resource) = User has tagged Resource with Tag in a certain Group at a particular time.

This prompts the next question: what do we call these groups? I vote a "moot".

What Next: Copyright Tax on Potato Stamps?

Printer manufacturer Hewlett-Packard has announced that the German Supreme Court ruled in a hearing that the firm will not have to pay a flat fee to German copyright collective VG Wort to cover copyrights.

And I should damn well hope so: why should people have to pay a copyright tax to print out their own documents - which, surprisingly, is what most people do most of the time. If a printer, why not on pens? Pencils? Potato stamps? - They can all be used to commit heinous acts of copyright infringement that would doubtless bring civilisation as we know it to its knees....

Interview with OSC's Mark Taylor

On the Open Enterprise blog.

Likewise Does Likewise

On the Open Enterprise blog.

(Henceforth, for those who might be interested, I'll be posting simple links to entries on my new Open Entreprise blog in this way. If you're not, just ignore them.)

Seagate All at Sea

Here's a company with a death-wish:

SEAGATE'S latest batch of drives are not compatible with the Open Sauce operating system Linux.

...


There are a few work-arounds but Seagate Tech Support says they do not know what they are. Instead they are telling man plus dog that their latest drives do not support Linux.

How to run a 21st-century computer company. Not.

06 December 2007

Fixing ISO...with OpenISO

As I noted below, ISO has some serious problems. The solution? Open up:

The vision of OpenISO.org is to become a truly open international standards organization. In particular,

* Decisions between conflicting opinions or interests should always be made in a fact-oriented manner based on sound engineering and openness principles.
* Participation in OpenISO.org work should be open to everyone who is willing and able to work according to a reasonable set of procedural guidelines. Draft OpenISO.org Core Guidelines.
* In addition to facilitating a conversation among experts (by means of which standards are developed and evaluated, etc), OpenISO.org should also solicit, actively consider, and respond to feedback from the general public.
* OpenISO.org will be active both in the area of developing technical specifications which are suitable as standards and in the area of reviewing documents published by other organizations for compliance with principles of good engineering, openness and economic fairness.
* All work documents of OpenISO.org will be made freely available to everyone via the internet, free of charge.
* Currently OpenISO.org is simply a personal initiative with the commitment that at the very least, a good and appropriate problem report regarding the OOXML specification will be produced. However, a serious attempt is being made to establish a sustainable business model for OpenISO.org, this could be an industry consortium composed of companies which are interested in ensuring that fact-oriented review of standardization proposals is done in a credible manner, or alternatively a start-up business of some kind might be established for which it makes sense as part of its marketing plan to operate OpenISO.org.

(Via Planet FSFE.)

Dysfunctional ISO - Courtesy of Microsoft

This is an extraordinary testimony to the havoc wrought by Microsoft on parts of ISO through its attempts to get OOXML (aka ECMA 376) fast-tracked:

This year WG1 have had another major development that has made it almost impossible to continue with our work within ISO. The influx of P members whose only interest is the fast-tracking of ECMA 376 as ISO 29500 has led to the failure of a number of key ballots. Though P members are required to vote, 50% of our current members, and some 66% of our new members, blatantly ignore this rule despite weekly email reminders and reminders on our website. As ISO require at least 50% of P members to vote before they start to count the votes we have had to reballot standards that should have been passed and completed their publication stages at Kyoto. This delay will mean that these standards will appear on the list of WG1 standards that have not been produced within the time limits set by ISO, despite our best efforts.

Unless ISO tightens up on its rules, and removes or demotes, P members who do not vote as required by ISO rules I would recommend my successor that it is perhaps time to pass WG1’s outstanding standards over to OASIS, where they can get approval in less than a year and then do a PAS submission to ISO, which will get a lot more attention and be approved much faster than standards currently can be within WG1. The disparity of rules for PAS, Fast-Track and ISO committee generated standards is fast making ISO a laughing stock in IT circles. The days of open standards development are fast disappearing. Instead we are getting “standardization by corporation”, something I have been fighting against for the 20 years I have served on ISO committees. I am glad to be retiring before the situation becomes impossible. I wish my colleagues every success for their future efforts, which I sincerely hope will not prove to be as wasted as I fear they could be.

(Via The Open Sourcerer.)

Reneging on Gowers

When the Gowers review of copyright in the UK came out I was ambivalent: there was some good things, but also areas that were potentially problematic, notably in terms of one-side strengthening copyright enforcement. One year on, it looks like my fears were justified:


If enforcement and flexibility are two sides of the same coin, then one year on it looks like the toss has definitely gone to enforcement. This means that Government is in effect making the situation worse: concentrating on strengthening enforcement measures while failing to address the inherent inflexibility of copyright law that Gowers identified as a key factor in the general public’s disrespect for the law.

Given what is happening in France, with its idiotic "three strikes and you're out" approach - this isn't a game, you know - and the fact that the UK industry is beginning to salivate at the prospect of bringing in something similar here, this is not good.

Behold! The New Anti-Open Access FUD

As I've noted before, I'm something of a connoisseur of FUD, and I really like coming across new examples. Here's one, directed at the burgeoning open access movement, which wants to make publicly-paid for scientific papers freely available (and others, too):


'The idea of public access to research information is a little bit specious,' added Robert Parker, managing director of RSC publishing. 'The UK government will be funding the London Olympics in 2012, but that doesn't mean that everybody can have free tickets - there is a big difference between funding something and having it be freely available.'

Nice sleight of hand there, Robbie. Except that the UK government is funding the Olympics in the (probably mistaken) belief that everyone will benefit from the knock-on effects on the economy, world prestige, blah-blah-blah: so there *is* an expectation of getting something in return for the public funds. And of course no one expects free seats - because there is a finite number of them - whereas the larger benefits, if they materialise, can be shared.

Open access is different because taxpayers can benefit from it directly. Most importantly, though, open access is digital in nature, and therefore can be copied and distributed for effectively zero cost - it is non-scarce and non-rivalrous. There is no way of giving away seats at the Olympics for zero cost, because they are scarce, rivalrous resources. The economics are completely different, as any managing director should understand. (Via Peter Murray-Rust.)

Wired Uses the 'B'-word

I write about commons a lot here - digital commons, analogue commons - and about how we can nurture them. Whales form a commons, and one that came perilously close to becoming a tragedy. Which is why Japan's resumption of commercial whaling under a flimsy pretext of "scientific" whaling sticks in my craw. Obviously, I'm not the only one; here's the Chief Copy at Wired:

But more and more the Japanese are turning to the cultural-tradition defense, a blatant if clumsy attempt to portray themselves as the victims of cultural prejudice. That, too, is bilge water. This is no time for the world to cave in to some misguided sense of political correctness. On the contrary, pressure should be applied to stop. If Japan won't stop, a boycott of Japanese goods would not be unreasonable.

Oooh, look: there's the "b"-word: I predict we'll be hearing a lot more of it if Japan persists in this selfish destruction of a global commons.

Microsoft: Gone in a Flash

I have commented here and elsewhere that the new category of ultra-light portables using flash memory represents a huge opportunity for GNU/Linux - and a potential bloody nose for Microsoft. It seems I'm not the only one to see it that way:


Microsoft announced plans today to expand support for Windows XP on budget flash-storage computing devices with an eye towards getting Windows XP running on the OLPC. The software giant will publish design guidelines next year that will make it possible for manufacturers of low-cost mobile devices to build hardware that provides optimal compatibility with Microsoft's legacy operating system. The company also announced plans for field trials next month that will put Windows XP to the test on One Laptop Per Child (OLPC) project's XO laptop.

This is interesting from two points of view.

First, it confirms that Windows Vista is a complete dog: it will be Windows XP, rather than Vista, that forms the basis for most desktop computing - which means less revenue for Microsoft, and trouble with its entire ratchet pricing model.

Secondly, it shows - contrary to all the usual whining - that GNU/Linux is innovating, and Microsoft is following. In fact, this is far from the only such case of open source being the leader - remember, the Internet is open at all levels - but it's nice that Microsoft is once again seen to be trotting behind the open pack with its tail between its legs.

05 December 2007

DRM in the Analogue World

DRM is normally viewed as an issue in the world of digital content, which can be duplicated losslessly. But in this virtuosic post, Mike Masnick points out that it also exists in an analogue context in the form of noncompete agreements, which seek to prevent ideas being copied perfectly:


just think of noncompetes as the "DRM" of human capital. Just as DRM tries to restrict the spread of content, a noncompete seeks to restrict the spread of a human's ideas for a particular industry within the labor arena. Both concepts are based on the faulty assumption that doing so "protects" the original creator or company -- but in both cases this is incorrect. What it actually does is set up an artificial barrier, limiting the overall potential of a market. It may not be easy to see that from the position of the content creator or company management (or investors). It's natural to want to "protect," but it's actually quite damaging.

...


While it may seem easier to "protect" your ideas and your people, what you really end up doing is blocking off your own access to many of the ideas that you need to continue to innovate. You limit the vital mix of ideas to build not just decent products, but great products. Just as DRM has helped to destroy the record labels when competing against more nimble, more open technology -- noncompetes destroy businesses when competing against more nimble, more open technology clusters.

Brilliant.

Sun Gives Prizes to Teacher's Pet (Projects)

Sun has released some more details of its forthcoming Open Source Community Innovation Awards Program:

which will foster innovation and recognize some of the most interesting initiatives within Sun-sponsored open source communities worldwide. To participate in the program's first year, Sun has selected six communities: GlassFish, NetBeans, OpenJDK, OpenOffice.org, OpenSolaris and OpenSPARC. Prizes are expected to total at least $1 million (USD) a year.

Beginning in mid-January 2008, Sun and the six open source communities will announce details on how developers can participate in the individual programs. Each community will have its own contest rules and judging criteria. Prize winners will be announced in August 2008.

So, unlike Google's Summer of Code programme, which is basically to foster generic open source among young hackers, Sun's effort is targeted at its own projects. And nothing wrong with that, especially when one of them, OpenOffice.org, is a critical component of the free software stack. But Sun should't expect to get as many brownie points as Google, which, for all its faults, has been is playing the open source card very well (about which more later.)

A Real Carrot to Use Open Source

This was new to me:

Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize (cluster) search results into thematic categories

This is actually very useful, since it groups similar results together, and lets you sort through results by theme. There is an associated company, Carrot Search. (Via eHub.)