There was a small ripple around the internet this morning caused by the Home office opening up the Beta terrorist reporting tool.

To what extent the reports from this tool are monitored is unclear, but I suspect this will cause more problems that it solves.

Even before we consider the rather broad definition the government has for illegal material (which on the face of it could cover a number of science and religious texts), I can see the tool quickly becoming buried under false positives – whether through over sensitive citizens or through plain vindictiveness – which would need to be investigated.

Even if no further action is taken after the investigation, the cost in both time and resources must surely represent a significant risk that things that are actually a threat will be missed.

Today saw the release of Data.gov.uk, the government data website spearheaded by Tim Berners-Lee which hopes to collate government data and make it available for people to build on.

Although it is clearly aimed at developers, it is my hope that innovative and genuinely useful tools will quickly start popping up as entrepreneurs get to grips with this new wealth of information.

The launch has triggered a fair amount of buzz, and a flurry of blog posts elsewhere which do a much better job at explaining the ins and outs of the site than I have time to.

Personally, I think this is a good step in the right direction. It is also good to see that they have opted to go ugly early – publishing the raw data so we can begin hacking straight away – rather than wait until their cathedral-like semantic web interface is perfect.

True, while the data is in this state it is not so useful to the wider world – yet. Projects such as Scores on the doors have proven that turning raw data into something useful can be a useful and profitable undertaking, so I’ve no doubt that this will change.

The biggest disappointment is the choice to release much of the data under crown copyright. While this was almost certainly a compromise to get anything to happen at all, it would have been nice if the government had taken the bolder step and released it unencumbered and let the economy profit from it.

I would also like to see more local authorities opening up their data, moving away from the idea that everything has to be centralised.

Still, the new site follows a general positive trend of data glasnost which has already seen the promise to open up the postcode database, and in that spirit I welcome it.

Image “New, Improved *Semantic* Web!” by Duncan Hull

I have just written a very small Akismet plugin for Elgg.

When enabled and configured, this plugin will scan newly submitted comments of the ‘generic_comment’ annotation class.

While spam comments are rarer on Elgg due to the fact that most sites don’t allow anonymous comments, this could be useful for people who are getting spam comments from people who have signed up.

This plugin comes into its own when you allow anonymous comments, such as on a site I recently built for a client.

Extending this plugin to scan other content should be fairly straight forward for even a novice coder, but if I have time I’ll provide an interface to do so.

Anyway, go get it here, or check out the project page on google code!

Image “Spam! [don’t buy]” by David Trattnig