Raw Data Now in the UK

Good news from the Raw Data Now front. Tim Berners-Lee has been asked by Gordon Brown to help open up access to government data in the UK. Let’s hope this is not just a publicity stunt to boost Mr Brown’s plummeting popularity, but a true effort in creating a more transparent government. If this works, it should point more governments towards this direction.

In the meantime, critics in the UK point out the great obstacles such an attempt will stumble upon. BBC technology correspondent Rory Cellan-Jones comments:

I fear Sir Tim may be in for a bruising few months, as he tries to convince Sir Humphrey et al to let it all hang out.

Clearly, a lot has to change in the idiosyncrasy of state officials for Sir Tim’s vision to fully come to life, but at least he’s been given this chance. Let’s just see how this goes…

BBC adopting the Semantic Web

Being a major content owner, the BBC is using Semantic Web technologies to efficiently manipulate this content and improve the services offered to the public.

I met Richard Wright of the BBC Archive in the AXMEDIS conference in Barcelona on November 2007. He was showing then a demo of their system, which uses the GATE web services for NLP (Natural Language Processing) on textual news items. News items are analyzed to extract named entities according to an ontology, such as person names, companies, locations, etc. The attributes of the entities are also extracted, e.g. the position of a person within a company. The demo also showed video indexing with the use of various techniques.

The BBC Artists pages were recently launched, using Semantic Web technologies to enrich artists’ profiles and link them to external resources, such as Wikipedia entries. Matthew Shorter, BBC’s interactive editor for music, told CNET UK that “this is part of a general movement that’s going on at the BBC to move away from pages that are built in a variety of legacy content production systems to actually publishing data that we can use in a more dynamic way across the Web.”