Douglas Purdy

“We need a Wikipedia for data”

with 2 comments

The title of this post is not mine.

It is Bret Taylor’s.

Bret, of Google Maps and more importantly FriendFeed fame, is now at Facebook working closely with some of the best Microsoft alums I know.

Back in 2008, he was on to something, something important.

How do you discover a given dataset, particularly a common dataset that should be like “air” for developers?

Once you find it, what are the legal requirements to access it?

Once you can legally access it, what is the mechanism to access it?  Do you have to screen scrape it?  You would be surprised at the amount of screenscraping you need to do for even datasets you pay for.  Jon Udell captured some of my personal frustration around this in 2006 here.

Of course, if you are a dataset provider, you have the inverse of these questions.

Bret called his solution to these problems, DataWiki.

I call it “Dallas”.

There is, however, a key difference between Bret’s concept of the DataWiki and “Dallas” that is best highlighted by a Steward Brand quote:

Information Wants To Be Free. Information also wants to be expensive.

I do not think you can ignore this tension and any “data as a service” like “Dallas” needs to internalize this deeply in both its technical architecture and business strategy.

With that said, I think of “Dallas” as an important example and (I hope) success story of the Open Data vision that many of us at Microsoft share.

Maybe Bret will get his DataWiki after all…

February 6th, 2010 at 5:34 am

Posted in Data, Microsoft, OData, Software Development

2 Responses to '“We need a Wikipedia for data”'

Subscribe to comments with RSS or TrackBack to '“We need a Wikipedia for data”'.

  1. bad data is worse than no data. you don’t waste so much of your time no data. the internet is full of bad data.

    Kris

    6 Feb 10 at 20:39

  2. Do you think Freebase meets this need?

    http://www.freebase.com

    Charlie Robbins

    10 Feb 10 at 03:21

Leave a Reply