“We need a Wikipedia for data”
The title of this post is not mine.
It is Bret Taylor’s.
Bret, of Google Maps and more importantly FriendFeed fame, is now at Facebook working closely with some of the best Microsoft alums I know.
Back in 2008, he was on to something, something important.
How do you discover a given dataset, particularly a common dataset that should be like “air” for developers?
Once you find it, what are the legal requirements to access it?
Once you can legally access it, what is the mechanism to access it? Do you have to screen scrape it? You would be surprised at the amount of screenscraping you need to do for even datasets you pay for. Jon Udell captured some of my personal frustration around this in 2006 here.
Of course, if you are a dataset provider, you have the inverse of these questions.
Bret called his solution to these problems, DataWiki.
I call it “Dallas”.
There is, however, a key difference between Bret’s concept of the DataWiki and “Dallas” that is best highlighted by a Steward Brand quote:
Information Wants To Be Free. Information also wants to be expensive.
I do not think you can ignore this tension and any “data as a service” like “Dallas” needs to internalize this deeply in both its technical architecture and business strategy.
With that said, I think of “Dallas” as an important example and (I hope) success story of the Open Data vision that many of us at Microsoft share.
Maybe Bret will get his DataWiki after all…
bad data is worse than no data. you don’t waste so much of your time no data. the internet is full of bad data.
Kris
6 Feb 10 at 20:39
Do you think Freebase meets this need?
http://www.freebase.com
Charlie Robbins
10 Feb 10 at 03:21