Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

use Perl Log In

Log In

[ Create a new account ]

Alias (5735)

Alias
  (email not shown publicly)
http://ali.as/

Journal of Alias (5735)

Wednesday November 11, 2009
10:03 PM

The Database That Ate the Census

[ #39886 ]

http://ali.as/census

Following up from my previous post, and some concerns raised about the legalities of using the Google Maps geocoder, I've managed to find a way to qualify for a free unlimited subscription to Microsoft Bing Geocoder, formerly Microsoft MSN Geocoder.

So if we do hit problems with Google, I have a workable fallback option.

Moving onwards, my biggest problem with the http://geo2gov.com.au/ service has been that I link to census keys which aren't particularly usable for most people.

The Census provides a huge range of information for these keys, but behind layers and layers of web interface. This is fine for basic situations, but what if I what if I want to go watch the Lithuania vs Argentina World Cup match to cheer against Argentina. How do I search for pubs and clubs located in areas or enclaves with high numbers of Lithuanian immigrants, so I can maximise my chance of bumping into others to cheer with?

(This is, of course, highly notional)

Fortunately, at many University libraries you can find copies of this ABS online information compiled into a single file tree. This, of course, still isn't particularly useful because it isn't in easy to use file formats.

So I've fixed that problem with The Database That Ate the Census.

This is a single file download, in either PostgreSQL backup or SQLite formats, of the entire Australian 2006 Census data. Or more specifically, the CD (Collection District) layer of the BCP (Basic Community Profile) subset of the census compilation.

The timing makes it too late to be used in the Mashup Australia competition, but just in time for the apps4nsw competition, which was launched today.

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • Hi Adam,

    very nice idea, but I think the sqlite version is truncated/corrupted. I've downloaded twice and obtained a 165.3MB file both times (rather than the advertised 174MB).

    sqlite3 produces "Error: database disk image is malformed" when it tries to open the unzipped db.

    The md5 sum of my downloaded version (census-20091111.sqlite) is

    cb3b4f03ffab095df265f4c9dc6ad6c5

    if that's any help.

    Cheers,
    Paul
       

  • Adam -- Thanks for your thoughtful comments on my mashup. Note that http://ali.as/contact.html [ali.as] gives a 404, alas, so I am responding here in case you don't go back and read the followup comments on the mashup site... we actually DID a lot of statistical analysis, but it got lost in an update. It's on the site now, enjoy, have fun. (And there is a bias. It isn't subtle. However, we don't know how monies are getting spent in the REST of the stim programs.) Looks like you are doing cool stuff here -- I w