Monday October 16, 2006
08:10 PM
Perl catches a pedophile on Myspace.
Wired Magazine's Kevin Poulsen has shown that at least part of the problem of policing MySpace is a SMOP.
http://www.wired.com/news/technology/1,71948-0.html My road to this New York police unit began in Perl.
In May, I began an automated search of MySpace's membership rolls for 385,932 registered sex offenders in 46 states, mined from the Department of Justice's National Sex Offender Registry website -- a gateway to the state-run Megan's Law websites around the country. I searched on first and last names, limiting results to a five mile radius of the offender's registered ZIP code.
Wired News will publish the code under an open-source license later this week.
The code swept in a vast number of false or unverifiable matches. Working part time for several months, I sifted the data and manually compared photographs, ages and other data, until enhanced privacy features MySpace launched in June began frustrating the analysis.
Excluding a handful of obvious fakes, I confirmed 744 sex offenders with MySpace profiles, after an examination of about a third of the data. Of those, 497 are registered for sex crimes against children. In this group, six of them are listed as repeat offenders, though Lubrano's previous convictions were not in the registry, so this number may be low. At least 243 of the 497 have convictions in 2000 or later.
Excellent; now what about the flipside? (Score:1)
That’s good; but think about the other side of the coin too. The Semantic Web and The Quilombo [zedshaw.com] makes this point…
You have got to be kidding!?!?! (Score:1)
To start with, the person doing this was using a public list of known criminals and searching and cross referencing that list though a site membership which is a known hot spot for sexual predators. You only need to do a quick search of the news from the
Re: (Score:1)
I didn’t criticise him for identifying sex offenders on Myspace, did I?
I am worried about the trend of putting more and more ever more easily minable personal records online. In this case it was used for good, but what might happen when someone els
Re: (Score:1)
No, you equated it with IBM sorting out Jews in the German census data. Of course you did it indirectly, but you still did it. This was my issue.
Re: (Score:1)
You don’t have much control over your public data either. Just try to cancel your account and remove all of its traces with most online services. Or try to get everything you have put online out of Google.
The only difference is that with public dat
Re: (Score:1)
Some more points (sorry):
Well, of course. Both cases are, in fact, exactly the same act. But you failed to notice that I made a disctinction about the intent. I did not say it was a ba