Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • I think web scrapping the Odeon website is not a problem. The problem would be (or was) turning a webscrapping script into a CGI that's available to everyone (and that Odeon users might take for Odeon's original site).

    So, as long as you can do whatever you want with the data you downloaded from the website, why not go for the web proxy option?

    use HTTP::Proxy;
    use HTTP::Proxy::BodyFilter::Odeon; # coding this module is left as an exercise for the reader ;-)

    my $proxy = HTTP::Proxy->new;
    $proxy->push_filter(
        response => HTTP::Proxy::BodyFilter::Odeon->new,
        host     => 'www.odeon.co.uk',
        # maybe a few more options here
    );

    $proxy->start;

    And voilà, you can now surf as usual, infringing no copyright whatsoever and munging the web data for your own (and eventually your users) use. I don't think distributing an hypothetical HTTP::Proxy::BodyFilter::Odeon module would cause any trouble with Odeon (except for the name, maybe).

    I think I've found the tagline for my YAPC::Europe talk:

    HTTP::Proxy: surfing your version of the world-wide web.