acme (email not shown publicly)
http://www.astray.com/
Leon Brocard (aka acme) is an orange-loving Perl eurohacker with many varied contributions to the Perl community, including the GraphViz module on the CPAN. YAPC::Europe was all his fault. He is still looking for a Perl Monger group he can start which begins with the letter 'D'.
Facinating, anyone for bz2? (Score:2)
I'm amazed by your finding that an uncompressed CPAN is only 13% larger than the compressed version. I would have thought that anything text based like a Perl module should compress very well, even with ZIP or tar.gz.
I wonder what is taking all the space up and is uncompressible?
I know in the cygwin [cygwin.com] world bzip2 [redhat.com] is very popular, and I've wondered if going forward it would be useful for CPAN or future CPAN to support it as well, to squeeze a little more compression in.
-- "It's not magic, it's work..."
Re:Facinating, anyone for bz2? (Score:3, Informative)
thousands of tiny little files...
Re:Facinating, anyone for bz2? (Score:2)
Re:Facinating, anyone for bz2? (Score:2)
no, but there isn't much compression to be had in a 3k file no matter what algorithm you use. I recall that the average filesize is around 50k....gz, bz, Z ...it'll all be much the same result.
Re:Facinating, anyone for bz2? (Score:2, Insightful)
If you wanted to compress perl modules better, you'd want a denser file packing scheme than tar, and build a compression scheme that was prepopulated with a lot of the common perl substri
Re:Facinating, anyone for bz2? (Score:2)
While I agree that bz2 or someother compressor isn't going to fix the problem, I do find that on a tar of text files, it's quite a bit more than 1% efficient than gzip.
I can't comment on replacing the tar structure, but I've seen comments on it's weaknesses in other places too.
I'm still amazed at how little compression there is in CPAN, the latest module I've uploaded for example, shrank from 90kb to 24kb with gzip (22kb with bz2). What is in there that doesn't compress?
-- "It's not magic, it's work..."
Re:Facinating, anyone for bz2? (Score:1)
It's not that the data on CPAN is oddly uncompressible. It's that t
To be fair... (Score:3, Informative)
In particular, the size of a moderately complicated binary (XS module) is significantly larger than the source.
Also, what if you only take the latest (or latest two) versions of any given module? A lot of authors haven't heard that BackPAN exists, and that the Master Librarian would like to see things under 700 MB.
Were that I say, pancakes?
Reply to This
Blame (Score:2)