Slash Boxes
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

use Perl Log In

Log In

[ Create a new account ]

ajt (2546)

  (email not shown publicly)

UK based. Perl, XML/HTTP, SAP, Debian hacker.

  • PerlMonks: ajt []
  • Local LUG: AdamTrickett []
  • Debian Administration: ajt []
  • LinkedIn: drajt []

Journal of ajt (2546)

Tuesday July 25, 2006
03:03 PM

Finding Dupes

[ #30422 ]

Last Friday was my local mini-LUG meeting. Someone asked to borrow a small Perl script I hacked up that finds duplicated files on a server.

At work my Windows admins are running out of hours in the night to back things up, adding more disk isn't a solution we just can't get it off to tape before the users are back on the systems. By forcing users to take ownership of their mess we were able to delete many megabytes of duplicated binary files scatterd all over the place. It's not fixed the chaos problems, but it's put off the end of the world for a short while.

It turns out that my friend has the same problem, disk space is cheap, but being able to back it up isn't...

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
More | Login | Reply
Loading... please wait.
  • Someone asked to borrow a small Perl script I hacked up that finds duplicated files on a server.

    Has he given it back yet?

    • I must confess I've not lent it to him yet... ;-)

      -- "It's not magic, it's work..."
  • Regarding the backup problem. Why not get some of those cheap disks and create a staging box where all of the "data to be backed up" is stored and then the tape can back up from that?
    • For our Unix systems we backup to disk using IBM's flashcopy, which is our prefered backup/restore method for the SAP systems. The Windows systems are a bit primitive, they don't do LVM or any fancy file systems, just old fashioned NTFS onto hardware mirrored disks. While you can do smart things with the disk, we don't...

      I've now completed my re-write of my find-duplicate-files tool, which works okay on my Linux box at home. It's a bit faster than the last version I wrote, as this one only checksums files

      -- "It's not magic, it's work..."