1

I have a large image gallery, and a few years later I realized that I get more and more duplicates. :) Basically, my gallery consists of 7-Zip archives containing the image sets. I saw there are some commercial image duplicate scanners for Windows, but, unfortunately, they deal with RAR and ZIP formats only.

So I would like to find a tool that can scan the regular directories and such archives like they were a regular directories. (Perhaps, there could be some kind of a virtual file system mount if possible, that could make something like a file system mirror where all archives are expanded to regular files.) The target system is Windows, though Cygwin-related ideas are also really welcome.

Can anyone help? Your help would be very appreciated.

1 Answers1

2

I don't know any tool which does that but if you're interested to try for yourself a way to achieve that would be to write a script wrapping the 7z l -slt archive.7z which for each file would append to a file:

  • the name of the file
  • the size of the file
  • its CRC

From here you just have to find duplicates (defined as 'Equal CRC and size' to limit collisions) and delete them using 7z d archive.7z filename.

edit:

You may want to look at this project, even though it doesn't support writing yet it could evolve:
fuse-7z

Many other solutions exist for other archive formats, here is a list: http://sourceforge.net/apps/mediawiki/fuse/index.php?title=ArchiveFileSystems

Shadok
  • 4,070