Batch delete duplicates?

classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

Batch delete duplicates?

Johnny
After having used the "find duplicates" tool, I end up with a long list
of photos that have duplicates. How would I go about to tell Digikam to
delete all except one copy (e.g. the first, or whichever really) of each
photo?

The only way I have found out is to go through the list for each photo
individually, but that becomes tedious for 700 some images that have
duplicates...

Thanks for any help!
--
Johnny
_______________________________________________
Digikam-users mailing list
[hidden email]
https://mail.kde.org/mailman/listinfo/digikam-users
Reply | Threaded
Open this post in threaded view
|

Re: Batch delete duplicates?

Rinus
I have an idea, but can not test it because I have no fingerprints, and
making them would take much too long.

This works alone if you have real duplicates, with same name not for
look alikes and duplcates with different name.

If digikam is able to show all the duplicates in the browser (I do not
know) you could select them all, move them all to a tempory directory,
if they have the same name it will ask if overwrite is ok select yes and
apply to all.

The only problem left is they are no longer in the original place.

Does this help?

Rinus

Op 28-08-11 16:19, Johnny schreef:

> After having used the "find duplicates" tool, I end up with a long list
> of photos that have duplicates. How would I go about to tell Digikam to
> delete all except one copy (e.g. the first, or whichever really) of each
> photo?
>
> The only way I have found out is to go through the list for each photo
> individually, but that becomes tedious for 700 some images that have
> duplicates...
>
> Thanks for any help!

_______________________________________________
Digikam-users mailing list
[hidden email]
https://mail.kde.org/mailman/listinfo/digikam-users
Reply | Threaded
Open this post in threaded view
|

Re: Batch delete duplicates?

jdd@dodin.org
Le 28/08/2011 18:53, sleepless a écrit :

>> After having used the "find duplicates" tool, I end up with a long list

problem is many photos can have the same name still no be true
duplicates (edited one versus original ones...)

jdd

--
http://www.dodin.net
http://www.youtube.com/user/jdddodinorg
http://jdd.blip.tv/
_______________________________________________
Digikam-users mailing list
[hidden email]
https://mail.kde.org/mailman/listinfo/digikam-users
Reply | Threaded
Open this post in threaded view
|

Re: Batch delete duplicates?

Johnny
In reply to this post by Rinus
sleepless <[hidden email]> writes:
> If digikam is able to show all the duplicates in the browser (I do not
> know) you could select them all, move them all to a tempory directory,
> if they have the same name it will ask if overwrite is ok select yes
> and apply to all.
This is the solution I went for in the end; from a terminal I move all
my uppercased files into a new directory, downcased them and moved them
back in the right dir (with digikam open, so not to confuse it).

Some research leads me to belive that a oneliner using find and rename
should be able to do it, but there seems to be several versions of the
rename utility, and the one on Fedora appears different than the Debian
which was referenced, so I couldn't use it.

,----
| find . -regex .*IM.*\.JPG -exec rename {} 's/(.*)\/([^\/]*)/$1\/\L$2/' {} \;
`----

In theory this would be nice if one has to do it often and avoid any
manual moving around files or looping, but alas, this is a quite
infrequent procedure so I am happy to move files for now. :)
(if anyone on Debian knows if my guess of different 'rename' tools is
correct and can make it work, that would be interesting knowledge
though!)

Thanks!
--
Johnny
_______________________________________________
Digikam-users mailing list
[hidden email]
https://mail.kde.org/mailman/listinfo/digikam-users
Reply | Threaded
Open this post in threaded view
|

Re: Batch delete duplicates?

Johnny
In reply to this post by jdd@dodin.org
jdd <[hidden email]> writes:

> Le 28/08/2011 18:53, sleepless a écrit :
>
>>> After having used the "find duplicates" tool, I end up with a long list
>
> problem is many photos can have the same name still no be true
> duplicates (edited one versus original ones...)
>
I wouldn't see it as a problem but rather a user decision of how to work
with images that are either edited or nearly identical for other
reasons, e.g. playing with camera settings and sequential shooting
somtimes give very similar images that can be picked up by the duplicate
tool. This will depend on the threshold setting of the "match" criteria,
but it would still be useful to be able to choose e.g. the first or last
version of a photo, or even keep first and last, and skip everything in
between to e.g. enable quick deletion of intermediary edits, revert to
"originals" or keep latest version only. Of course, one could get false
matches depending on the settings used, and what type of pictures the
album/s contains, but that I assume would be a user decision?

An issue for me (maybe others?) in adapting the digital world is that I
like to try to get organised now when the tools are available / I
(well, sometimes at least) understand how to use it, and many legacy
photos I have copied and moved around in a fairly random fashion (a
sporadic backup on a CD, maybe som image on a 3.5" disk still, some on
an old HDD etc.). Some images may be in several places, sometimes with
different names, in particular random caps (upper or lower). I want to
reel these in in a structured way to a common structure (i.e. Digikam),
but will clearly have some duplicates around. I really do not want to
blatantly lowercase all photos (as I now ended up doing), as there is a
risk, albeit small (however, if two cameras would for some reason start
at the same numbering at some point, a significant number of images
would be lost!), that two different cameras have used the same name for
different images which means I would lose images! As I know these images
are sufficiently different not to get false positives, using the
duplicates tools seems to be the right choice.

I think this may be achievable by using 'fdupes' from the command line,
but haven't tried it yet for this and am unsure what Digikam is using
for this? Surely it must be something similar?

Regards,
--
Johnny
_______________________________________________
Digikam-users mailing list
[hidden email]
https://mail.kde.org/mailman/listinfo/digikam-users