this post was submitted on 11 Jan 2026
9 points (100.0% liked)

Data Hoarder

830 readers
1 users here now

Keep it about datahoarding.

Rules

founded 2 years ago
MODERATORS
 

I don't know if this is the right place, but I figured the Hoarder Community would have a good idea on software.

I'm looking for an app that will scan an audio library and pick out duplicates. It has to do this by some other means than a mere filename, file size or audio tags. Ideally it would use all of those criteria, and do an audio analysis. I do have all my music sorted, collated, and tagged correctly tho. Opensource would be awesome. Baring that, Free is also acceptable. LOL

'presh

top 11 comments
sorted by: hot top controversial new old
[–] Tippon@lemmy.dbzer0.com 3 points 1 month ago (1 children)

Which OS are you using? MusicBrainz Picard can scan your music and organise it on Windows and Linux. I use it on Mint to organise music into a root folder for Clementine to pick up.

I'm pretty sure that one of them has duplication detection, but I'm not at home to check.

[–] irmadlad@lemmy.world 3 points 1 month ago (1 children)

Which OS are you using?

I use them all. LOL

I was not aware that Picard could do that. I'll put it on the list. Currently I am running a scan with Czkawka comparing hashes. I'll see how it goes.

Thanks for the input.

[–] Tippon@lemmy.dbzer0.com 3 points 1 month ago (1 children)

Ah, sorry, I've led you astray 😫

I checked, and I can't find anything for duplicates in either of them. I tried a few different programs before I sett on these, so it must have been one of the others.

[–] irmadlad@lemmy.world 3 points 1 month ago (1 children)

No worries mate. I am grateful for any input or recommendations and thank you for taking of your time to do so. I deployed Czkawka, and ran a couple test runs, then pulled the trigger. Czkawka is quite fast considering I have a fairly large audio collection. It has many options and I am now duplicate free.

[–] Tippon@lemmy.dbzer0.com 3 points 1 month ago (1 children)

I'll have to have a look at that myself. I'm pretty sure that my only duplicates now are different formats, but I could do with checking. I could do with sorting through my photos too, but that would take ages

[–] irmadlad@lemmy.world 3 points 1 month ago (1 children)

oof! Sorting photos. Back in the day Picassa was pretty good. The downside is it's a Google product. Not sure how you feel about Google. As far as the audio collection, I ripped everything to flac long ago when I ran a licensed internet radio station, in the pre-Napster era when audio on the internet was mostly cheesy midi files you were forced to listen to when you visited someone's MySpace or Geocities. My mail carrier at the time got so very angry with me because I would solicit Indie bands for their CD's to promote them, free of charge. I am just a huge music fan. But I had to put a big box out by the mailbox for him to dump them all in everyday. Good times.

[–] Tippon@lemmy.dbzer0.com 3 points 1 month ago

I sorted out my phone photos a while back, and arranged them all properly, but then I found an old backup with loads of missing photos and dodgy file names. Dropbox automatically renamed them during a backup about ten years ago, so I would have had to rename them all and check for duplicates again, and I just couldn't be bothered at that point.

I'm an amateur photographer, so I've got tens of thousands of photos going back at least 20 years, so I really do need to sort them out.

I've got thousands of music files too, mostly as mp3. I've just finished organising those, so the next job is getting rid of the crap and getting better quality copies of what's left.

Funnily enough though, my father was really into the local music scene in the 90s and got me into it, so I've got a load of music from small unsigned bands too. I help with a small music festival here, also for unsigned acts. I haven't upset the postman here though, I've only had posters delivered. So far... 😁

[–] Zachariah@lemmy.world 2 points 1 month ago (1 children)

If they’re all tagged correctly, then wouldn’t you be able to merely match duplicates based on tags?

https://dupeguru.com/does-dupeguru-support-scanning-music-files/ (dupeGuru is open source) can help with that.

Or filter on open source for the alternatives (such as Czkawka ) to dupeGuru here for alternatives: https://alternativeto.net/software/dupeguru/

Oh, I just found a table of options on https://github.com/qarmin/czkawka with some saying they support audio content matching.

Weird: https://czkawka.com/ has cancer; do not visit it. Seems https://github.com/qarmin/czkawka is the real website.

[–] irmadlad@lemmy.world 2 points 1 month ago (1 children)

Weird: https://czkawka.com/ has cancer

Elaborate?

Thanks. I'll check these out.

[–] Zachariah@lemmy.world 2 points 1 month ago (1 children)

Ads and “infected” messages all over it. It was first hit on DuckDuckGo. Not sure it was ever legit, or if it was compromised.

[–] irmadlad@lemmy.world 2 points 1 month ago