Comparing MP3 files...
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
Same "content"? Do a voice to text of the files; then compare the text. Probably need to run the import at normal speed, or less.
"Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
I've used Duplicate Cleaner for many years now, not just for audio, and I've liked it enough to buy the pro version. I'm pretty sure it will do what you're looking for.
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
Depending on the exact use case, you could decode the mp3 files, align both that they are in-phase (don't know if that's the correct word, mainly finding the point where both signals have the same waveform, works good for digital stuff, and less good if you have different recordings from tape), then phase invert one and adding them up (or simply subtract one of the other..). then calculate the peaks, if these are below a certain threshold (around < -50dB for audio from CD-Masters is a good value, but depends on the audio and encoding) chances are good it's the same audio. getting the audio programmatically in-phase could be tricky it think, again, depending on the audio / conversions / encodings. maybe that helps.
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
Dandy72, I just come with an idea, not a clear and tested solution. You could try to extract the fourrier transform of your signal, and then compare your two frequencies graphs. If they are close you can assume the two signals are similar. As you manipulate signals, of course you certainly encounter all the issues related to unwanted noise, bit rate...etc.
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
Just found Audio DiffMaker[^] but never used it. What I do use is Mp3Tag with its build-in md5 audio checksum, but I think, it will fail for different compression rates. Some usage idea in How-to: Copy the MD5 checksum to a tag and compare it[^] Second find - sounds too good to believe (english landing page) AllDup - Freeware zum Suchen und Löschen von Dateiduplikaten[^] foobar 2000 with a plugin was also mentioned somewhere ...
-
Is there any tool out there that can take two MP3 files, and compare the audio portion of it (only) to determine whether it's the same content or not? That is, if you have the same file that has been copied and then had its ID3 tags or embedded images modified - is there something that could tell me they are still, essentially, "the same"...? Anything using hashes on the file as a whole is obviously not a contender. (Way) more complex scenario: How about comparing two files, but also ignoring the encoding bit rate? That is, if it's the same track, but one's been encoded at 128kbps vs another at 320kbps - then I'd want to get rid of the lower-bitrate version...I believe something like MusicBrainz could do that sort of thing - as I understand it, it "listens" to the audio regardless of encoding and tries to identify what it is...but it's not exactly suited for my purposes (comparing files in bulk). But I have to think because this exists, it is technically possible...so what could be used automate this and organize the results in a way that makes sense?
Just to follow up on my own thread. Thanks to those who took the time to respond. This isn't some programming project I want to take on. And of those programs that exist...it sounds iffy at best. I wrote them all down and will check out later exactly what I can get done...