Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
Music Media

Audio Fingerprinting Via Cell Phones 89

aruil writes: "MSNBC has a story reporting on yet another audio fingerprinting application. Next year, Royal Philips Electronics will begin selling licenses to allow users to identify songs using their cell phones. Similar technology has already been open-sourced in FreeAmp, which uses the Relatable engine."
This discussion has been archived. No new comments can be posted.

Audio Fingerprinting Via Cell Phones

Comments Filter:
  • Aside from what's covered in the article, what possible applications can anyone see for this?
    • No doubt the RIAA could think of some way to use this to their advantage. Maybe require all new radios to have some sort of hardware added that would prevent playback of songs without this fingerprint? Also, it could be used for phone authentication, have a hardware fingerprinting system to ensure the person on the other end of the line is really from where they say they are.
    • in the "Name that tune" section of bar quizzes.
  • Kind of hopeless... (Score:4, Interesting)

    by MiTEG ( 234467 ) on Sunday December 16, 2001 @07:33AM (#2710874) Homepage Journal
    It seems kind of hopeless for all these companies and the audio fingerprinting. About a year and a half ago some company came out with a similar idea, except is was a separate device and you would have to wait until you got home and connect it to your computer. And it only worked with 2 radio stations. Obviously this idea was found to be a dud rather quickly. So now you can get the same information with a cell phone, this wouldn't by any chance be easier than actually calling the radio station and asking them what's playing? The only really effective method I've seen to do this are those giant billboards by the freeway with the screen that tells you what song is playing.

    And what about the distortion this will cause in the audio? It's not like FM radio is great quality, but embedding some sort of watermark/fingerprint that only requires 3 seconds of playback has got to have some sort of negative impact on the sound.

    Anyway, what the hell is up with all these trolls tonight? Time to start browsing at +2 now I guess.
    • you don't get it: (Score:2, Informative)

      by TheM0cktor ( 536124 )
      no watermark/fingerprint/anything in the music - just an algorithm that samples it, creates a small chunk of data describing it and looks that up against a database for matches. So no loss of quality involved, and if the algorithms as good as they say it'll cope with quite badly broken up music (AM radio? Noisy club?) as input.
      • The article is quite vague - I can't figure out whether it uses watermarking or actually identifies the song by audible content.

        If it's the latter, then that'd be very interesting - they've essentially implemented 'grep' for audio!

        It's "interesting" but I can't think of many uses of it apart from copyright enforcement.

        I'd like to feed this system three seconds of "baby baby baby baby" from a generic pop song, and see how confused it gets.
        • ;)

          nice idea... will probably return SONG BY BRITNEYSTEPSA-TEENSBRYANADAMSZONEMALFUNNNNNNNCTION ZZZZZT

          definitely points toward non-watermarking:
          All that?s required is to hold a mobile phone to a radio?s speaker for three seconds ? long enough to record a digital fingerprint ? when a song is playing.
        • It's really useful.

          For example, it can be used to determine whether or not songs are similar, and therefore be used to find songs similar to what you listen to a lot and would likely enjoy. Or to create good mixes on the fly.

          It can be used for 'radio tivo' applications, editing ads out of broadcast radio. (because it knows what bit of the pre-recorded radio was ad, and what wasn't.) And of course, it's handy for automatic station changing.

          Naturally, it's useful to be able to ID a song that's playing where you wouldn't otherwise know what it is. And a great tool for ID'ing mp3s and looking them up on a CDDB type database to fill the ID3 tags. (if for example, you hadn't done so when you ripped it)

          I was working on some mp3 projects for a while -- we were definately hoping to make use of this kind of stuff.
      • Sorry, my mistake. The idea of watermarking jumped into my mind after reading about fingerprinting, I guess I just ignored the stuff about the algorithm mentioned later in the article.
    • It seems kind of hopeless for all these companies and the audio fingerprinting. About a year and a half ago some company came out with a similar idea, except is was a separate device and you would have to wait until you got home and connect it to your computer. And it only worked with 2 radio stations

      It was Sony. It was basically a clock with a USB interface. When you heard a song you wanted to identify, you pressed a button, and it stored a timestamp. When you got home, you plugged it into the USB port, and the software read the timestamps, and then reported what was playing on the participating radio stations in your area at those times.

      They had quite a few more than two radio stations.

  • yet another wonderful technological innovation that will revolutionize the world

    please. is it really that hard to ask ya friend (you know ... the one with 10,000 mp3's) what song that is?

    yet another technological instrument that if implemented properly along with many other features could contrubute to a useful device, but instead will probably be marketed as a gimmick (just like that blue led "flashlight" you have at the bottom of your desk drawer, don't deny it) and eventually be dismissed as useless.

    btw this wasn't meant as a flame. go ahead and -1 it but basically waht i'm trying to say is that these devices have uses which could make them successful, who wouldn't want to know in the middle of a tv show or movie or music vid what song they're listening to. i just get annoyed at the poor implementations and over hype of such ideas

    and just disregard this if it makes no sense ... i'm tired and feel like crap

    -john
  • This could be a dastardly test-bed for keeping track of pepople using their voiceprints, since that technology, linked up to a global GPS satellite network, might just be "the end of privacy" as people are tracked down as soon as they speak on any telephone on the planet. Quick, call in Drew Barrymore, Lucy Liu and Cameron Diaz to foil them :)
    • I think we're safe for now. THEY'd need samples of everyone's voice. Hang on, my voicemail message provides my voice! But then THEY'd have to remove any accents I might put on to foil THEIR evil plans. And I'm hoping they can't do that yet.

      But until they do, we're safe from THEM spying and tracking us through the phone.
      Oh dear, I hope those Angels do something to avert this looming crisis.

      ~Duane
  • by mlinksva ( 1755 ) on Sunday December 16, 2001 @08:08AM (#2710905) Homepage Journal
    Bitzi [bitzi.com] (the "another" link in the article's "yet another" statement) isn't primarily an audio fingerprinting application. It's a file metadata catalog, audio fingerprints being just one sort of metadata collected. File metadata is keyed by a "bitprint" [bitzi.com] composed of two cryptographic hashes. The code [sourceforge.net] for generating bitprints and contributing metadata to the catalog is in the public domain and the catalog itself is available for free reuse and redistribution under a dmoz-like license [bitzi.com]. Disclaimer: I work for Bitzi.
    • Is there any plans to combine the Bitzi database with FreeDB?
      • Bitzi & FreeDB attack different problems: Bitzi does discrete digital files (not just audio), while FreeDB does physical CDs.

        So while there are some similarities, and if you could get perfect bit-for-bit CD rips then Bitzi would duplicate a lot of what FreeDB does, there's no pressing need for a combined offering. If there was, it wouldn't be too hard for either project to use whatever data it needed from the other, within the free licenses.

        (FYI: I am the CTO of Bitzi.)
        • Yes, but even though Bitzi is a general file catalogueing project, taking information from other, more specific sources can help (when their license allows it, of course)

          For music files, this would of course be FreeDB. For movies, a good choice would be IMDB etc.

          For instance, if I look up one of my Nightwish songs, Bitzi gives me the following information:
          http://bitzi.com/lookup/TTGZBRZLZ2HLXDHSQYBTEJD33M Y4OA2X.46IPPFIFT353PXN2BWBZEMYBF3ASZXTCWJN43RY [bitzi.com]
          the data from FreeDB is more accurate, giving the album the song belongs to, the playtime etc.
          http://www.freedb.org/freedb_search_fmt.php?cat=ro ck&id=c011130f [freedb.org]
          • You mean the FreeDB record is more comprehensive (though the Bitzi record for that file does include playtime). Bitzi records can accomodate much more information than FreeDB, but will generally have less for a newly reported file. There are two obstacles to integrating FreeDB data with Bitzi:
            • You need a discid (or the track offsets used to calculate discid), and generally someone with a file won't have access to the discid, so there really isn't any way to look up FreeDB information given just a file, which is what Bitzi has to work with.
            • FreeDB is GPL, Bitzi is dmoz-like (OpenBits).
            Those caveats aside, in the longer term I do hope to see collaboration and cross-pollination among all free catalog projects.
  • Any 3 seconds!? (Score:3, Interesting)

    by mlinksva ( 1755 ) on Sunday December 16, 2001 @08:39AM (#2710923) Homepage Journal
    I'm impressed if the service really does accurately identify songs after only 3 seconds, and any 3 seconds of a song. Presumably you'd need to fingerprint second or two chunks of every song to have this capability. This is quite different from what I understand Relatable does, which is to fingerprint the first 30 seconds of a song, meaning a song can only be identified after 30 seconds, and only the first 30 seconds.
  • by hovik ( 257174 ) on Sunday December 16, 2001 @09:58AM (#2710968)
    Fast Search & Transfer [fastsearch.com] has developed a "whistle sreach". Just whistle a few notes from the song you want, and their searchengine finds the songs mathcing.

    I have tried it and can confirm it works really well.

    Story from newscientist.com here [google.com]. (cache)
    Also a article from GEMENI here [www.ntnu.no].
    • Wow, searching on whistling would even be more unbelievable! One would fear lots of duplicates, problems with unpure/shifted whistling - it really sounds like vaporware and the site is down as well :) But it would really help to find out whether Michael Jackson really ripped that number one hit.. ;)

      Offtopic: * 2001-11-30 12:33:57 Mobile Phone System Identifies Music (science,news) (rejected) -- This was originally a New Scientist article indeed. All my submitted stories so far have been rejected, only to be accepted weeks later in a worse write-up by someone else. I will submit no more stories - it seems a huge waste of effort to me. No thanks.
      • yeah, same here, I submitted somethign about a week ago on bioinformatics on the US/Canada border to identify people, but it never got selected, a quick reject tho....I wonder if it has to do with Karma?
        • No, karma hasn't got to do with it - my karma is 37 now. Read the faq [slashdot.org]. The "omelette" part sounds the least convincing to me - if it's interesting nerd news, why not post it *now*? Do the math: it wouldn't generate more stories, but posting news sooner would shorten the submission queue. Plus it's less a disgrace to us users.</sulk>
  • ... as for one thing it needs a radio to work. So you need to be physically beside a radio playing a song that you don't know, which isn't very useful, as the announcer would probably announce the name of the song before or after they played it.

    What would be more useful is where you could hum a song into the phone, and it could tell you what it was. I personally need something like this as i've got this song in my head (with no lyrics so you can't use Google [google.com] to find it) that is driving me nuts as I try looking for it. The ability to hum it in and give it further parameters to search for would be good.
    • I don't know about the radio stations where you live, but out here, the DJ will announce the name of a song the first time he plays it and that's it. Took me two months to track down the name of Tracy Chapman's "Telling Stories" by the lyrics.
  • The first mental image I got from the head line was a picture of someone rolling a cell phone across a black ink pad... :)

    And, correct me if I am wrong but this kinda says "digital watermark":
    "The fingerprint might contain small mistakes. The technology is so robust that it can handle that," said Jaap Haitsma, a Philips research scientist.

    Haven't we gone thru this already?

    Seriously. The only thing missing was an SMDI challenge and the RIAA. Even though Microsoft is involved in this, I'm quite sure the RIAA will (pardon the pun) chime in Very Soon Now(TM).

    Ok, quit possibly I am missing the point, but read this:
    As well, a legitimate online music services running on the Napster model could use the technology to stop copyright-protected material from being shared.

    Legitimate, Napster and stop sharing all in the same sentence?

    Eh? The whole point of Napster was to share (leave out the legit or not) songs.

    {Pitr voice}
    Someone please to be explainink this to me.
    {/end voice}

    Moose.

    .
  • Can this actually tell two different Britney Spears songs apart? Even most humans can't do that...
  • Say, a weird Al song and the original.
  • I would think that the quality of music over a cell phone would be absolutely horrible.

    Remember that the way digital cell phones work is that they analyze the sound input, and send it not as audio data, but as coefficients to a human-voice synthesis DSP to save bandwidth. Digital cell phones are really only good for transmitting and recieving human voices, and are really really bad at music.

    So it is rather suprising to me that such as technology is feasible.

    • Remember that the way digital cell phones work is that they analyze the sound input, and send it not as audio data, but as coefficients to a human-voice synthesis DSP to save bandwidth.

      Err, what?

      You're talking about GSM encoding [bdti.com]?

      "The quality of the algorithm is good enough for reliable speaker recognition; even music often survives transcoding in recognizable form (given the bandwidth limitations of 8 kHz sampling rate)."

      What's this about human voice synthesis DSP? Sounds a bit black-helicopters-faked-the-moon-landing to me. (or else I'm totally out of it - it happens! my cellphone here's still analog)
    • No, I'm talking about CELP.

      Check it out [selbyventures.com]. Or do a google search.
  • I've seen this.... (Score:2, Informative)

    by kinko ( 82040 )
    At the recent ISMIR conference in Indiana, I saw a demonstration of a very similar thing from the Franhoffer (sp) people - it was pretty cool. Just to clarify some of the questions posed by above comments:

    1) No tampering is done to the audio - ie there is no watermark, it is "just" signal processing.

    2) The system I saw could take any 3 or 4 seconds, so that means that a fingerprint was calculated over the whole song. This means they must have some clever algorithms to make sure that the hash is calculated using the same time slices (or something...)

    3) The song has to be in the database. So that means that the fingerprint has already been calculated, and probably had metadata assigned to it either by hand or using mp3 id3 tags (this is a guess). The fingerprint size is about 16KB per song, which seems pretty reasonable.

    4) The technique only works on a per-recording basis, so even the same performer doing a slightly different version or another recording won't match if that recording was not already fingerprinted.

    5) The version I saw was standard PC software, using audio input through the microphone.

    My suspicion is that this technology is more likely to be of value to copyright holders looking to automatically identify violations (eg public airings, radio stations not paying royalties) than it will be useful to joe sixpack (or even people like us...)
  • Show me the money (Score:2, Interesting)

    by oflanigan ( 524770 )
    Including metadata in a digital stream is trivial, and radio should go digital soon enough. (There is not enough room for all the would-be radio stations to broadcast in the scant airwave real estate the way it is parceled now, but digital broadcasting would make more room for reluctant web-based radio stations.) Soon, the only non-digital (and therefore non meta-tagged) music will be stored in the brain. As others have stated, technology that can only identify playback isn't really that useful, but a program that could identify a song by a few hummed or whistled notes would be Really Cool.

    As a musician, I value authenticity and would love to have an app that would prevent my brain from tricking me into thinking I had composed a remembered melody. Also, many people would love to know what that song stuck in their head is. However, is this enough consumer interest to sustain such a product? Furthermore, is there a chance that copyright lawyers will get out of control with this kind of power?
  • by Tyrall ( 191862 ) on Monday December 17, 2001 @12:18PM (#2715051) Homepage
    The RIAA must be rubbing its proverbial hands with glee. Gone are the legal defenses [slashdot.org] that music/file-sharing systems have used.
    Gone are the methods of avoiding detection [slashdot.org] used to date.

    Even if this detection has no way to discern between the original and a cover of the song, I can see the RIAA and major labels nailing a bunch of people, and using this system as proof.

Top Ten Things Overheard At The ANSI C Draft Committee Meetings: (5) All right, who's the wiseguy who stuck this trigraph stuff in here?

Working...