Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror
×
AI Mozilla Open Source

Mozilla's New Open Source Voice-Recognition Project Wants Your Voice (mashable.com) 55

An anonymous reader quotes Mashable: Mozilla is building a massive repository of voice recordings for the voice apps of the future -- and it wants you to add yours to the collection. The organization behind the Firefox browser is launching Common Voice, a project to crowdsource audio samples from the public. The goal is to collect about 10,000 hours of audio in various accents and make it publicly available for everyone... Mozilla hopes to hand over the public dataset to independent developers so they can harness the crowdsourced audio to build the next generation of voice-powered apps and speech-to-text programs... You can also help train the speech-to-text capabilities by validating the recordings already submitted to the project. Just listen to a short clip, and report back if text on the screen matches what you heard... Mozilla says it aims is to expand the tech beyond just a standard voice recognition experience, including multiple accents, demographics and eventually languages for more accessible programs. Past open source voice-recognition projects have included Sphinx 4 and VoxForge, but unfortunately most of today's systems are still "locked up behind proprietary code at various companies, such as Amazon, Apple, and Microsoft."
This discussion has been archived. No new comments can be posted.

Mozilla's New Open Source Voice-Recognition Project Wants Your Voice

Comments Filter:
  • Sounds good if they make the corpus freely available. Having lots of free high quality audio recorded from modern digital microphones would be useful. Voxforge recordings tend to be poor quality, TIMIT is still proprietary despite being over 30 years old now, and the TEDLIUM corpus recordings seem to have a horrible amount of reverb/echo in them.

    • Sounds good if they make the corpus freely available. Having lots of free high quality audio ...

      I agree, but from a quick look at their page, I see a lot of problems with reaching that goal.

      1: Most computers I've seen have pretty wretched audio inputs: tiny microphones near the screen, so not anywhere near the speaker's mouth. So we can expect lots of noise, echo, and other stuff. Good for simulating the real world (because it basically is the real world), but not what I would call high quality. Some gamers and others probably use good quality headsets, but I doubt they will make up the majority of

      • When you set up an account (as I did), you specify what region and type of english you are speaking.

    • by starless ( 60879 )

      high quality audio recorded from modern digital microphones would be useful. .

      What is a "digital microphone"...?
      Does that term actually mean something?

    • My employer is also building up a massive repository of voice recordings [gov1.info], and we'd also be keen to get everyone's voiceprints on file. If people are interested in contributing, please contact bulkcollectionoffice@nsa.gov.
    • The are already open source projects doing this, in fact they are even linked in the summary. VoxForge already exists so instead of yet another NIH syndrome project why not work together to improve VoxForge?
  • by WaffleMonster ( 969671 ) on Saturday July 22, 2017 @07:20PM (#54859501)

    Thanks to Nuance voice recognition industry is effectively dead. If Mozilla can make this work in offline mode it would be awesome. Not requiring your every word to be recorded shipped off to third parties would be very useful.

    • by Anonymous Coward
      Hey, at least someone here acknowledges the actual market leader in speech technology: Nuance. I love how they always quote companies like Apple, Microsoft and Amazon. Actually I'm quite surprised they didn't mention Google or Facebook. But it's Nuance who has its speech and transcription services in 85% of US hospitals, in plenty of household equipment like Samsung TVs, the first generation of Siri (you know, of Apple that they mention here), most car systems ( BMW, Audi, Jaguar, Mercedes, Porsche, Volkswa
      • Hey, at least someone here acknowledges the actual market leader in speech technology: Nuance.

        TVs, the first generation of Siri (you know, of Apple that they mention here), most car systems ( BMW, Audi, Jaguar, Mercedes, Porsche, Volkswagen, Fiat, Peugeot, Citroen, Ford, ...), call centers

        Holy shit, my shill meter has gone to plaid.

        The only point I was making is Nuance is a terrible company. They either bought out or sued their competition to the point where there is no longer a functioning market leaving Nuance as a defacto monopoly. My remarks were never intended to assign praise or acknowledge the "greatness" of Nuance.

        I strongly believe the world would be in a much better place in terms of current commercially available voice recognition capabilities had Nuance never existed.

  • In soviet Russia the domain extension autopilots you.

The Tao is like a glob pattern: used but never used up. It is like the extern void: filled with infinite possibilities.

Working...