Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Youtube Technology

YouTube Has 1 Billion Videos With Closed-Captioning (But Not All of Them Are Accurate) (variety.com) 52

Over a billion videos on YouTube are accessible to viewers with difficulties in hearing, thanks to the video giant's automated captions, it said Thursday. From a report on Variety: That certainly sounds impressive -- except when you realize that many of the site's automatically generated captions aren't completely right. The Google-owned video giant first launched captions back in 2006, and three years later introduced automatic speech recognition to add closed-captioning to YouTube content. Today, YouTube users watch video with auto-generated captions more than 15 million times per day. But the system is prone to errors. For example, the trailer for Amazon Studio's Oscar-nominated "Manchester by the Sea" (at this link) includes numerous inaccuracies in the auto-transcribed captions, sometimes to hilarious -- not to mention frustrating -- effect.
This discussion has been archived. No new comments can be posted.

YouTube Has 1 Billion Videos With Closed-Captioning (But Not All of Them Are Accurate)

Comments Filter:
  • > thanks to the video giant's automated captions, > That certainly sounds impressive -- except when you realize that many of the site's automatically generated captions aren't completely right.

    I know robots are taking over jobs. But put those two statements together and this sounds like auto-generated bad lip reading.

    Now if someone could only implement all possible bad lip readings, and then auto-rate them for hilarity, we would be onto something.
  • Something is clearly wrong with the translations of the Downfall [youtube.com] videos. Sometimes it's about SAP, sometimes it's about the World Cup, but my limited German tells me it's about the fall of the Third Reich.

    • Something is clearly wrong with the translations of the Downfall videos. Sometimes it's about SAP, sometimes it's about the World Cup, but my limited German tells me it's about the fall of the Third Reich.

      We're living in the postmodern era. The interpretation is left up to the viewer!

  • site's automatically generated captions aren't completely right

    Maybe they are generating the comments as well

  • And I don't need to pay extra money for that.
  • by fish_in_the_c ( 577259 ) on Thursday February 16, 2017 @02:47PM (#53881499)

    I recently watched a video with closed captioning on.
    'stan fortuna school of the eucharist'
    lets just say google search doesn't think eucharist is a common term and has an especially hard time with it when it is a quickly spoken rap song with a Hispanic accent.

    It was pretty funny what they translated it too.

    It did leave me wondering if there should be a mechanism to tell them the words are wrong and really wrong.

  • I grew up in Boston, and when I go back to the old neighborhood it makes me wonder how people understand me at all. Speech recognition programs never work for me.

    • Don't worry. You are a special, unique snowflake. If people can't understand you, they are racist.

      • by hey! ( 33014 )

        Don't worry. You are a special, unique snowflake. If people can't understand you, they are racist.

        Actually people who don't understand me are usually idiots. Some of them are blockheads. Funnily enough racists understand me fine, they just don't like what I have to say.

  • I've lost the link, but someone recently mentioned an intentionally humorous duo who:
    1) write a skit, perform it, and upload to Google
    2) let captioning take its best stab
    3) use the captioning as a new script, and re-record the scene
    4) upload and re-caption
    5) record a third time, with even weirder dialogue

    Then they splice it all together, and you get to watch the degeneration of language as iterative captioning makes everything nonsensical.

    My wife and I also tend to watch a lot of TV when the other wants qui

    • by aevan ( 903814 )
      Sounds like a variation on those 'bad translation' humor videos: use online translators to translate through several languages, before ending back in English and using that. Like Bohemian Rapsody [youtube.com].
  • From my personal experience, I can't help but wonder if there is even ONE out of all the auto-generated captions that is accurate at all. :P
    Have you guys ever seen one? I mean, a few mistakes are ok... but so far I haven't seen any video that had auto-generated captions that was even understandable at all. More like a mish mash of guesses.
    Which is great for comedic effect I guess, but not so much for viewers with difficulties in hearing.

    • I dunno I use the auto CC all the time. Some of the guesses are terrible but normally seems to have the biggest issues around random nouns. So assuming that I know what the topic of the video is I can use the CC, I just have to substitute the proper noun at the right time.

      I tend to use it when watching a technical video with a single talker and I don't actually want the sound on for what ever reason.

  • Accuracy is not that important to me. We all are pretty used to inaccuracies while texting now. What is important to me is the synchronization. If the captions follow the actual speech by more than just a bit, it makes it hard for me to follow as I lip read in addition to reading the captions. Lip reading is often ok by itself, but with movies and TV, the speakers face is not always pointed to the camera or there might be something covering the speakers lips. That's when cc comes in handy.
    • I really don't think this thing is using lip reading ( I could be wrong does anyone have a source). I'd guess it is using the same servers as google voice search and the type of errors it has seems consistent with that.

  • Yeah, the Swedish Chef video was quite wrong...
  • by 140Mandak262Jamuna ( 970587 ) on Thursday February 16, 2017 @03:42PM (#53881793) Journal
    It is true in software engineering, and it is true everywhere.

    Perfection is the goal. But doing better than current version is the shipping criterion.

    Auto captioning is better than no captioning for hearing impaired.

    And human captioning is not perfect. I remember watching Lion King with closed captioning turned on and they had missed a crucial "o" in some dialog that had the word "count".

  • Not always a bad thing.

    Bad Lip Reading are far more entertaining than the actual text of the presidential debates, for example.

  • Anyone who's actually watched YouTube with captions will know almost immediately what I'm getting at in the subject. Saying the system "is prone to errors" as above is being very kind. Amongst its many errors are frequent phantom occurrences of the word "yeah". While the phantom "yeah" instances are more funny than anything else, many of the other errors are much worse. Amongst other problems they have been known to convert a family-friendly video into something that no longer fits that description.
  • Are any of them accurate? Can you manually enter captions?

    Undertake this discourse, actually types. [youtube.com]

  • by ukoda ( 537183 ) on Thursday February 16, 2017 @06:49PM (#53882833) Homepage
    For some reason Youtube thinks that people speaking with a New Zealand accent swear a lot. I was testing the Youtube product tutorial on an Android product which, unlike PC browsers, has the closed captioning on by default. A lot of the technical terms, spoken with a Kiwi accent, were being captioned with obscene words. When I recovered from laughing at just how rude it was being I warned our marketing team that made the video. They were mortified and suddenly had a large task of checking and removing the computer generated captions. It turns out all of our SFW videos had NSFW captions.
  • Clearly, these algorithms don't know what a bubblah or a blinkah or a clickah is.

"Nuclear war can ruin your whole compile." -- Karl Lehenbauer

Working...