Ask Slashdot: Are YouTube's Subtitles 'Appallingly Bad'? 100
Long-time Slashdot reader Anne Thwacks frequently uses YouTube's subtitles "not to disturb others in the room, or because my hearing is not very good." But they say there's a new problem.
"The subtitling is terrible!" Almost every sentence has a huge error. Proper names are more often wrong than right. Non-English place names are almost always mangled to barely recognizable. And no effort whatsoever is made to use context to figure out whether a place name is Russian or Arabic, and often complete garbage is used in place of a common French, Spanish or Italian name!
If AI actually works (I have my doubts about this), surely it would be possible to figure out language contexts. If it is about an event in Italy, then expect a lot of Italian names! If it is about the Russia-Ukraine war, then expect places in Russia or Ukraine to be more plausible than mindless gobbledygook! Does YouTube not know that there are places in the world that are not in America? (However, plenty of names of people and places famous in America are also regularly screwed up.)
They argue the subtitles are "appallingly bad" — and that "the situation seems to be getting worse," wondering why the problem isn't addressed with some basic spell-checking. ("I'm sure that the vast majority of foul-ups could be fixed by the use of a dictionary.") Have any Slashdot readers seen similar problems? A friend of mine noticed that YouTube's subtitles even bungled this innocuous song from the 1966.
ANNETTE FUNICELLO: "If your love is true love, you can tell by his touch."
YOUTUBE SUBTITLE: "If your love is too lava, you can tell by his touch..."
Share your own experiences and thoughts in the comments. And do you think YouTube's subtitles are "appallingly bad"?
"The subtitling is terrible!" Almost every sentence has a huge error. Proper names are more often wrong than right. Non-English place names are almost always mangled to barely recognizable. And no effort whatsoever is made to use context to figure out whether a place name is Russian or Arabic, and often complete garbage is used in place of a common French, Spanish or Italian name!
If AI actually works (I have my doubts about this), surely it would be possible to figure out language contexts. If it is about an event in Italy, then expect a lot of Italian names! If it is about the Russia-Ukraine war, then expect places in Russia or Ukraine to be more plausible than mindless gobbledygook! Does YouTube not know that there are places in the world that are not in America? (However, plenty of names of people and places famous in America are also regularly screwed up.)
They argue the subtitles are "appallingly bad" — and that "the situation seems to be getting worse," wondering why the problem isn't addressed with some basic spell-checking. ("I'm sure that the vast majority of foul-ups could be fixed by the use of a dictionary.") Have any Slashdot readers seen similar problems? A friend of mine noticed that YouTube's subtitles even bungled this innocuous song from the 1966.
ANNETTE FUNICELLO: "If your love is true love, you can tell by his touch."
YOUTUBE SUBTITLE: "If your love is too lava, you can tell by his touch..."
Share your own experiences and thoughts in the comments. And do you think YouTube's subtitles are "appallingly bad"?
Nah (Score:3, Insightful)
It's just that the entire YouTube is appallingly bad.
Re: (Score:3)
Youtube has plenty of good content. The caveat being what is pushed on their main page is utter garbage so you have to actively search for it. Take Adam Savage for example. He toured a facility that scans imax film and showed the original one off film scanning machine built 25 years ago. It's an engineering work of art. https://www.youtube.com/watch?... [youtube.com]
Re: Nah (Score:2, Insightful)
As it should be.
Good News! (Score:1)
Excellent work.
Re: (Score:2)
Youtube has plenty of good content. The caveat being what is pushed on their main page is utter garbage so you have to actively search for it. Take Adam Savage for example. He toured a facility that scans imax film and showed the original one off film scanning machine built 25 years ago. It's an engineering work of art. https://www.youtube.com/watch?... [youtube.com]
I like Savage's content as well. His workshop vids are like siding down with an old friend, and I find most of them relaxing.
YouTube Audio Quality - Bad Production (Score:2)
It's just that the entire YouTube is appallingly bad.
A lot of the audio production in individual videos is really bad. This isn't anything to do with YouTube per se, not their compression algorithms or other features. A lot of YouTubers have absolutely no concept of microphone placement, of using audio compression, of reducing background noise. All of which are things which will drastically affect audio quality and the ability of a speech-to-text model to create subtitles.
It would be nice if YouTube would normalize all the uploaded videos to one set standard.
Re: (Score:2)
YouTube does already do something to the audio. I notice it when streaming guitar and vocals: the sound which comes out of YouTube emphasises the guitar, so if I balance listening to the mixer feed I have to balance the guitar low.
YouTube has scam ads (Score:3)
The number of medical misinformation ads is crazy.
Re: YouTube has scam ads (Score:2)
Like Dr Gundry's " water makes you fat" claims. What dumbass believes that shit?
The one that needs outlawed is the AI lady hocking free grocery cards to Medicare recipients.
Re: (Score:1)
Do you suffer from STUCK POOP?
Re: (Score:2)
LOL and I thought it was just my feed serving up ads like that!
Re: (Score:2)
More the branch I was looking for. Business model makes EVIL.
But the story is actually related to a topic I wanted to AskSlahdot about... It would be interesting if there was a video website where you paid by helping to train the AIs. As it applies here, you would earn credit for correcting errors in the subtitles. Easily leveraged against the users by checking each participant's corrections and suggestions against other participants' work.
But for the greater glory and never-sufficient profits of the ever-m
Yes Google Is Bad. (Score:2)
Google's subtitles are terrible, and what's worse is that a lot of AI crap trains off it.
Like all my videos have perfect subtitles because it comes directly off the input. When I've let youtube "auto" caption, it can't tell what is speaking most of the time. So you get really worthless captions on anything that is not a one-person podcast with no music and an American accent. If they have an Aussie accent, not a chance. If they have a Quebecois accent, ZERO.
Re: (Score:2)
Re: (Score:2)
It's just that the entire YouTube is appallingly bad.
Youtube has a lot of good material. Perhaps your interests are appallingly bad? I use Youtube to watch woodworking videos, to watch repair videos - like if I'm trying to figure out a problem on my son's car. Radio technology videos - computer teardown videos. I replaced a battery and hard drive on a laptop recently, so I previewed what it was going to be like before I tore into it. Even some programming videos. I recently became interested in Node-Red, so looked up some of the basic operations to get starte
Time was, image search had like 20% accuracy (Score:1)
But 20% of all the internet's traffic is still more than enough to make more money than God.
Different set of incentives at play here.
especially bad translations of pronouns (Score:2)
For many years, my friends and I have noticed that Google Translate (and YouTube) have a better chance of getting a European language pronoun wrong than correct.
We've noticed this: French, Italian, German, Spanish ... and most of the Scandinavian languages all suffer from this.
It must somehow be related to how those LLMs are trained.
Re: especially bad translations of pronouns (Score:2, Funny)
Too woke for strongly gendered languages?
Maybe that's why my emails keep getting lost: it can't tell a mail server from a femail server!
Thank you! I'll be here all week!
Re: especially bad translations of pronouns (Score:4, Interesting)
yes, they are terrible (Score:2)
Re: (Score:1)
uploaders can put in their own captions, if the video is considered incomplete that's where you point the finger
as a stand-in auto CC is a courtesy feature and has been around longer than the septembers complaining
i imagine this particular tourist will be complaining about AI not "realizing" context for a long time, how is deleting things MORE productive???!?!?
Re: (Score:2)
As a musician I can promise you trying to figure out the "foreign" context wont help you understand Angine de Poitrine. The whacky costumes and the like are part of a very english speaking tradition, punk absurdism. Likewise musically, they are coming from a group of musical genres thats about as english speaking as you get , prog rock and jam bands. That they are french speaking canadians is almost irrelevant, other than the fact that they've been known amonst qubecans longer than the rest of the anglosphe
Re: (Score:2)
Re: (Score:2)
Yes, they're terrible, but until someone forces all the creators to upload their scripts or create the captions themselves, that's the best we got - speech to text is never 100% accurate, and fails the more strongly an accent is.
That said, some YouTubers do actually properly caption their video, and you can tell because it no longer says "auto generated".
And the only reason YouTube has captions is because of ADA compliance demanding it.
It's just a natural consequence of a site where production can range fro
Re: (Score:2)
Then perhaps what Google has failed to add on YouTube is a checkbox to exclude videos with only auto-generated captions from search results and recommendations.
Sure (Score:2)
It's quite hilarious for me watching foreign videos where I can't tell what they might have actually said because I lack any context, all I can do is boggle.
I have seen improvement, but it's mostly in the software's ability to pick speech out of noise. The comprehension part is still nonexistent.
Funky Forest! (Score:1)
If you've never seen it; Watch some clips of Funky Forest in the original Japanese.
I watched them and wouldn't find translations for years, I had a strong suspicion that there was no story at all and it was just nonsense video art but it turned out to mostly not be the case.
I guess I should also mention this is not anime
Re: (Score:2)
Go back to the previous pre-LLM technology. Happy now?
Re: (Score:2)
Go back to the previous pre-LLM technology. Happy now?
Except pre-LLM technology Youtube's subtitles sucked equally but only in one language?
Re: (Score:2)
Go back to the previous pre-LLM technology. Happy now?
Except pre-LLM technology Youtube's subtitles sucked equally but only in one language?
This! I used to watch the subtitles because they were hilarious. And once upon a time, you could turn on subs on non-English videos, which the guesses their sub software made were even funnier.
lots of AI shorts (Score:2)
Re: (Score:2)
What's your beef with Betteridge? (Score:2)
There was no need to phrase this as a question: if you have ever used YouTube subtitles, you'll know they are bad.
The real question is why. While I think AI is overhyped in general, I don't believe that this is the best Google could do if they put in some effort. It seems the model is just picking the most likely word on an individual word level, not considering any context like the video title or description (which can contain the very names it gets wrong), the type of content the channel typically publish
Favorite subtitle (Score:2)
Re: (Score:1)
I find them pretty danged good (Score:2)
Somewhere between the 25th and 50th percentile of a stenographer that hasn't pre programmed names.
Not all (Score:3)
Some [youtube.com] are OK.
Re: (Score:2)
Murph D'Egg is such a boss.
Appallingly bad (Score:3)
I rarely see _any_ subtitles that are not appallingly bad, including YouTube.
Re: (Score:2)
Beat me to it. Yes, Youtube subtitles are generally appallingly bad, but when I watch movies with subtitles on DVD or Bluray they're often also garbage. In some cases it appears the person creating the subtitles is working from a different script to the movie as shot because the words bear no relation at all to what they're saying on the screen.
Re: Appallingly bad (Score:2)
Re: (Score:3, Interesting)
It's possible. But in some cases we've been watching with English subtitles on an English TV show with people speaking in English because my girlfriend doesn't understand English English. The words in the subtitles still don't necessarily match the words they're saying in the same language.
Re: Appallingly bad (Score:1)
Re: (Score:2)
Not sure what movies you're talking about, but closed caption subtitles on a typical Bluray are word for word transcribed in literally every movie in my vast collection. There is of course the occasional hiccup, a typo or word missing, but if that happens once in any feature film it's already worse than normal.
Are you talking about foreign language films or something?
Re: (Score:3)
Here's one current example. My girlfriend is watching a detective show on Amazon Prime.
This is an American show where the characters are speaking English and the subtitles are English. Not even sure what show it is, but it clearly has a decent budge that could afford to spend a few bucks creating the subtitles.
Main character's daughter is fixing main character's computer and says "I exported all your data."
Subtitle says: "I extorted all your data."
Now, the girl's accent is a bit heavy, but it's clear from t
Re: (Score:2)
I mentioned last week, that such processors can't handle homonyms: Always writing "brake" instead of "break".
Nah (Score:1, Funny)
Does YouTube not know that there are places in the world that are not in America?
Why should YouTube be any better than the average American?
Re: (Score:2)
But the subtitles are wrong! [youtube.com]
Captions Plus! (Score:2)
BUT if you Subscribe! We will fix the captions. ;-)
Heat (Score:2)
Every single stupid time there's just music playing and the stupid subtitler prints "Heat. Heat" and I'd like to see the whole of the YouTube/Google management complex to experience flashpoint/ignition heat.
Humans are terrible at hearing lyrics (Score:2)
Over the years there have been many lists published of lyrics people thought they heard but were completely nonsensical. To some apparently Abba sings, "feel the beat of the tangerine."
So I'm not surprised youtube can't understand song lyrics. Regular speaking, it's fairly good. And not too bad at translating the subtitles.
Re: (Score:3)
Over the years there have been many lists published of lyrics people thought they heard but were completely nonsensical. To some apparently Abba sings, "feel the beat of the tangerine."
If you were ever beat by a tangerine, you'd think differently, you insensitive clod! ;^)
Insensitive Claude (Score:2)
you insensitive clod!
When has Claude, or Gemini or ChatGPT for that matter, ever been sensitive?
They're Broken (Score:2)
Re: They're Broken (Score:2)
YouTube has ads?
I don't think they're really any worse. (Score:2)
I use it pretty much every time I'm writing something (like right now)
And it is pretty darn High error rate.
For instance why did it capitalize High and why did it make has into is?
Some of it, I'm sure, has to do with working with really poor pronunciation. Most people tend to blur and slur when they're speaking.
And then depending on the listener to straighten out what it was they said.
Re: (Score:1)
I don't think they're really any worse.
Than Google's voice to text.
Fun fact, youtube auto-subtitles are produced with none other than googles first gen voice to text!
So you're exactly right, they are not worse and not better, it is the same thing.
It's also been in place a lot longer than most people think.
Over 10 years ago, about when they made their v3 api available, all uploads had their audio transcribed to text and stored with the other metadata. It was accessible and searchable via API.
It was many years later before they started using it for subtitles.
Other fun facts
tiny model? (Score:3)
Re: (Score:2)
That makes sense. What doesn't make sense to me is that I feel that it has gotten worse. I remember thinking that a few years ago that live transliteration seemed to be getting pretty good. I remember testing out early Dragon speech-to-text software and not being impressed, so maybe I had set my expectations so low that that even muddling translations were a real improvement, and I misremember how well it was really doing.
Accents (Score:2)
AI has always been and will continue to be terrible with accents. Each new accent is like a whole new language it has to learn. Niche accents will never have enough training data.
Crunchyroll Too (Score:2)
Same thing happens with some newer animes on Crunchyroll. The CC is sometimes complete nonsense. If the content is live, I can totally understand the CC being a little rough. But the content I'm watching is pre-recorded and has a script. The script could easily be provided, stripped of everything not needed for CC, and then synced to the video. Hell, use the AI to sync it, but there should never be any reason an AI is just making up random, wrong shit for pre-recorded content that has a script available. It
Hitler's plan for food (Score:2)
https://imgur.com/a/tQ1OboP [imgur.com]
Re: (Score:1)
Subtitles are bad but (Score:2)
Closed Captions are even worse. It's in many news outlet Youtube channel.
It's like it was translated to some other language and back to English with a 90's translator, with lot of missing words, in all capital letters and with a 15 seconds delay.
Mind you, are the authors that provides the content of these hateful CC's.
They're supposed to be at least decent, because they're (supposedly) human-made and directed to hearing impaired people.
Oh and Youtube don't let you fallback to regular automatic subtitles. It
No idea why they aren't using SOA TTS (Score:2)
I happen to know a lot about this, and am familiar with the state-of-the-art text-to-speech models, and for some reason, Google definately isn't using one. My guess is that they have so many youtube videos to transcribe each day that they just dont have the compute to use a state-of-the-art model to do it, and have to use a fast cheap one with a higher error rate. If that is the reason, then Google should provide the option to people posting videos to pay for having their video transcribed by a SOA model.
It's everywhere (Score:1)
Not just YouTube. Instagram is almost unusable. YouTube shilling their shorts and auto-dubbing is pushing me to the edge.
More than one problem. (Score:2)
The modern trend of mumble acting doesn't help. When actors transitioned from stage to screen they came armed to annunciate. They could even speak clearly and act at the same time.
Admittedly my hearing isn't great, but it pisses me off when actors mumble and slur their way through dialog because they and the director both think that makes the performance brooding or edgy.
The AI can be forgiven for getting it wrong when regular people also struggle to hear it.
Watch great actors speak. Listen to Hopkins as Le
They COULD be good (Score:1)
Burned-in subtitles are the real AI shite (Score:2)
AI works pretty well (Score:2)
It depends on how much money you are willing to burn per token. For offering free subtitles on YT? The quality is going to be "budget" tier AI. We have a massive scalability, capacity, and energy problem to overcome before AI actually does a decent job at average people's mundane tasks.
Of course capitalism is not generally going you give away a higher quality service than it needs to. Especially if the end user is not a customer but the product.
Re: (Score:2)
My Pixel phone has on-device voice recognition, and it's extremely accurate. Like better than a human's first pass accurate. There are some things it struggles on, like odd place names, but even then if it's something like Google Maps it tends to figure out what I meant anyway.
So I'm not sure why YouTube is so bad, because clearly Google has better technology than this and with YouTube has the luxury of being able to take its time, rather than having to produce real-time output.
Re: (Score:2)
For starters, there are a lot of tricks used in devices. You don't need inference to do decent voice recognition for dialing and commands.
Offline inference is pretty spotty for me on my Pixel 9A. Like unable to understand me well enough to take down an address. But on the other hand when it has good signal it is good enough for sniff conversations in the room and start recording products my wife and I are discussing. Probably because the engines are far better trained on product names than on place names. J
Auto-generated subs bad for UK lottery (Score:2)
The UK ball-drawn lotteries are shown live on YouTube and have auto-gen'ed subtitles. With the official UK lottery site scrapping a list of the balls in drawn order in Jan 2026 (they only have them in ascending order now), it's remarkable that the only official text form of the drawn order of the balls is that subtitle text stream.
Yep, that official subtitle stream has regular parsing errors such as "[music]" instead of a ball number (yes, there's video background music that can interfere with the narrator'
Re: Auto-generated subs bad for UK lottery (Score:2)
Re: (Score:2)
lottery.co.uk is an unofficial UK lottery site - my point was there is no longer a reliable official text feed of the balls in drawn order. lottery.co.uk probably have to watch each individual draw video to get the drawn order because parsing the video subtitle streams has about a 20% error rate!
Fix 'em? (Score:2)
Not everything needs to be autogenerated. Upload something with subtitles.
Not a clue of context (Score:1)
So if there are pro bikers speaking: it does not now the names of teams, brands, bikers, routes or climbs. It just tries to hear, but still plain stupid. Aliens visiting this planet are laughing their asses off.
Why is there not a checkbox called context in the video upload form, check cycling and he should learn that for this video. Is that so hard to come up with?
The balance of appalingly bad and good enough (Score:1)
Auto-generated subtitles on all platforms are appallingly bad. The trouble with them is that they do generate some text that is good enough for most people not to bother to watch the video and correct them before posting.
Appallingly bad doesnt come close (Score:2)
The ones I love are the animated ones where there is text printed on the frames and the automatic comments textify the spoken portions. Its entertainment on its own, the mangling of proper names are even random so the same botched rendering hardly ever repeats. An entertainment all by itself. I routinely play youtube at 1.5x, tried 2.0 but the speach compression totally fails at that rate. Just watching stuff for entertainment so really dont care about the accuracy. I prefer words in a row to hearing a talk
Scale (Score:2)
They could use a better model and spend 50x the cost to generate 30% better subtitles, no doubt.
It's good enough for their "summarize the video" feature which I now use to avoid listening to AI narrators. Especially the ones programmed to maximize runtime with nonsense, repetition, and restatements.
At least they aren't hard-coded. (Score:2)
A ton of videos have big, colored, hard-coded subtitles, especially for voice-overs. Those are incredibly distracting and the absolute worst. At least you can turn these off.
We're fortunate they work at all (Score:2)
No, they're much, much better than having no generated titles at all, and by a lot.
mpv won't generate any subtitles for me; it can only show what's included in the data.
Cardassian/Kardashian (Score:2)
Whenever I watch a Star Trek-related video that happens to touch on the species that oppressed Bajor, Youtube will almost always render the name as "Kardashian".
Yeah, automatic subtitles are pretty bad (Score:2)
I did some video where I had a script. So it was easy to extract subtitle and compare to script. Yeah, its pretty bad. Probably every other sentence had some obviously incorrect subtitling. And these videos were on clean audio inputs: no background noise, good mic.
I ended up having to correct the subtitles using the transcript.
I have seen a study recently on noisy audio in a group setting, so the speakers maybe not be perfectly mic-ed and all. The speech-to-text was done with a couple of Whisper models. The
Auto translation is worse (Score:2)
Opening a French video out of the blue and hearing some weird English translation no one asked for instead of simply adding subtitles is such an awful example of American exceptionalism...
I'm not necessarily going to set my browser to every language I want to hear with subtitles, but auto-voice translation is just wrong.
If you managed to get to Youtube, you can read and choose to activate it if you want, but stop with this English-centric view.
I Dearly Love Youtube, But ... (Score:2)
Yeah, those subtitles can be pretty 'orrible, no question. I've often wondered if there was work there for humans, until I remember that most humans' spelling, grammar, punctuation are pretty weak too. Still, better than nothing, and I do miss it when the subtitles are not available. And it's always a surprise when I turn them on for a particular thing, and find them to be in Russian (!) or Spanish or German or whatever. I leave them on (well, except for the Russian ones because I can't read the script
Lazy creators (Score:2)
Final Cut Pro (Score:2)
YouTube is really bad at auto-generation, and in particular gets punctuation wrong much of the time. There are better solutions out there than YouTube, and it should be something every creator does by default rather than leaving it to YouTube.
Spot on (Score:1)
They're actually pretty good (Score:2)
Yes, they are sometimes inaccurate, but I'm using them to supplement what my ears are hearing. For example, someone might say, "50 CAL" and the subtitles say, "50 COW" but even though I might not hear "CAL" properly, the subtitles help me figure it out, despite being wrong.
It's like th