Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
×
Youtube AI Google

Why YouTube Could Give Google an Edge in AI (theinformation.com) 30

Google last month upgraded its Bard chatbot with a new machine-learning model that can better understand conversational language and compete with OpenAI's ChatGPT. As Google develops a sequel to that model, it may hold a trump card: YouTube. From a report: The video site, which Google owns, is the single biggest and richest source of imagery, audio and text transcripts on the internet. And Google's researchers have been using YouTube to develop its next large-language model, Gemini, according to a person with knowledge of the situation. The value of YouTube hasn't been lost on OpenAI, either: The startup has secretly used data from the site to train some of its artificial intelligence models, said one person with direct knowledge of the effort. AI practitioners who compete with Google say the company may gain an edge from owning YouTube, which gives it more complete access to the video data than rivals that scrape the videos. That's especially important as AI developers face new obstacles to finding high-quality data on which to train and improve their models. Major website publishers from Reddit to Stack Exchange to DeviantArt are increasingly blocking developers from downloading data for that purpose. Before those walls came up, AI startups used data from such sites to develop AI models, according to the publishers and disclosures from the startups.

The advantage that Google gains in AI from owning YouTube may reinforce concerns among antitrust regulators about Google's power. On Wednesday, the European Commission kicked off a complaint about Google's power in the ad tech world, contending that Google favors its "own online display advertising technology services to the detriment of competing providers." The U.S. Department of Justice in January sued Google over similar issues. Google could use audio transcriptions or descriptions of YouTube videos as another source of text for training Gemini, leading to more-sophisticated language understanding and the ability to generate more-realistic conversational responses. It could also integrate video and audio into the model itself, giving it the multimodal capabilities many researchers believe are the next frontier in AI, according to interviews with nearly a dozen people who work on these types of machine-learning models. Google CEO Sundar Pichai told investors earlier this month that Gemini, which is still in development, is exhibiting multimodal capabilities not seen in any other model, though he didn't elaborate.

This discussion has been archived. No new comments can be posted.

Why YouTube Could Give Google an Edge in AI

Comments Filter:
  • Are all the answers going to sound like pewdiepie now, no thanks.
    • Nah, they're going to train it on the comments.

      I can't wait...

    • Funny, but YT has a shit ton of tutorials, from using software tools to wood working. This will train the base models for robotics and UI automation. UI automation affects all office workers, basically it means AI operating the UI of a computer to complete tasks.
    • Don't worry. Not everyone is pewdiepie on YouTube.

  • by oldgraybeard ( 2939809 ) on Thursday June 15, 2023 @12:34PM (#63605604)
    AI more about information storage, handling, retrieval within a required context and the proper actions, display, etc. What I see being touted is just gimmicks and glitzy marketing fluff. Not much real value there.
  • Noooo...you don't say.

  • they better somehow filter all the spiderman/elsa crap and all the already prevalent generated random bullshit

    maybe they can train an AI to flag such content

    • Re:except that (Score:5, Interesting)

      by VeryFluffyBunny ( 5037285 ) on Thursday June 15, 2023 @01:21PM (#63605732)
      Youtube already flags copyright content so that covers a lot of stuff from media companies already.

      Using natural, unscripted, spontaneous, spoken, conversational language to train models is actually a really smart move. This is the language that forms the foundation for more "academic" & formal language uses. Its development in children & adults (learning another language) always precedes academic/formal language.

      My impression of the output from LLMs so far is that they've been trained on a lot of academic/formal/written language & not nearly enough on unscripted/spontaneous/spoken, making it sound overly formal & inauthentic when you try to actually converse/chat with it. I get the feeling of being "ChatGPT-splained." Let's see how this works out for Google.
      • It's not that, they already used reddit, twitter and other datasets with more natural language. There is plenty of simple English text online. For me the great advantage is having aligned video+audio+text transcription, the size of this dataset is just unprecedented. It could allow much smarter robotics, finally AI is coming for the blue collar and office worker.
  • In the US...whomever clicks the photo button on a still, or runs the camera...owns the copyright on the image or video file.

    That means, even though Google/YouTube holds and shows the content, they do NOT own the content and it is not theirs to use to train AI.

    We've already seen cases of this....

    I signed up for YT before Google owned them and I don't recall signing over any rights to my content to them to use as they wish....

    • by DarkRookie2 ( 5551422 ) on Thursday June 15, 2023 @01:27PM (#63605758)
      From YouTube:

      License to YouTube By providing Content to the Service, you grant to YouTube a worldwide, non-exclusive, royalty-free, sublicensable and transferable license to use that Content (including to reproduce, distribute, prepare derivative works, display and perform it) in connection with the Service and YouTube’s (and its successors' and Affiliates') business, including for the purpose of promoting and redistributing part or all of the Service.

      You put anything on YouTube, or didn't take down content when the EULA changed, they can use it as they see fit and not pay you for it.

    • I signed up for YT before Google owned them and I don't recall signing over any rights to my content to them to use as they wish....

      And yet you'll have as much luck stopping them as you'd have stopping a troll from dismembering and digesting you on a foolish adventure into an unexplored mountain cave.

    • by ranton ( 36917 )

      That means, even though Google/YouTube holds and shows the content, they do NOT own the content and it is not theirs to use to train AI. [...] I signed up for YT before Google owned them and I don't recall signing over any rights to my content to them to use as they wish....

      You don't need to own the content to use it to train AI. If you posted it to the public, then anyone in the public has the ability to view it and use the memory of viewing it in any way they wish. Just like an AI "viewing" it can use it to modify its model.

      • Yup. It's the outputs of the AI systems that could potentially have copyright infringement issues (most likely to occur from over-training), not the inputs.

    • You absolutely give up your rights and it's one reason why I never uploaded more than one or two videos to YouTube.

  • by nightflameauto ( 6607976 ) on Thursday June 15, 2023 @01:08PM (#63605700)

    If the chatbots start gathering comments off Youtube, say goodbye to anything positive ever coming out of them again. Holy wow. Talk about a cesspool. It makes slashdot's nazi moron and "beat up a liberal" folks look like ubermensch.

    That would probably be one way to kill off this fascination with LLM chatbots for a bit though. I say, full steam ahead!

    • An LGBT church was hit by lightning and burned down abour a week ago. There was a video posted of the fire. You can guess what almost every comment was. Of course I had to rational-thinkersplain that more "straght" churches were burned down by lightning than "gay" churches, and that conservative areas of the US are just as prone to disaster as the liberal areas. I really hope Google does not train it's AI on these Youtube comment sections because it's intellegence will drop to such a negative number that
  • by jetkust ( 596906 ) on Thursday June 15, 2023 @01:09PM (#63605706)
    Google has had the entire internet at their disposal for years, yet their search has been getting worse if anything. They profit immensely out of the inefficiency of the world wide web. The more you have to click and search the better. They're probably more worried about how to make money off Gemini than how they can improve it.
  • by nospam007 ( 722110 ) * on Thursday June 15, 2023 @01:40PM (#63605800)

    If they admit that their AI actually CAN watch all the videos that are loaded up real-time, they'll have no excuse anymore to wait until somebody complains.

  • You are all singing cats now. Sing, cats, sing!

  • I've uploaded 3400 videos in the last 10 years to a very large Youtube channel. After all that time, using the same mic, and speaking native English extremely clearly, I can tell you with 100% certainty that the auto-closed captioning engine has no idea what I'm saying. But it will confidently lie to people about what I'm saying. So maybe it is a good match for AI then actually, since they both seem keen on doing that.
    • I've too noticed its inaccurate but lie is the wrong word. The words or meaning it uses often sounds like the same word but different meaning. Does not seem intentional. If someone is saying something not true but they are unintentionally doing it, its just being wrong, not lying.

    • by kiore ( 734594 )

      Are you a native speaker of American English, a British English or another variety of the English language? When watching English TV programmes & documentaries, I've noticed that the auto generated captioning is laughably wrong.

      If Google wants to mine their uploaded videos to train their AI, they'll need to fix this first.

  • Google could use audio transcriptions or descriptions of YouTube videos as another source of text for training Gemini.

    But audio transcription are generated by AI, and we were told yesterday that feeding AI with AI-generated content leads to model collapse [venturebeat.com], or in other words, reinforcement of garbage in, garbage out.

  • I'm currently using a Google AI project to transcribe my videos, because the YouTube automated transcription is ... pants. The Google hegemony, it seems, is weakened by a lack of coordinated buy-in across it's various research projects.

Ignorance is bliss. -- Thomas Gray Fortune updates the great quotes, #42: BLISS is ignorance.

Working...