Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
AI Wikipedia

Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot (digitaltrends.com) 17

Digital Trends reports: Working with the Wikimedia Foundation, Meta AI (that's the AI research and development research lab for the social media giant) has developed what it claims is the first machine learning model able to automatically scan hundreds of thousands of citations at once to check if they support the corresponding claims....

"I think we were driven by curiosity at the end of the day," Fabio Petroni, research tech lead manager for the FAIR (Fundamental AI Research) team of Meta AI, told Digital Trends. "We wanted to see what was the limit of this technology. We were absolutely not sure if [this AI] could do anything meaningful in this context. No one had ever tried to do something similar [before]."

Trained using a dataset consisting of 4 million Wikipedia citations, Meta's new tool is able to effectively analyze the information linked to a citation and then cross-reference it with the supporting evidence.... Just as impressive as the ability to spot fraudulent citations, however, is the tool's potential for suggesting better references. Deployed as a production model, this tool could helpfully suggest references that would best illustrate a certain point. While Petroni balks at it being likened to a factual spellcheck, flagging errors and suggesting improvements, that's an easy way to think about what it might do.

This discussion has been archived. No new comments can be posted.

Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot

Comments Filter:
  • Citogenesis (Score:4, Insightful)

    by Dwedit ( 232252 ) on Sunday August 21, 2022 @06:52PM (#62809605) Homepage

    Seems like something like this might have difficulty dealing with Citogenesis (circular citing, media articles with facts originating from Wikipedia, then they get cited)

    • Re:Citogenesis (Score:5, Insightful)

      by narcc ( 412956 ) on Sunday August 21, 2022 @07:44PM (#62809737) Journal

      It's a bit worse that that. A system like this will undoubtedly support something like "proof texting", an unethical practice by which you start with the conclusion you want and then find sources to support it. It's common among undergrads who don't know how to properly write a research paper ... and on Wikipedia.

         

      • Re: (Score:2, Insightful)

        "you start with the conclusion you want and then find sources to support it"

        Did you realize that you are describing how the Republican Party operates these days? Of course their version of "facts" include ravings that at one point would have led to incarceration in a mental ward.

        • by gweihir ( 88907 )

          All fanatics operate like that. That is one of the core reasons why they cannot get anything right: They do not look at reality.

    • Yep, the current work does not attempt to independently grade the quality of sources cited, unlike Google's PageRank.
  • by Bruce66423 ( 1678196 ) on Sunday August 21, 2022 @06:56PM (#62809613)

    Checking references is one of the major banes of writing an academic paper. The idea of an extra level of support that will mitigate the prospect of getting one wrong is attractive. However if the system enforces rules, banning certain sources and imposing certain interpretations on material, then we will have a problem...

    • by ls671 ( 1122017 ) on Sunday August 21, 2022 @07:01PM (#62809633) Homepage

      However if the system enforces rules, banning certain sources and imposing certain interpretations on material, then we will have a problem...

      How can you possibly imagine it would be otherwise?

    • I support anything that'll make it easier to improve &/or rate the quality of research papers. I doubt this FAIR algorithm would be sufficiently reliable to work unaided but if it's used as computer assisted checking, i.e. helping an expert in the field evaluate papers, which they do frequently for literature reviews & metastudies, I think it'll be a great help & greatly reduce researchers' workloads.
  • Because Facebook can't even check thier own content
  • It took me forever to figure out why I couldn't draw a "wizard shooting fireballs" in DALL-E. It contains the controversial word "shooting". AI still fails miserably at semantics and while this could be a nice tool, there will be a whole new set of failure modes for people to blindly slam into and flailingly work around. Also, Slashdot and Google's spellcheck don't think "flailingly" is a word, so I'm pretty sure we're still too early for computers to be checking facts.
    • It would be nice if you could submit a link like "Wizards shooting [thesaurus.com] fireballs" with an anchor to "cast". Providing a little context would be good step forward.
  • You might argue that the reason a lot of current results especially in ML aren't reproducible, is because they are based on earlier results that also aren't reproducible. So a very useful additional feature would be to flag the quality of references.

Each new user of a new system uncovers a new class of bugs. -- Kernighan

Working...