Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot (digitaltrends.com) 17

Posted by EditorDavid on Sunday August 21, 2022 @07:12PM from the citation-needed dept.

Digital Trends reports: Working with the Wikimedia Foundation, Meta AI (that's the AI research and development research lab for the social media giant) has developed what it claims is the first machine learning model able to automatically scan hundreds of thousands of citations at once to check if they support the corresponding claims....

"I think we were driven by curiosity at the end of the day," Fabio Petroni, research tech lead manager for the FAIR (Fundamental AI Research) team of Meta AI, told Digital Trends. "We wanted to see what was the limit of this technology. We were absolutely not sure if [this AI] could do anything meaningful in this context. No one had ever tried to do something similar [before]."

Trained using a dataset consisting of 4 million Wikipedia citations, Meta's new tool is able to effectively analyze the information linked to a citation and then cross-reference it with the supporting evidence.... Just as impressive as the ability to spot fraudulent citations, however, is the tool's potential for suggesting better references. Deployed as a production model, this tool could helpfully suggest references that would best illustrate a certain point. While Petroni balks at it being likened to a factual spellcheck, flagging errors and suggesting improvements, that's an easy way to think about what it might do.

Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 17 Comments Log In/Create an Account

Comments Filter:

Citogenesis (Score:4, Insightful)

by Dwedit ( 232252 ) writes: on Sunday August 21, 2022 @07:52PM (#62809605) Homepage

Seems like something like this might have difficulty dealing with Citogenesis (circular citing, media articles with facts originating from Wikipedia, then they get cited)

- Re:Citogenesis (Score:5, Insightful)
  
  by narcc ( 412956 ) writes: on Sunday August 21, 2022 @08:44PM (#62809737) Journal
  
  It's a bit worse that that. A system like this will undoubtedly support something like "proof texting", an unethical practice by which you start with the conclusion you want and then find sources to support it. It's common among undergrads who don't know how to properly write a research paper ... and on Wikipedia.
  
  - Re: (Score:2, Insightful)
    
    by Required Snark ( 1702878 ) writes:
    
    "you start with the conclusion you want and then find sources to support it"
    Did you realize that you are describing how the Republican Party operates these days? Of course their version of "facts" include ravings that at one point would have led to incarceration in a mental ward.
    - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      All fanatics operate like that. That is one of the core reasons why they cannot get anything right: They do not look at reality.
- Re: (Score:2)
  
  by markjhood2003 ( 779923 ) writes:
  
  Yep, the current work does not attempt to independently grade the quality of sources cited, unlike Google's PageRank.
Checker or enforcer (Score:3)

by Bruce66423 ( 1678196 ) writes: on Sunday August 21, 2022 @07:56PM (#62809613)

Checking references is one of the major banes of writing an academic paper. The idea of an extra level of support that will mitigate the prospect of getting one wrong is attractive. However if the system enforces rules, banning certain sources and imposing certain interpretations on material, then we will have a problem...

- Re:Checker or enforcer (Score:4, Funny)
  
  by ls671 ( 1122017 ) writes: on Sunday August 21, 2022 @08:01PM (#62809633) Homepage
  
  However if the system enforces rules, banning certain sources and imposing certain interpretations on material, then we will have a problem...
  How can you possibly imagine it would be otherwise?
  
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
Which is funny (Score:2)

by wakeboarder ( 2695839 ) writes:

Because Facebook can't even check thier own content
DALL-E banned the word "shooting" (Score:2)

by SinGunner ( 911891 ) writes:

It took me forever to figure out why I couldn't draw a "wizard shooting fireballs" in DALL-E. It contains the controversial word "shooting". AI still fails miserably at semantics and while this could be a nice tool, there will be a whole new set of failure modes for people to blindly slam into and flailingly work around. Also, Slashdot and Google's spellcheck don't think "flailingly" is a word, so I'm pretty sure we're still too early for computers to be checking facts.
- Re: (Score:2)
  
  by spaceman375 ( 780812 ) writes:
  
  It would be nice if you could submit a link like "Wizards shooting [thesaurus.com] fireballs" with an anchor to "cast". Providing a little context would be good step forward.
References with reproducible results? (Score:2)

by fygment ( 444210 ) writes:

You might argue that the reason a lot of current results especially in ML aren't reproducible, is because they are based on earlier results that also aren't reproducible. So a very useful additional feature would be to flag the quality of references.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot (digitaltrends.com) 17

Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot More Login

Meta AI and Wikimedia Foundation Build an ML-Powered, Citation-Checking Bot

Citogenesis (Score:4, Insightful)

Re:Citogenesis (Score:5, Insightful)

Re: (Score:2, Insightful)

Re: (Score:2)

Re: (Score:2)

Checker or enforcer (Score:3)

Re:Checker or enforcer (Score:4, Funny)

Re: (Score:2)

Which is funny (Score:2)

DALL-E banned the word "shooting" (Score:2)

Re: (Score:2)

References with reproducible results? (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot