Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
AI Open Source

Hugging Face Clones OpenAI's Deep Research In 24 Hours 17

An anonymous reader quotes a report from Ars Technica: On Tuesday, Hugging Face researchers released an open source AI research agent called "Open Deep Research," created by an in-house team as a challenge 24 hours after the launch of OpenAI's Deep Research feature, which can autonomously browse the web and create research reports. The project seeks to match Deep Research's performance while making the technology freely available to developers. "While powerful LLMs are now freely available in open-source, OpenAI didn't disclose much about the agentic framework underlying Deep Research," writes Hugging Face on its announcement page. "So we decided to embark on a 24-hour mission to reproduce their results and open-source the needed framework along the way!"

Similar to both OpenAI's Deep Research and Google's implementation of its own "Deep Research" using Gemini (first introduced in December -- before OpenAI), Hugging Face's solution adds an "agent" framework to an existing AI model to allow it to perform multi-step tasks, such as collecting information and building the report as it goes along that it presents to the user at the end. The open source clone is already racking up comparable benchmark results. After only a day's work, Hugging Face's Open Deep Research has reached 55.15 percent accuracy on the General AI Assistants (GAIA) benchmark, which tests an AI model's ability to gather and synthesize information from multiple sources. OpenAI's Deep Research scored 67.36 percent accuracy on the same benchmark with a single-pass response (OpenAI's score went up to 72.57 percent when 64 responses were combined using a consensus mechanism).

As Hugging Face points out in its post, GAIA includes complex multi-step questions such as this one: "Which of the fruits shown in the 2008 painting 'Embroidery from Uzbekistan' were served as part of the October 1949 breakfast menu for the ocean liner that was later used as a floating prop for the film 'The Last Voyage'? Give the items as a comma-separated list, ordering them in clockwise order based on their arrangement in the painting starting from the 12 o'clock position. Use the plural form of each fruit." To correctly answer that type of question, the AI agent must seek out multiple disparate sources and assemble them into a coherent answer. Many of the questions in GAIA represent no easy task, even for a human, so they test agentic AI's mettle quite well.
Open Deep Research "builds on OpenAI's large language models (such as GPT-4o) or simulated reasoning models (such as o1 and o3-mini) through an API," notes Ars. "But it can also be adapted to open-weights AI models. The novel part here is the agentic structure that holds it all together and allows an AI language model to autonomously complete a research task."

The code has been made public on GitHub.
This discussion has been archived. No new comments can be posted.

Hugging Face Clones OpenAI's Deep Research In 24 Hours

Comments Filter:
  • by Growlley ( 6732614 ) on Thursday February 06, 2025 @04:13PM (#65147955)
    Sam Altman going to compete with Elon if peoiple keep doing that - they are literallyy taking the money out of his pockets
  • by Anonymous Coward
    With rivals open sourcing everything and destroying "OpenAI's" business model we might finally see an end to the AI bubble. At last, 2025 will be the year of the Linux desktop!
  • by Big Hairy Gorilla ( 9839972 ) on Thursday February 06, 2025 @05:03PM (#65148095)
    AI's are training on other AI's. Makes sense. It's like going to school. The teacher teaches you stuff that has already been figured out. So sorry to hear about anyone's copyrights being violated.

    I'm announcing my new AI model: Deeper Open Face Hugging Deeper Seeker.
    It's more open and goes deeper than any AI has gone yet.
    Seriously, It's an AI model, not a porn movie. For real.
    • Too bad, because there is probably a huge demand for an AI that can do decent porn movies on your home computer with modest graphic card.

      • oh, for sure! I almost think I read that article already...
        South Korean middle schoolers have been slapped down for something nearly like that. Using generative AI to face swap classmates into porn movies.. I think middle schoolers everywhere are working on it ... more fun than homework :-)

        Seriously, apologies to the girls who don't deserve this, though, but you can't stop water from running down a hill.
        Don't yell at me, I didn't do it.
        • Were they exclusively swapping faces of female classmates? No male face swapping at all?

          Interesting datapoint, if so.

  • by SlashbotAgent ( 6477336 ) on Thursday February 06, 2025 @05:16PM (#65148121)

    Here's hoping that we don't see any Xenomorphs bursting forth.

  • This is great work, but scoring 55.15% to OpenAI's 67.36% is not what I would call "cloning." (Not in the normal sense of being an equivalent drop-in replacement, and also not in the narrow technical sense using the 'cloning' approach to using OpenAI as a teacher for their own model - the article doesn't mention doing that).

    But it is true and increasingly clear that you can stay "pretty close" to the frontier with orders of magnitude fewer resources than it took to blaze the trail in the first place.

    If

  • Isn't it OpenAI?

    So is it worse to have OpenAI take my words and artwork to use as their own?

    Or for Someone else to take OpenAI's work and make it available to everyone.

    I'm not feeling much sympathy for OpenAI.

    • by ceoyoyo ( 59147 )

      I'm not sure you're making the point you think you are. If you make something and someone else says "that's a neat idea, I'm going to make one too" that's a good thing. If you object, you're an entitled dick.

      • My point is OpenAI ignored robots.txt and violated websites terms of service and scraped the entire internet taking many people's copyrighted guides, how-to's, stories.

        And now OpenAI is complaining that Deepseek basically did the same thing to OpenAI (using OpenAI to train Deepseek)...and OpenAI is complaining that Deepseek violated their terms of service.

        Which is hypocritical.

        And if given a choice between a hypocritical actor who violated terms of service and common norms or an actor who violated terms of

        • by ceoyoyo ( 59147 )

          This is a story about a bunch of people on Hugging Face making something that's similar to something OpenAI has described. There are no terms of service violations, alledged or otherwise, and DeepSeek isn't involved.

          Maybe you're posting on the wrong story?

"Nuclear war can ruin your whole compile." -- Karl Lehenbauer

Working...