Databricks Claims Its Open Source Foundational LLM Outsmarts GPT-3.5

Databricks Claims Its Open Source Foundational LLM Outsmarts GPT-3.5 (theregister.com) 17

Posted by BeauHD on Monday April 01, 2024 @10:02PM from the not-too-shabby dept.

Lindsay Clark reports via The Register: Analytics platform Databricks has launched an open source foundational large language model, hoping enterprises will opt to use its tools to jump on the LLM bandwagon. The biz, founded around Apache Spark, published a slew of benchmarks claiming its general-purpose LLM -- dubbed DBRX -- beat open source rivals on language understanding, programming, and math. The developer also claimed it beat OpenAI's proprietary GPT-3.5 across the same measures.

DBRX was developed by Mosaic AI, which Databricks acquired for $1.3 billion, and trained on Nvidia DGX Cloud. Databricks claims it optimized DBRX for efficiency with what it calls a mixture-of-experts (MoE) architecture â" where multiple expert networks or learners divide up a problem. Databricks explained that the model possesses 132 billion parameters, but only 36 billion are active on any one input. Joel Minnick, Databricks marketing vice president, told The Register: "That is a big reason why the model is able to run as efficiently as it does, but also runs blazingly fast. In practical terms, if you use any kind of major chatbots that are out there today, you're probably used to waiting and watching the answer get generated. With DBRX it is near instantaneous."

But the performance of the model itself is not the point for Databricks. The biz is, after all, making DBRX available for free on GitHub and Hugging Face. Databricks is hoping customers use the model as the basis for their own LLMs. If that happens it might improve customer chatbots or internal question answering, while also showing how DBRX was built using Databricks's proprietary tools. Databricks put together the dataset from which DBRX was developed using Apache Spark and Databricks notebooks for data processing, Unity Catalog for data management and governance, and MLflow for experiment tracking.

Databricks Claims Its Open Source Foundational LLM Outsmarts GPT-3.5

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 17 Comments Log In/Create an Account

Comments Filter:

I outperform Steven Hawking in a 100-meter dash (Score:3)

by Rosco P. Coltrane ( 209368 ) writes: on Tuesday April 02, 2024 @12:30AM (#64362928)

but I don't brag about it.

- Re: (Score:2)
  
  by gweihir ( 88907 ) writes:
  
  Yep, pretty much. Even the term out-"smart" is pretty much a lie by misdirection.
huge model (Score:2)

by Tom ( 822 ) writes:

The model requires ~264GB of RAM
(from the github link)
Bit much for my local setup, so we'll have to wait a bit before we can test this properly.
- I'd also add that "open source" (Score:3)
  
  by Rei ( 128717 ) writes:
  
  ... should be in quotes - if anything, it's even more restrictive than the LLaMA license. In particular banning the use of its outputs to train anything that's not DBRX related.
  This is beyond, as you note, that it's too large to run locally. Saying "only 36 billion are only active on any input" isn't useful with MOEs because the decision on gating isn't taken until you get to that layer / token, based on the inputs leading up to it, and loading on-demand is simply not practical, speed-wise. You have to ha
  - Re: (Score:2)
    
    by Rei ( 128717 ) writes:
    
    ** The "in particular" is a commonality with the LLaMA license, not an example of a new restriction
Pfft, that's nothing (Score:4, Funny)

by zmollusc ( 763634 ) writes: on Tuesday April 02, 2024 @05:22AM (#64363174)

I am running a LLM instance on drastically underclocked hardware and it is outperforming everything else, in that it produces fewer errors and hallucinations per hour than any other chatbot.

Emacs: M-x doctor (Score:2)

by xpiotr ( 521809 ) writes:

Emacs doctor [emacswiki.org] was ahead of it's time.
Still better than GPT, because it asks you to think by yourself and find the answer.br
By what measure? (Score:2)

by TomGreenhaw ( 929233 ) writes:

We see constant headlines about better performance but only offer a foggy notion of what that means. We have a need for speed. Real world users aren't going to wait 15 to 60 seconds to get a response from a prompt even if it is a good answer. And if the current crop of AI really gains traction, quotas for GPU time will become a very real problem and time to response will stunt adoption of the technology.
Not opensource (Score:2)

by Improv ( 2467 ) writes:

It may be source-available, but this is a shitty closed license resembling Facebook's Llama license.
- Re: (Score:2)
  
  by Rei ( 128717 ) writes:
  
  Yep. If anything, it's even worse.
  Mixtral is still the best that's truly open (and also not e.g. something that claims to be open but was clearly trained on closed inputs and which the ancestral model(s)' author(s) could make claims to).
  - Re: (Score:2)
    
    by cayenne8 ( 626475 ) writes:
    
    I thought Stable Diffusion was open source?
    - Re: (Score:2)
      
      by Rei ( 128717 ) writes:
      
      This discussion is about LLMs, not diffusion models.
      But for the record, Stability uses a wide range of different licenses for their different products, some of which are entirely proprietary, and others of which are quite open. They've been trending towards increasingly closed, though.
That's not hard (Score:2)

by allo ( 1728082 ) writes:

Define your niche, find a good 7B-30B models and you outsmart GPT-3.5 (beginning with 70B sometimes even GPT-4).
Most claims to outsmart GPT-3.5 in all disciplines with a single model are false.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Databricks Claims Its Open Source Foundational LLM Outsmarts GPT-3.5 (theregister.com) 17

Databricks Claims Its Open Source Foundational LLM Outsmarts GPT-3.5 More Login

Databricks Claims Its Open Source Foundational LLM Outsmarts GPT-3.5

I outperform Steven Hawking in a 100-meter dash (Score:3)

Re: (Score:2)

huge model (Score:2)

I'd also add that "open source" (Score:3)

Re: (Score:2)

Pfft, that's nothing (Score:4, Funny)

Emacs: M-x doctor (Score:2)

By what measure? (Score:2)

Not opensource (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

That's not hard (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot