Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Open Source Microsoft The Internet Technology

Microsoft Open-Sources a Crucial Algorithm Behind Its Bing Search Services (techcrunch.com) 55

An anonymous reader quotes a report from TechCrunch: Microsoft today announced that it has open-sourced a key piece of what makes its Bing search services able to quickly return search results to its users. By making this technology open, the company hopes that developers will be able to build similar experiences for their users in other domains where users search through vast data troves, including in retail, though in this age of abundant data, chances are developers will find plenty of other enterprise and consumer use cases, too. The piece of software the company open-sourced today is a library Microsoft developed to make better use of all the data it collected and AI models it built for Bing .

With the Space Partition Tree and Graph (SPTAG) algorithm that is at the core of the open-sourced Python library, Microsoft is able to search through billions of pieces of information in milliseconds. Vector search itself isn't a new idea, of course. What Microsoft has done, though, is apply this concept to working with deep learning models. First, the team takes a pre-trained model and encodes that data into vectors, where every vector represents a word or pixel. Using the new SPTAG library, it then generates a vector index. As queries come in, the deep learning model translates that text or image into a vector and the library finds the most related vectors in that index. The library is now available under the MIT license and provides all of the tools to build and search these distributed vector indexes. You can find more details about how to get started with using this library -- as well as application samples -- here.

This discussion has been archived. No new comments can be posted.

Microsoft Open-Sources a Crucial Algorithm Behind Its Bing Search Services

Comments Filter:
  • ...with Microdick's Open Source VectorBang search algorithm. Just type in your desires. VectorBang will find them!
  • And now it is open it will be used to Spam Bing.
  • by Shaitan ( 22585 ) on Wednesday May 15, 2019 @06:16PM (#58599102)

    I'm not a Microsoft fan in general but code is code and this sounds like a solid contribution and with a bit of modification could be useful in certain deep learning applications.

    All the usual disclaimers apply, beware any azure or other MS service, platform hooks, or back doors. In general Kudos MS.

    • I'm not a Microsoft fan in general but code is code and this sounds like a solid contribution and with a bit of modification could be useful in certain deep learning applications.

      Really? What kind of deep learning applications can you think of, here?

      • by Shaitan ( 22585 )

        Anything that is storing and indexing a large volume of arbitrary data really could potentially make use of this algorithm. A number of deep learning techniques are starting to make use of a memory of previous inputs. For instance if you are training on a massive catalog of the Gutenberg works etc. This isn't just pigeon-holed to the web itself.

        Though that works as well. IBM's AI won jeopardy utilizing a technique that indexed and looked up results relating to words in the questions. This provides an effic

  • by AlanObject ( 3603453 ) on Wednesday May 15, 2019 @06:18PM (#58599110)

    It might seem a bit nitpicky not bring this up, but quotes such as:

    .. able to search through billions of pieces of information in milliseconds ...

    should be responded to.

    Uh, no. What you might be able to do in milliseconds is search through an index (excuse me, a vector index. It sounds more science-y) that represents billions or "pieces of information." Not the information itself. What that means is if the overall data set wasn't inverted with search terms you wanted to use in mind you aren't finding what you want out of billions of records in milliseconds.

    Having said that it is amazing how far search engines have come. But I just find over-hype tiresome.

  • var results = websites.Select(s => s.ToString().Contains(query));
    • by Anonymous Coward

      You've got some real balls, you know that? Do you know how many software patents that code you just posted violates?
      I hope you've got a good lawyer.

  • by Anonymous Coward

    Never been a fan of Cortana but Bing works just fine for most of search uses. Occasionally I go back to Google search for comparison. I definitely do not see much advantage in DuckDuckGo, unless your a privacy focused person its really not great, I would use Bing before DDG. Google stuff is sort of habitual for most people, its what many got used to.

  • function fast_search($query) { return curl("https://www.google.com/search/q=$query"); }
  • by sad_ ( 7868 )

    As much as i dislike MS (i'm an old guy, i've seen their most ugly side), i can't complain about any company making software available under an approved open source license, even though i will probably never use it myself.

"What man has done, man can aspire to do." -- Jerry Pournelle, about space flight

Working...