Yahoo! Boss – An Insider View

By Vik

Disclaimer: This is my personal blog. The views expressed on these pages are mine alone and not those of my employer.

Boss stands for Build your Own Search Service. The goal of Boss is to open up search to enable third parties to build incredibly useful and powerful search-based applications. Several months ago I pitched this idea to the executives on how Yahoo! can specifically open up its search assets to fragment the market. It’s remarkable to finally see some of the vision (with the help of many talented people) reach the public today.

Web search is a tough business to get into. $300+ Million capex, amazing talent, infrastructure, a prayer, etc. just to get close to basic parity. Only 3 companies have really pulled it off. However, I strongly believe we need to find innovative, incremental ways to spread the search love in order to encourage fragmentation and help promising companies get to basic parity instantly so that they can leverage their unique assets (new algorithm, user data, talent) to push their search solution beyond the current baseline.

Search is all about understanding the user’s intent. If we can nail the intent, then search is pretty much a solved problem. However, the current model of a single search box for everything loses an intent focus as it aims to cater to all people and queries. Albeit, a single search box definitely makes our lives easier, but I have a hard time believing this is the *right* approach.

In my online experience, I typically visit a variety of sites: Techmeme, Digg, Techcrunch, eBay, Amazon, del.icio.us, etc. While on these pages, something almost always catches my eye, and so I proceed to the search box in my browser to find out more on the web. Why do we have this disconnected experience? I think it’s because these sites do not provide web-level comprehensiveness. It’s unfortunate, because the page that I’m on may have additional information about my intent (maybe I’m logged in so it has my user info, or it’s a techy shopping site).

The biggest goal of Boss is to help bootstrap sites like these to get comprehensiveness and basic ranking for free, as well as offer tools to re-rank, blend, and overlay the results in a way that revolutionizes the search experience.

When I’m on del.icio.us, why can’t I search in their box, get relevant del.icio.us results at the top, and also have web results backfill below? I think users should be confident that if they searched in a search box on any page in the whole wide web that they’ll get results that are just as good as Yahoo/Google and only better.

The first milestone of Boss is a simple one: Make available a clean search API that turns off the traditional restrictions so that developers can totally control presentation, re-rank results, run an unlimited number of queries, and blend in external content all without having to include any Yahoo! attribution in the resulting product(s). Want to build the example above or put news search results on a map – go for it!

Here’s a link to the API:

http://developer.yahoo.com/search/boss/

Also, check out the Boss Mashup Framework:

http://developer.yahoo.com/search/boss/mashup.html

The Boss Mashup Framework in my opinion makes the Boss Search API really useful. It lets developers use SQL like syntax for operating on heterogeneous web data sources. The idea came up as I was working on examples to showcase Boss, and realized the operations I was developing imperatively followed closely to declarative SQL like constructs. Since it’s a recent idea and implementation, there may be some bugs or weird designs lurking in there, but I strongly recommend playing around with it and viewing the examples included in the package. I’m biased of course but do think it’s a fun framework for remixing online data. One can rank web results by digg and youtube favorite counts, remove duplicates, and publish the results using a provided search results page template in less than 30 lines of code and without having to specify any parsing logic of the data sources/API’s as the framework can infer the structure and unify the data formats automatically in most cases.

The next couple of milestones for Boss I think are even more interesting and disruptive – server side services, monetization, blending ranking models, more features exposure, query classifiers, open source … so stay tuned.

Tags: , , , , ,

43 Responses to “Yahoo! Boss – An Insider View”

  1. Yahoo! Opens Search and Supports Developers | Nate Koechley's Blog Says:

    [...] Vik Singh had the idea for BOSS, and posted Yahoo! Boss – An Insider’s View. It’s money line is this, and describes the big idea succinctly: “I think users should [...]

  2. jack Says:

    Wow, this is really cool. Thanks for the additional insight. It is great to see that there are nearly no restrictions for BOSS. I have to try it out now.

  3. » Blog Archiv » BOSS - Yahoo bietet seinen Suchdienst per API an ErkenntnisWerk Says:

    [...] Quellen: Yahoo Blog, CNET, TechCrunch, GigaOm, zooie’s, Webware, ReadWriteWeb, Futuristic [...]

  4. Duc Nguyen Says:

    Oh, inevitably Yahoo! becomes more and more my favourite company on the web! I love Yahoo! Pipes, and now – BOSS! Must try it out…

  5. Sérgio Nunes Says:

    This is a great announcement. Thanks!

    It seems that it is not possible to make a vertical search engine based on selected sites (something like Google CSE). Are you considering this as a next feature? If not, please do. :)

  6. Adrian Bye Says:

    You’re a rockstar, this is tremendous vision and really deserves to be congratulated! You may have changed the entire web by getting this implemented.

  7. zooie Says:

    thanks adrian – too kind. lots of people helped to make this happen. there’s plenty left to do – it’s the next couple of milestones that i think will really define open search.

  8. Jody Says:

    I agree, this is tremendous vision. I couldn’t agree with you more about user intent. I concur that this will change Search as we know it.

  9. Yahoo! Cool thing of the Day » Blog Archive » Who’s the BOSS? Says:

    [...] I like this post, which discusses some of the origins behind and future plans for [...]

  10. Media-Blog » Blog Archive » Yahoo BOSS: Yahoo öffnet API für Entwickler Says:

    [...] Hier gibt es einige weitere Infos zu BOSS, von dem Entwickler, der’s erfunden hat. [...]

  11. ProjectX Blog » Blog Archive » Xlinks Digest - 11 / 07 / 2008 Says:

    [...] Xlinks digest is a interestig collection of links as discovered by the ProjectX team. Yahoo Boss – an insiders view Added on 07/11/2008 at 10:12AM Google Lively Added on 07/11/2008 at [...]

  12. Brett Says:

    @Sergio. I haven’t looked at the framework, or the API (yet!), so I’m really out on a limb here, but couldn’t you just include the site(s) names as parameters in the search string?
    BOSS – awesome concept, can’t wait to play with it. Thx all.

  13. Peter J. Says:

    Very fast. Very useful. Very cool.

  14. zooie Says:

    @Sergio: in the query you can include site: terms to do restrictions. also stay tuned for an upcoming operator that should help you here.

  15. DevWebPro » Post Topic » Yahoo! Releases Build Your Own Search Service Says:

    [...] To get an in-depth perspective of yahoo! Search Boss, you can also visit Yahoo! Boss – An Insider View. [...]

  16. Yahoo! debuts Search BOSS : Tech Blog Philippines - TechBlog.ph Says:

    [...] there’s Yodel Anecdotal’s launch post, Yahoo! Search Blog’s launch post, an insider’s view on Search BOSS, and of course one should check out Yahoo! Developer Network’s section on Yahoo! Search BOSS. [...]

  17. andy Says:

    Great stuff zooie. Congratulations and thanks. My only fear is that if MS was going to acquire Yahoo it could kill off this wise and bold concept. How likely to happen do you consider this in case of a takeover? I guess this uncertainty doesn’t allow for the fastest and widest adoption possible.

  18. zooie Says:

    it’s a good question. don’t want to represent yahoo on this one but i’ll answer a part of it personally if that’s ok. i think once you go open (say boss not only provided web services but also the source code) it’s really hard to go back – whether a company later becomes a $100 billion dollars or part of microsoft. personally, i think going open can give a company a lot of leverage in these type of negotiations …

  19. andy Says:

    thanks zooie. I’m feeling better now ;)

  20. sudhakarmishra Says:

    i m also feeling better

  21. Arjun Rathore Says:

    Hi Vik,

    Why do we have a restriction of using a search box. I wanted to build a web site to help domaineers find valuable domain names out of thousands of domain names that expire daily based on certain factors most prominent being the search hits. I have hit a wall since none of the api’s from any search providers will let me do this in a script. This would result in probably max of 10,000 searches a day. All i care is what is the total number of search hits for a given keyword. Screen scraping is illegal and would result in banning of IP. There should be a way to allow using search api in scripts ( non web based) provided the api user advertises the BOSS api on the web site or any other condition which might be beneficial to both the api users and yahoo. Any Input?

    Thanks.

  22. zooie Says:

    hey arjun – just got off a plane (sorry for the delay). i agree with you. i’ll check with legal.

  23. Michael Says:

    Hello.

    Are you going to allow to search for multiple domains at the same time via BOSS? Something like (site:site1.com OR site:site2.com OR site:site3.com).

  24. Vik Says:

    @Michael: Yes we added better support for this recently.

    Check out the ’sites’ parameter, which takes in a comma separated string of domains

    For ex.

    http://boss.yahooapis.com/ysearch/web/v1/query?sites=yahoo.com,disney.com

  25. The Idée Blog » Blog Archive » Being BOSSed around Says:

    [...] pretty snazzy and quite amusing too, and oh yes, made with just 50 lines of code. Singh’s insider’s view on BOSS is an interesting read as [...]

  26. Remmrit Bookmarking Says:

    Programs Bookmarks…

    Remmrit.com user has just tagged your post as programs!…

  27. My daily readings 11/17/2008 « Strange Kite Says:

    [...] Yahoo! Boss – An Insider View « zooie’s blog [...]

  28. Sam Says:

    Vik,

    Is is possible to search within sub-directories in multiple domains?

    for example: ?sites=disney.com/toys, cnn.com/news, garden.com/green/trees …

    I tried doing this with no success.

  29. Vik Says:

    one way to maybe approximate this is to use “inurl:” inside the query

    for ex.

    /web/v1/query+inurl:disney.com/toys+OR+inurl:cnn.com/news …

    you could also play around with using the sites parameter in conjunction with the inurl query syntax

    hope this helps
    and if you can, please post your questions to the ysearchboss yahoo group so others could also benefit from this discussion

    best

  30. William Says:

    What’s the limit on how many domains can be added? We probably need 10,000 or so for our application.

  31. dinkler Says:

    I echo william, and I think growing that limit is very important to allowing people to create truly useful vertical search engines with BOSS.

  32. amit Says:

    Vic,
    I tried to develop search results such as provided by kosmix.com via BOSS API and couldnt do this due to structure of search kosmix use. any suggestions???
    amit

  33. Vik Says:

    @amit

    To approximate kosmix results, you’ll probably need to issue multiple BOSS API calls doing site restricts on different domains (blogs, web, flickr, etc.). These can be loaded fast via multithreading/processing and/or AJAXed modules for each domain of results.

    Check out the “sites” parameter in BOSS for more details.

    You might also find the “keyterms” and “searchmonkey_rss” parameters very useful. More to come for BOSS so stay tuned.

  34. Vik Says:

    @William and @dinkler

    We hear you. Working on something for you guys. Check out search.techcrunch.com (which is powered by a future BOSS platform) for a pseudo sneak peak.

  35. TweetNews (Real-Time Search) Is Back « zooie’s blog Says:

    [...] the openness and power of Yahoo! BOSS (you can read more about it in my previous posts here and here). Remarkably, many users found the service useful despite its slow performance, barebones UI, lack [...]

  36. A Comparison of Open Source Search Engines « zooie’s blog Says:

    [...] example, one non-search application of BOSS leveraged the Spelling service to spell correct video comments before handing them off to their [...]

  37. Shraga Says:

    From what I see, Yahoo Boss does not include a way to search for videos? Is this correct? Wouldn’t any search startup have to be able to offer video search in order to be competitive?

  38. Vik Says:

    @Shraga

    We’re looking into it. There are ways of doing video search via web search (&sites=youtube.com,metacafe.com) but agree a video search solution would be great.

    Appreciate the feedback.

  39. Shraga Says:

    Vic,

    The big difference would be that the results would be text only, without thumbnails. Is this correct?

    Aside from that, I must say it’s a really great initiative!

  40. Shraga Says:

    Sorry, that should be Vik with a “k”, my mistake!

  41. Vik Says:

    @Shraga

    Probably not but you could try using the &view=searchmonkey_rss feature to obtain structured abstracts if they are available (which might include thumbnail links).

    Best

  42. Sushil Says:

    I want to search certain keywords within about 5000 domains using yahoo boss, and I do not want to get multiple results from the same domain, instead I want only one result from each website. So that I can use some custom reordering of the results. Can anybody suggest how can I achieve this? Actually I want to get the ranks of my listed domains with respect to certain keywords.

    • Vik Says:

      This cannot be done in a single query with BOSS. Might be possible with many queries via individual or grouped site restricts (can do ~30 domains in a single query) with post-search filtering.

Leave a Reply