Disclaimer: This is my personal blog. The views expressed on these pages are mine alone and not those of my employer.
Boss stands for Build your Own Search Service. The goal of Boss is to open up search to enable third parties to build incredibly useful and powerful search-based applications. Several months ago I pitched this idea to the executives on how Yahoo! can specifically open up its search assets to fragment the market. It’s remarkable to finally see some of the vision (with the help of many talented people) reach the public today.
Web search is a tough business to get into. $300+ Million capex, amazing talent, infrastructure, a prayer, etc. just to get close to basic parity. Only 3 companies have really pulled it off. However, I strongly believe we need to find innovative, incremental ways to spread the search love in order to encourage fragmentation and help promising companies get to basic parity instantly so that they can leverage their unique assets (new algorithm, user data, talent) to push their search solution beyond the current baseline.
Search is all about understanding the user’s intent. If we can nail the intent, then search is pretty much a solved problem. However, the current model of a single search box for everything loses an intent focus as it aims to cater to all people and queries. Albeit, a single search box definitely makes our lives easier, but I have a hard time believing this is the *right* approach.
In my online experience, I typically visit a variety of sites: Techmeme, Digg, Techcrunch, eBay, Amazon, del.icio.us, etc. While on these pages, something almost always catches my eye, and so I proceed to the search box in my browser to find out more on the web. Why do we have this disconnected experience? I think it’s because these sites do not provide web-level comprehensiveness. It’s unfortunate, because the page that I’m on may have additional information about my intent (maybe I’m logged in so it has my user info, or it’s a techy shopping site).
The biggest goal of Boss is to help bootstrap sites like these to get comprehensiveness and basic ranking for free, as well as offer tools to re-rank, blend, and overlay the results in a way that revolutionizes the search experience.
When I’m on del.icio.us, why can’t I search in their box, get relevant del.icio.us results at the top, and also have web results backfill below? I think users should be confident that if they searched in a search box on any page in the whole wide web that they’ll get results that are just as good as Yahoo/Google and only better.
The first milestone of Boss is a simple one: Make available a clean search API that turns off the traditional restrictions so that developers can totally control presentation, re-rank results, run an unlimited number of queries, and blend in external content all without having to include any Yahoo! attribution in the resulting product(s). Want to build the example above or put news search results on a map – go for it!
Here’s a link to the API:
http://developer.yahoo.com/search/boss/
Also, check out the Boss Mashup Framework:
http://developer.yahoo.com/search/boss/mashup.html
The Boss Mashup Framework in my opinion makes the Boss Search API really useful. It lets developers use SQL like syntax for operating on heterogeneous web data sources. The idea came up as I was working on examples to showcase Boss, and realized the operations I was developing imperatively followed closely to declarative SQL like constructs. Since it’s a recent idea and implementation, there may be some bugs or weird designs lurking in there, but I strongly recommend playing around with it and viewing the examples included in the package. I’m biased of course but do think it’s a fun framework for remixing online data. One can rank web results by digg and youtube favorite counts, remove duplicates, and publish the results using a provided search results page template in less than 30 lines of code and without having to specify any parsing logic of the data sources/API’s as the framework can infer the structure and unify the data formats automatically in most cases.
The next couple of milestones for Boss I think are even more interesting and disruptive – server side services, monetization, blending ranking models, more features exposure, query classifiers, open source … so stay tuned.
July 10, 2008 at 6:53 am |
[...] Vik Singh had the idea for BOSS, and posted Yahoo! Boss – An Insider’s View. It’s money line is this, and describes the big idea succinctly: “I think users should [...]
July 10, 2008 at 8:50 am |
Wow, this is really cool. Thanks for the additional insight. It is great to see that there are nearly no restrictions for BOSS. I have to try it out now.
July 10, 2008 at 10:30 am |
[...] Quellen: Yahoo Blog, CNET, TechCrunch, GigaOm, zooie’s, Webware, ReadWriteWeb, Futuristic [...]
July 10, 2008 at 1:14 pm |
Oh, inevitably Yahoo! becomes more and more my favourite company on the web! I love Yahoo! Pipes, and now – BOSS! Must try it out…
July 10, 2008 at 1:29 pm |
This is a great announcement. Thanks!
It seems that it is not possible to make a vertical search engine based on selected sites (something like Google CSE). Are you considering this as a next feature? If not, please do.
July 11, 2008 at 12:29 am |
You’re a rockstar, this is tremendous vision and really deserves to be congratulated! You may have changed the entire web by getting this implemented.
July 11, 2008 at 1:14 am |
thanks adrian – too kind. lots of people helped to make this happen. there’s plenty left to do – it’s the next couple of milestones that i think will really define open search.
July 11, 2008 at 4:20 am |
I agree, this is tremendous vision. I couldn’t agree with you more about user intent. I concur that this will change Search as we know it.
July 11, 2008 at 6:36 am |
[...] I like this post, which discusses some of the origins behind and future plans for [...]
July 11, 2008 at 7:53 am |
[...] Hier gibt es einige weitere Infos zu BOSS, von dem Entwickler, der’s erfunden hat. [...]
July 11, 2008 at 11:33 am |
[...] Xlinks digest is a interestig collection of links as discovered by the ProjectX team. Yahoo Boss – an insiders view Added on 07/11/2008 at 10:12AM Google Lively Added on 07/11/2008 at [...]
July 11, 2008 at 12:22 pm |
@Sergio. I haven’t looked at the framework, or the API (yet!), so I’m really out on a limb here, but couldn’t you just include the site(s) names as parameters in the search string?
BOSS – awesome concept, can’t wait to play with it. Thx all.
July 11, 2008 at 12:38 pm |
Very fast. Very useful. Very cool.
July 11, 2008 at 1:40 pm |
@Sergio: in the query you can include site: terms to do restrictions. also stay tuned for an upcoming operator that should help you here.
July 11, 2008 at 2:29 pm |
[...] To get an in-depth perspective of yahoo! Search Boss, you can also visit Yahoo! Boss – An Insider View. [...]
July 13, 2008 at 8:01 am |
[...] there’s Yodel Anecdotal’s launch post, Yahoo! Search Blog’s launch post, an insider’s view on Search BOSS, and of course one should check out Yahoo! Developer Network’s section on Yahoo! Search BOSS. [...]
July 14, 2008 at 12:57 am |
Great stuff zooie. Congratulations and thanks. My only fear is that if MS was going to acquire Yahoo it could kill off this wise and bold concept. How likely to happen do you consider this in case of a takeover? I guess this uncertainty doesn’t allow for the fastest and widest adoption possible.
July 14, 2008 at 4:45 am |
it’s a good question. don’t want to represent yahoo on this one but i’ll answer a part of it personally if that’s ok. i think once you go open (say boss not only provided web services but also the source code) it’s really hard to go back – whether a company later becomes a $100 billion dollars or part of microsoft. personally, i think going open can give a company a lot of leverage in these type of negotiations …
July 14, 2008 at 10:57 pm |
thanks zooie. I’m feeling better now
July 17, 2008 at 5:49 pm |
i m also feeling better
July 27, 2008 at 4:25 pm |
Hi Vik,
Why do we have a restriction of using a search box. I wanted to build a web site to help domaineers find valuable domain names out of thousands of domain names that expire daily based on certain factors most prominent being the search hits. I have hit a wall since none of the api’s from any search providers will let me do this in a script. This would result in probably max of 10,000 searches a day. All i care is what is the total number of search hits for a given keyword. Screen scraping is illegal and would result in banning of IP. There should be a way to allow using search api in scripts ( non web based) provided the api user advertises the BOSS api on the web site or any other condition which might be beneficial to both the api users and yahoo. Any Input?
Thanks.
July 28, 2008 at 11:14 pm |
hey arjun – just got off a plane (sorry for the delay). i agree with you. i’ll check with legal.
August 6, 2008 at 7:23 am |
Hello.
Are you going to allow to search for multiple domains at the same time via BOSS? Something like (site:site1.com OR site:site2.com OR site:site3.com).
August 6, 2008 at 4:40 pm |
@Michael: Yes we added better support for this recently.
Check out the ’sites’ parameter, which takes in a comma separated string of domains
For ex.
http://boss.yahooapis.com/ysearch/web/v1/query?sites=yahoo.com,disney.com
August 11, 2008 at 9:30 pm |
[...] pretty snazzy and quite amusing too, and oh yes, made with just 50 lines of code. Singh’s insider’s view on BOSS is an interesting read as [...]
August 16, 2008 at 5:15 pm |
Programs Bookmarks…
Remmrit.com user has just tagged your post as programs!…
November 17, 2008 at 11:35 am |
[...] Yahoo! Boss – An Insider View « zooie’s blog [...]
February 9, 2009 at 11:05 pm |
Vik,
Is is possible to search within sub-directories in multiple domains?
for example: ?sites=disney.com/toys, cnn.com/news, garden.com/green/trees …
I tried doing this with no success.
February 10, 2009 at 1:49 am |
one way to maybe approximate this is to use “inurl:” inside the query
for ex.
/web/v1/query+inurl:disney.com/toys+OR+inurl:cnn.com/news …
you could also play around with using the sites parameter in conjunction with the inurl query syntax
hope this helps
and if you can, please post your questions to the ysearchboss yahoo group so others could also benefit from this discussion
best
March 20, 2009 at 2:03 pm |
What’s the limit on how many domains can be added? We probably need 10,000 or so for our application.
March 29, 2009 at 1:55 am |
I echo william, and I think growing that limit is very important to allowing people to create truly useful vertical search engines with BOSS.
April 5, 2009 at 5:44 am |
Vic,
I tried to develop search results such as provided by kosmix.com via BOSS API and couldnt do this due to structure of search kosmix use. any suggestions???
amit
April 5, 2009 at 12:16 pm |
@amit
To approximate kosmix results, you’ll probably need to issue multiple BOSS API calls doing site restricts on different domains (blogs, web, flickr, etc.). These can be loaded fast via multithreading/processing and/or AJAXed modules for each domain of results.
Check out the “sites” parameter in BOSS for more details.
You might also find the “keyterms” and “searchmonkey_rss” parameters very useful. More to come for BOSS so stay tuned.
April 5, 2009 at 12:17 pm |
@William and @dinkler
We hear you. Working on something for you guys. Check out search.techcrunch.com (which is powered by a future BOSS platform) for a pseudo sneak peak.
June 25, 2009 at 5:19 pm |
[...] the openness and power of Yahoo! BOSS (you can read more about it in my previous posts here and here). Remarkably, many users found the service useful despite its slow performance, barebones UI, lack [...]
July 6, 2009 at 10:46 am |
[...] example, one non-search application of BOSS leveraged the Spelling service to spell correct video comments before handing them off to their [...]