<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>zooie&#039;s blog &#187; CSE</title>
	<atom:link href="http://zooie.wordpress.com/category/cse/feed/" rel="self" type="application/rss+xml" />
	<link>http://zooie.wordpress.com</link>
	<description>vik singh&#039;s (mainly techy) thoughts</description>
	<lastBuildDate>Sun, 18 Oct 2009 21:55:36 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='zooie.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/fbfd4e3186e2ecd3a7ad448bf907c50f?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>zooie&#039;s blog &#187; CSE</title>
		<link>http://zooie.wordpress.com</link>
	</image>
			<item>
		<title>Google Co-op just got del.icio.us!</title>
		<link>http://zooie.wordpress.com/2007/01/03/google-co-op-just-got-delicious/</link>
		<comments>http://zooie.wordpress.com/2007/01/03/google-co-op-just-got-delicious/#comments</comments>
		<pubDate>Wed, 03 Jan 2007 02:01:30 +0000</pubDate>
		<dc:creator>Vik</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[CS]]></category>
		<category><![CDATA[CSE]]></category>
		<category><![CDATA[Co-op]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Machine Learning]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Tagging]]></category>

		<guid isPermaLink="false">http://zooie.wordpress.com/2007/01/03/google-co-op-just-got-delicious/</guid>
		<description><![CDATA[Update: Sorry, link is going up and down. Worth trying, but will try to find a more stable option when time cycles free up. 
This past week I decided to cook up a service (link in bold near the middle of this post) I feel will greatly assist users in developing advanced Google Custom Search [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=zooie.wordpress.com&blog=31469&post=24&subd=zooie&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><i>Update: Sorry, link is going up and down. Worth trying, but will try to find a more stable option when time cycles free up. </i></p>
<p>This past week I decided to cook up a service (link in bold near the middle of this post) I feel will greatly assist users in developing advanced Google Custom Search Engines (CSE&#8217;s). I read through the Co-op discussion posts, digg/blog comments, reviews, emails, etc. and learned many of our users are fascinated by the refinements feature &#8211; in particular, building search engines that produce results like this:</p>
<p><a href="http://vik.singh.googlepages.com/mlsearchresults3?cx=007096099364872597928%3Aml_context&amp;q=linear+regression&amp;sa=Search&amp;cof=FORID%3A11#1411">&#8216;linear regression&#8221; on my Machine Learning Search Engine</a></p>
<p>&#8230; but unfortunately, many do not know how to do this nor understand/want to hack up the XML. Additionally, I think it&#8217;s fair to say many users interested in building advanced CSE&#8217;s have already done similar site tagging/bookmarking through services like <a href="http://del.icio.us/" target="_blank">del.icio.us</a>. <a href="http://del.icio.us/" target="_blank"> del.icio.us</a> really is great. Here are a couple of reasons why people should (and do) use <a href="http://del.icio.us/" target="_blank">del.icio.us</a>:</p>
<ul>
<li>It&#8217;s simple and clean</li>
<li>You can multi-tag a site quickly (comma separated field; don&#8217;t have to keep reopening the bookmarklet like with Google&#8217;s)</li>
<li>You can create new tags on the fly (don&#8217;t choose the labels from a fixed drop-down like with Google&#8217;s)</li>
<li>The bookmarklet provides auto-complete tag suggestions; shows you the popular tags others have used for that current site</li>
<li>Can have bundles (two level tag hierarchies)</li>
<li>Can see who else has bookmarked the site (can also view their comments); builds a user community</li>
<li>Generates a public page serving all your bookmarks</li>
</ul>
<p>Understandably, we received several requests to support <a href="http://del.icio.us/" target="_blank"> del.icio.us</a> bookmark importing. My part-time role with Google just ended last Friday, so, as a non-Googler, I decided to build this project. Initially, I was planning to write a simple service to convert <a href="http://del.icio.us/" target="_blank"> del.icio.us</a> bookmarks into CSE annotations &#8211; and that&#8217;s it &#8211; but realized, as I learned more about <a href="http://del.icio.us/" target="_blank">del.icio.us</a>, that there were several additional features I could develop that would make our users&#8217; lives even easier. Instead of just generating the annotations, I decided to also generate the CSE contexts as well.</p>
<p><b><font size="3">Ok, enough talk, here&#8217;s the final product:</font><br />
<a href="http://basundi.com:8000/login.html" target="_blank">http://basundi.com:8000/login.html</a></b></p>
<p>If you don&#8217;t have a <a href="http://del.icio.us/" target="_blank">del.icio.us</a> account, and just want to see how it works, then shoot me an email (check the bottom of the Bio page) and I&#8217;ll send you a dummy account to play with (can&#8217;t publicize it or else people might spam it or change the password).</p>
<p>Here&#8217;s a quick feature list:</p>
<ul>
<li>Can build a full search engine (like the machine learning one above) in two steps, without having to edit any XML, and in less than two minutes</li>
<li>Auto-generates the CSE annotations XML from your <a href="http://del.icio.us/" target="_blank">del.icio.us</a> bookmarks and tags</li>
<li>Provides an option to auto-generate CSE annotations just for <a href="http://del.icio.us/" target="_blank">del.icio.us</a> bookmarks that have a particular tag</li>
<li>Provides an option to Auto-calculate each annotation&#8217;s boost score (log normalizes over the max # of Others per bookmark)</li>
<li>Provides an option to Auto-expand links (appends a wildcard * to any links that point to a directory)</li>
<li>Auto-generates the CSE context XML</li>
<li>Auto-generates facet titles</li>
<li>Since there&#8217;s a four facet by five labels restriction (that&#8217;s the max that one can fit in the refinements display on the search results page), I provide two options for automatic facet/refinement generation:
<ul>
<li>The first uses a machine learning algorithm to find the four most frequent disjoint 5-item-sets (based on the # of <a href="http://del.icio.us/" target="_blank">del.icio.us</a> tag co-occurrences; it then does query-expansion over the tag sets to determine good facet titles)</li>
<li>The other option returns the user&#8217;s most popular <a href="http://del.ico.us/" target="_blank">del.ico.us</a> bundles and corresponding tags</li>
<li>Any refinements that do not make it in the top 4 facets are dumped in a fifth facet in order of popularity. If you don&#8217;t understand this then don&#8217;t worry, you don&#8217;t need to! The point is all of this is automated for you (just use the default Cluster option). If you want control over which refinements/facets get displayed, then just choose Bundle.</li>
</ul>
</li>
<li>Provides help documentation links at key steps</li>
<li>And best of all &#8230; You don&#8217;t need to understand the advanced options of Google CSE/Co-op to build an advanced CSE! This seriously does all the hard, tedious work for you!</li>
</ul>
<p>In my opinion, there&#8217;s no question that this is the easiest way to make a fancy search engine. If I make any future examples I&#8217;m using this &#8211; I can simply use <a href="http://del.icio.us/" target="_blank">del.icio.us</a>, sign-in to this service, and voila I have a search engine with facets and multi-label support.</p>
<p><font size="1"><br />
Please note that this tool is not officially endorsed by nor affiliated with Google or Yahoo! It was just something I wanted to work on for fun that I think will benefit many users (including myself). Also, send your feedback/issues/bugs to me or post them on this blog.</font></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/zooie.wordpress.com/24/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/zooie.wordpress.com/24/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/zooie.wordpress.com/24/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/zooie.wordpress.com/24/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/zooie.wordpress.com/24/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/zooie.wordpress.com/24/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/zooie.wordpress.com/24/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/zooie.wordpress.com/24/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/zooie.wordpress.com/24/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/zooie.wordpress.com/24/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/zooie.wordpress.com/24/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/zooie.wordpress.com/24/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=zooie.wordpress.com&blog=31469&post=24&subd=zooie&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://zooie.wordpress.com/2007/01/03/google-co-op-just-got-delicious/feed/</wfw:commentRss>
		<slash:comments>73</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/17518bf0a462f22fc174f2df8e464e69?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">zooie</media:title>
		</media:content>
	</item>
	</channel>
</rss>