<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Chaoticity &#187; Web</title>
	<atom:link href="http://chaoticity.com/category/web/feed/" rel="self" type="application/rss+xml" />
	<link>http://chaoticity.com</link>
	<description>a state of things in which chance is supreme</description>
	<lastBuildDate>Mon, 23 Jan 2012 20:05:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>How to change font on the BBC Urdu website</title>
		<link>http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/</link>
		<comments>http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/#comments</comments>
		<pubDate>Tue, 17 May 2011 23:50:14 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[chaos]]></category>
		<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Urdu]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/</guid>
		<description><![CDATA[Let’s face it. The font on the BBC Urdu website sucks. When a friend complained about it on our alumni list, I thought of writing a small greasemonkey script to take&#160; care of the problem. The results are pretty good, as visible in the image below. The left part is the site after installing the [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share"><div style="width:100px;" class="really_simple_share_facebook_like"> 
				<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fhow-to-change-font-on-the-bbc-urdu-website%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
					scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe>
			</div><div style="width:110px;" class="really_simple_share_twitter"> 
				<a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
					data-text="How to change font on the BBC Urdu website" data-url="http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/"></a> 
			</div></div>
		<div style="clear:both;"></div><p align="justify">Let’s face it. The font on the <a href="http://www.bbc.co.uk/urdu/" target="_blank">BBC Urdu website</a> sucks. When a friend complained about it on our alumni list, I thought of writing a small greasemonkey script to take&#160; care of the problem. The results are pretty good, as visible in the image below. The left part is the site after installing the Urdu Naskh Asia type font provided by BBC and before installing the script (and I maintain, Aijaz, it sucks). The right part is after installing the script. Click on the image and you’ll get an un-scaled version. </p>
<p align="justify">To install the script, click the link below and follow the installation instructions given there. Currently, it works only on Chrome and Firefox.</p>
<p align="center"><a href="http://userscripts.org/scripts/show/103069" target="_blank">BBC Urdu Font Changer @ userscripts.org</a></p>
<p>and the world is a bit better now…</p>
<p><a href="http://chaoticity.com/images/change1.png" target="_blank"><img style="background-image: none; border-bottom: 0px; border-left: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top: 0px; border-right: 0px; padding-top: 0px" title="change" border="0" alt="change" src="http://chaoticity.com/images/change_thumb1.png" width="600" height="313" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>A challenge in time-limited search</title>
		<link>http://chaoticity.com/a-challenge-in-time-limited-search/</link>
		<comments>http://chaoticity.com/a-challenge-in-time-limited-search/#comments</comments>
		<pubDate>Tue, 23 Nov 2010 17:57:23 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[chaos]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[header]]></category>
		<category><![CDATA[http]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[time]]></category>

		<guid isPermaLink="false">http://chaoticity.com/a-challenge-in-time-limited-search/</guid>
		<description><![CDATA[I was trying to trace the source of the quote “Any sufficiently advanced financial instrument is indistinguishable from fraud.”. If you do a quoted google search on a custom date range, an interesting problem can be seen. The results contain pages originally published in 2005 but re-indexed recently. While re-indexing, the current tweets of the [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share"><div style="width:100px;" class="really_simple_share_facebook_like"> 
				<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fa-challenge-in-time-limited-search%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
					scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe>
			</div><div style="width:110px;" class="really_simple_share_twitter"> 
				<a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
					data-text="A challenge in time-limited search" data-url="http://chaoticity.com/a-challenge-in-time-limited-search/"></a> 
			</div></div>
		<div style="clear:both;"></div><p>I was trying to trace the source of the quote “Any sufficiently advanced financial instrument is indistinguishable from fraud.”. If you do a quoted <a href="http://www.google.co.uk/search?q=%22Any+sufficiently+advanced+financial+instrument+is+indistinguishable+from+fraud.%22&amp;tbs=cdr:1,cd_min:1/1/2005,cd_max:1/1/2006" target="_blank">google search on a custom date range</a>, an interesting problem can be seen.</p>
<p><a href="http://chaoticity.com/images/timesearch.jpg"><img style="display: inline; border-width: 0px;" title="timesearch" src="http://chaoticity.com/images/timesearch_thumb.jpg" border="0" alt="timesearch" width="561" height="275" /></a></p>
<p>The results contain pages originally published in 2005 but re-indexed recently. While re-indexing, the current tweets of the author were visible to the crawler and got indexed along with the original article. This makes it seem like the quoted text was mentioned first in 2005 where as originally it’s only a recent meme.</p>
<p>One way to avoid this might focus on identifying dynamic widgets like twitter/news/weather feeds and eliminating them from the index.  The HTTP Header (pasted below) lists the last-updated date which probably means that google is either getting the date from the first time it indexed the post or from the URL itself. Whatever the case is, it’s an interesting problem to distinguish between the ‘original’ content and other dynamically added elements on a page.</p>
<p><code>HTTP/1.1 200 OK<br />
</code><code>Server: nginx Date: Tue, 23 Nov 2010 17:50:20 GMT<br />
Content-Type: text/html; charset=UTF-8<br />
Transfer-Encoding: chunked<br />
Connection: close<br />
Vary: Cookie, Accept-Encoding<br />
X-hacker: If you're reading this, you should visit automattic.com/jobs and apply to join the fun, mention this header. X-Pingback: </code></p>
<p>P.S. An interesting way to advertise. You have read this header anyway so you might want to apply for the job :)</p>
<p>P.P.S. On second thought, it&#8217;s not much of a &#8216;challenge&#8217; per se. It&#8217;s just an interesting problem.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/a-challenge-in-time-limited-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google as a Question Answering System</title>
		<link>http://chaoticity.com/google-as-a-question-answering-system/</link>
		<comments>http://chaoticity.com/google-as-a-question-answering-system/#comments</comments>
		<pubDate>Sat, 06 Feb 2010 04:34:58 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[Language]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[MRR]]></category>
		<category><![CDATA[Question Answer]]></category>
		<category><![CDATA[Summary]]></category>

		<guid isPermaLink="false">http://chaoticity.com/google-as-a-question-answering-system/</guid>
		<description><![CDATA[A Question Answering (QA) system is an Information Retrieval system which gives the answer to a question posed in natural language. For example, if you ask it Who wrote Hamlet?, it should answer Shakespeare. A few years ago (don’t ask me how many), search engines did not focus on language queries. Recently [sic], Google has [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share"><div style="width:100px;" class="really_simple_share_facebook_like"> 
				<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fgoogle-as-a-question-answering-system%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
					scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe>
			</div><div style="width:110px;" class="really_simple_share_twitter"> 
				<a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
					data-text="Google as a Question Answering System" data-url="http://chaoticity.com/google-as-a-question-answering-system/"></a> 
			</div></div>
		<div style="clear:both;"></div><p>A <a href="http://en.wikipedia.org/wiki/Question_answering">Question Answering</a> (QA) system is an Information Retrieval system which gives the answer to a question posed in natural language. For example, if you ask it <i>Who wrote Hamlet?</i>, it should answer <i>Shakespeare</i>. A few years ago (don’t ask me how many), search engines did not focus on language queries. Recently [sic], Google has started incorporating some NLP (Natural Language Processing) in their results. You can try it out by typing the same question in the search box yourself ( <a href="http://www.google.co.uk/search?q=Who+wrote+Hamlet">or clicking here</a> ). </p>
<p><a href="http://chaoticity.com/images/image9.png"><img title="image" style="border-right: 0px; border-top: 0px; display: inline; margin-left: 0px; border-left: 0px; margin-right: 0px; border-bottom: 0px" height="199" alt="image" src="http://chaoticity.com/images/image_thumb9.png" width="330" align="right" border="0" /></a> </p>
<p>During my <a href="http://www.cl.cam.ac.uk/admissions/cstit/">M.Phil. course</a>, one of the tasks was to build a basic QA system and extend it however we liked. We used the <a href="http://trec.nist.gov/data/qa/t8_qadata.html">TREC 8 dataset</a> for evaluations. While building the system, I evaluated how current search engines (read Google) performed on this task. For this, I just queried the exact question and used the summaries of the top five results as answers. Evaluating at that time (2008), I got a <a href="http://en.wikipedia.org/wiki/Mean_reciprocal_rank">Mean Reciprocal Rank</a> (MRR) score of <b>0.212</b> over 198 questions. 156 questions had no answers found in top 5 responses.</p>
<p>This term, I am demonstrating for the same task. Demonstrators are usually PhD students who provide help and guidance to junior students. For pure geek fun and lack of better things to do while taking a break, I decided to quickly jolt down a JavaScript (read&#160; <a href="http://chaoticity.com/wp-admin/jquery.com">JQuery</a> ) based QA system. This time,&#160; the resulting MRR score over 198 questions was <b>0.384 </b>while only 79 questions had no answers found in top 5 responses.</p>
<p>The results show clearly that during the last two years, Google has significantly improved on answering NLP queries. In fact (IIRC), my baseline system back in 2008 (based on <a href="http://www.cl.cam.ac.uk/~aac10/papers/rmrsdraft.pdf" target="_blank">RMRS</a> based matching of sentences from the top 100 documents returned by an IR system) could only achieve an MRR score of approximately <b>0.290, </b>showing that the current results are much better than that baseline. I hope this decade sees some more developments/improvements in QA systems and I can ask a system <em>What do you get if you multiply</em> <em>six by nine?</em></p>
<blockquote><p>I&#8217;ve always said there was something fundamentally wrong with the universe. <strong>~Arthur Dent</strong></p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/google-as-a-question-answering-system/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>outwit</title>
		<link>http://chaoticity.com/outwit/</link>
		<comments>http://chaoticity.com/outwit/#comments</comments>
		<pubDate>Fri, 26 Jun 2009 20:26:02 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[Customized]]></category>
		<category><![CDATA[Outwit]]></category>
		<category><![CDATA[Twitter Client]]></category>

		<guid isPermaLink="false">http://chaoticity.com/outwit/</guid>
		<description><![CDATA[I am playing around with a customized twitter client, temporarily named ‘outwit’. I’ll try to add up the features as I need them but its strictly an experiment for the time being. Let’s see if things go smoothly from here.]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share"><div style="width:100px;" class="really_simple_share_facebook_like"> 
				<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Foutwit%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
					scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe>
			</div><div style="width:110px;" class="really_simple_share_twitter"> 
				<a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
					data-text="outwit" data-url="http://chaoticity.com/outwit/"></a> 
			</div></div>
		<div style="clear:both;"></div><p>I am playing around with a customized twitter client, temporarily named ‘outwit’. I’ll try to add up the features as I need them but its strictly an experiment for the time being. Let’s see if things go smoothly from here.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/outwit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Creating Canonical / Normalized Links and URLs</title>
		<link>http://chaoticity.com/canonical_links/</link>
		<comments>http://chaoticity.com/canonical_links/#comments</comments>
		<pubDate>Mon, 16 Feb 2009 21:52:21 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Web]]></category>
		<category><![CDATA[canonical]]></category>
		<category><![CDATA[normalized]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[url]]></category>

		<guid isPermaLink="false">http://chaoticity.com/?p=36</guid>
		<description><![CDATA[Here is a short intro on how to make sure that major search engines (Google, Yahoo, Microsoft) can be directed to see different URLs with the same content as a single &#8216;conanical&#8217; URL. For example, the following links point to the same page but have different URLs http://nu.edu.pk/default.aspx http://nu.edu.pk http://www.nu.edu.pk/default.aspx http://www.nu.edu.pk/ http://nu.edu.pk/default.ASPX The solution is [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share"><div style="width:100px;" class="really_simple_share_facebook_like"> 
				<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fcanonical_links%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
					scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe>
			</div><div style="width:110px;" class="really_simple_share_twitter"> 
				<a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
					data-text="Creating Canonical / Normalized Links and URLs" data-url="http://chaoticity.com/canonical_links/"></a> 
			</div></div>
		<div style="clear:both;"></div><p><a href="http://www.mattcutts.com/blog/canonical-link-tag/">Here</a> is a short intro on how to make sure that major search engines (Google, Yahoo, Microsoft) can be directed to see different URLs with the same content as a single &#8216;conanical&#8217; URL. For example, the following links point to the same page but have different URLs</p>
<ul style="margin-left: 1.5em;">
<li><a href="http://nu.edu.pk/default.aspx">http://nu.edu.pk/default.aspx</a></li>
<li><a href="http://nu.edu.pk">http://nu.edu.pk</a></li>
<li><a href="http://www.nu.edu.pk/default.aspx">http://www.nu.edu.pk/default.aspx</a></li>
<li><a href="http://www.nu.edu.pk/">http://www.nu.edu.pk/</a></li>
<li><a href="http://nu.edu.pk/default.ASPX">http://nu.edu.pk/default.ASPX</a></li>
</ul>
<p>The solution is to select a single point as your representative URL and include this line in the HTML code.</p>
<p>Although a standard 301 redirect should work too, but this would be a bit easier for the non-techie designers and SEO enthusiasts to implement.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/canonical_links/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Am I asking the right question?</title>
		<link>http://chaoticity.com/am-i-asking-the-right-question/</link>
		<comments>http://chaoticity.com/am-i-asking-the-right-question/#comments</comments>
		<pubDate>Tue, 10 Feb 2009 18:35:22 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Web]]></category>
		<category><![CDATA[Ask]]></category>
		<category><![CDATA[Niche]]></category>
		<category><![CDATA[Question]]></category>
		<category><![CDATA[Web Development]]></category>

		<guid isPermaLink="false">http://chaoticity.com/?p=34</guid>
		<description><![CDATA[Twitter asks &#8216;What are you doing?&#8217; Google Latitude asks &#8216;Where are you?&#8217; del.icio.us asks &#8216;What are you browsing?&#8217; As in science, one of the most important things in web application development is thus asking the right question (at the right time). A properly phrased question can identify your niche along with giving you a guideline [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share"><div style="width:100px;" class="really_simple_share_facebook_like"> 
				<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fam-i-asking-the-right-question%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
					scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe>
			</div><div style="width:110px;" class="really_simple_share_twitter"> 
				<a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
					data-text="Am I asking the right question?" data-url="http://chaoticity.com/am-i-asking-the-right-question/"></a> 
			</div></div>
		<div style="clear:both;"></div><p><a title="Twitter" href="http://twitter.com">Twitter</a> asks &#8216;What are you doing?&#8217; <a title="Google Latitude" href="http://http://www.google.com/latitude/"><br />
Google Latitude</a> asks &#8216;Where are you?&#8217;<br />
<a title="deli.cio.us" href="http://deli.cio.us">del.icio.us</a> asks &#8216;What are you browsing?&#8217;</p>
<p>As in science, one of the most important things in web application development is thus asking the right question (at the right time). A properly phrased question can identify your niche along with giving you a guideline for keeping your focus on a single problem.<br />
As some people might notice, I tend NOT to write much anyway. So until the writer in me is back (if there _is_ one at all), I&#8217;ll try to write something even if its a one liner.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/am-i-asking-the-right-question/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

