<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Chaoticity &#187; Software</title>
	<atom:link href="http://chaoticity.com/category/software/feed/" rel="self" type="application/rss+xml" />
	<link>http://chaoticity.com</link>
	<description>a state of things in which chance is supreme</description>
	<lastBuildDate>Wed, 09 May 2012 17:35:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Twingual: A twitter client for bilingual tweeple</title>
		<link>http://chaoticity.com/twingual-a-twitter-client-for-bilingual-tweeple/</link>
		<comments>http://chaoticity.com/twingual-a-twitter-client-for-bilingual-tweeple/#comments</comments>
		<pubDate>Wed, 09 May 2012 17:35:50 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[Language]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Web]]></category>
		<category><![CDATA[bilingual]]></category>
		<category><![CDATA[client]]></category>
		<category><![CDATA[twingual]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://chaoticity.com/?p=518</guid>
		<description><![CDATA[In my last post, I highlighted some problems that I face daily while using twitter in Urdu as well in English. A few days ago, I decided to experiment with the Twitter API and write my own client to fix some of these problems. You can see the result at www.twingual.com. It is a javascript [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Ftwingual-a-twitter-client-for-bilingual-tweeple%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="Twingual: A twitter client for bilingual tweeple" data-url="http://chaoticity.com/twingual-a-twitter-client-for-bilingual-tweeple/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p align="justify"><a title="Twingual" href="http://www.twingual.com" target="_blank"><img style="background-image: none; border-bottom: 0px; border-left: 0px; margin: 0px 5px 0px 0px; padding-left: 0px; padding-right: 0px; display: inline; float: left; border-top: 0px; border-right: 0px; padding-top: 0px" title="Twingual" border="0" alt="Twingual" align="left" src="http://chaoticity.com/images/bg.png" width="138" height="140" /></a>In my <a href="http://chaoticity.com/nastaleeq-urdu-typesetting-when-will-they-get-it-right/" target="_blank">last post</a>, I highlighted some problems that I face daily while using twitter in Urdu as well in English. A few days ago, I decided to experiment with the Twitter API and write my own client to fix some of these problems. You can see the result at <a href="http://www.twingual.com" target="_blank">www.twingual.com</a>. It is a javascript only twitter client which supports neat Nastaleeq urdu fonts as well as transliteration.</p>
<p align="justify">It’s a work in progress and does not implement all twitter features. If you like it and want to see something you need everyday implemented, feel free to send a tweet. </p>
<p align="justify">Meanwhile, tweet away! </p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/twingual-a-twitter-client-for-bilingual-tweeple/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to change font on the BBC Urdu website</title>
		<link>http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/</link>
		<comments>http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/#comments</comments>
		<pubDate>Tue, 17 May 2011 23:50:14 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[chaos]]></category>
		<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Urdu]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/</guid>
		<description><![CDATA[Update: BBC has now embedded a new font (BBCNasim) on their website which is quite good. In fact I don&#8217;t use this plugin any more myself. It&#8217;s not Nastaleeq but it is good enough. Let’s face it. The font on the BBC Urdu website is not that good. When a friend complained about it on [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fhow-to-change-font-on-the-bbc-urdu-website%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="How to change font on the BBC Urdu website" data-url="http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p><strong>Update: BBC has now embedded a new font (BBCNasim) on their website which is quite good. In fact I don&#8217;t use this plugin any more myself. It&#8217;s not Nastaleeq but it is good enough.</strong></p>
<p align="justify">Let’s face it. The font on the <a href="http://www.bbc.co.uk/urdu/" target="_blank">BBC Urdu website</a> is not that good. When a friend complained about it on our alumni list, I thought of writing a small greasemonkey script to take  care of the problem. The results are pretty good, as visible in the image below. The left part is the site after installing the Urdu Naskh Asia type font provided by BBC and before installing the script (and I maintain, Aijaz, it is not good). The right part is after installing the script. Click on the image and you’ll get an un-scaled version.</p>
<p align="justify">To install the script, click the link below and follow the installation instructions given there. Currently, it works only on Chrome and Firefox.</p>
<p align="center"><a href="http://userscripts.org/scripts/show/103069" target="_blank">BBC Urdu Font Changer @ userscripts.org</a></p>
<p>and the world is a bit better now…</p>
<p><a href="http://chaoticity.com/images/change1.png" target="_blank"><img style="background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;" title="change" src="http://chaoticity.com/images/change_thumb1.png" alt="change" width="600" height="313" border="0" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/how-to-change-font-on-the-bbc-urdu-website/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>DependenSee: A Dependency Parse Visualisation/Visualization Tool</title>
		<link>http://chaoticity.com/dependensee-a-dependency-parse-visualisation-tool/</link>
		<comments>http://chaoticity.com/dependensee-a-dependency-parse-visualisation-tool/#comments</comments>
		<pubDate>Sat, 28 Aug 2010 18:36:21 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Dependency]]></category>
		<category><![CDATA[Graph]]></category>
		<category><![CDATA[Parser]]></category>
		<category><![CDATA[Stanford]]></category>
		<category><![CDATA[Tree]]></category>
		<category><![CDATA[Typed]]></category>
		<category><![CDATA[Visualisation]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://chaoticity.com/dependensee-a-dependency-parse-visualisation-tool/</guid>
		<description><![CDATA[&#160; There aren’t many tools which allow you to visualise sentences parsed with dependency grammars. Here’s a small tool which generates a PNG of the dependency graph of a given sentence using the Stanford Parser. You can generate the image for Einey’s quote below by following these steps. Click here to download DependenSee.jar. Download the [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fdependensee-a-dependency-parse-visualisation-tool%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="DependenSee: A Dependency Parse Visualisation/Visualization Tool" data-url="http://chaoticity.com/dependensee-a-dependency-parse-visualisation-tool/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p>&nbsp;</p>
<p>There aren’t many tools which allow you to visualise sentences parsed with <a href="http://en.wikipedia.org/wiki/Dependency_grammar" target="_blank">dependency grammars</a>. Here’s a small tool which generates a PNG of the dependency graph of a given sentence using the <a href="http://nlp.stanford.edu/software/lex-parser.shtml" target="_blank">Stanford Parser</a>. You can generate the image for Einey’s quote below by following these steps.</p>
<p><a href="http://chaoticity.com/images/out.png"><img style="display: inline; border: 0px;" title="out" src="http://chaoticity.com/images/out_thumb.png" border="0" alt="out" width="596" height="212" /></a></p>
<ol>
<li>Click here to download <code><a href="http://chaoticity.com/software/DependenSee.jar" target="_blank">DependenSee.jar</a></code>.</li>
<li>Download the <a href="http://nlp.stanford.edu/software/lex-parser.shtml#Download" target="_blank">latest version of the Stanford Parser</a>.  I am using version 2.0.1 (For older versions, drop me an email)</li>
<li>Extract <code>stanford-parser.jar</code> and <code>stanford-parser-2012-03-09-models.jar</code> in the same folder as <code>DependenSee.jar</code>.</li>
<li>On the command prompt, run <br/><code>java -cp DependenSee.jar;stanford-parser.jar;stanford-parser-2012-03-09-models.jar com.chaoticity.dependensee.Main "Example isn't another way to teach, it is the only way to teach." out.png</code> <br/>(If you are on *nix, replace the semicolon by a colon and make sure you have Arial installed. If you have an already parsed dependency output file, replace the sentence by <code>-t input.txt</code> .)</li>
<li>Open <code>out.png</code> and admire :)</li>
</ol>
<p>I have added Part-of-Speech tags and very basic edge overlap management and might add more eye candy later (curved/coloured edges ?). You can link the library in your code as well. An example is given below. Comments and queries are welcome.</p>
<p><code><br />
import edu.stanford.nlp.trees.*;<br />
import edu.stanford.nlp.parser.lexparser.*;<br />
import com.chaoticity.dependensee.*;<br />
import java.util.Collection;<br />
class Test {<br />
   public static void main(String []args) throws Exception {<br />
      String text = "A quick brown fox jumped over the lazy dog.";<br />
      TreebankLanguagePack tlp = new PennTreebankLanguagePack();<br />
      GrammaticalStructureFactory gsf = tlp.grammaticalStructureFactory();<br />
      LexicalizedParser lp = LexicalizedParser.loadModel();<br />
      Tree tree = lp.apply(text);<br />
      GrammaticalStructure gs = gsf.newGrammaticalStructure(tree);<br />
      Collection tdl = gs.typedDependenciesCCprocessed(true);<br />
      Main.writeImage(tree,tdl, "image.png",3);<br />
  }<br />
}<br />
</code></p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/dependensee-a-dependency-parse-visualisation-tool/feed/</wfw:commentRss>
		<slash:comments>77</slash:comments>
		</item>
		<item>
		<title>Making a copy of WEKA Instances</title>
		<link>http://chaoticity.com/making-a-copy-of-weka-instances/</link>
		<comments>http://chaoticity.com/making-a-copy-of-weka-instances/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 21:54:09 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[attribute]]></category>
		<category><![CDATA[classification]]></category>
		<category><![CDATA[dataset]]></category>
		<category><![CDATA[instances]]></category>
		<category><![CDATA[string]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[weka]]></category>

		<guid isPermaLink="false">http://chaoticity.com/making-a-copy-of-weka-instances/</guid>
		<description><![CDATA[This ‘thing’ took about 30 minutes to figure out. According to the WEKA documentation, if  you add a new Instance to an existing Instances object, String values are not transferred ! In case you are working on copying a dataset with a string attribute, you need to transfer the string manually. The code segment below [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fmaking-a-copy-of-weka-instances%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="Making a copy of WEKA Instances" data-url="http://chaoticity.com/making-a-copy-of-weka-instances/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p><a href="http://chaoticity.com/images/image11.png"><img style="display: inline; margin-left: 0px; margin-right: 0px; border-width: 0px;" title="image" src="http://chaoticity.com/images/image_thumb11.png" border="0" alt="image" width="128" height="149" align="right" /></a>This ‘thing’ took about 30 minutes to figure out. According to the <a href="http://weka.sourceforge.net/doc/weka/core/Instances.html#add(weka.core.Instance)">WEKA documentation</a>, if  you add a new Instance to an existing Instances object, <strong>String values are not transferred</strong> <strong>! </strong>In case you are working on copying a dataset with a string attribute, you need to transfer the string manually. The code segment below copies the i^th instance from source to dest where the first attribute (at index 0) is a string attribute.</p>
<p><code><br />
dest.add(source.instance(i));<br />
dest.instance(dest.numInstances()-1)<br />
    .setValue(0,source.instance(i).toString(0));<br />
</code></p>
<p>This should come in handy for text classification using WEKA (and hopefully save your time).</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/making-a-copy-of-weka-instances/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google and Urdu Stemming</title>
		<link>http://chaoticity.com/google-and-urdu-stemming/</link>
		<comments>http://chaoticity.com/google-and-urdu-stemming/#comments</comments>
		<pubDate>Fri, 05 Mar 2010 02:30:25 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Urdu]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[stemming]]></category>

		<guid isPermaLink="false">http://chaoticity.com/google-and-urdu-stemming/</guid>
		<description><![CDATA[&#160; Is google (finally) stemming Urdu? The last time I checked, there were doing something like a transliteration based search but in the screenshot below, you can see that searching for the phrase ان پڑھ چٹا shows some stemming is being used. Does anyone know anything?&#160; Oh, and while I’m on this topic, I would [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fgoogle-and-urdu-stemming%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="Google and Urdu Stemming" data-url="http://chaoticity.com/google-and-urdu-stemming/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p>&#160;</p>
<p>Is google (finally) stemming Urdu? <a href="http://scalar.wordpress.com/2008/06/02/stemming-in-urdu-and-google/" target="_blank">The last time I checked</a>, there were doing something like a transliteration based search but in the screenshot below, you can see that <a href="http://www.google.co.uk/search?q=+%D8%A7%D9%86+%D9%BE%DA%91%DA%BE+%DA%86%D9%B9%D8%A7" target="_blank">searching for the phrase ان پڑھ چٹا</a> shows some stemming is being used. Does anyone know anything?&#160; Oh, and while I’m on this topic, I would also like to know why is it called چٹا ان پڑھ ?</p>
<p><a href="http://chaoticity.com/images/image10.png"><img title="image" style="border-right: 0px; border-top: 0px; display: inline; border-left: 0px; border-bottom: 0px" height="770" alt="image" src="http://chaoticity.com/images/image_thumb10.png" width="514" border="0" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/google-and-urdu-stemming/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Visualizing Citation Networks</title>
		<link>http://chaoticity.com/visualizing-citation-networks/</link>
		<comments>http://chaoticity.com/visualizing-citation-networks/#comments</comments>
		<pubDate>Thu, 04 Feb 2010 04:59:07 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[ACL]]></category>
		<category><![CDATA[Anthology]]></category>
		<category><![CDATA[Citation]]></category>
		<category><![CDATA[GraphViz]]></category>
		<category><![CDATA[Network]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://chaoticity.com/visualizing-citation-networks/</guid>
		<description><![CDATA[For techies: I’ve been working on citation networks lately. You can visualize such a network as a graph. In this graph, the nodes represent publications (papers,articles etc) and the edges represent citations between them. The graph above was produced using the GraphViz. The data is from the ACL Anthology Network which contains publications from the [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fvisualizing-citation-networks%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="Visualizing Citation Networks" data-url="http://chaoticity.com/visualizing-citation-networks/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p><a href="http://chaoticity.com/images/aclnet.jpg"><img title="aclnet" style="border-right: 0px; border-top: 0px; display: inline; border-left: 0px; border-bottom: 0px" height="500" alt="aclnet" src="http://chaoticity.com/images/aclnet_thumb.jpg" width="545" border="0" /></a> </p>
<p><strong>For techies: </strong>I’ve been working on citation networks lately. You can visualize such a network as a graph. In this graph, the nodes represent publications (papers,articles etc) and the edges represent citations between them. The graph above was produced using the <a href="http://www.graphviz.org/" target="_blank">GraphViz</a>. The data is from the <a href="http://clair.si.umich.edu/clair/anthology/" target="_blank">ACL Anthology Network</a> which contains publications from the publicly available <a href="http://aclweb.org/" target="_blank">ACL Anthology</a>. </p>
<p><strong>For non-techies: </strong>Oooooo! pretty picture!</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/visualizing-citation-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online English to Urdu Translator</title>
		<link>http://chaoticity.com/online-english-to-urdu-translator/</link>
		<comments>http://chaoticity.com/online-english-to-urdu-translator/#comments</comments>
		<pubDate>Sat, 23 Jan 2010 19:39:33 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Urdu]]></category>
		<category><![CDATA[english]]></category>
		<category><![CDATA[hindi]]></category>
		<category><![CDATA[translation]]></category>
		<category><![CDATA[transliteration]]></category>

		<guid isPermaLink="false">http://chaoticity.com/online-english-to-urdu-translator/</guid>
		<description><![CDATA[While all the online English to Urdu translators that I have seen don’t really work that well (read suck), if we make use the overlapping vocabulary and grammar of Hindi and Urdu along with using Google’s translation API, things come out pretty decent (as mentioned in my previous post). Here’s a small 15 min first [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fonline-english-to-urdu-translator%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="Online English to Urdu Translator" data-url="http://chaoticity.com/online-english-to-urdu-translator/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p>While all the online English to Urdu translators that I have seen don’t really work that well (read suck), if we make use the overlapping vocabulary and grammar of Hindi and Urdu along with using Google’s translation API, things come out pretty decent (<a href="http://chaoticity.com/how-do-you-transliterate-that/" target="_blank">as mentioned in my previous post</a>). Here’s a small 15 min first cut script which just uses English to Hindi translation and then transliterates from Hindi to Urdu. Feel free to use the code and do ping me if you improve something. This works as a Hindi to Urdu transliterator as well.</p>
<p><script src="http://www.google.com/jsapi" type="text/javascript"></script><script type="text/javascript">google.load("language", "1");var conv=[];
	conv['ऀ']='';//'ऀ';
	conv['ँ']='ن'; 
	conv['ं']='ن';
	conv['ः']='ہ';
	conv['ऄ']='';//'ऄ';
	conv['अ']='اَ';
	conv['आ']='آ';
	conv['इ']='اِ';
	conv['ई']='اِی';
	conv['उ']='اُ';
	conv['ऊ']='اُو';
	conv['ऋ']='';//'ऋ';
	conv['ऌ']='';//'ऌ';
	conv['ऍ']='ای';
	conv['ऎ']='ऎ';
	conv['ए']='';//'ِ';
	conv['ऐ']='ائے';
	conv['ऑ']='';//'ऑ';
	conv['ऒ']='ؤ';
	conv['ओ']='او';
	conv['औ']='اؤ';
	conv['क']='ک';
	conv['ख']='کھ';
	conv['ग']='گ';
	conv['घ']='گھ';
	conv['ङ']='ن';
	conv['च']='چ';
	conv['छ']='چھ';
	conv['ज']='ج';
	conv['झ']='جھ';
	conv['ञ']='ن';
	conv['ट']='ٹ';
	conv['ठ']='ٹھ';
	conv['ड']='ڈ';
	conv['ढ']='ڈھ';
	conv['ण']='ن';
	conv['त']='ت';
	conv['थ']='تھ';
	conv['द']='د';
	conv['ध']='دھ';
	conv['न']='ن';
	conv['ऩ']='';//'ऩ';
	conv['प']='پ';
	conv['फ']='پھ';
	conv['ब']='ب';
	conv['भ']='بھ';
	conv['म']='م';
	//conv['य']='ے';
	conv['य']='ی';
	conv['र']='ر';
	conv['ऱ']='ऱ';
	conv['ल']='ل';
	conv['ळ']='';//ळ';
	conv['ऴ']='';//'ऴ';
	conv['व']='و';
	conv['श']='ش';
	conv['ष']='ش';
	conv['स']='س';
	conv['ह']='ہ';
	conv['ऺ']='';//'ऺ';
	conv['ऻ']='';//'ऻ';
	conv['़']='';//'़';
	conv['ऽ']='';//'ऽ';
	conv['ा']='ا';
	conv['ि']='ِ';
	conv['ी']='ی';
	conv['ु']='ُ';
	conv['ू']='وُ';
	conv['ृ']='ر';
	conv['ॄ']='';//'ॄ';
	conv['ॅ']='ی';
	conv['ॆ']='ء';
	conv['ै']='ی';
	//conv['े']='ے';
	conv['े']='ی';
	conv['ॉ']=''//'ا';
	conv['ॊ']='';//'ॊ';
	conv['ो']='و';
	conv['ौ']='و';
	conv['्']='';
	conv['ॎ']='';//'ॎ';
	conv['ॏ']='';//'ॏ';
	conv['ॐ']='';//'ॐ';
	conv['॑']='॑';
	conv['॒']='॒';
	conv['॓']='॓';
	conv['॔']='॔';
	conv['ॕ']='';//'ॕ';
	conv['ॖ']='';//'ॖ';
	conv['ॗ']='';//'ॗ';
	conv['क़']='ق';
	conv['ख़']='خ';
	conv['ग़']='غ';
	conv['ज़']='ز';
	conv['ड़']='ڑ';
	conv['ढ़']='ڑھ';
	conv['फ़']='ف';
	conv['य़']='';//य़';
	conv['ॠ']='';//'ॠ';
	conv['ॡ']='';//'ॡ';
	conv['ॢ']='';//'ॢ';
	conv['ॣ']='';//'ॣ';
	conv['।']='۔';
	conv['॥']='';//'॥';
	conv['0']='۰';
	conv['1']='۱';
	conv['2']='۲';
	conv['3']='۳';
	conv['4']='۴';
	conv['5']='۵';
	conv['6']='۶';
	conv['7']='۷';
	conv['8']='۸';
	conv['9']='۹';
	conv['॰']='॰'
	conv['ॱ']='';//'ॱ';
	conv['ॲ']='';//'ॲ';
	conv['ॳ']='';//'ॳ';
	conv['ॴ']='';//'ॴ';
	conv['ॵ']='';//'ॵ';
	conv['ॶ']='';//'ॶ';
	conv['ॷ']='';//'ॷ';
	conv['ॸ']='';//'ॸ';
	conv['ॹ']='';//'ॹ';
	conv['ॺ']='';//'ॺ';
	conv['ॻ']='';//'ॻ';
	conv['ॼ']='';//'ॼ';
	conv['ॽ']='';//'ॽ';
	conv['ॾ']='';//'ॾ';
	conv['ॿ']='';//'ॿ';
	conv['?']='؟';
	conv['.']='۔';
	function xliterate() {
		src = document.getElementById('src').value;	
		google.language.translate(src, "en", "hi", function(result) {
				if (!result.error) {
					mid = result.translation;
					dest='';
					for(i=0; i<mid.length;i++) {
						dest=dest+ ((conv[mid[i]]!=undefined) ? conv[mid[i]] : mid[i]);
					}
					document.getElementById('dest').value=dest;
				} else {
					alert(result.error);
				}
			});			
	}
</script><br />
<textarea id="src" style="font-family: arial unicode;" cols="80" rows="5">How are you?</textarea><br />
<textarea id="dest" style="direction: rtl; font-family: arial unicode;"  cols="80" rows="5">آپ کیسی ہیں؟</textarea></p>
<input id="tx" onclick="xliterate(); return false;" type="button" value=" Translate "/>
<p>(Thanks to عزت مآب جناب آغا علی رضا قزلباش رحمتہ اللہ علیہ who graciously sent me his term report on Hindi to Urdu transliteration, from where I’ve copied (and modified) the character mapping.)</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/online-english-to-urdu-translator/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How do you transliterate that?</title>
		<link>http://chaoticity.com/how-do-you-transliterate-that/</link>
		<comments>http://chaoticity.com/how-do-you-transliterate-that/#comments</comments>
		<pubDate>Thu, 21 Jan 2010 12:18:27 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://chaoticity.com/how-do-you-transliterate-that/</guid>
		<description><![CDATA[I am thinking of using google’s English to Hindi translation and hooking it to a Hindi to Urdu transliterator to get an approximate English to Urdu translation. The Hindi to English transliteration provided by google has some errors which might not be there if we convert directly to Urdu. For example, on translating the sentence [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fhow-do-you-transliterate-that%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="How do you transliterate that?" data-url="http://chaoticity.com/how-do-you-transliterate-that/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p> I am thinking of using google’s English to Hindi translation and hooking it to a Hindi to Urdu transliterator to get an approximate English to Urdu translation. The Hindi to English transliteration provided by google has some errors which might not be there if we convert directly to Urdu. For example, on translating the sentence </p>
<p><strong>It can be used in Urdu too</strong>, <a href="http://chaoticity.com/images/image6.png"><img title="image" style="border-right: 0px; border-top: 0px; display: inline; margin-left: 0px; border-left: 0px; margin-right: 0px; border-bottom: 0px" height="142" alt="image" src="http://chaoticity.com/images/image_thumb6.png" width="240" align="right" border="0" /></a> </p>
<p>we get the Hindi translation</p>
<p><strong>यह उर्दू में इस्तेमाल किया जा सकता है</strong> </p>
<p>and the Roman transliteration of the Hindi translation</p>
<p>&#160;<em><strong>yaha urdū mēṁ istēmāla kiyā jā sakatā hai</strong></em>. </p>
<p>If you notice the first word, it should have been transliterated to “yeh”. Instead, we get a phonetic transliteration which is made up of two letters <em>ya</em> and <em>ha. </em>Transliteration from Hindi to Urdu directly would have avoided that error. There’s a nice paper titled “<a href="http://www.crulp.org/clt09/download/Papers/Paper4.pdf" target="_blank">Hindi to Urdu Conversion: Beyond Simple Transliteration</a>”&#160; which lists problems faced in simple character-to-character transliteration fromm Hindi to Urdu. Whenever I get some time, I’ll try to cook some javascript code quickly. Until then, the idea is open. Any takers?</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/how-do-you-transliterate-that/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Custom Resolution in Remote Desktop</title>
		<link>http://chaoticity.com/custom-resolution-in-remote-desktop/</link>
		<comments>http://chaoticity.com/custom-resolution-in-remote-desktop/#comments</comments>
		<pubDate>Tue, 05 Jan 2010 15:06:49 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[chaos]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://chaoticity.com/custom-resolution-in-remote-desktop/</guid>
		<description><![CDATA[I have a 1920&#215;1080 desktop at work but when I use remote desktop to connect to home, it automatically resizes to my compact 1024*768 desktop. Most programs don’t seem to have a problem but I was working on Weka KnowledgeFlow and one of my flows, originally designed on the higher resolution, never showed a horizontal [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fcustom-resolution-in-remote-desktop%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="Custom Resolution in Remote Desktop" data-url="http://chaoticity.com/custom-resolution-in-remote-desktop/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p><a href="http://chaoticity.com/images/horimon.jpg"><img title="horimon" style="border-right: 0px; border-top: 0px; display: inline; margin: 0px 10px 0px 0px; border-left: 0px; border-bottom: 0px" height="260" alt="horimon" src="http://chaoticity.com/images/horimon_thumb.jpg" width="200" align="left" border="0" /></a>I have a 1920&#215;1080 desktop at work but when I use remote desktop to connect to home, it automatically resizes to my compact 1024*768 desktop. Most programs don’t seem to have a problem but I was working on Weka KnowledgeFlow and one of my flows, originally designed on the higher resolution, never showed a horizontal scroll. It might just be a java thing. In short, I had to look for a method to remote using a higher resolution than that of the local machine. Luckily, you can specify a custom resolution for the RDC using a command line switch ( m<a title="http://www.microsoft.com/resources/documentation/windows/xp/all/proddocs/en-us/ts_cmd_mstsc.mspx?mfr=true" href="http://www.microsoft.com/resources/documentation/windows/xp/all/proddocs/en-us/ts_cmd_mstsc.mspx?mfr=true">ore here</a> ). The command line below gave be enough space to fix the flow. I hope this helps someone out there. </p>
<blockquote><p><strong>mstsc /w:1280 /h:1024</strong></p>
</blockquote>
<p>The picture above is my office machine when i was trying a horizontal flip. It works when you have many consoles open but the bottom part gets for browsing/coding, it’s not that great.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/custom-resolution-in-remote-desktop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What do you tweet about? : A shell script for getting most frequent words for twitter</title>
		<link>http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/</link>
		<comments>http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/#comments</comments>
		<pubDate>Fri, 18 Dec 2009 20:29:51 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[chaos]]></category>
		<category><![CDATA[Language]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/</guid>
		<description><![CDATA[There are a lot of web apps around which report your twitter stats. But at times, it&#8217;s better to do things yourself. I haven’t done any fun coding for ages now so last night, I finally got around to making a small program to gather twitter word statistics. The fun part was to do everything [...]]]></description>
			<content:encoded><![CDATA[<div style="height:33px;" class="really_simple_share robots-nocontent snap_nopreview"><div class="really_simple_share_facebook_like" style="width:100px;"><iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fchaoticity.com%2Fwhat-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter%2F&amp;layout=button_count&amp;show_faces=false&amp;width=100&amp;action=like&amp;colorscheme=light&amp;send=false&amp;height=27" 
						scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:100px; height:27px;" allowTransparency="true"></iframe></div><div class="really_simple_share_twitter" style="width:110px;"><a href="http://twitter.com/share" class="twitter-share-button" data-count="horizontal" 
						data-text="What do you tweet about? : A shell script for getting most frequent words for twitter" data-url="http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/" 
						data-via=""  ></a></div></div>
		<div style="clear:both;"></div><p>There are a lot of web apps around which report your twitter stats. But at times, it&#8217;s better to do things yourself. I haven’t done any fun coding for ages now so last night, I finally got around to making a small program to gather twitter word statistics. The fun part was to do everything using&#160; unix tools.&#160; <a href="http://chaoticity.com/software/tword.zip" target="_blank">Here’s a small script file</a> which displays the 10 most used words in the tweets for any twitter id.&#160; I have only tested it under cygwin so this is probably the best place to say “USE AT YOUR OWN RISK”. </p>
<p>Here’s how it works.</p>
<ol>
<li>downloads all status information in a directory </li>
<li>extracts the status message lines</li>
<li>does some regex magic and filters <a href="http://en.wikipedia.org/wiki/Stop_words" target="_blank">stop words</a> like the, a, an etc. ( haven’t&#160; seen this done earlier anywhere but the <strong>join </strong>command comes in handy for processing stopwords)</li>
<li>displays the top 10 most frequent words (and emoticons) </li>
</ol>
<p>Twitter assigns a limit to the number of messages that you can download (3200). Also, the twitter id timeline has to be public for this script to work. All you need to do is <a href="http://chaoticity.com/software/tword.zip" target="_blank">download the script file and stop word list</a>, keep them in the same directory, run it with the twitter id in the command line and you’ll get the list of words with the frequency at the start of each line. For example,</p>
<blockquote><p>$ ./tword.sh barackobama      <br />161 watch       <br />119 live       <br />92 http://mybarackobamacom/livestream       <br />81 health       <br />63 reform       <br />55 today       <br />52 rally       <br />48 #hc09       <br />47 &amp;amp;       <br />38 vote</p>
</blockquote>
<p>The script takes time to complete so be patient. As you may have noticed, there are still html tags inside. You can remove them by piping in any html2text program. There’s a small perl script in the zipfile which does this processing. The output now brings in a new word “change”. You will, however, need to pipe this in the script after installing <a href="http://search.cpan.org/~gaas/HTML-Parser-3.64/lib/HTML/Entities.pm" target="_blank">HTML::Entities</a> though CPAN. </p>
<blockquote><p>$ ./tword.sh barackobama      <br />161 watch       <br />119 live       <br />92 http://mybarackobamacom/livestream       <br />83 health       <br />68 change       <br />63 reform       <br />55 today       <br />55 rally       <br />48 #hc09       <br />39 vote</p>
</blockquote>
<p>My list toppers as <strong>good, :D, time, day, twitter, read, hope, back, :p</strong> and <strong>make. </strong>I wonder if this makes me a happy person :)</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

