<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Chaoticity &#187; Software</title>
	<atom:link href="http://chaoticity.com/category/software/feed/" rel="self" type="application/rss+xml" />
	<link>http://chaoticity.com</link>
	<description>a state of things in which chance is supreme</description>
	<lastBuildDate>Tue, 06 Jul 2010 20:13:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Making a copy of WEKA Instances</title>
		<link>http://chaoticity.com/making-a-copy-of-weka-instances/</link>
		<comments>http://chaoticity.com/making-a-copy-of-weka-instances/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 21:54:09 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[attribute]]></category>
		<category><![CDATA[classification]]></category>
		<category><![CDATA[dataset]]></category>
		<category><![CDATA[instances]]></category>
		<category><![CDATA[string]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[weka]]></category>

		<guid isPermaLink="false">http://chaoticity.com/making-a-copy-of-weka-instances/</guid>
		<description><![CDATA[This ‘thing’ took about 30 minutes to figure out. According to the WEKA documentation, if  you add a new Instance to an existing Instances object, String values are not transferred ! In case you are working on copying a dataset with a string attribute, you need to transfer the string manually. The code segment below [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://chaoticity.com/images/image11.png"><img style="display: inline; margin-left: 0px; margin-right: 0px; border-width: 0px;" title="image" src="http://chaoticity.com/images/image_thumb11.png" border="0" alt="image" width="128" height="149" align="right" /></a>This ‘thing’ took about 30 minutes to figure out. According to the <a href="http://weka.sourceforge.net/doc/weka/core/Instances.html#add(weka.core.Instance)">WEKA documentation</a>, if  you add a new Instance to an existing Instances object, <strong>String values are not transferred</strong> <strong>! </strong>In case you are working on copying a dataset with a string attribute, you need to transfer the string manually. The code segment below copies the i^th instance from source to dest where the first attribute (at index 0) is a string attribute.</p>
<div class="codecolorer-container text default" style="overflow:auto;white-space:nowrap;border:1px solid #9F9F9F;width:435px;"><table cellspacing="0" cellpadding="0"><tbody><tr><td style="padding:5px;text-align:center;color:#888888;background-color:#EEEEEE;border-right: 1px solid #9F9F9F;font: normal 12px/1.4em Monaco, Lucida Console, monospace;"><div>1<br />2<br />3<br /></div></td><td><div class="text codecolorer" style="padding:5px;font:normal 12px/1.4em Monaco, Lucida Console, monospace;white-space:nowrap">dest.add(source.instance(i));<br />
dest.instance(dest.numInstances()-1)<br />
&nbsp; &nbsp; .setValue(0,source.instance(i).toString(0));</div></td></tr></tbody></table></div>
<p>This should come in handy for text classification using WEKA (and hopefully save your time).</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/making-a-copy-of-weka-instances/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google and Urdu Stemming</title>
		<link>http://chaoticity.com/google-and-urdu-stemming/</link>
		<comments>http://chaoticity.com/google-and-urdu-stemming/#comments</comments>
		<pubDate>Fri, 05 Mar 2010 02:30:25 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Urdu]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[stemming]]></category>

		<guid isPermaLink="false">http://chaoticity.com/google-and-urdu-stemming/</guid>
		<description><![CDATA[&#160; Is google (finally) stemming Urdu? The last time I checked, there were doing something like a transliteration based search but in the screenshot below, you can see that searching for the phrase ان پڑھ چٹا shows some stemming is being used. Does anyone know anything?&#160; Oh, and while I’m on this topic, I would [...]]]></description>
			<content:encoded><![CDATA[<p>&#160;</p>
<p>Is google (finally) stemming Urdu? <a href="http://scalar.wordpress.com/2008/06/02/stemming-in-urdu-and-google/" target="_blank">The last time I checked</a>, there were doing something like a transliteration based search but in the screenshot below, you can see that <a href="http://www.google.co.uk/search?q=+%D8%A7%D9%86+%D9%BE%DA%91%DA%BE+%DA%86%D9%B9%D8%A7" target="_blank">searching for the phrase ان پڑھ چٹا</a> shows some stemming is being used. Does anyone know anything?&#160; Oh, and while I’m on this topic, I would also like to know why is it called چٹا ان پڑھ ?</p>
<p><a href="http://chaoticity.com/images/image10.png"><img title="image" style="border-right: 0px; border-top: 0px; display: inline; border-left: 0px; border-bottom: 0px" height="770" alt="image" src="http://chaoticity.com/images/image_thumb10.png" width="514" border="0" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/google-and-urdu-stemming/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Visualizing Citation Networks</title>
		<link>http://chaoticity.com/visualizing-citation-networks/</link>
		<comments>http://chaoticity.com/visualizing-citation-networks/#comments</comments>
		<pubDate>Thu, 04 Feb 2010 04:59:07 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Art]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[chaos]]></category>
		<category><![CDATA[ACL]]></category>
		<category><![CDATA[Anthology]]></category>
		<category><![CDATA[Citation]]></category>
		<category><![CDATA[GraphViz]]></category>
		<category><![CDATA[Network]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://chaoticity.com/visualizing-citation-networks/</guid>
		<description><![CDATA[For techies: I’ve been working on citation networks lately. You can visualize such a network as a graph. In this graph, the nodes represent publications (papers,articles etc) and the edges represent citations between them. The graph above was produced using the GraphViz. The data is from the ACL Anthology Network which contains publications from the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://chaoticity.com/images/aclnet.jpg"><img title="aclnet" style="border-right: 0px; border-top: 0px; display: inline; border-left: 0px; border-bottom: 0px" height="500" alt="aclnet" src="http://chaoticity.com/images/aclnet_thumb.jpg" width="545" border="0" /></a> </p>
<p><strong>For techies: </strong>I’ve been working on citation networks lately. You can visualize such a network as a graph. In this graph, the nodes represent publications (papers,articles etc) and the edges represent citations between them. The graph above was produced using the <a href="http://www.graphviz.org/" target="_blank">GraphViz</a>. The data is from the <a href="http://clair.si.umich.edu/clair/anthology/" target="_blank">ACL Anthology Network</a> which contains publications from the publicly available <a href="http://aclweb.org/" target="_blank">ACL Anthology</a>. </p>
<p><strong>For non-techies: </strong>Oooooo! pretty picture!</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/visualizing-citation-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online English to Urdu Translator</title>
		<link>http://chaoticity.com/online-english-to-urdu-translator/</link>
		<comments>http://chaoticity.com/online-english-to-urdu-translator/#comments</comments>
		<pubDate>Sat, 23 Jan 2010 19:39:33 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Urdu]]></category>
		<category><![CDATA[english]]></category>
		<category><![CDATA[hindi]]></category>
		<category><![CDATA[translation]]></category>
		<category><![CDATA[transliteration]]></category>

		<guid isPermaLink="false">http://chaoticity.com/online-english-to-urdu-translator/</guid>
		<description><![CDATA[While all the online English to Urdu translators that I have seen don’t really work that well (read suck), if we make use the overlapping vocabulary and grammar of Hindi and Urdu along with using Google’s translation API, things come out pretty decent (as mentioned in my previous post). Here’s a small 15 min first [...]]]></description>
			<content:encoded><![CDATA[<p>While all the online English to Urdu translators that I have seen don’t really work that well (read suck), if we make use the overlapping vocabulary and grammar of Hindi and Urdu along with using Google’s translation API, things come out pretty decent (<a href="http://chaoticity.com/how-do-you-transliterate-that/" target="_blank">as mentioned in my previous post</a>). Here’s a small 15 min first cut script which just uses English to Hindi translation and then transliterates from Hindi to Urdu. Feel free to use the code and do ping me if you improve something. This works as a Hindi to Urdu transliterator as well.</p>
<p><script src="http://www.google.com/jsapi" type="text/javascript"></script><script type="text/javascript">google.load("language", "1");var conv=[];
	conv['ऀ']='';//'ऀ';
	conv['ँ']='ن'; 
	conv['ं']='ن';
	conv['ः']='ہ';
	conv['ऄ']='';//'ऄ';
	conv['अ']='اَ';
	conv['आ']='آ';
	conv['इ']='اِ';
	conv['ई']='اِی';
	conv['उ']='اُ';
	conv['ऊ']='اُو';
	conv['ऋ']='';//'ऋ';
	conv['ऌ']='';//'ऌ';
	conv['ऍ']='ای';
	conv['ऎ']='ऎ';
	conv['ए']='';//'ِ';
	conv['ऐ']='ائے';
	conv['ऑ']='';//'ऑ';
	conv['ऒ']='ؤ';
	conv['ओ']='او';
	conv['औ']='اؤ';
	conv['क']='ک';
	conv['ख']='کھ';
	conv['ग']='گ';
	conv['घ']='گھ';
	conv['ङ']='ن';
	conv['च']='چ';
	conv['छ']='چھ';
	conv['ज']='ج';
	conv['झ']='جھ';
	conv['ञ']='ن';
	conv['ट']='ٹ';
	conv['ठ']='ٹھ';
	conv['ड']='ڈ';
	conv['ढ']='ڈھ';
	conv['ण']='ن';
	conv['त']='ت';
	conv['थ']='تھ';
	conv['द']='د';
	conv['ध']='دھ';
	conv['न']='ن';
	conv['ऩ']='';//'ऩ';
	conv['प']='پ';
	conv['फ']='پھ';
	conv['ब']='ب';
	conv['भ']='بھ';
	conv['म']='م';
	//conv['य']='ے';
	conv['य']='ی';
	conv['र']='ر';
	conv['ऱ']='ऱ';
	conv['ल']='ل';
	conv['ळ']='';//ळ';
	conv['ऴ']='';//'ऴ';
	conv['व']='و';
	conv['श']='ش';
	conv['ष']='ش';
	conv['स']='س';
	conv['ह']='ہ';
	conv['ऺ']='';//'ऺ';
	conv['ऻ']='';//'ऻ';
	conv['़']='';//'़';
	conv['ऽ']='';//'ऽ';
	conv['ा']='ا';
	conv['ि']='ِ';
	conv['ी']='ی';
	conv['ु']='ُ';
	conv['ू']='وُ';
	conv['ृ']='ر';
	conv['ॄ']='';//'ॄ';
	conv['ॅ']='ی';
	conv['ॆ']='ء';
	conv['ै']='ی';
	//conv['े']='ے';
	conv['े']='ی';
	conv['ॉ']=''//'ا';
	conv['ॊ']='';//'ॊ';
	conv['ो']='و';
	conv['ौ']='و';
	conv['्']='';
	conv['ॎ']='';//'ॎ';
	conv['ॏ']='';//'ॏ';
	conv['ॐ']='';//'ॐ';
	conv['॑']='॑';
	conv['॒']='॒';
	conv['॓']='॓';
	conv['॔']='॔';
	conv['ॕ']='';//'ॕ';
	conv['ॖ']='';//'ॖ';
	conv['ॗ']='';//'ॗ';
	conv['क़']='ق';
	conv['ख़']='خ';
	conv['ग़']='غ';
	conv['ज़']='ز';
	conv['ड़']='ڑ';
	conv['ढ़']='ڑھ';
	conv['फ़']='ف';
	conv['य़']='';//य़';
	conv['ॠ']='';//'ॠ';
	conv['ॡ']='';//'ॡ';
	conv['ॢ']='';//'ॢ';
	conv['ॣ']='';//'ॣ';
	conv['।']='۔';
	conv['॥']='';//'॥';
	conv['0']='۰';
	conv['1']='۱';
	conv['2']='۲';
	conv['3']='۳';
	conv['4']='۴';
	conv['5']='۵';
	conv['6']='۶';
	conv['7']='۷';
	conv['8']='۸';
	conv['9']='۹';
	conv['॰']='॰'
	conv['ॱ']='';//'ॱ';
	conv['ॲ']='';//'ॲ';
	conv['ॳ']='';//'ॳ';
	conv['ॴ']='';//'ॴ';
	conv['ॵ']='';//'ॵ';
	conv['ॶ']='';//'ॶ';
	conv['ॷ']='';//'ॷ';
	conv['ॸ']='';//'ॸ';
	conv['ॹ']='';//'ॹ';
	conv['ॺ']='';//'ॺ';
	conv['ॻ']='';//'ॻ';
	conv['ॼ']='';//'ॼ';
	conv['ॽ']='';//'ॽ';
	conv['ॾ']='';//'ॾ';
	conv['ॿ']='';//'ॿ';
	conv['?']='؟';
	conv['.']='۔';
	function xliterate() {
		src = document.getElementById('src').value;	
		google.language.translate(src, "en", "hi", function(result) {
				if (!result.error) {
					mid = result.translation;
					dest='';
					for(i=0; i<mid.length;i++) {
						dest=dest+ ((conv[mid[i]]!=undefined) ? conv[mid[i]] : mid[i]);
					}
					document.getElementById('dest').value=dest;
				} else {
					alert(result.error);
				}
			});			
	}
</script><br />
<textarea id="src" style="font-family: arial unicode;" cols="80" rows="5">How are you?</textarea><br />
<textarea id="dest" style="direction: rtl; font-family: arial unicode;"  cols="80" rows="5">آپ کیسی ہیں؟</textarea></p>
<input id="tx" onclick="xliterate(); return false;" type="button" value=" Translate "/>
<p>(Thanks to عزت مآب جناب آغا علی رضا قزلباش رحمتہ اللہ علیہ who graciously sent me his term report on Hindi to Urdu transliteration, from where I’ve copied (and modified) the character mapping.)</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/online-english-to-urdu-translator/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How do you transliterate that?</title>
		<link>http://chaoticity.com/how-do-you-transliterate-that/</link>
		<comments>http://chaoticity.com/how-do-you-transliterate-that/#comments</comments>
		<pubDate>Thu, 21 Jan 2010 12:18:27 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://chaoticity.com/how-do-you-transliterate-that/</guid>
		<description><![CDATA[I am thinking of using google’s English to Hindi translation and hooking it to a Hindi to Urdu transliterator to get an approximate English to Urdu translation. The Hindi to English transliteration provided by google has some errors which might not be there if we convert directly to Urdu. For example, on translating the sentence [...]]]></description>
			<content:encoded><![CDATA[<p> I am thinking of using google’s English to Hindi translation and hooking it to a Hindi to Urdu transliterator to get an approximate English to Urdu translation. The Hindi to English transliteration provided by google has some errors which might not be there if we convert directly to Urdu. For example, on translating the sentence </p>
<p><strong>It can be used in Urdu too</strong>, <a href="http://chaoticity.com/images/image6.png"><img title="image" style="border-right: 0px; border-top: 0px; display: inline; margin-left: 0px; border-left: 0px; margin-right: 0px; border-bottom: 0px" height="142" alt="image" src="http://chaoticity.com/images/image_thumb6.png" width="240" align="right" border="0" /></a> </p>
<p>we get the Hindi translation</p>
<p><strong>यह उर्दू में इस्तेमाल किया जा सकता है</strong> </p>
<p>and the Roman transliteration of the Hindi translation</p>
<p>&#160;<em><strong>yaha urdū mēṁ istēmāla kiyā jā sakatā hai</strong></em>. </p>
<p>If you notice the first word, it should have been transliterated to “yeh”. Instead, we get a phonetic transliteration which is made up of two letters <em>ya</em> and <em>ha. </em>Transliteration from Hindi to Urdu directly would have avoided that error. There’s a nice paper titled “<a href="http://www.crulp.org/clt09/download/Papers/Paper4.pdf" target="_blank">Hindi to Urdu Conversion: Beyond Simple Transliteration</a>”&#160; which lists problems faced in simple character-to-character transliteration fromm Hindi to Urdu. Whenever I get some time, I’ll try to cook some javascript code quickly. Until then, the idea is open. Any takers?</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/how-do-you-transliterate-that/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Custom Resolution in Remote Desktop</title>
		<link>http://chaoticity.com/custom-resolution-in-remote-desktop/</link>
		<comments>http://chaoticity.com/custom-resolution-in-remote-desktop/#comments</comments>
		<pubDate>Tue, 05 Jan 2010 15:06:49 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[chaos]]></category>

		<guid isPermaLink="false">http://chaoticity.com/custom-resolution-in-remote-desktop/</guid>
		<description><![CDATA[I have a 1920&#215;1080 desktop at work but when I use remote desktop to connect to home, it automatically resizes to my compact 1024*768 desktop. Most programs don’t seem to have a problem but I was working on Weka KnowledgeFlow and one of my flows, originally designed on the higher resolution, never showed a horizontal [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://chaoticity.com/images/horimon.jpg"><img title="horimon" style="border-right: 0px; border-top: 0px; display: inline; margin: 0px 10px 0px 0px; border-left: 0px; border-bottom: 0px" height="260" alt="horimon" src="http://chaoticity.com/images/horimon_thumb.jpg" width="200" align="left" border="0" /></a>I have a 1920&#215;1080 desktop at work but when I use remote desktop to connect to home, it automatically resizes to my compact 1024*768 desktop. Most programs don’t seem to have a problem but I was working on Weka KnowledgeFlow and one of my flows, originally designed on the higher resolution, never showed a horizontal scroll. It might just be a java thing. In short, I had to look for a method to remote using a higher resolution than that of the local machine. Luckily, you can specify a custom resolution for the RDC using a command line switch ( m<a title="http://www.microsoft.com/resources/documentation/windows/xp/all/proddocs/en-us/ts_cmd_mstsc.mspx?mfr=true" href="http://www.microsoft.com/resources/documentation/windows/xp/all/proddocs/en-us/ts_cmd_mstsc.mspx?mfr=true">ore here</a> ). The command line below gave be enough space to fix the flow. I hope this helps someone out there. </p>
<blockquote><p><strong>mstsc /w:1280 /h:1024</strong></p>
</blockquote>
<p>The picture above is my office machine when i was trying a horizontal flip. It works when you have many consoles open but the bottom part gets for browsing/coding, it’s not that great.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/custom-resolution-in-remote-desktop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What do you tweet about? : A shell script for getting most frequent words for twitter</title>
		<link>http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/</link>
		<comments>http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/#comments</comments>
		<pubDate>Fri, 18 Dec 2009 20:29:51 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Language]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[chaos]]></category>

		<guid isPermaLink="false">http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/</guid>
		<description><![CDATA[There are a lot of web apps around which report your twitter stats. But at times, it&#8217;s better to do things yourself. I haven’t done any fun coding for ages now so last night, I finally got around to making a small program to gather twitter word statistics. The fun part was to do everything [...]]]></description>
			<content:encoded><![CDATA[<p>There are a lot of web apps around which report your twitter stats. But at times, it&#8217;s better to do things yourself. I haven’t done any fun coding for ages now so last night, I finally got around to making a small program to gather twitter word statistics. The fun part was to do everything using&#160; unix tools.&#160; <a href="http://chaoticity.com/software/tword.zip" target="_blank">Here’s a small script file</a> which displays the 10 most used words in the tweets for any twitter id.&#160; I have only tested it under cygwin so this is probably the best place to say “USE AT YOUR OWN RISK”. </p>
<p>Here’s how it works.</p>
<ol>
<li>downloads all status information in a directory </li>
<li>extracts the status message lines</li>
<li>does some regex magic and filters <a href="http://en.wikipedia.org/wiki/Stop_words" target="_blank">stop words</a> like the, a, an etc. ( haven’t&#160; seen this done earlier anywhere but the <strong>join </strong>command comes in handy for processing stopwords)</li>
<li>displays the top 10 most frequent words (and emoticons) </li>
</ol>
<p>Twitter assigns a limit to the number of messages that you can download (3200). Also, the twitter id timeline has to be public for this script to work. All you need to do is <a href="http://chaoticity.com/software/tword.zip" target="_blank">download the script file and stop word list</a>, keep them in the same directory, run it with the twitter id in the command line and you’ll get the list of words with the frequency at the start of each line. For example,</p>
<blockquote><p>$ ./tword.sh barackobama      <br />161 watch       <br />119 live       <br />92 http://mybarackobamacom/livestream       <br />81 health       <br />63 reform       <br />55 today       <br />52 rally       <br />48 #hc09       <br />47 &amp;amp;       <br />38 vote</p>
</blockquote>
<p>The script takes time to complete so be patient. As you may have noticed, there are still html tags inside. You can remove them by piping in any html2text program. There’s a small perl script in the zipfile which does this processing. The output now brings in a new word “change”. You will, however, need to pipe this in the script after installing <a href="http://search.cpan.org/~gaas/HTML-Parser-3.64/lib/HTML/Entities.pm" target="_blank">HTML::Entities</a> though CPAN. </p>
<blockquote><p>$ ./tword.sh barackobama      <br />161 watch       <br />119 live       <br />92 http://mybarackobamacom/livestream       <br />83 health       <br />68 change       <br />63 reform       <br />55 today       <br />55 rally       <br />48 #hc09       <br />39 vote</p>
</blockquote>
<p>My list toppers as <strong>good, <img src='http://chaoticity.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> , time, day, twitter, read, hope, back, :p</strong> and <strong>make. </strong>I wonder if this makes me a happy person <img src='http://chaoticity.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/what-do-you-tweet-about-a-shell-script-for-getting-most-frequent-words-for-twitter/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>JabRef and Google Scholar</title>
		<link>http://chaoticity.com/jabref-and-google-scholar/</link>
		<comments>http://chaoticity.com/jabref-and-google-scholar/#comments</comments>
		<pubDate>Sat, 14 Nov 2009 16:57:18 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://chaoticity.com/jabref-and-google-scholar/</guid>
		<description><![CDATA[I can’t seem to find any way to import the bib entries provided by google scholar to JabRef directly. You can enable the Import into BibTex link from the preferences but it streams the bib file as text/plain which opens up in the browser. You can save it and import it but that wastes a [...]]]></description>
			<content:encoded><![CDATA[<p>I can’t seem to find any way to import the bib entries provided by google scholar to JabRef directly. You can enable the <strong>Import into BibTex</strong> link from the <a href="http://scholar.google.co.uk/scholar_preferences?hl=en" target="_blank">preferences</a> but it streams the bib file as text/plain which opens up in the browser. You can save it and import it but that wastes a lot of clicks. The easiest option is to copy-paste all the text into a new JabRef entry (Ctrl+N). The default settings leave the double curly braces in the title (to preserve case) which can be removed by enabling the <strong>Remove double braces… </strong>checkbox in the <strong>File</strong> tab of Options/Preferences. This works for JabRef 2.5.</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/jabref-and-google-scholar/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Are you interested in using computers in the classrooms?</title>
		<link>http://chaoticity.com/are-you-interested-in-using-computers-in-the-classrooms/</link>
		<comments>http://chaoticity.com/are-you-interested-in-using-computers-in-the-classrooms/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 16:02:44 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Education]]></category>
		<category><![CDATA[OpenWare]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Classroom]]></category>
		<category><![CDATA[Computer]]></category>
		<category><![CDATA[Moodle]]></category>
		<category><![CDATA[OpenMark]]></category>

		<guid isPermaLink="false">http://chaoticity.com/are-you-interested-in-using-computers-in-the-classrooms/</guid>
		<description><![CDATA[&#160; A friend of mine is carrying out research in classroom based e-assessment in developing countries such as Pakistan. The aim of the research is to assist primary school teachers with computer software that: · Is aligned with the particular subject curriculum they follow in their schools. · Provides pupils with challenges and interactive short [...]]]></description>
			<content:encoded><![CDATA[<p>&#160;</p>
<p><a href="http://chaoticity.com/images/MPj043935900001.jpg"><img title="MPj04393590000[1]" style="border-right: 0px; border-top: 0px; display: inline; margin: 0px 10px 0px 0px; border-left: 0px; border-bottom: 0px" height="244" alt="MPj04393590000[1]" src="http://chaoticity.com/images/MPj043935900001_thumb.jpg" width="244" align="left" border="0" /></a> A friend of mine is carrying out research in classroom based e-assessment in developing countries such as Pakistan. The aim of the research is to assist primary school teachers with computer software that:</p>
<p>· Is aligned with the particular subject curriculum they follow in their schools.</p>
<p>· Provides pupils with challenges and interactive short quizzes and tests to take after completing a topic taught by the teacher in the classroom. </p>
<p>· Provides students with immediate and diagnostic feedback on their performance on each challenge or test they attempt. </p>
<p>· Helps teachers in identifying the individual pupils needing help in certain conceptual areas of the curriculum, in managing the overall classroom portfolio, and in assuring better teaching and learning within the socio-cultural context of their educational system.</p>
<blockquote><p align="center"><strong>You can help in two ways!</strong></p>
</blockquote>
<p>1. The project is still in the stage where available literature on formative e-assessment is being critically reviewed and any bright ideas relevant to the topic are welcome. </p>
<p>2. One very important aspect of the research is related to the kind of tools and technologies that should be utilized in developing the software product. This is done keeping into consideration the fact that the use of <i>expensive</i> tools, technologies and infrastructure does not help much in sustaining any change in the educational systems of developing countries. Therefore, Moodle (<a href="http://www.moodle.org">www.moodle.org</a>) and other such open source learning management systems are being considered for the initial version of this project<b>. </b></p>
<p><b></b></p>
<blockquote><p><b>We need someone who can evaluate both Moodle and OpenMark, tell whether the functionality of diagnostic assessment used in OpenMark can also be integrated in Moodle, and how.</b> Monetary remuneration is available for this activity.</p>
</blockquote>
<p>The Open University (<a href="http://www.open.ac.uk">www.open.ac.uk</a>) in the UK is one of the pioneers in establishing distance learning (also e-learning) programs for higher education. They are currently using OpenMark (<a href="https://openmark.dev.java.net/">https://openmark.dev.java.net/</a>), their own open source Computer Assisted Assessment (CAA) system, as well as Moodle to develop formative assessment tests for their students enrolled in distance learning programs (<a href="http://labspace.open.ac.uk/course/view.php?id=3484&amp;topic=all">http://labspace.open.ac.uk/course/view.php?id=3484&amp;topic=all</a>). You will find some documentation on this at <a href="http://labspace.open.ac.uk/course/view.php?id=3484&amp;topic=all">http://labspace.open.ac.uk/course/view.php?id=3484&amp;topic=all</a>, and at <a href="http://labspace.open.ac.uk/mod/resource/view.php?id=381989&amp;direct=1">http://labspace.open.ac.uk/mod/resource/view.php?id=381989&amp;direct=1</a>. After that we would need help in taking on the development of our own software from there.</p>
<p>If you know anyone who is interested, please leave a comment or drop an email to awais {at} chaoticity.com</p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/are-you-interested-in-using-computers-in-the-classrooms/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to get formants of phones from sound files</title>
		<link>http://chaoticity.com/how-to-get-formants-of-phones-from-sound-files/</link>
		<comments>http://chaoticity.com/how-to-get-formants-of-phones-from-sound-files/#comments</comments>
		<pubDate>Wed, 22 Jul 2009 16:01:08 +0000</pubDate>
		<dc:creator>awais</dc:creator>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Speech]]></category>
		<category><![CDATA[Formant]]></category>
		<category><![CDATA[Praat]]></category>
		<category><![CDATA[PraatCon]]></category>
		<category><![CDATA[Script]]></category>

		<guid isPermaLink="false">http://chaoticity.com/how-to-get-formants-of-phones-from-sound-files/</guid>
		<description><![CDATA[This tutorial is mostly for people (esp. my AI students) who want to find formants of any sounds sample. So here are the things/software you will need. a good mic (recommended but not necessary) a quite room any software which can cut and save wav files. I use Praat. the console version of Praat, called [...]]]></description>
			<content:encoded><![CDATA[<p>This tutorial is mostly for people (esp. my AI students) who want to find <a href="http://en.wikipedia.org/wiki/Formant" target="_blank">formants</a> of any sounds sample. So here are the things/software you will need.</p>
<ol>
<li>a good mic (recommended but not necessary) </li>
<li>a quite room </li>
<li>any software which can cut and save wav files. I use <a href="http://www.fon.hum.uva.nl/praat/praat5111_winsit.exe" target="_blank">Praat</a>. </li>
<li>the console version of Praat, called <a href="http://www.fon.hum.uva.nl/praat/praatcon5111_winsit.exe" target="_blank">PraatCon</a>. This will be used to extract the formants from wav files. </li>
<li>about 30 mins. </li>
</ol>
<p>Once you have all these things, here is what you do for recording the phone sounds</p>
<ol>
<li>Start Praat. You will see two windows. You can close the other window but keep the one titled <strong>Praat Objects</strong> open. </li>
<li>In the menu, select <strong>New/Record mono sound</strong>. Select sampling frequency to 16000 Hz, press <strong>Record</strong> and start speaking in the mic. When you are through, press <strong>Stop</strong>. Speak all the phones you want in a single go but with silence in between. In the screenshot below, AA (as in f<u>a</u>ther) is spoken multiple times. </li>
<li>Press <strong>Save to list</strong> and then <strong>Close. </strong>You should now an entry in the Objects listbox. Make sure the entry is select and then press <strong>Edit</strong>. </li>
<li>Select the phone for which you want to extract the formants by pressing the mouse button on the start of the phone and dragging it to the point <em>right before</em> it finishes. Please note that we will be using the mean formant value for a phone so it is essential that only the middle part of the phone is selected to ensure accuracy and no other noise or silence is present. The selected area will be highlighted. </li>
<li>From the <strong>File </strong>menu, select <strong>Write selected sound to WAV file</strong>, enter the name of the wav file and save. </li>
<li>Repeat step 4 and 5 for all the phones you want to save. </li>
</ol>
<p><a href="http://chaoticity.com/images/image2.png"><img title="Saving WAV in Praat" style="border-top-width: 0px; display: inline; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="484" alt="Saving WAV in Praat" src="http://chaoticity.com/images/image_thumb2.png" width="586" border="0" /></a> </p>
<p>That’s it for the recording part. Alternately, if you already have a collection of sound files with a single phone, you can use that. You can download one such collection here. The zip file has 40 files where 2 speakers have spoken AA (as in f<u>a</u>ther) and II (as in b<u>ee</u>n) 10 times each. </p>
<p align="center"><a href="/software/formanter.zip">Click here to download script and sound files.</a></p>
<p>Although you can use the GUI based Praat to find out the formant values for each phone, we want to automate the task. To do so, we will use the console version of Praat called PraatCon, and run a script to extract the formants. The small Praat script written below takes the name of a phone file as argument and outputs the first two formats F1 and F2 to the console.</p>
<pre>form Display mean F1 and F2
   sentence filename
endform
filename_noext$ = replace$ (filename$, &quot;.wav&quot;, &quot;&quot;, 0)

Read from file... 'filename$'
select Sound 'filename_noext$'
To Formant (burg)... 0 5 5500 0.025 50
f1 = Get mean... 1 0 0 Hertz
f2 = Get mean... 2 0 0 Hertz
clearinfo
print 'filename$' 'f1''tab$''f2''newline$'</pre>
<p>&#160;</p>
<p>This script along with a small batch file (run.bat) which runs this script for all wav files present in the current directory, can be downloaded with all the sound files by clicking on the link below. To get this running, extract all files in a directory and place your downloaded copy of PraatCon in the same directory. The just click on the run.bat icon and watch the magic!</p>
<p align="center"><a href="/software/formanter.zip">Click here to download script and sound files.</a></p>
<p><a href="http://chaoticity.com/images/image3.png"><img title="Output of the formanter script" style="border-top-width: 0px; display: inline; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="327" alt="Output of the formanter script" src="http://chaoticity.com/images/image_thumb3.png" width="644" border="0" /></a> </p>
<p>The sound files in the archive give this nice little graph when their F1 and F2 values are plotted as an XY scatter graph shown below. All this was done to construct a linearly separable dataset which can be given to students for training a perceptron to distinguish between two phones. Any suggestion/comments are welcome. </p>
<p><a href="http://chaoticity.com/images/image4.png"><img title="F1 and F2 scatter plot of AA and II " style="border-top-width: 0px; display: inline; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="295" alt="F1 and F2 scatter plot of AA and II " src="http://chaoticity.com/images/image_thumb4.png" width="487" border="0" /></a> </p>
</p>
<p align="center"><a href="/software/formanter.zip">Click here to download script and sound files.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://chaoticity.com/how-to-get-formants-of-phones-from-sound-files/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
