a state of things in which chance is supreme

UQA - Corpus for Urdu Question Answering

I think it was around 1999 when I first heard that Urdu is a low resource language. 25 years later Urdu is still considered low-resource despite having over 70 million native speakers. This is because large manually curated linguistic resources required for training are still not available for Urdu. One way around this barrier is to translate existing corpora from a resource-rich language (cough English cough) to Urdu. This seems like a chicken-and-egg problem though since good automatic translation systems require training resources....

May 24, 2024

Concordance of Iqbal's Urdu Poetry | کشف الالفاظ - اردو کلامِ اقبال

tl;dr I built a KWIC concordance of Iqbal’s Urdu Poetry - Check out iqbal.chaoticity.com ! In distributional semantics, just like in interpretation of law, a word is known by the company it keeps. Observing how a word appears in context of other words allows us to identify usage nuances and gain deeper insights into the meaning and connotations of that particular term. Traditional word indexes do not show this context and thus miss out on the most remarkable ability of the human brain – recognising patterns....

January 1, 2024

#AcademicValentines

Roses are red Violets are blue Everyone’s equal When σ is μ

February 25, 2020

Almost eleven

This blog is almost 11 now. I’ll be trying out switching to an android client this year, just to see if it makes any difference in my posting frequency. Meanwhile, here’s a ’test swan'.

January 2, 2020