Category Archives: hindi

Theory behind Multilingual domains and security issues

Well the topic of multilingual domains is really looking very interesting to me , so here am i in my 2nd night reading about multilingual domains.

Now as i have studied currently Every Web addresses are typically expressed using Uniform Resource Identifiers or URIs. The URI syntax defined in RFC 3986 STD 66 (Uniform Resource Identifier (URI): Generic Syntax) limits web address or uri`s to limit to english characters and numerals. Now if we want to start गाैरव.com this is not possible with the RFC 3986 standard.

So for enabling the registration of domain names as गाैरव.com a new concept of IDN (internationalized domain names) was agreed by IETF in March 2003 and was defined in RFC 3490,3491, 3492 and 3454.

Now as the implementation part goes when a user request a domain name in it`s native format (Unicode for hindi) it is changed/Encoded in punycode before going to the DNS. The Encoding is done at the Application part (Usually by the browsers) and the punycode is sent to to the browsers.Remember this that the current DNS and name resolver infrastructure cannot handle unicode based(non-ascii) domain names, that`s why they are encoded into punycode.

For converting hindi domains into punycode and vice versa use

The conversion between ASCII and Non-ASCII formats is done by algorithms ToASCII and ToUnicode. I will be giving a separate blog entry for these algo`s.

Now in last blog entry we discussed about the inconsistencies developed due to the unicode representation of IDN. Now i realized something far dangerous spoofing .

So how does concept of spoofing is applied here, let`s take the example of only mld i can found on net , the origial रफ़्तार.com (Punycode ) is currently spoofed with a different website रफ्तार.com(Punycode which is a parked website and You can see that both of the domain names are so similar in unicode representation but produces a very different punycode string. Which could result in spoofing any user to a alternate website, it`s dangerous.

In general this kind of attack is known as a homograph spoofing attack. On February 7, 2005, Slashdot reported that this exploit was disclosed at the hacker conference Shmoocon with an example available at

Since it is such a obvious way of spoofing people this has been taken very seriously by the IDNA , since they were responsible for changing the unicode to the punycode strings. Among which IE7 has implemented a anti-phishing filter to avoid this kind of spoofing and Mozilla Foundation (firefox) shows the punycode URLs instead of the unicode, thus thwarting any attacks while still allowing people to access websites on an IDN domain.

Still this remains a very promising and necessary technology will talk about the algos and methodology used while encoding and decoding punycode and how DNS handles the requests.

Notice: All the work is under GFDL licence and copyright of Gaurav Mishra

Typing Hindi through SCIM

SCIM is the name of the program that will allow you to input Hindi in Ubuntu 6.06 Dapper Drake.

1. Open system System>Administration>Language Support

2. and check Hindi to install required locale files

3. then log out (System>Quit>Log Out), and login again.

Note : You should already be able to use SCIM Hindi input in a few applications, like gedit (Application>Accessories>Text Editor), by right clicking on the document, then selecting Input Methods>SCIM Input Method. However, it won’t work in the others, like Open Office.

However you can setup SCIM to work with Other Applications with given method

The recommended method to set up SCIM input is to create 75scim file under /etc/X11/Xsession.d/ with following lines (if you have 90im-switch, better to remove it for this setup)

export XMODIFIERS=”@im=SCIM”
export XIM_PROGRAM=”/usr/bin/scim -d”
export GTK_IM_MODULE=scim
export QT_IM_MODULE=scim

* After your setup, restart at least your X session. (ctrl-alt-backspace!)

Note :- you need to install scim-qtimm and skim to get it working on Kde

sudo apt-get install scim-qtimm skim

Notice: All the work is under GFDL licence and copyright of Gaurav Mishra