aosshome

Indian Languages


Tools

  1. About Unicode and Character table for all Scripts
  2. Unicode Table with equivalents for Indian Scripts
  3. Tamil Editor
  4. Kannada Editor
  5. Hindi (Devanagari Script) Editor

    Script Converters

  6. Kannada from/to Tamil
  7. Kannada (Devanagari Script) from/to Hindi
  8. Hindi (Devanagari Script) from/to Tamil
Note: Transliteration will not be perfect, as each language or script is unique. There will be equivalent substitution in some cases.

LANGUAGES Pages

  1. BENGALI
  2. GURMUKHI
  3. GUJARATHI
  4. HINDI
  5. KANNADA
  6. MALAYALAM
  7. MARATHI
  8. ORIYA
  9. TAMIL
  10. TELUGU
Extras: Japanese scripts
India has at least 33 Principal languages in common use out of 1600+ different Indian languages which are spoken.

The language diversity of its one billion people makes the communications problems of most other countries seem trivial by comparison.

The number of languages in India as being 1652 (1961 census). More recent censuses show a slightly different number, 1,576 to 1,721 “mother tongues” with separate grammatical structures. The exact figure is not available due to the richness of native Indian languages and their variants, contradictions in language surveys and human’s inability to identify the difference between Indian languages and Indian dialects.

It is hard to accurately determine exact number of languages and dialects in India. It is like counting number of birds in a region, as in Birbal story. The taste of water changes every mile, language or dialect changes every four miles (कोस कोस पर बदले पानी , चार कोस पर वाणी।). There are different theories about how many of these mother-tongues qualify to be described as independent languages.

The encyclopaedic People of India series of the Anthropological Survey of India, identified 75 "major languages" out of a total of 325 languages used in Indian households. Ethnologue, too reports India as a home for 398 languages, including 387 living and 11 extinct languages.

As per the 2011 Census, there are about 122 languages, out of which 23 (including English) are listed as the official languages of the Republic of India. 800 million Indians can speak this 23 languages. The Constitution stresses the primacy of Hindi which, written in Devanagari script. But English remains in widespread official use.

Collections and Essays

  1. Declaration of Human Rights: Tamil
  2. Declaration of Human Rights: MALAYALAM
  3. Languages: Concepts and Myths

General Note

  1. As per the People's Linguistic Survey of India (PLSI), as many as 780 different languages are spoken and 86 different scripts are used in the country. Nearly 250 languages have been lost in the last 50 years. 22 of the 780 languages are scheduled Indian languages. 122 languages have been declared by the census as spoken by a population exceeding 10,000. [hindustantimes 17 Jul 2013]
  2. For many educated Indians, English is their important language for communication.
  3. 23 national languages, plus English, associate official. They are: Assamese; Bengali; Bodo; Dogri; English; Gujarati; Hindi; Kannada; Kashmiri; Konkani; Maithili; Malayalam; Marathi; Meitei (Manipuri); Nepali; Odia; Punjabi; Sanskrit; Santali; Sindhi; Tamil; Telugu; and Urdu.
    These languages have over 720 dialects.
  4. Formation of lingustic provinces. The acceptance of this policy involved the statutory recognition of all the major regional languages.
  5. The general script of the Aryan languages is different from the general script of the Dravidian languages. Along with these two main language families, there are others from Sino-Mongoloid family spoken in the East India.

Number of Languages

India LanguagesNumber
Official23
Principal50 plus
Radio programs 100 plus
Newspapers 90 plus
Films20 plus
Schools teach60 plus
Actual1600 plus
Language FamilyNumberPercentage of population
Indo-European or Indo-Aryan5470.0
Dravidian2623.8
Mon-Khmer202.0
Sino-Tibeto-Burman982.0
Austro- Asiatic languages251.2
Others1400 plus1.0

325 recognized/documented Indian languages

Agaria, Ahirani, Aimol, Aiton, Anal, Andamanese, Angani, Angika, Ao, Apatani, Arabic, Armenian, Ashing, Assamese, Asuri, Awadhi
Badaga, Baghelkhandi, Bagri, Baigani, Bajania, Balti, Bangni, Banjari, Basturia, Bauria, Bawm, Bazigar Boli, Bengali, Bhanja- bhumia, Bantu, Bharmauri, Bhairi, Bhili, Bhojpuri, Bhotia, Bhuiya, Bhumij, Bhunjia, Biate, Bilaspuri, Birhor, Birjia, Bishnupriya, Bodo, Bokar, Bondo, bori, Braj Bhasha, Brijlal, Bugun, Bundelkhandi, Burmese, Bushari
Chakhesang, Chakma, Chambilai, Chameali, Chang, Changpa, Chattisgarhi, Chikari, Chinali, Chiru, Chote, Churasi
Dalu, Deori, Dhanki, Dhimal, Dhodia, Dhundhari, Didayi, Dimasa, Dingal, Dogri, Dommari, Droskhat/Dokpa, Duhlian-Twang
English
French,
Gadaba, Gadiali, Gallong, Gameti, Gamit, Gangte, Garasia, Garhwali, Garo, Giarahi, Gondi, Gujarati, Gujjari, Gurung, Gutob
Hajong, Halam, Halbi, Harauti, Haryanavi, Hebrew, Himachali, Hindi, Hinduri, Hindusthani, Hmar, Ho, Hrusso, Hualngo
Irula
Jabalpuri, Jangali, Jarawa, Jaunsari, Juang
Kabui, Kachanga, Kachari, Kachchi, Kadar, Kagati, Kakbarak, Kanashi, Kangri, Kannada, Karbi, Karen, Karko, Kashmiri, Kathiawari, Khadiboli, Khaka, Khamba, Khampa, Khampti, Khampti-shan, Kharia, Khasi, Khaskura, Khatri, Kherwari, Khiangan, Khorusti, Khotta, Kinnauri, Kiradi, Kisan, Koch, Kodagu, Koi, Koireng, Kokni, Kolami, Kom, Komkar, Konda, Konicha, Konkani, Konyak, Koracha, Koraga, Korava, Korku, Korwa, Kota, Kotwalia, Kudmali, Kui, Kuki, Kulvi, Kumaoni, Kunbi, Kurukh, Kuvi
Ladakhi, Lahauli, Laihawlh, Lakher (Mara), Lalung,Lambani, Lamgang, Laotian, Laria, Lepcha, Limbu, Lisu, Lodha, Lotha, Lushai
Mag, Magahi, Magarkura, Mahal, Maithili, Majhi, Makrani, Malankudi, Malayalam, Malhar, Malto, Malvi, Manchat, Mandiali, Mangari, Mao, Maram, Marathi, Maria, Maring, Marwari, Mavchi, Meitei, Memba, Mewari, Mewati, Milang, Minyong, Miri, Mishing, Mishmi, Mizo, Monpa, Monsang, Moyon, Muduga, Multani, Mundari
Na, Nagari, Nagpuri, Naikadi, Naiki, Nati, Nepali, Nicobarese, Nimari, Nishi, Nocte,
Odki, Onge, Oriya
Padam, Pahari, Paharia, Palilibo, Paite, Panchpargania, Pang, Pangi, Pangwali, Parimu, Parji, Paschima, Pasi, Pashto, Pawri, Pengo, Persian, Phom, Pochury, Punchi, Punjabi,
Rai (Raikhura), Rajasthani, Ralte, Ramo, Rathi, Rengma, Riang,
Sadri, Sajalong, Sambalpuri, Sangtam, Sansi, Santali, Sadra, Saraji, Sarhodi, Saurashtri, Sema, Sentinelese, Shekhawati, Sherdukpen, Sherpa, Shimong, Shina, Shompen, Sikligar, Sindhi, Singpo, Siraji, Sirmauri, Soliga, Sulung, Surajpuri
Tagin, Tai, Tamang, Tamil,Tangam, Tangkhul, Tangsa, Tataotrong, Telugu, Thado, Thar, Tharu, Tibetan, Toda, Toto, Tulu
Urdu
Vaiphei, Varli
Wagri, Wancho
Yereva, Yerukula, Yimchungre
Zakring (Meyer), Zeliang, Zemi, Zou.

Endangered languages and scripts

An endangered language is a language that is at a risk of falling out of use, generally because it has few surviving speakers. If it loses all of its native speakers, it becomes an extinct language. UNESCO defines four levels of language endangerment between "safe" (not endangered) and "extinct": Vulnerable; Definitely endangered; Severely endangered; and Critically endangered. 191 languages of India are classified as vulnerable or endangered.

Some notes on North Indian languages

Some notes on South Indian Languages

South India, surrounded by three oceans and is separated from north India by the Vindhya mountain range. This this triangular volcanic land, is insulated by the Arabian Sea and Eastern Ghats on the east and the Bay of Bengal and Western Ghats on the west. This was once part of the geologically primeval Gondwanaland, remained culturally undisturbed for millennia, evolving an aura of poised tranquillity.

The dominant features of south India are the tropical climate, lush green tropical vegetation in the coastal areas and the architecture, culture, languages and lifestyle which had remained essentially Dravidian at the core.

The major languages in South India today are: Tamil, Telugu, Kannada and Malayalam. The minor languages are several and these are: Brahui, Gondi, Kui, Malto, Oraon (Kurukh), Toda, Tulu and Konkani (in Kannada script).

Dravidian languages

Around 18 percent of the Indian populace speak Dravidian languages. Only a few isolated groups of Dravidian speakers, such as the Gonds in Madhya Pradesh and Orissa, and the Kurukhs in Madhya Pradesh and Bihar, reside in the north India. Dravidian speakers are also the Brahuis in Pakistan)

The Dravidian family of languages includes approximately 26 languages, appear to be unrelated to languages of other known families. Some scholars include the Dravidian languages in a larger Elamo-Dravidian language family, which includes the ancient Elamite language.

Scripts

Major languages in India, with over 720 dialects are written in over 13 different scripts.
The Brahmi script is the earliest writing system after the Indus script. Most of the Indian scripts and several hundred scripts found in Southeast and East Asia are derived from Brahmi. Brahmi is an abugida that thrived in the Indian subcontinent and uses a system of diacritical marks to associate vowels with consonant symbols. ‎It has numerous descendents like Gupta‎ script. A southern form of Brahmi developed into the Grantha script
The Kharosthi script (also known as 'Indo-Bactrian' script) was more or less contemporary to Brahmi script and was employed to represent a form of Prakrit.
Origin of Brahmi Script is debatable with possible candidates: Unknown so far, Indus, Hieratic, cuneiform, Phoenician, Aramaic etc.
Prinsep deciphered Brahmi and Kharosthi from the bilingual Indo-Greek coins. Tamil-Brahmi script was found in Palani in Southern India, scientifically dated to 540 BCE.

Unicode Consortium

The Unicode Consortium is a non-profit organization devoted to developing, maintaining, and promoting software internationalization standards and data, particularly the Unicode Standard, which specifies the representation of text in all modern software products and standards. The Unicode Consortium actively develops standards in the area of internationalization including defining the behaviour and relationships between Unicode characters. The Consortium works closely with W3C and ISO—in particular with ISO/IEC/JTC 1/SC2/WG2, which is responsible for maintaining ISO/IEC 10646, the International Standard synchronized with the Unicode Standard.

The latest electronic version of the Unicode Standard can be found at Unicode site. The publications of the Unicode Consortium include Unicode Standard, with its Annexes and Character http://www.unicode.org/ucd/, Unicode Technical Standards and Reports http://unicode.org/reports/, Unicode Technical Notes and the Unicode Locales project, the Common Locale Data Repository.

The Unicode Character Standard primarily encodes scripts rather than languages. That is, where more than one language shares a set of symbols that have a historically related derivation, the union of the set of symbols of each such language is unified into a single collection identified as a single script. These collections of symbols (i.e., scripts) then serve as inventories of symbols which are drawn upon to write particular languages. In many cases, a single script may serve to write tens or even hundreds of languages (e.g., the Latin script).

Bilingualism

A bilingual person is, in its broadest definition, anyone with communicative skills in two languages, be it active or passive. In a narrow definition, the term bilingual is often reserved for those speakers with native or native-like proficiency in two languages. Similarly, the terms trilingual and multilingual are used to describe comparable situation in which three or more languages are involved. Many bilingual speakers are able to switch from language to language with ease, sometimes in mid-sentence.

In India, 3-language formula is some what popular. Many can speak more than 2 languages. Speakers of Indian languages tend to maintain their languages over generations and centuries, even when they live away from the region where it is dominant.

By necessity, a substantial minority are able to speak two Indian languages; even in the so-called linguistic states, there are minorities who do not speak the official language as their native tongue and must therefore learn it as a second language. Many tribal people are bilingual. Rural-urban migrants are frequently bilingual in the regional standard language as well as in their village dialect.

There are 26 major languages (no majority language) with more than one million speakers each!

Popularity of English Language

Around 10 percent Indians, speak English. Even 5%, which is 50 million people, the largest population of English-speakers in the world. English is the main vehicle for certain kinds of knowledge, a library language. It is the best source of scientific knowledge today.
English is the lingua franca, or more precisely, the link language of India. It remains the language of the Lok Sabha (the parliament), of the higher courts, of the highest levels of the Indian Civil Service, of the major universities, of multi-nationals and of most large sale Indian businesses.
Indian software developers and programmers, invariable use English in their work.

Digital Divide or Digital Opportunity

But what of the other 95 or 90 percent, who can not use English?
So, is there a market for local (vernacular) computing in South Asia? The story of the two shoe salesmen who went to a rural village will provide the answer. The first came back to his headquarters and said, "The situation is absolutely hopeless. Of a thousand people, not a single one wears shoes." The second returned to his home company and said, 'This is an incredible opportunity: no one owns shoes and we can sell a thousand pairs!"

Points to note:

Some Interesting Facts


Trivia


Old language map of India India Language Map


References (including sources) and Language Studies WEB sites

  1. https://en.wikipedia.org/wiki/Languages_of_India
  2. https://en.wikipedia.org/wiki/Languages_with_official_status_in_India
  3. https://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers_in_India
  4. https://en.wikipedia.org/wiki/List_of_endangered_languages_in_India
  5. AncientScripts.com URL:http://www.ancientscripts.com/
  6. India: Languages and Scripts URL:http://www.cs.colostate.edu/~malaiya/scripts.html
  7. Wikipedia URL:http://en.wikipedia.org/wiki/Indian_languages
  8. Introduction to indic scripts URL:http://people.w3.org/rishida/scripts/indic-overview/

Email Contact