Chapters
There are many languages in India - depending on the definition, the number is thought to exceed a thousand. In fact, India is one of the few countries that does not have a national language, though both English and Hindi are the official language of India used by the government - and the language spoken or at least understood by most of its inhabitants, is Hindi.
So where does Hindi come from and what is its place among the languages of the world?
Language Families in India
India is a land rich in languages. Hundreds of languages and dialects co-exist, some of the spoken by only a few hundred people. While it’s not unusual for several different languages to be spoken in one country, India is fairly uncommon not only through the number of languages but also their diversity, as not all the languages belong to the same language group.
Indo-European Languages of India
About 78% of Indians speak a language belonging to the Indo-European language family, to which English also belongs. However, they do not belong to the same sub-group. The Indian languages are part of the Indo-Iranian languages, forming a subgroup called Indo-Aryan.
The oldest Indo-Aryan language attested in India is Vedic Sanskrit, with texts dating to the 2nd-1st millennium BC transmitted orally before being set down in writing - for example, some of the sutras of the Rigveda, an important collection of Hindu hymns.
The Indo-Aryan languages of India include:
- Hindi (with Urdu, over 590 million speakers)
- Bengali or Bangla (205 million) mostly spoken in the state of West Bengal and part of Assam
- Punjabi (100 million), spoken in the states of Punja, Haryana and Delhi, as well as much of Pakistan
- Marathi (75 million) spoken in Maharashtra and Goa
- Gujarati (50 Million) spoken in Gujarat and the federal (union) territories of Daman and Diu, Dadra and Nagar Haveli. It was Mahatma Gandhi’s native tongue.
- Bhojpuri (40 million) spoken in Uttar Pradesh, Bihar and Jharkhand
- Maithili (35 million) spoken in Bihar and Jharkhand
- Odia (35 million) spoken in Odisha, West Bengal, Jharkhand, Chhattisgarh and Andhra Pradesh
- Marwari (30 million) spoken in Rajasthan, Gujarat and Haryana
- Sindhi (25 million) spoken in Gujarat and Maharashtra and the Sindh province of Pakistan
- Assamese (24 million) spoken in Assam (of tea fame) and Arunachal Pradesh
- Rajasthani (20 million) spoken in Rajasthan, Madhya Pradesh, Gujarat, Punjab and Haryana
- Rangpuri (15 million) spoken in Assam and West Bengal
- Bishnupriya (12 million) mostly spoken in Assam and Tripura

Dravidian languages
Dravidian languages seem to be native to India (though some scholars disagree); they are not related to any other language family. Only two Dravidian languages are spoken outside of India (Brahui in Pakistan and Afghanistan and Dhangar in Nepal and Bhutan).
The oldest written evidence of a Dravidian language in India is the Tamil-Brahmi script found on cave walls in Tamil Nadu dating to the 2nd century BC.
Some Dravidian languages in India include:
- Telugu (the most common Dravidian language, numbering about 74 million native speakers)
- Tamil (also spoken in Sri Lanka; over 66 million worldwide)
- Kannada (over 37 million worldwide)
- Malayalam (over 33 million worldwide)
- Tulu (about 1.9 million)
- Beary (about 1.5 million)
- Gondi (over 2 million)
- Brahui (over 2 million)
- Kurukh (over 2 million)
- Muria (about 1 million)
- Kui (about 900,000)
- Koya (360,000)
- Madiya (360,000)
- Kuvi (155,000)
- Malto (over 200,000)
- Kurambhag Paharia (about 12,500)
- Kolami (about 120,000)
- Duruwa (about 51,000)
- Ollari (about 15,000)
- Naiki (about 10,000)
- Manda (spoken in Odisha, about 4,000)
- Kota (spoken in Tamil Nadu, about 930)
- Kalanadi (spoken in Kerala, about 750)
- Holiya (Madhya Pradesh, Maharashtra, Karnataka, about 500 speakers)
- Aranadan (Kerala, about 200)
Other language groups in India
India also hosts languages belonging to the Austroasiatic (Munda), Sino-Tibetan, Tai-Kadai and a few other linguistic groups and isolates.
Origin of the Hindi language: Sanskrit and Prakrit
The Hindi language evolved from Sanskrit, an ancient Indo-European language.
Sanskrit
Based on linguistic grounds, Vedic Sanskrit could date as far back as 1500 BC. Some of the oldest pieces of Hindi literature, such as some of the hymns of the Hindu compilation called the Rigveda, were written in Vedic Sanskrit.

Around 800 BC it morphed into Classical Sanskrit, a language mostly spoken by the upper class, which remained the classical literary language in India for a long time. Though few still speak it, it is still taught in schools the same way that Latin is taught as the classical literary language of Europe.
Find a Hindi class anywhere in the UK.
Prakrit
Prakrit languages evolved from Vedic and Classical Sanskrit. The earliest are attested around 500 BC; the latest around 800 AD. Linguists differ as to whether all Middle Indo-Aryan languages should be grouped under the term Prakrit; as it is, most of the Indo-European languages of India evolved from one or more of them.
Some were Dramatic Prakrits, that is to say, languages used almost exclusively for literature and plays. None of these was used in everyday speech and very often Sanskrit translations were provided so the reader could understand the dialogue. However, as Sanskrit lost ground in certain areas, some dramatic Prakrits devolved into vernacular languages, such as Maharashtra Prakrit, the ancestor of the Marathi language.
The most important Prakrit language was Ardhmagadhi Prakrit, and its grammar is usually used as the standard to teach other Prakrits. In regions where Hindi would later be spoken, however, Sanskrit remained very popular, so that the etymology of many Hindi words comes directly from Sanskrit rather than through a Prakrit language.
Check out Hindi courses London on Superprof.
The way to Hindustani
In northern India around 500 AD, the Apabhramsha dialects evolved from Prakrit. They served as a kind of lingua franca in use until the 13th century AD and were referred to as Hindavi by the Persian rulers of the Delhi Sultanate who ruled large swathes of India from 1206 to 1526. The Hindi languages started branching off from Apabramsha around the 11th century AD, most of them being entirely distinct by the 12th, though in many places the Apabhramsha languages were still spoken in parallel.
It was under the Delhi Sultanate that the Persian language first started mixing with the local Apabhramsha dialects to form what would later become the Hindi and Urdu language.

In 1526, the Moghul Empire, a persianised empire of Turko-Mongol descent, supplanted the Delhi Sultanate and ruled over much of India, allowing even more Persian loanwords to enter the language.
By the time the Moghul Empire slowly dissolved in the 18th century, Khari Boli or Khariboli vernacular, successor dialects to the Apabhramsha languages, had replaced Persian as the common language.
The variant of Khariboli used by the upper class in northern India became known as Hindustani.
Find a course in Hindi thanks to Superprof.
The Hindi Language
Throughout the Moghul Empire and for many following and rival dynasties, Persian was the court language. However, when the British colonised India in the 18th through the 19th century, they were on the lookout for a widely-spoken language they could use for administration. Hindustani was widespread enough that it became the official language of the British Indian Empire, under the name of Urdu.
Hindustani is still used as a vernacular and lingua franca is the northern and western regions of the Indian subcontinent.
Today, we speak of Hindi if it is spoken in India and Urdu as the language of Pakistan - though, as we will see, there are some differences between the two.
Influences on the Hindi Language
Throughout its history, Hindi absorbed loan words from many different languages.
The main outside influence on the Khariboli that later became Hindustani was Persian, through the administrators and soldiers of the Delhi Sultanate and later the Moghul Empire.
Most Arabic words in Hindustani come from Persian, which has a lot of Arabic loan words.
Additionally, since Portugal had territories in India until the 1960s, Hindi has a fair amount of Portuguese loan words such as mez for "table" (from Portuguese mesa) or kamiz, "shirt", from camisa.
Of course, through English colonisation and modern globalisation, Hindi also has a good number of English loan words such as botal from “bottle” or prafessar from “professor”.
Obviously, other Indian languages have also provided Hindi with new words, just as Hindi words have seeped into other languages such as Tamil or Marathi.
Hindi and Urdu: A Language Split or a Reunification?
Urdu is first attested in the late Moghul Empire as a version of Khariboli with heavy Persian influences called Zaban-e Urdu-e Mualla, the “language of the court (or camp)”. It existed in parallel to Hindi and eventually became the official language of Pakistan when the country was founded.
Both Urdu and Hindi are considered registers of Hindustani - two versions of the same language, much like British English and American English are both registers of English. They remain mutually understandable.

They mostly have the same grammar, but differ greatly in vocabulary, with Hindi borrowing much more heavily from Classical Sanskrit while Urdu borrowed more from Persian. The differences are more evident in the literary register than in everyday speech. There are some differences in pronunciation as well.
Language | Script | Main Influences |
---|---|---|
Hindi | Devanagari | Sanskrit |
Urdu | Persian (Nastaliq) | Persian Arabic Turkish |
The further evolution of Hindi and Urdu is rather exciting. In some ways, they are drifting farther apart as India and Pakistan have their own unique cultures, both in terms of religion (India is primarily Hindu and Pakistan primarily Muslim) and other aspects. On the other hand, Bollywood movies, with their own brand of Hindi (with many Urdu phrases), are immensely popular in both countries and influence the speech of the younger generation, while young Hindi speakers are more comfortable using Persian or Arabic loan words than before. This is an evolution that could go either way:
- Hindi and Urdu can continue to drift apart and become separate languages in truth.
- Hindi and Urdu can continue to influence each other so that their differences slowly erode.
Either way, the future of Hindi and Urdu will be an interesting one!
So as you see, learning to speak Hindi will mean getting to know a language with a long and literary past and an exciting, dynamic future.
The platform that connects tutors and students
The article is very fine. but i need references
I’m glad you liked it – most of the information is taken from the links provided. We don’t offer scientific research articles on this site, but introductory pieces to provide some basic info. If your research should turn up anything interesting or contradictory, you’re welcome to let us know!
Latest Research (2019) on Linguistics have provided More Clarification on the Fact that Sanskrit is a language of Brahmins who Migrated to India From Europe and that slavic Brahmins did not* travel to Iran or Persia so Genetically were not related to Aryans who were Persians. So Brahmins or their language cannot be classified as Aryan as Brahmins were Purely European* and there fore their language too landed as pure European. Due to these Europeans Copying* All* Indian Linguistic not just one just to make all Indians feel that they have a upper hand just to make Sanskrit look primitive… misusing Indians Innocence used spiritual loop holes and India being Multi lingual …it was easy as Sanskrit was never a spoken language but only in script to act as if Sanskrit is mother of Any Indian lanuage….How can a non*spoken* foreign language ever be a mother or Origin point of any Indian language.? To have power to influence first it should have been in use/spoken language. Prakrit alone influenced Hindi.
Hindi has been clubbed with Urdu to “claim” its majority, mainly for political purposes. The bare fact is that Hindi is the sole language of the Hindi heartland ONLY, which happens to be central Uttar Pradesh. Madhya Pradesh is dominated by several tribal languages, which are indeginous to itself. Rajasthani is the native tongue of Rajasthan. So this RJ & MP may be excluded from the region shaded in the map under “Hindi-land”. Going by recent statistics, it will be found that Bengali & Telugu are good contenders for the most popular language of Indian origin, however, since they don’t enjoy political patronage, right from the British period, this fact will be shrouded in political intention. The literature of these two are richer than Hindi, whose only protagonist is the Bollywood lobby,only for business purposes and literary & creative contribution is negligible, since Bollywood thrives in shameless plagiarism of Hollywood, without attempts at genuine creativity. In order to really catapult Hindi to its intended popularity, the Hindi protagonists may be humbly urged to nurture their creativity in ALL fields, e.g. technical also, rather than cracking the political ringmaster’s whip while forcibly imposing it on all Indians (esp south) with much non-value added expenditure in its popularising through bribery in the form of prizes & incentives or utter persuation, solely for political hegemony. A little creativity is humbly & respectfully urged in popularising it without opposition, which could then be pleasurably enjoyed rather than painfully endured.
Relatively true narrative on Hindi and Urdu. The predominant languages with historical literature evidence is Sanskrit and Tamil as contemporary languages. Sanskrit the Indo-Aryan from where the European languages originated later on with mix of later persian.
Tamil the predominant Dravidian language, has lierature evidence as old as Sanskrit with current evidence still available from 500BC on words but may older as a coin from Indus valley was found to have a Tamil letters indicating migration of Dravidians from Indus gradually. Telgu comes soon after Tamil on lierature evidence but late because of influence of both Sanskrit and Tamil was strong.
The intersting fact is Sanskrit has faded but Tamil and Telgu as ister languages survive!!
Agree Totally. But what worried is these languages Hindi or Sanskrit(European) which even if pleasurably enjoyed is Turing to become painful after a decade when it’s been made compulsary for a Central Govt Job. Or KV education – These are Reasons why South Indians Asked for a Dravidian language too to be included as official language of India.But in vain till Date. Remember there are n number of official languages in Many Countries in the world.Why not India.
Very informative..keep up
It would be appreciated if Hindi is not thrust on South Indians as we speak good English which is international. Since Indians speak good English why are we killing it. Good jobs Indian and International require English. All Banks start of with Hindi and sadly the same person cannot speak English for nuts. Please don’t thrust the language on us.
It’s good to know the English language as it’s international but don’t you think hindi would extinct in no time which is our language. Our minds are already westernised and we’re totally shifting towards them , their thinking, everything theirs and in this we’re going to loose the Indian identity. Hindi keeps us together and we’re proud of it.
How I know the historical and scientific pali language?
In time of Tathagat Gautam buddha.
Very nice good luck
I landed here after reading a page (link posted on FB by one of India’s leading playback singer). The article is informative and loaded with facts, would just like to have the references for the same. Speaking my mind: To start with, nothing is absolute true, and history is always twisted and crafted. We read just the history what the people sitting on the apex of the pyramid wants us to read. We Indians have learnt and the same we have inherently in our blood, to respect every culture alike our own. But the politics being played even in the field of academics instigate our minds to pose differences and anti-feelings towards other cultures and beliefs. That is a different thing that we still live as one helping each other and crossing our paths. To be true enough, if we have to reach the root of any language then we have to work together as one unit, the team being driven by like minded people void of conflicts and differences. An alien unit cannot determine something which has got no stance, and sad thing is even some front-line citizens, so called rationals are following the articles and getting them posted. The irony is at a time, when everyone is living in an age of (technology) revolution, we are not evolving and still revolving around some baseless conflicts of our horrendous past. But at this point the catch is we should never forget the past but of course let it go. To speak aloud here, just like Latin called the mother of English, similarly Sanskrit and Prakrit are two languages which have determine the form of Hindi and given a touch to many more languages too. People are just a tough nut not to crack and just living with subsumed and already consumed brain.
Mind blowing account of history and development of Hindi language. I read whole article and hold my breath til the end! Great article!
Excellent article, i live in Trinidad. It cleared up a lot of misconception. Qu, where did Sánskcrit originate from. Is it a spoken language?
Good information about Hindi language.
The beauty & capacity to explain
by Hindi language should have been included.
It binds people of different regional areas.
However, good description.
Wish you all the best.
Good Information. Thank You.
Kannada ♥️
Hindi is not India’s official language.
No where mentioned in constituent as Hindi or any language as official language.
Hindi and English are the official languages of the Republic of India
Kannada speaker number is wrong. According to 2011 census there are 40.37 million (4.37 crore) kannada mother tongue speakers in india.
Quite informative.
Very informative..keep up
Place in me
Very good article. It’s so interesting.