Romanisation of Bengali

Romanisation of Bengali is the representation of written Bengali language in the Latin script. Various romanisation systems for Bengali are used, most of which do not perfectly represent Bengali pronunciation. While different standards for romanisation have been proposed for Bengali, none has been adopted with the same degree of uniformity as Japanese or Sanskrit.^{[note 1]}

The Bengali script has been included with the group of Indic scripts whose romanisation does not represent the phonetic value of Bengali. Some of them are the "International Alphabet of Sanskrit Transliteration" or IAST system (based on diacritics),^[1] "Indian languages Transliteration" or ITRANS (uses upper case alphabets suited for ASCII keyboards),^[2] and the National Library at Calcutta romanisation.^[3]

In the context of Bengali romanisation, it is important to distinguish transliteration from transcription. Transliteration is orthographically accurate (the original spelling can be recovered), but transcription is phonetically accurate (the pronunciation can be reproduced). English does not have all sounds of Bengali, and pronunciation does not completely reflect orthography. The aim of romanisation is not the same as phonetic transcription. Rather, romanisation is a representation of one writing system in a different writing system. If Bengali script has "ত" and Bengalis pronounce it /to/ there is nevertheless an argument based on writing-system consistency for transliterating it as "त" or "ta." The writing systems of most languages do not faithfully represent the spoken sound of the language, as famously with English words like "enough," "women," or "nation" (see "ghoti").

History

Portuguese missionaries stationed in Bengal in the 16th century were the first people to employ the Latin alphabet in writing Bengali books. The most famous are the Crepar Xaxtrer Orth, Bhed and the Vocabolario em idioma Bengalla, e Portuguez dividido em duas partes, both written by Manuel da Assumpção. However, the Portuguese-based romanisation did not take root. In the late 18th century, Augustin Aussant used a romanisation scheme based on the French alphabet. At the same time, Nathaniel Brassey Halhed used a romanisation scheme based on English for his Bengali grammar book. After Halhed, the renowned English philologist and oriental scholar Sir William Jones devised a romanisation scheme for Bengali and other Indian languages in general; he published it in the Asiatick Researches journal in 1801.^[4] His scheme came to be known as the "Jonesian system" of romanisation and served as a model for the next century and a half.

Transliteration and transcription

Romanisation of a language written in a non-Roman script can be based on either transliteration (orthographically accurate and the original spelling can be recovered) or transcription (phonetically accurate, and the pronunciation can be reproduced). The distinction is important in Bengali, as its orthography was adopted from Sanskrit and ignores several millennia of sound change. All writing systems differ at least slightly from the way the language is pronounced, but this is more extreme for languages like Bengali. For example, the three letters শ, ষ, and স had distinct pronunciations in Sanskrit, but over several centuries, the standard pronunciation of Bengali (usually modelled on the Nadia dialect) has lost the phonetic distinctions, and all three are usually pronounced as IPA [ʃɔ]. The spelling distinction persists in orthography.

In written texts, distinguishing between homophones, such as শাপ shap "curse" and সাপ shap "snake", is easy. Such a distinction could be particularly relevant in searching for the term in an encyclopaedia, for example. However, the fact that the words sound identical means that they would be transcribed identically, so some important distinctions of meaning cannot be rendered by transcription. Another issue with transcription systems is that cross-dialectal and cross-register differences are widespread, so the same word or lexeme may have many different transcriptions. Even simple words like মন "mind" may be pronounced "mon", "môn", or (in poetry) "mônô" (as in the Indian national anthem, "Jana Gana Mana").

Often, different phonemes are represented by the same symbol or grapheme. Thus, the vowel এ can represent either [e] (এল elo [elɔ] "came") or [ɛ] (এক êk [ɛk] "one"). Occasionally, words written in the same way (homographs) may have different pronunciations for differing meanings: মত can mean "opinion" (pronounced môt), or "similar to" (môtô). Therefore, some important phonemic distinctions cannot be rendered in a transliteration model. In addition, to represent a Bengali word to allow speakers of other languages to pronounce it easily, it may be better to use a transcription, which does not include the silent letters and other idiosyncrasies (স্বাস্থ্য sbasthyô, spelled <swāsthya>, or অজ্ঞান ôggên, spelled <ajñāna>) that make Bengali romanisation so complicated. Such letters are misleading in a phonetic romanisation of Bengali and are a result of often inclusion of the Bengali script with other Indic scripts for romanisation, but the other Indic scripts lack the inherited vowel ô, which causes chaos for Bengali romanisation.

Comparison of romanisations

Comparisons of the standard romanisation schemes for Bengali are given in the table below. Two standards are commonly used for transliteration of Indic languages, including Bengali. Many standards (like NLK/ISO), use diacritic marks and permit case markings for proper nouns. Schemes such as the Harvard-Kyoto one are more suited for ASCII-derivative keyboards and use upper- and lower-case letters contrastively, so forgo normal standards for English capitalisation.

"NLK" stands for the diacritic-based letter-to-letter transliteration schemes, best represented by the National Library at Kolkata romanisation or the ISO 15919, or IAST. It is the ISO standard, and it uses diacritic marks like ā to reflect the additional characters and sounds of Bengali letters.
ITRANS is an ASCII representation for Sanskrit; it is one-to-many: more than one way of transliterating characters may be used, which can make internet searches more complicated. ITRANS ignores English capitalisation norms to permit representing characters from a normal ASCII keyboard.
"HK" stands for two other case-sensitive letter-to-letter transliteration schemes: Harvard-Kyoto and XIAST scheme. Both are similar to the ITRANS scheme and use only one form for each character.
XHK or Extended Harvard-Kyoto (XHK) stands for the case-sensitive letter-to-letter Extended Harvard-Kyoto transliteration. It adds some specific characters for handling Bengali text to IAST.

Examples

The following table includes examples of Bengali words romanised by using the various systems mentioned above.

Example words
In orthography	Meaning	NLK	XHK	ITRANS	HK	Wiki	IPA
মন	mind	mana	mana	mana	mana	mon	[mon]
সাপ	snake	sāpa	sApa	saapa	sApa	shap	[ʃap]
শাপ	curse	śāpa	zApa	shaapa	zApa	shap	[ʃap]
মত	opinion	mata	mata	mata	mata	môt	[mɔt̪]
মত	like	mata	mata	mata	mata	moto	[mɔt̪o]
তেল	oil	tēla	tela	tela	tela	tel	[t̪el]
গেল	went	gēla	gela	gela	gela	gêlô	[ɡɛlɔ]
জ্বর	fever	jvara	jvara	jvara	jvara	jôr	[dʒɔr]
স্বাস্থ্য	health	svāsthya	svAsthya	svaasthya	svAsthya	shasththo	[ʃast̪ʰːo]
বাংলাদেশ	Bangladesh	bāṃlādēśa	bAMlAdeza	baa.mlaadesha	bAMlAdeza	Bangladesh	[baŋlad̪eʃ]
ব্যঞ্জনধ্বনি	consonant	byañjanadhvani	byaJjanadhvani	bya~njanadhvani	byaJjanadhvani	bênjondhoni	[bɛndʒɔnd̪ʱoni]
আত্মহত্যা	suicide	ātmahatyā	AtmahatyA	aatmahatyaa	AtmahatyA	attohotta	[at̪ːohɔt̪ːa]

Romanisation reference

The IPA transcription is provided in the rightmost column, representing the most common pronunciation of the glyph in Standard Colloquial Bengali, with the various romanisations described above.

Vowels and miscellaneous
Symbol	BA^[5]	NLK	XHK	ITRANS	HK	Wiki	IPA
অ	a	a	a	a	a	ô/o	[ɔ]/[o]
আ	ā	ā	ā	A~aa	A	a	[a]
ই	i	i	i	i	i	i	[i]
ঈ	ī	ī	ī	I~ii	I	i	[i]
উ	u	u	u	u	u	u	[u]
ঊ	ū	ū	ū	U~uu	U	u	[u]
ঋ	r	ṛ	ṛ	RRi~R^i	R	ri	[ri]
এ	e	ē	e	e	e	e/æ	[e]/[ɛ]
ঐ	ai	ai	ai	ai	ai	oi	[oi]
ও	o	ō	o	o	o	o	[o]
ঔ	au	au	au	au	au	ou	[ou]
ঃ	ḥ	ḥ	ḥ	H	H	varies	varies
ং	ng	ṃ	ṁ	.m	M	ng	[ŋ]
ঁ	◌̃	ṃ	ɱ	.N	~	~	[~] (nasalization)
্য	y	y	y	y	y	varies	varies
্ব	w/v	v	v	v	v	varies	varies
ক্ষ	kṣ	kṣ	kṣ	x	kS	kkhô	[kʰːɔ]
জ্ঞ	jñ	jñ	jñ	GY	jJ	ggô	[ɡːɔ]
শ্র	śr	śr	śr	shr	zr	shrô	[ʃɾɔ]

Consonants
Symbol	BA^[5]	NLK	XHK	ITRANS	HK	Wiki	IPA
ক	k	k	k	k	k	kô	[kɔ]
খ	kh	kh	kh	kh	kh	khô	[kʰɔ]
গ	g	g	g	g	g	gô	[ɡɔ]
ঘ	gh	gh	gh	gh	gh	ghô	[ɡʱɔ]
ঙ	ng	ṅ	ṅ	~N	G	ngô	[ŋɔ]/[uõ]
চ	c	c	c	ch	c	chô	[tʃɔ]
ছ	ch	ch	ch	Ch	ch	chhô	[tʃʰɔ]
জ	j	j	j	j	j	jô	[dʒɔ]
ঝ	jh	jh	jh	jh	jh	jhô	[dʒʱɔ]
ঞ	ñ	ñ	ñ	~n	J	niô	[nɔ]
ট	ṭ	ṭ	ṭ	T	T	ţô	[ʈɔ]
ঠ	ṭh	ṭh	ṭh	Th	Th	ţhô	[ʈʰɔ]
ড	ḍ	ḍ	ḍ	D	D	đô	[ɖɔ]
ড়	ṛ	ḍ	ḏ	.D	P	ŗô	[ɽɔ]
ঢ	ḍh	ḍh	ḍh	Dh	Dh	đhô	[ɖʱɔ]
ঢ়	ṛh	ḍh	ḏh	.Dh	Ph	ŗhô	[ɽɔ]
ণ	ṇ	ṇ	ṇ	N	N	nô	[nɔ]
ত	t	t	t	t	t	tô	[tɔ]
থ	th	th	th	th	th	thô	[tʰɔ]
দ	d	d	d	d	d	dô	[dɔ]
ধ	dh	dh	dh	dh	dh	dhô	[dʱɔ]
ন	n	n	n	n	n	nô	[nɔ]
প	p	p	p	p	p	pô	[pɔ]
ফ	ph	ph	ph	ph	ph	fô/phô	[ɸɔ~pʰɔ]
ব	b	b	b	b	b	bô	[bɔ]
ভ	bh	bh	bh	bh	bh	bhô	[bʱɔ]
ম	m	m	m	m	m	mô	[mɔ]
য	y/j	ẏ	y	y	y	jô	[dʒɔ]
য়	ẏ	y	ẏ	Y	Y	yô/e	[e̯ɔ]/–
র	r	r	r	r	r	rô	[rɔ]
ল	l	l	l	l	l	lô	[lɔ]
শ	ś/sh	ś	ś	sh	z	shô	[ʃɔ]
ষ	ṣ/sh	ṣ	ṣ	Sh	S	shô	[ʃɔ]
স	s	s	s	s	s	sô	[sɔ]
হ	h	h	h	h	h	hô	[ɦɔ]

Notes

↑ In Japanese, some debate exists as to whether to accent certain distinctions, such as Tōhoku vs Tohoku. Sanskrit is well standardized, as it has few speakers, and sound change is not a large concern.

References

↑ "Learning International Alphabet of Sanskrit Transliteration". Sanskrit 3 – Learning transliteration. Gabriel Pradiipaka & Andrés Muni. Archived from the original on 12 February 2007. Retrieved 20 November 2006.
↑ "ITRANS – Indian Language Transliteration Package". Avinash Chopde. Retrieved 20 November 2006.
↑ "Annex-F: Roman Script Transliteration" (PDF). Indian Standard: Indian Script Code for Information Interchange — ISCII. Bureau of Indian Standards. 1 April 1999. p. 32. Retrieved 20 November 2006.
↑ Jones 1801
1 2 বাংলা একাডেমী ব্যবহারিক বাংলা অভিধান Bangla Academy Byaboharik Bangla Abhidhan (Bangla Academy Functional Bengali Dictionary) (16th reprint ed.). DHaka 1000, Bangladesh: Bangla Academy. Nov 2012. p. আট্রিশ (তালিকা -৪). ISBN 984-07-5071-2.

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] In Japanese, some debate exists as to whether to accent certain distinctions, such as Tōhoku vs Tohoku. Sanskrit is well standardized, as it has few speakers, and sound change is not a large concern.

[IAST1-2] "Learning International Alphabet of Sanskrit Transliteration". Sanskrit 3 – Learning transliteration. Gabriel Pradiipaka & Andrés Muni. Archived from the original on 12 February 2007. Retrieved 20 November 2006.

[ITRANS1-3] "ITRANS – Indian Language Transliteration Package". Avinash Chopde. Retrieved 20 November 2006.

[NatLib-4] "Annex-F: Roman Script Transliteration" (PDF). Indian Standard: Indian Script Code for Information Interchange — ISCII. Bureau of Indian Standards. 1 April 1999. p. 32. Retrieved 20 November 2006.

[5] Jones 1801

[BABBA-6] 1 2 বাংলা একাডেমী ব্যবহারিক বাংলা অভিধান Bangla Academy Byaboharik Bangla Abhidhan (Bangla Academy Functional Bengali Dictionary) (16th reprint ed.). DHaka 1000, Bangladesh: Bangla Academy. Nov 2012. p. আট্রিশ (তালিকা -৪). ISBN 984-07-5071-2.

Romanization
By publisher (for several languages)	ALA–LC BGN/PCGN GOST ISO Yale
By language or writing system	Amharic Arabic Armenian Bengali Berber Burmese Chinese in Taiwan in Singapore Cyrillic informal Belarusian Bulgarian Kyrgyz Macedonian Russian Serbian Ukrainian Georgian Greek Hebrew Inuktitut Japanese Khmer Korean Lao Malayalam Maldivian Persian Syriac Tibetan Telugu Thai Urdu Uyghur Vietnamese

Bengali language
Written Bengali	Alphabet (Grammar Consonant clusters Romanization) Numerals Braille
Spoken Bengali	Phonology Vocabulary tôtsômô Dialects
Language institutions	Bangla Academy PôshchimBônggô Bangla Akademi Bônggiyô Sahityô Pôrishôd Bishwô Sahityô Kendrô Pôshchim Bônggô Natyô Akademi
Literature	Folk literature Authors Poets
Literary awards	Bangla Academy Literary Award Ekushey Padak Rabindra Puraskar Sahitya Akademi Award Bankim Puraskar Ananda Purashkar
Personalities	Rammohan Roy Kazi Nazrul Islam Rabindranath Tagore Ishwar Chandra Vidyasagar Nathaniel Brassey Halhed John Beames Suniti Kumar Chatterjee Sukumar Sen Asit Kumar Banerjee
Mega-events	Ekushey Book Fair Kolkata Book Fair
Cinema	Cinema of Bangladesh Cinema of West Bengal
Other	Bengali Language Movement (Bangladesh) Language Movement Day (Bangladesh) Shoheed Minar International Mother Language Day Bengali Language Movement in Assam Bengali Language Movement (Manbhum) Bengali input methods in computers States of India by Bengali speakers

Writing systems
Overview	History of writing History of the alphabet Graphemes Scripts in Unicode
Lists	Writing systems Languages by writing system / by first written account Undeciphered writing systems Inventors of writing systems
Types	Featural Alphabets Abjads Alphasyllabaries / Abugidas Syllabaries Semi-syllabaries Ideogrammic Pictographic Logographic Numeral