This article describes why and how Dunian was created to be an equally global auxiliary language.

Languages of the world and world languages

The 15 most widely spoken languages in the world are listed in the table below. The table is ordered by the number of native speakers. The numbers are educated estimates based on data in Ethnologue and in Wikipedia.

Ranking Language Native speakers Non-native speakers
1 Mandarin Chinese 899 million 178 million
2 English 500 million 510 million
3 Spanish 500 million 70 million
4 Hindi-Urdu 438 million 214 million
5 Arabic 290 million 132 million
6 Portuguese 230 million 32 million
7 Bengali 226 million 19 million
8 Russian 160 million 115 million
9 Punjabi 146 million 1 million
10 Japanese 130 million 1 million
11 German 95 million 15 million
12 Hausa 85 million 65 million
13 French 80 million 192 million
14 Telugu 80 million 12 million
15 Malay-Indonesian 77 million 204 million

The numbers make it clear that Mandarin Chinese is by far the largest language in the world counted by native speakers, while English is the language with the greatest number of second language speakers. Both of their number of speakers reaches one billion. However they are followed by languages, whose speakers are also counted in hundreds of millions.

It is estimated that over 6000 different languages are spoken in the world.

However, native and non-native speakers of the 5 most widely spoken languages together add up to more than half of the total population of the world. Also these five languages represent different language typologies well and they also represent the five main divisions of international vocabularies.

Spelling and pronunciation

The most widely used alphabet

There are many writing systems in the world and a handful of them have become international. Chinese "hanzi" symbols are used in Japanese, where they are known as "kanji". The Arabic script is used for writing Urdu in Pakistan and India. However only the Roman alphabet has become truly global. It is used by numerous European and American languages but also by African language of Africa (including Hausa and Swahili) and by several notable languages in Asia, namely Turkish, Malay-Indonesian and Vietnamese.

Standard Chinese is written in Chinese characters, which form a system so complex that school children are taught to read the Roman alphabet first. This system of romanization is known as Pīnyīn.

Therefore it is clear why also Dunian is written in the Roman alphabet.

Only five vowels

English and Chinese have rich vowel inventories. English has 14-20 vowels, depending on the dialect. Standard Chinese has 9 vowels (and many more vowel qualities). These are comparatively high numbers considering that globally the average vowel inventory size is only 5-6 vowels.

Dunian has only 5 pure vowels: a, e, i, o, u. In this respect, it is close to languages such as Spanish, Swahili, Japanese and Indonesian, which have simple vowel inventories.

Common consonants

Dunian's consonant inventory is smaller than that of English and Mandarin. Majority of the consonant letters are pronounced in the same way in all three languages. The table below shows how consonant sounds are mapped from Dunian to English and Mandarin. Sounds that are in English or Mandarin but not in Dunian are enclosed in parenthesis.

Dunian English Mandarin Pinyin
Nasals m n ng m n ng m n ng
Stops p b t d k g p b t d k g p b t d k g
Liquids l r y w l r y w l r y w (yü)
Sibilants s z x s z sh (zh) s z* sh (x)
Fricatives f h f h (v th dh) f h
Affricates c j ch j ch zh (q j c)

Simple syllable patterns

Syllable patterns vary across languages. For example in Japanese the heaviest syllable type is CVN, where C is a consonant, V is a vowel and N is a nasal consonant. This gives Japanese a very vocalic sound. English, on the other hand, can have very heavy syllables, for example "sprints" (CCCVNCC).

Dunian is somewhere in the middle. Most of the words consist of simple CV syllables but also more complex syllables are allowed, especially for internationally known technical terms. For example kristal (crystal) is considered a complex word in Dunian. More complex loan words can be dealt with in two ways.

  1. Select a simpler variant from another language. For example the word for "project" is taken from the French version: proje.
  2. Break the consonant clusters by adding vowels. For example "sport" would become esporte. Many languages do something similar. See Portuguese "esporte" and Japanese "supōtsu" for comparison.

Regular spelling

English spelling is notoriously irregular. Pīnyīn was created more recently, in the 1950s, but unfortunately it also has some irregularities, simply because there are more sounds in spoken Chinese than there are letters in the Roman alphabet. Still, in comparison to English, Pinyin is very regular. For example the English rhymes my, sigh, lie, and rye would be written in Pīnyīn mai, sai, lai, rai. It is as simple as that!

Dunian can be spelled regularly because it has fewer speech sounds (24) than there are letters in the basic Roman alphabet (26). The alphabet of Dunian is:
a b c d e f g h i j k l m n o p r s t u w x y z

Dunian has perfect letter-to-sound correspondence. One letter stands for one sound only. One sound is represented by exactly one letter. Every word is pronounced as it is written.

Easy accent

Texts in Pīnyīn are loaded with accent marks, as in "Wǒmen yě huì shuō zhōngguòhuá." They mark tones. In Standard Chinese each syllable is pronounced in one of the four tones or in the unmarked neutral tone.

English doesn't have word tones but it has word stress. Word stress is variable in English, so the position of stress is unpredictable. In a written expression like "totally fantastic personnel", nothing shows that each word has the stress on a different syllable. If the stress was marked with an accent, it might look something like this: "tótally fantástic personnél".

Tones are hard to learn for people who are not used to them. Variable stress is hard to learn for people who are used to fixed stress. Neither word tone nor variable word stress are necessary in the world language.

Dunian has fixed stress. The stress falls on the syllable that is before the last consonant. Like this: mi wól lóga supér dúnia báxe.

Dunian doesn't have lexical tone.

Relatively simple structure

Languages can be categorized by two parameters:

  • Is a single word made of few or many parts?
  • Are those parts easy to separate or fused together?

The widely spoken languages can be divided into four types according to these parameters.

  1. Analytic languages – Words are made of few, distinct parts.
    • Mandarin Chinese
  2. Analytic fusional languages – Words are made of few, fused parts.
    • English
  3. Agglutinative languages – Words are made of many, distinct parts.
    • Japanese, Malay-Indonesian, Telugu
  4. Synthetic fusional languages – Words are made of many, fused parts.
    • Spanish, Portuguese, French
    • German, Russian
    • Hindi-Urdu, Bengali, Punjabi
    • Arabic, Hausa

Usually languages are a mixture of different types. For example, in English the plural can be formed in several different ways. Many a cat is an analytic phrase that consists of three separate words. Cats is an agglutinative word that consists of two distincts parts (cat and -s). Leaves is a fused word that consists of two parts (leaf and -s).

Dunian belongs to the first type. It is an analytic language. Its words consist of few parts and they are clearly separable. This is a good thing because it makes the language easy to learn and use in comparison to languages where words in average are long and consist of many parts.

Parts of word

The word is made of a root and optional affixes, which are attached to the root. Prefixing languages put affixes before the root and suffixing languages put affixes after the root. Some languages put affixes on both sides or even inside the root. Usually languages use several different ways. For example English uses both prefixes (ex. un-kind) and suffixes (kind-ly)

Suffixing languages are the most common type. Indo-European languages, Telugu, Chinese and Japanese are mostly suffixing.

Chinese has no inflection. Words are only combined into larger words. Some words have a special meaning when they appear as a part of a larger word. These so called bound morphemes are much like suffixes.

English, Spanish and Hindustani use mainly root and affix system. The meaning is changed by adding dependent parts before and after the root. For example "booklets" consists of root book and affixes -let (which adds meaning of smallness) and -s (which adds plural meaning). Most affixes can't appear alone, but they need a root.

Arabic uses transfixes (also known as the root and pattern system). The root consists of (usually three) consonants and the root is changed by inserting a pattern of vowels between them. Arabic also has many prefixes and suffixes for creating additional words.

Dunian uses root and affix system. The principles of this system are known to most people. New words can be created easily.

Word orders

Different word orders are used in the languages of the world. Some of the most important areas of word order are:

  • Sentence structure. Order of subject (S), verb (V) and object (O) in a transitive clause. The most common sentence structures are subject-verb-object (SVO) and object-verb-subject (SOV).
  • Order of numeral and noun. Cardinal numeral can either precede (NumN) or follow (NNum) the noun.
  • Order of adjective and noun. There are two possible orders
    1. Adjective precedes the noun (AdjN)
    2. Adjective follows the noun (NAdj)
  • Order of adposition and noun.
    1. Prepositions precede the noun.
    2. Postpositions follow the noun.
  • Order of relative clause and noun. The relative clause can either precede (RelN) or follow (NRel) the noun.

The table below shows what are the typical, unmarked word orders in several important world languages.

Language Sentence Numeral Adjective Relative Adposition
English SVO NumN AdjN NRel pre
Chinese SVO NumN AdjN RelN post
Spanish SVO NumN NAdj NRel pre
Hindi-Urdu SOV NumN AdjN RelN post
Arabic VSO both NAdj NRel pre
Russian SVO NumN AdjN NRel pre
French SVO NumN NAdj NRel pre
Indonesian SVO NumN NAdj NRel pre
Japanese SOV NumN AdjN RelN post
Swahili SVO NNum NAdj NRel pre

Also other word orders can be possible. For example in English, which normally uses the SVO order in declarative sentences, the object can be fronted in interrogative and relative clauses, like in "What did you say?"

We can see from the previous table that languages do not agree about word orders. There are three main types of languages:

  1. Languages where the main word comes first have typically SVO, NNum, NAdj, NRel and prepositions.
  2. Languages where the main word comes last have typically SOV, NumN, AdjN, RelN and postpositions.
  3. Languages with mixed or free word order.

The third type is the most attractive for the world auxiliary language, which has to welcome people with many different speaking habits. It doesn't make the language only versatile but also more interesting!

Dunian allows different word orders. This is achieved mainly by the verb endings -a and -u, which signal reverse word orders. In Dunian verbs can function also as adpositions (so called coverbs).

mi ama tu. (SVO)
mi tu amu. (SOV)
I love you.

mi salta supra meza gawi. (I jump, surpass the table high.)
mi salta gawi meza supru. (I jump, the high table is-surpassed.)
I jump over the high table.

World words

World's word heritage

English and Spanish carry the heritage of Latin and Greek, which have greatly influenced all languages of Europe, America and beyond, including French, Portuguese, Italian, German, Polish and Russian.

Mandarin Chinese carries the Sinitic heritage of Old Chinese, which has greatly influenced all other East Asian languages, including Japanese, Korean, Vietnamese and other varieties of Chinese.

Hindi-Urdu carries the heritage of Sanskrit, which has influenced also all other languages of South and South-East Asia, including Bengali, Punjabi, Telugu, Tamil, Burmese, Khmer, Thai, Malay and Indonesian.

Arabic and Hindi-Urdu (Urdu more than Hindi) carry the Perso-Arabic heritage, which has greatly influenced languages in Central, South-West, South and South-East Asia and North, West and East Africa, including Turkish, Persian, Bengali, Punjabi, Telugu, Indonesian, Hausa, Wolof, Amharic, Oromo, Somali and Swahili.

Western words

Typically Western words have this structure: prefix + root + suffixes. Usually the root ends in a consonant.

For example in Spanish, the root cort- (short) can be combined with affixes to produce different kinds of words.

  • Adjectives: corto (masc.), corta (fem.)
  • Noun: cortedad
  • Verb: acortar

Also English uses comparable affixes.

  • Adjectives: short, shorter, shortest
  • Nouns: shortness, shorty
  • Verb: shorten

Dunian borrows the roots of Western words. The goal is to select a form that sounds familiar to speakers of as many languages as possible.

Dunian root English Portuguese Spanish French German Russian
kort short curto corto court kurz korotkiy
now new novo nuevo nouveau neu novıy
marc march marcha marcha marche Marsch marš
post post (mail)(postal) posta poste Post počta

The suffixes of Dunian are applied on the roots. Here are some resulting Dunian words: korti (short), korte (shorty), korta (shorten), nowi (new), nowe (news), nowa (renovate).

Sinitic words

Sinitic words are words from Middle Chinese that are used today in languages of East Asia, including Chinese languages, Japanese, Korean and Vietnamese. Sinitic words are single-syllable words or compounds of syllabic elements.

Middle Chinese had lexical tone. Today Chinese languages and Vietnamese have tones but they are not the same as in Middle Chinese. Japanese and Korean are not tonal languages so they have ignored the tones. Also Dunian ignores the tones. (To ignore the tones is about the same as to ignore the stress accent or pitch accent of other source languages.)

Middle Chinese had unreleased stop consonants, which are usually written -p, -t and -k. Cantonese, Vietnamese and Korean keep them mostly as they were. Mandarin has deleted them. Japanese has added a vowel to ease pronunciation. Dunian keeps the final stops and adds a normal PoS suffix.

Dunian root Cantonese Mandarin Japanese Korean Vietnamese
suy sui shuǐ sui su thuỷ
jung zung zhōng chū jung trung
xim sam xīn shin sim tâm
mun mun mén mon mun (môn)
wen man wèn mon mun vấn
dug duk doku dog đọc
cut cœt chū shutsu chul xuất
sog sok suǒ saku sæg
ag ok è aku ag ác

Applying the suffixes of Dunian to Sinitic roots may seem unusual at first, but it is nothing new – Sinitic words are already inflected in Korean!

Easy sentences

In this section we will compare the sentence structures of Dunian with English and Chinese, the two most widely spoken languages of the world.

The normal sentence word order is subject-verb-object – just like in English and Chinese.

English: I love you, and you love me.
Dunian:  mi ama tu, tu ama mi.
Chinese: Wǒ ài nǐ, nǐ ài wǒ. (我爱你,你爱我。)

The verb bey is used when the object of an action comes first in the sentence. (This is the so called passive sentence.)

English: The apples were eaten.
Dunian:  pingo bey nyama.
Chinese: Píngguǒ bèi chī le. (苹果被吃了。)

Bey is a loan word from Standard Chinese, but it is also close to some uses of English "to be".

English: It can not be eaten.
Dunian:  ye no ken bey nyama.
Chinese: Tā bù néng bèi chī. (它不能被吃。)

Like Chinese, Dunian doesn't mark verbs with a word like "to".

English: I invite him to drink coffee.
Dunian:  mi cing ye nyama kafe.
Chinese: Wǒ qǐng tā hē kāfēi. (我请他喝咖啡。)

In Dunian and Chinese, nouns can be singular or plural depending on surrounding words. There's no plural ending like -s in English. Also verbs are not conjugated. One word, si, is used instead of am, is, are, was, were...

English: It is an apple.
Dunian:  ye si pingo.
Chinese: Tā shì píngguǒ. (他是苹果。)

English: They are apples.
Dunian:  yemen si pingo.
Chinese: Tāmen shì píngguǒ. (他们是苹果。)

Dunian - a global language