Language Families Catalog

A reference catalog of the world’s language families, their branches, geographic spread, approximate language counts, native speaker totals, and a representative sample of member languages with speaker counts. Living language inventories follow Ethnologue / Glottolog as of the mid-2020s. Isolates and major controversial macro-groupings included at the end.


I. Indo-European

Approximate scale: ~445 living languages; ~3.2 billion native speakers; spread across Europe, the Iranian plateau, the Indian subcontinent, and (via colonial expansion) the Americas, southern Africa, and Oceania.

BranchNotable LanguagesSpeakers (approx.)Notes
Anatolian (extinct)Hittite, Luwian, Lydian, PalaicEarliest attested IE branch (2nd millennium BCE)
Tocharian (extinct)Tocharian A (Agnean), Tocharian B (Kuchean)Tarim Basin; extinct by ~1000 CE
Indo-Iranian — Indo-AryanHindi-Urdu (590M), Bengali (270M), Marathi (95M), Gujarati (60M), Punjabi (130M), Sanskrit (liturgical)~1.5BDevanagari and related Brahmic scripts
Indo-Iranian — IranianPersian/Farsi (80M), Pashto (60M), Kurdish (30M), Tajik (8M), Balochi~200MPersian = Farsi/Dari/Tajik continuum
Indo-Iranian — NuristaniKati, Waigali<50kHindu Kush
GreekModern Greek (~13M), Ancient and Koine Greek (classical/liturgical)~13MContinuous attestation since c.1450 BCE Linear B
Italic — Latin/RomanceSpanish (490M), French (320M), Portuguese (260M), Italian (65M), Romanian (24M), Catalan (10M), Galician, Occitan, Sardinian, Romansh~1.1BAll descend from Vulgar Latin
CelticIrish (~170k L1), Welsh (~560k), Scottish Gaelic (~57k), Breton (~210k), Cornish (revived), Manx (revived)~1MTwo surviving subgroups: Goidelic and Brythonic
Germanic — WestEnglish (380M L1, 1.5B total), German (95M), Dutch (24M), Afrikaans (7M L1), Frisian, Yiddish (~600k), Luxembourgish~530M L1English dominant world lingua franca
Germanic — NorthSwedish (10M), Danish (6M), Norwegian (5M), Icelandic (330k), Faroese (70k)~22MMutually intelligible mainland Scandinavian
Balto-Slavic — SlavicRussian (150M), Polish (40M), Ukrainian (30M), Czech (10M), Belarusian, Bulgarian, Serbo-Croatian, Slovenian, Slovak, Macedonian~290MEast / West / South Slavic
Balto-Slavic — BalticLithuanian (3M), Latvian (1.5M)~4.5MMost archaic IE phonology preserved
AlbanianAlbanian (Tosk and Gheg, ~6M)~6MOwn branch since at least 2nd millennium BCE
ArmenianEastern Armenian (3M), Western Armenian (1M)~4MDistinct branch; own script (Mesropian, 405 CE)

II. Sino-Tibetan

Approximate scale: ~510 languages; ~1.4 billion speakers; China, Tibet, Himalaya, Myanmar, NE India.

BranchNotable LanguagesSpeakers (approx.)Notes
SiniticMandarin (920M L1), Cantonese / Yue (85M), Wu / Shanghainese (80M), Min Hokkien / Taiwanese (50M), Hakka (40M), Xiang (40M), Gan (20M)~1.3BTonal; written in Chinese characters
TibeticStandard Tibetan / Ü-Tsang (1.2M), Khams, Amdo, Dzongkha (640k, Bhutan official), Ladakhi, Sherpa~6MPre-1959 literary standard for Mahayana
BurmicBurmese (33M), Karen languages (~7M), Arakanese~40MBurmese script Pali-derived
Lolo-BurmeseYi / Lolo (~7M), Lisu, Lahu, Naxi~10MSW China; Naxi has unique pictographic Dongba script
Naga & Kuki-ChinNaga languages (40+), Mizo (700k), Meitei (1.8M)~5MNE India / Myanmar borderlands

III. Niger-Congo

Approximate scale: ~1,500 languages; ~700 million speakers; sub-Saharan Africa. Largest family by language count.

BranchNotable LanguagesSpeakers (approx.)Notes
BantuSwahili (~200M L2; 18M L1), Zulu (28M), Xhosa (19M), Shona (15M), Lingala (40M L2), Kikuyu (8M), Luganda (10M), Kinyarwanda (12M), Kirundi (12M), Sotho, Tswana~350MClass-noun system; spread south/east c.500 BCE–500 CE
MandeBambara (15M), Mandinka (1.3M), Soninke, Dyula~70MWest Africa; N’Ko script
AtlanticFula / Fulfulde (40M), Wolof (12M), Serer~70MSahel and West African coast
Volta-NigerYoruba (50M), Igbo (45M), Edo, Igala~110MNigeria-Benin
GurMooré (8M), Dagbani, Senufo~25MBurkina Faso, Mali, Ghana
KwaAkan / Twi (11M), Ewe (8M), Ga, Fon~30MGhana, Togo, Benin
Adamawa-UbangiSango (5M L2, lingua franca CAR), Mumuye, Mbum~10MCentral Africa
KruBété, Krahn, Klao~5MCôte d’Ivoire, Liberia

IV. Afro-Asiatic

Approximate scale: ~370 languages; ~500 million speakers; North Africa, Horn, Middle East.

BranchNotable LanguagesSpeakers (approx.)Notes
SemiticArabic (420M total across varieties), Hebrew (9M), Maltese (520k), Amharic (32M), Tigrinya (9M), Aramaic varieties (~750k), Syriac (liturgical)~470MTriconsonantal root system
Berber (Tamazight)Tashelhit (~8M), Kabyle (6M), Tarifit, Tuareg~30MTifinagh script revived in Morocco/Algeria
ChadicHausa (50M L1, 80M total), Bole, Margi~60MNorthern Nigeria, Niger; lingua franca
CushiticSomali (22M), Oromo (37M), Afar, Sidamo, Beja~75MHorn of Africa
Egyptian (extinct)Ancient Egyptian, Coptic (liturgical)3000 BCE – 17th c CE
OmoticWolaitta, Gamo, Aari~5MSW Ethiopia; status as Afro-Asiatic debated

V. Austronesian

Approximate scale: ~1,250 languages; ~390 million speakers; Madagascar to Easter Island.

BranchNotable LanguagesSpeakers (approx.)Notes
Formosan (~14 subfamilies)Atayal, Paiwan, Amis, Bunun~200kAboriginal Taiwan — homeland of family
Malayo-Polynesian — WesternMalay/Indonesian (280M total), Filipino/Tagalog (28M L1, 80M total), Cebuano (20M), Javanese (80M), Sundanese (40M), Madurese (15M), Malagasy (25M)~350MInsular SE Asia + Madagascar
Malayo-Polynesian — CentralBali Sasak (4M), Manggarai, Tetum (1.2M; East Timor co-official)~10MLesser Sundas
Malayo-Polynesian — Eastern (Oceanic)Fijian (650k), Samoan (510k), Tongan (190k), Maori (185k), Hawaiian (24k), Tahitian (68k), Tok Pisin (4M L2; Papua New Guinea)~6MPacific from Fiji to Hawaii / Easter Island

VI. Dravidian

Approximate scale: ~85 languages; ~250 million speakers; southern India, Sri Lanka, pockets in Pakistan.

BranchNotable LanguagesSpeakers (approx.)Notes
South DravidianTamil (80M), Kannada (45M), Malayalam (38M), Tulu (2M)~165MTamil Sangam literature from 3rd c BCE
South-CentralTelugu (95M), Gondi, Kui, Kuvi~100MTelugu = largest Dravidian by population
CentralKolami, Naiki, Parji<500kMaharashtra-AP borderlands
NorthernBrahui (2.4M; Balochistan), Kurukh (2M), Malto~5MBrahui = isolated Pakistani enclave

VII. Trans-New Guinea

Approximate scale: ~480 languages; ~3 million speakers; PNG highlands and adjacent.

BranchNotable LanguagesSpeakers (approx.)
EnganEnga (230k), Huli (150k)~400k
Chimbu-WahgiMelpa, Kuman, Wahgi~300k
MadangAmele, Gum~100k
Finisterre-HuonSelepet, Kâte~100k

VIII. Altaic (controversial macrofamily) — usually treated as four separate families

FamilyNotable LanguagesSpeakers (approx.)Notes
TurkicTurkish (88M), Azerbaijani (24M), Uzbek (35M), Kazakh (13M), Uyghur (11M), Tatar (5M), Turkmen (7M), Kyrgyz (5M)~200MVowel harmony; agglutinative
MongolicMongolian / Khalkha (5M), Buryat (270k), Kalmyk (155k)~6MTraditional Mongolian script + Cyrillic
TungusicManchu (near-extinct; 20 fluent speakers), Evenki (4k), Nanai<20kSeverely endangered
KoreanicKorean (77M)~77MTreated as isolate or sole member of Koreanic
JaponicJapanese (125M), Ryukyuan varieties (Okinawan, Amami, Miyako, Yaeyama)~127MRyukyuan UNESCO-endangered

IX. Uralic

Approximate scale: ~38 languages; ~25 million speakers; Hungary, Finland, Estonia, NW Russia, Siberia.

BranchNotable LanguagesSpeakers (approx.)
FinnicFinnish (5.4M), Estonian (1.1M), Karelian (40k), Veps, Livonian~7M
SámiNorthern Sámi (25k), Lule, Inari, Skolt~30k total
UgricHungarian (13M), Khanty (10k), Mansi (1k)~13M
SamoyedicNenets (22k), Selkup, Nganasan~30k

X. Tai-Kadai (Kra-Dai)

Approximate scale: ~95 languages; ~80 million speakers; SE Asia and southern China.

BranchNotable LanguagesSpeakers (approx.)
TaiThai / Siamese (70M total), Lao (30M total), Shan (5M), Zhuang (16M)~95M
Kam-SuiKam (Dong, 2.6M)~3M
HlaiHlai (Li, 760k; Hainan)~800k

XI. Austroasiatic

Approximate scale: ~170 languages; ~120 million speakers; mainland SE Asia, scattered India.

BranchNotable LanguagesSpeakers (approx.)
VieticVietnamese (86M), Mường (1.5M)~88M
KhmerKhmer (17M), Northern Khmer~18M
MonMon (1M), Nyah Kur~1M
MundaSantali (7.6M), Mundari, Ho, Korku~13M

XII. Hmong-Mien (Miao-Yao)

Approximate scale: ~38 languages; ~10 million speakers; southern China, SE Asia, US diaspora.

BranchNotable LanguagesSpeakers (approx.)
HmongicHmong / Miao (5M; includes White, Green)~5M
MienicIu Mien / Yao (1.2M)~1M

XIII. North American Indigenous Families

FamilyGeographic SpreadNotable LanguagesSpeakers
Eskimo-AleutArctic from Siberia to GreenlandInuktitut (39k), Kalaallisut / Greenlandic (57k), Central Yupik (10k), Aleut (150)~110k
Na-DeneNW North America + SW USNavajo (170k), Apache (~3k), Tlingit (100), Carrier, Slavey, Chipewyan~175k
Dene-Yeniseian(proposed by Vajda 2010) linking Na-Dene to Yeniseian (Siberia)Ket (210), Yug (extinct)
AlgicAlgonquian + Wiyot, YurokCree (96k), Ojibwe (50k), Blackfoot (5k), Cheyenne (1.7k), Lenape, Mi’kmaq (8k)~165k
IroquoianNE North AmericaCherokee (2k), Mohawk (2.4k), Onondaga, Seneca, Oneida, Cayuga, Tuscarora~5k
Siouan-CatawbanGreat Plains, SELakota (2k), Dakota (290), Crow (3.5k), Hidatsa, Mandan (5)~6k
SalishanPacific NWHalkomelem, Lushootseed, Lillooet~2k
SahaptianPlateauSahaptin (100), Nez Perce (200)~300
WakashanVancouver Island, BC coastKwak’wala, Nuu-chah-nulth, Haisla~1k
ChinookanColumbia RiverWasco-Wishram (10)<50
Uto-AztecanSW US, Mexico, Central AmericaNahuatl (1.7M), Hopi (6k), Comanche (100), Pima/Tohono O’odham (16k), Shoshone (1k), Tarahumara (70k)~2M
MayanMesoamericaYucatec (770k), K’iche’ (1.7M), Cakchiquel (450k), Mam (600k), Q’eqchi’ (800k), Tzeltal (560k), Tzotzil (600k)~6M
Oto-MangueanMexicoZapotec (490k), Mixtec (520k), Otomi (290k), Mazahua, Mazatec, Chinantec~2M
MisumalpanNicaragua, HondurasMiskito (180k), Sumo~200k

XIV. South American Indigenous Families

FamilyGeographic SpreadNotable LanguagesSpeakers
QuechuanAndes from Ecuador to ArgentinaSouthern Quechua (5M), Cuzco, Ayacucho, Ancash~7M
AymaranBolivia, Peru, ChileAymara (1.7M), Jaqaru~2M
ArawakanAmazon, CaribbeanGarifuna (200k), Wayuu (400k), Achagua, Asháninka~700k
Tupi-GuaraniBrazil, ParaguayGuaraní (6.5M; Paraguay co-official), Tupinambá (extinct), Nheengatu~7M
CaribNorthern S AmericaCarib, Pemon, Macushi~80k
Macro-JêBrazilian CerradoKayapó, Xavante, Krenak~30k
Pano-TacananW AmazonShipibo-Conibo (35k), Cashinahua~50k

XV. Khoisan (defunct macrofamily; now Tuu + Kx’a + Khoe-Kwadi + isolates)

Family / LanguageGeographic SpreadSpeakersNotes
Khoe-KwadiBotswana, NamibiaKhoekhoe / Nama (250k), Naro~300k
Kx’aBotswana, NamibiaJu|‘hoan (15k), |‘Hoan~20k
TuuSouthern Africa!Xóõ (4k), ǂHõã~5k
Sandawe (isolate)TanzaniaSandawe (60k)Click consonants
Hadza (isolate)TanzaniaHadza (1k)Click consonants; no demonstrated relatives

XVI. Other Smaller Families

FamilyGeographic SpreadNotable LanguagesSpeakers
Northeast CaucasianDagestan, ChechnyaChechen (1.4M), Avar (800k), Lezgian (650k), Dargwa~3.5M
Northwest CaucasianNW CaucasusKabardian (1.6M), Abkhaz (130k), Adyghe (590k)~2.5M
KartvelianGeorgiaGeorgian (4M), Mingrelian (350k), Svan (15k), Laz (30k)~4.5M
Dene-Yeniseian (proposed)Siberia + N AmericaKet (210), Yug (extinct) — Yeniseian side only<500
Nilo-Saharan (debated grouping)Sub-Saharan AfricaLuo (5M), Dinka (4.5M), Nuer (1.5M), Maasai (1.5M), Songhay, Kanuri~50M
Kresh-Aja, Kuliak, Saharan, MabanSahel(sub-groupings of Nilo-Saharan)

XVII. Language Isolates

Languages with no demonstrated genetic relatives:

IsolateRegionSpeakersNotes
Basque (Euskara)W Pyrenees750kPre-IE Europe survivor
BurushaskiN Pakistan110kHunza, Nagar valleys
KoreanKorean peninsula77MSometimes grouped as Koreanic
JapaneseJapan125MSometimes grouped as Japonic with Ryukyuan
AinuHokkaido~10Severely moribund
Nivkh (Gilyak)Sakhalin, Amur200
KusundaNepal1–2Critically endangered
PirahãBrazilian Amazon250–380Recursion debate (Everett 2005)
ZuniNew Mexico9k
Purépecha (Tarascan)Michoacán130k
Sumerian (extinct)Mesopotamiac.3100 – 2nd c BCE
Elamite (extinct)SW Iranc.2800 – 4th c BCE
Etruscan (extinct)Etruriac.700 BCE – 1st c CE
Hurrian-Urartian (extinct)Anatolia, S Caucasusc.2nd millennium BCE
Hattic (extinct)AnatoliaPre-Hittite Anatolia
Iberian (extinct)E Iberian peninsulaPre-Roman Spain
Tartessian (extinct)SW Iberia
Meroitic (extinct)Nubia3rd c BCE – 5th c CE

XVIII. Macro-Groupings (controversial / rejected)

ProposalProposer / DateStatus
Nostratic (IE + Uralic + Altaic + Kartvelian + Dravidian + Afro-Asiatic)Pedersen 1903; Illich-Svitych 1960s; BomhardHeld by a minority; standard practice treats families as separate
Eurasiatic (IE + Uralic + Altaic + Eskimo-Aleut + others)Greenberg, Indo-European and Its Closest Relatives (2000–02); RuhlenMarginal acceptance
Amerind (almost all Native American languages outside Na-Dene and Eskimo-Aleut)Greenberg 1987Widely rejected; methodology criticized by Campbell, Goddard, Kaufman
Dene-Yeniseian (Na-Dene + Yeniseian)Vajda 2010Cautiously accepted by many; first widely recognized New World–Old World genetic link
Sino-Caucasian / Dene-CaucasianStarostin; BengtsonSpeculative
Austric (Austroasiatic + Austronesian + Tai-Kadai + Hmong-Mien)Schmidt 1906Largely abandoned

XIX. Sign Languages

Sign languages are full natural languages with their own genealogies, not derivatives of spoken languages.

FamilyNotable LanguagesNotes
French Sign Language familyLSF, ASL (American), LIBRAS (Brazilian), Irish SL, Russian SLSpread from de l’Épée’s Paris school (1760s)
British, Australian, New Zealand SL (BANZSL)BSL, Auslan, NZSLMutually intelligible
Japanese SL familyJSL, Korean SL, Taiwanese SL
German SL familyDGS, Polish SL, Israeli SL (partly)
Village sign languagesKata Kolok (Bali), Adamorobe SL (Ghana), Ban Khor SL (Thailand)Emerged in communities with high rates of hereditary deafness
Home signIdiosyncratic, family-internal

XX. Pidgins, Creoles, and Mixed Languages

LanguageBase / LexifierRegionSpeakers
Haitian CreoleFrenchHaiti12M
Tok PisinEnglishPNG4M (L2)
BislamaEnglishVanuatu200k
Jamaican PatoisEnglishJamaica3M
KrioEnglishSierra Leone7M (L2)
Sranan TongoEnglishSuriname500k
PapiamentoPortuguese/SpanishAruba, Curaçao, Bonaire320k
ChavacanoSpanishZamboanga, Philippines700k
MichifFrench + CreeMétis, North America1k
Media LenguaSpanish + QuechuaEcuador<1k

Adjacent