Sociolinguistics and Applied Linguistics
Sociolinguistics studies language in its social setting — how variation correlates with class, region, gender, age, ethnicity, and identity; how attitudes and ideologies shape language use; how communities deploy multiple varieties in bilingual and multilingual repertoires. Applied linguistics turns linguistic knowledge to practical ends — language teaching, testing, policy, lexicography, language disorders, forensic identification. The two fields overlap heavily in domains such as language policy, minority languages, and education.
The Variationist Tradition — Labov
William Labov (1927–2024) founded modern variationist sociolinguistics with a sequence of empirical studies that established its methods.
Martha’s Vineyard (1963)
Labov’s 1961 fieldwork on Martha’s Vineyard examined the centralization of the diphthongs /ay/ and /aw/ ([əɪ], [əʊ] for canonical [aɪ], [aʊ]). He showed that the centralized variants were strongest among speakers — particularly young fishermen — who identified strongly with the island’s traditional community against the influx of summer tourists. Variation was not random but socially patterned and ideologically charged.
The Social Stratification of English in New York City (1966)
Labov’s PhD turned to the postvocalic /r/ in New York City — a variable whose presence or absence had become socially diagnostic by mid-century. His famous department-store study (1962, published 1966) compared three Manhattan department stores — Saks Fifth Avenue (high-status), Macy’s (middle), and S. Klein (working-class). Researchers asked clerks “Where are the women’s shoes?” (rehearsed to be on the fourth floor), then a confused “Excuse me?” to elicit a careful repetition. Postvocalic /r/ rates rose steeply with store status; the careful (emphatic) repetition raised /r/ across all stores, but most dramatically in Macy’s — consistent with hypercorrection by the lower middle class. Labov’s data established that:
- Variation is systematic, not free.
- Different social groups have different rates of variants.
- Style shifting tracks attention to speech.
- The lower middle class is the locus of overt prestige norms (and hypercorrection).
Philadelphia and the Atlas of North American English
Labov’s Philadelphia neighborhood study (1970s–80s, Principles of Linguistic Change vols I 1994, II 2001, III 2010) documented change in progress with longitudinal recordings. The Atlas of North American English (Labov, Sharon Ash, Charles Boberg, Mouton 2006) mapped dialect regions and characterized chain shifts including the Northern Cities Shift (urban Great Lakes — /æ/ raises, /ɑ/ fronts, /ɔ/ lowers, /ɛ/ retracts, /ʌ/ backs, /ɪ/ centralizes) and the Southern Shift.
Sociolinguistic Variables, Variants, and Methods
A sociolinguistic variable is a linguistic element with two or more variants alternating in conditioned distribution. Variables can be phonetic (postvocalic /r/, /th/-stopping think → tink, /ng/-fronting walking → walkin’), morphosyntactic (third-person /s/ deletion, copula deletion, double negation, ain’t), or lexical.
Methods:
- Sociolinguistic interview — one-on-one recording, structured for elicitation of different styles: reading word lists (most careful), reading passages, interview speech, free conversation, narratives of personal experience (“Have you ever been in a situation where you thought you might be killed?”), all designed to reduce attention to speech in stages.
- The observer’s paradox (Labov): “to observe how people speak when they are not being observed.” Researchers attempt to mitigate via group recordings, long fieldwork, in-group fieldworkers, and naturalistic settings.
- Apparent-time analysis — comparing different age cohorts in a single synchronic sample as a proxy for change over time.
- Real-time analysis — re-interviewing the same individuals or community over years/decades (Panama, Tagliamonte’s Toronto, Sankoff & Wagner’s Montreal).
- Quantitative analysis using Varbrul / Goldvarb / Rbrul to assess significance and constraint hierarchies; recently mixed-effects logistic regression (
lme4in R).
Major Sociolinguists After Labov
- Peter Trudgill Sociolinguistics: An Introduction (1974, multiple editions) and the Norwich studies; later Sociolinguistic Typology (2011) on contact-driven simplification.
- Walt Wolfram — Detroit African American English (Wolfram 1969 A Sociolinguistic Description of Detroit Negro Speech); founding director of the North Carolina Language and Life Project.
- Penelope Eckert — Detroit-area high school study Jocks and Burnouts (1989); Linguistic Variation as Social Practice (2000); communities of practice; the third-wave sociolinguistics that treats variants as semiotic resources for identity construction.
- Lesley and James Milroy — Belfast working-class study (Language and Social Networks 1980), introducing social network analysis (density, multiplexity).
- Sali Tagliamonte — Toronto research, Variationist Sociolinguistics (2012), Teen Talk (2016); innovation in adolescent and quotative variants (be like).
- Natalie Schilling-Estes — Smith Island, OK, and style as design.
African American Vernacular English (AAVE)
AAVE (also African American English, AAE; Black English Vernacular, BEV; Ebonics — though the latter term, coined Robert Williams 1973, became politicized) is a distinct, rule-governed variety of English spoken by many but not all African Americans, especially in informal contexts. Documentation: Labov Language in the Inner City (1972); John Rickford African American Vernacular English (1999); Salikoko Mufwene, John Baugh, Geneva Smitherman, Arthur Spears (e.g., on come of indignation, be done).
Key AAVE features:
- Habitual be (invariant be): He be working “he habitually works” — distinct from non-habitual He working “he is working now.”
- Copula deletion in present where Standard English contracts: She nice (cf. She’s nice).
- Resultative been: I been told you “I told you long ago and it’s still relevant.”
- Multiple negation / negative concord: Ain’t nobody got no time for that.
- Ain’t as multipurpose negator (= isn’t, haven’t, doesn’t, didn’t).
- Third-person -s variable absence: She walk to school.
- Consonant cluster simplification: test → tes’.
- /r/-lessness (shared with non-rhotic varieties).
The 1996 Oakland Ebonics resolution sparked national debate. Linguists overwhelmingly affirmed AAVE’s status as a rule-governed dialect of English; the Linguistic Society of America issued an official statement. AAVE’s relation to other English varieties (creole substrate hypothesis vs Anglicist hypothesis vs ongoing divergence) remains debated (Mufwene, Bailey, Thomas).
Codeswitching and Bilingualism
Codeswitching is the alternation between two or more languages or varieties within a single conversation or even sentence. Shana Poplack Sometimes I’ll start a sentence in Spanish y termino en español (1980) demonstrated that codeswitching is structurally regular — constrained by the Equivalence Constraint (switches occur at points where the surface order of both languages aligns) and the Free Morpheme Constraint (no switching within a word boundary). Subsequent frameworks: Carol Myers-Scotton Duelling Languages (1993, Matrix Language Frame model), Pieter Muysken Bilingual Speech (2000, distinguishing insertion, alternation, congruent lexicalization).
Language Attitudes and Ideology
Michael Silverstein Language Structure and Linguistic Ideology (1979) and later work; Judith Irvine and Susan Gal Language Ideology and Linguistic Differentiation (2000); Kathryn Woolard Codeswitching, Identity, and Catalan in Catalonia (1989). The field analyzes how speakers’ beliefs about language — including beliefs about “standard,” “correct,” “broken,” “polite” forms — interact with and shape variation. Matched-guise experiments (Wallace Lambert, McGill 1960s) ask listeners to rate the same speaker producing different varieties; ratings expose attitudes toward varieties separable from the speakers themselves.
Sound Change in Progress
Major ongoing changes documented in detail:
- Northern Cities Shift — see above.
- California / West Coast Shift (Penelope Eckert, Robert Kennedy): /æ/ retraction before nasals; /uː/ fronting; the cot-caught merger consolidating across Western US.
- Cot-caught merger — /ɔ/ ↔ /ɑ/ merging across much of US and Canada; complete in Canada and Western US; advancing in the East.
- Pin-pen merger — /ɪ/ and /ɛ/ before nasals merge in much of Southern US.
- Mary-marry-merry merger — three-way merger in most of US; preserved as distinct in NYC and parts of Philadelphia.
Chain shifts are coordinated movements where one phoneme’s shift creates a hole filled by another’s shift, etc. (Labov 1994 Principles of Linguistic Change: Internal Factors). Convergence brings two varieties closer (dialect leveling); divergence separates them (some urban AAVE features have diverged from white vernaculars since the mid-20th c, per Labov; the claim is contested).
English Worldwide
Native varieties (ENL) — UK, US, Canada, Australia, NZ, South Africa, Ireland. Second-language (ESL) — India, Nigeria, Singapore, Philippines, where English has institutional status. Foreign language (EFL) — most of continental Europe, East Asia, Latin America. Lingua franca — international communication, including English as a Lingua Franca (ELF) research (Barbara Seidlhofer, Jennifer Jenkins).
World Englishes (Braj Kachru The Other Tongue 1982, The Alchemy of English 1986) divides into Inner Circle (ENL countries), Outer Circle (ESL, postcolonial English-using countries), Expanding Circle (EFL). Each Outer Circle variety — Indian English, Singapore English, Nigerian English, Philippine English — has developed distinctive phonological, lexical, and syntactic features.
Multiethnolects and Urban Vernaculars
Multicultural London English (MLE) — Jenny Cheshire, Paul Kerswill, and collaborators 2008+ — a new variety emerging in inner London since the 1980s, drawing on Cockney, Caribbean, South Asian, and West African Englishes plus features of Estuary English. Similar developments across Europe: Kiezdeutsch (Germany, Heike Wiese), Rinkebysvenska (Sweden), Multicultural Paris French, Verlan. These are sometimes called multiethnolects to distinguish from ethnically homogeneous varieties.
Gender and Language
Robin Lakoff Language and Woman’s Place (1975) was foundational: women’s speech, she argued, is marked by tag questions, hedges, intensifiers, indirectness, and lexical hyper-femininity, reflecting a position of relative powerlessness. Deborah Tannen You Just Don’t Understand (1990) popularized “different cultures” frameworks. Deborah Cameron The Myth of Mars and Venus (2007) critiqued popular gender-difference claims as overstated and ideologically loaded. Penelope Eckert and Sally McConnell-Ginet Language and Gender (2003, 2nd ed 2013) developed the communities of practice framework, showing gender is enacted through participation in specific local practices rather than a global binary.
Singular they — increasingly accepted in published style guides (APA 7th ed 2019; Chicago Manual of Style 17th ed 2017 — for known nonbinary individuals; widely accepted for unknown referent across all major guides). Nonbinary pronouns beyond English: see Greville Corbett Gender (1991) for gender systems; Spanish elle and the morpheme -e for gender-neutral inflection in progressive usage; Swedish hen (proposed 1966, mainstream 2010s) added to the Swedish Academy’s wordlist 2015.
LGBTQ and Queer Linguistics
- Lavender Linguistics (William Leap, 1996+) — linguistic features of LGBTQ communities.
- Queer linguistics (Don Kulick, Deborah Cameron, Robin Queen) — language and sexuality more broadly.
- Identifying “gay speech” phonetically (Benjamin Munson and collaborators) — listeners reliably identify perceived sexual orientation from voice; the perceptual cues partially overlap with the GAY /s/ (longer, higher peak frequency) and other features.
Language Policy and Planning
Robert Cooper Language Planning and Social Change (1989) and Bernard Spolsky Language Policy (2004) distinguish:
- Status planning — official-language designation, domains of use.
- Corpus planning — codification (dictionaries, grammars, orthography).
- Acquisition planning — teaching, education.
- Prestige planning — image and ideology.
Major historical and contemporary cases:
- English-Only US — recurring legislative pushes (1980s+); now adopted as official by ~30 US states. The US has no federally official language.
- Quebec’s Bill 101 / Charter of the French Language (1977) — Quebec French status, education, signage; strengthened repeatedly.
- Welsh Language Acts (1967, 1993, 2011) — Welsh equal status to English in Wales; Welsh-medium education; Welsh Language Commissioner.
- Catalan revival post-Franco — Catalan as co-official in Catalonia; Catalan-medium schooling.
- Brazilian Portuguese vs European Portuguese orthographic agreements (1990, 2009).
- Singapore’s bilingual policy — English + mother tongue (Mandarin, Malay, Tamil).
- South Africa — 11 official languages (1996 Constitution); language equity in practice unequal.
- India — Hindi and English as Official Languages of the Union; 22 scheduled languages; ongoing tensions over Hindi imposition.
Sign Languages
William Stokoe Sign Language Structure (1960) established that American Sign Language (ASL) is a full natural language with its own grammar — not a manual version of English. Sign languages have all the design features of spoken language: arbitrariness, productivity, displacement; they exhibit phonology (parameters of handshape, location, movement, palm orientation, non-manual features); morphology (classifier predicates, agreement verbs); and syntax (topic-comment structure, role shift).
Major sign languages — none mutually intelligible across communities despite shared modality:
- ASL — American Sign Language (US, English-speaking Canada, parts of Africa, the Philippines); ~500k–1M signers
- BSL — British Sign Language; ~100k signers
- LSF — Langue des Signes Française; ~100k signers; historically influential — American Thomas Hopkins Gallaudet brought Laurent Clerc (deaf teacher of LSF) to the US in 1817, making ASL historically related to LSF, not BSL
- Auslan — Australian Sign Language (related to BSL)
- JSL — Japanese Sign Language
- ISL — Israeli Sign Language; Indian Sign Language (the abbreviation is overloaded)
- NGT — Sign Language of the Netherlands
- DGS — German Sign Language
Nicaraguan Sign Language (Idioma de Señas de Nicaragua, ISN) — emerging from the establishment of schools for the deaf in Managua from 1977 onward — has been documented as a sign language being created in real time across generations (Judy Kegl, Ann Senghas, Marie Coppola).
Deaf community and Deafhood (Paddy Ladd 2003) — sociocultural identity of signing Deaf communities, contesting medicalized views of deafness. The cochlear implant debate divides communities — many oralist parents and audiologists view implants as remediation; many Deaf community members view them as a threat to a vibrant linguistic minority.
First Language Acquisition
Roger Brown A First Language (1973) established the canonical stages of English child language: one-word stage (~12 months), two-word stage (~18 months), the use of grammatical morphemes in order (-ing, in, on, plural -s, irregular past, possessive -s, copula, articles, regular past -ed, third-person -s) tracking Brown’s mean length of utterance (MLU). Cross-linguistic stages: babbling → first words → vocabulary spurt ~18 months → grammar emerges ~24 months → adult-like syntax by ~5 years.
Noam Chomsky’s innateness argument — children acquire complex grammars from impoverished and noisy input (“poverty of the stimulus”), making innate universal grammar a necessary precondition. Steven Pinker The Language Instinct (1994) popularized the nativist view. Michael Tomasello Constructing a Language (2003) and others develop a usage-based alternative: children learn grammar via item-based learning, abstraction over patterns, and cognitive-pragmatic skills shared with non-linguistic cognition. The debate is ongoing.
Critical Period Hypothesis
Eric Lenneberg Biological Foundations of Language (1967) proposed that language acquisition has a critical period ending around puberty; after which native-like attainment becomes very difficult. Evidence: feral and deprived children (Genie, Victor of Aveyron) failed to acquire full grammar after late onset; second-language acquisition typically shows persistent foreign accent and grammatical traces in late starters; sensitive-period effects in sign language acquisition (Mayberry & Eichen 1991 on age of first sign-language exposure and adult ASL proficiency). Modern research finds the picture more nuanced — gradual decline rather than sharp cutoff; different sub-systems (phonology, morphosyntax, lexicon) decline at different rates.
Second Language Acquisition
- Stephen Krashen — Monitor Theory, Input Hypothesis (comprehensible input + 1), distinction between acquisition and learning.
- Michael Long — Interaction Hypothesis; negotiated interaction supports SLA.
- Manfred Pienemann — Processability Theory; SLA proceeds through staged acquisition of processing capacities.
- Susan Gass and Larry Selinker Second Language Acquisition (textbook, multiple editions); Selinker introduced interlanguage (1972) — the systematic, evolving language system of the L2 learner.
- Lourdes Ortega Understanding Second Language Acquisition (2009).
Bilingual education — dual immersion programs, transitional bilingual education, content-and-language-integrated learning (CLIL). Evidence (Wayne Thomas and Virginia Collier, A National Study of School Effectiveness for Language Minority Students’ Long-Term Academic Achievement 2002) supports robust dual-language education over English-only.
Language Disorders
- Specific Language Impairment (SLI) — language deficits despite typical hearing, intelligence, and environmental opportunity. The KE family (Hurst et al. 1990, Lai et al. 2001) traced an SLI phenotype to a mutation in FOXP2, sparking the “language gene” headlines (mostly overstated; FOXP2 is necessary for vocal learning across species, not a “grammar gene”).
- Dyslexia — developmental reading disorder; ~5–10% of populations; strong phonological component.
- Aphasia — language loss from brain injury or stroke. The classical Broca-Wernicke-Geschwind model (Paul Broca 1861 Tan patient with left frontal lobe damage and non-fluent aphasia; Carl Wernicke 1874 fluent posterior aphasia with comprehension deficit; Norman Geschwind 1965 connectionist synthesis) was hugely influential. Modern neuroimaging (Hagoort, Friederici, Hickok-Poeppel) reveals a more complex, distributed dual-stream architecture — dorsal stream for articulation/syntax, ventral stream for comprehension/semantics.
Applied Linguistics — Teaching and Testing
TESOL — Teaching English to Speakers of Other Languages (the global field). TEFL / TESL — Teaching English as a Foreign / Second Language. Major teaching frameworks: grammar-translation (traditional), audiolingual (1940s–50s, behaviorist), communicative language teaching (Hymes’ communicative competence 1972; Canale & Swain 1980), task-based learning, content-based instruction.
Major language tests:
- TOEFL — Test of English as a Foreign Language (ETS, 1964–; iBT format since 2005)
- IELTS — International English Language Testing System (Cambridge, British Council, IDP)
- Cambridge ESOL exams — KET, PET, FCE, CAE, CPE
- TOEIC — workplace English
- CEFR — Common European Framework of Reference for Languages (Council of Europe 2001): A1 (Beginner) → A2 → B1 → B2 → C1 → C2 (Mastery); now used worldwide
Corpus Linguistics
A corpus is a large, principled collection of texts for systematic study. Foundational corpora:
- Brown Corpus (Kučera and Francis 1964, Brown University) — 1M words of American English; first balanced electronic corpus.
- LOB Corpus — British counterpart (Lancaster-Oslo/Bergen Corpus 1978).
- BNC — British National Corpus (1991–1994; 100M words; British English written and spoken).
- COCA — Corpus of Contemporary American English (Mark Davies, BYU; ~1B words; updated continuously since 2008).
- OEC — Oxford English Corpus (~2B+ words; Oxford University Press internal).
- Sketch Engine (Kilgarriff et al.) — corpus query tool with word sketches, thesaurus, multiple corpora across languages.
- AntConc (Laurence Anthony, Waseda University) — free concordance program for student/researcher use.
Lexicography
- OED — Oxford English Dictionary, first published in fascicles 1884–1928 (originally A New English Dictionary on Historical Principles), 2nd edition 1989, ongoing OED Online revision launched 2000.
- Merriam-Webster — Noah Webster’s An American Dictionary of the English Language (1828); current line of unabridged and learner dictionaries.
- Longman Dictionary of Contemporary English (LDOCE), Cambridge Advanced Learner’s Dictionary, Macmillan English Dictionary for Advanced Learners — major learner dictionaries based on the controlled defining vocabulary tradition (Michael West).
- Collins COBUILD — corpus-based dictionary (John Sinclair, Birmingham; first ed 1987) — pioneered full-sentence definitions and corpus-driven lexicography.
Forensic Linguistics
- Authorship attribution — stylometric and computational analysis. Malcolm Coulthard (UK) on the Bentley case and the Unabomber (Theodore Kaczynski identified partly via stylistic analysis of his manifesto, 1996). Patrick Juola identified J. K. Rowling as Robert Galbraith via stylometry (2013).
- John Olsson Forensic Linguistics (textbook, 2004, 3rd ed 2018).
- Speaker identification — voice profiling, accent identification (sometimes used in asylum cases, ethically fraught — see Language Analysis for the Determination of Origin LADO).
- Plagiarism detection — Turnitin and similar tools.
- Discourse analysis of police interviews and courtroom interaction — Roger Shuy The Language of Confession, Interrogation, and Deception.
Clinical Linguistics
Field overlapping speech-language pathology and applied linguistics: assessment and treatment of language disorders across the lifespan. Standardized assessments (PLS-5, CELF-5, BDAE for adults), targeted intervention for SLI, autism-spectrum communication, aphasia, traumatic brain injury, dementia (semantic dementia, primary progressive aphasia).
Adjacent
- phonetics-and-phonology — phonetic detail as sociolinguistic variable
- historical-linguistics-and-typology — change in progress and endangered languages
- semantics-and-pragmatics — discourse markers and pragmatic variation
- syntax-and-grammar — variationist work on syntactic variables
- cultural-anthropology — linguistic anthropology and ethnography of speaking
- neuroscience-foundations — aphasia and the neural basis of language