This site may store cookies on your machine.
DiACL is an open access database with lexical and typological/morphosyntactic data for historical, comparative and phylogenetic linguistics. It contains data from 500 languages of 18 families, divided into three macro-areas: Eurasia, Pacific, and the Amazon. The database has the following content:
DiACL contains data from contemporary and historical languages, and, if possible, reconstructed languages. Data is derived from dictionaries, grammars, or by new fieldwork (in particular data from Caucasus and the Amazon). All data is sourced in scientifically reliable literature.
Language metadata includes geographic position, alternative names, reliability, and family tree topology.
Culture vocabulary data is organized into semantic taxonomies of lexical meanings, which are adapted to macro-areas. Culture vocabulary meanings are selected according to geography and environment (by identifying culture-relevant flora and fauna of macro-areas), relevance to subsistence system of language families, cultural function or affordance, and occurrence in reconstructed vocabularies of targeted language families.
Lexemes are organized under etymologies (cognates), which are graphically reproduced as trees and maps on the database frontend. Lexical etymologies account for borrowing, morphological derivation, and semantic change.
Typological/morphosyntactic data is organized into a four-level hierarchy, which enables coding of polymorphic behaviour (e.g., several word orders) in individual languages.
Typological/morphosyntactic features are selected to match known prototypical features of linguistic areas of included macro-areas (Eurasia, Pacific, Amazon), targeting properties which ensure a typological variation and which are known to correlate typologically to each other. Besides, typological/morphosyntactic features are selected to whether they can be identified in historical languages.
From the frontend, data sets with basic vocabulary, culture vocabulary (by area), and typology/morphosyntax (by area) are downloadable in JSON format for computational analysis (requires registration/permission).
Data and results will be published in monograph form in collaboration with De Gruyter under the title Mouton Atlas of Languages and Cultures (2019).
Fieldwork data (ELAN, audio/video files) from several of the targeted language areas is available via the Lund Corpus Server on the Lund Humanities Lab.
Visit the Lund Language Typology and Evolution research group, for publications, news and guidelines to using data.
DiACL is a SWE-CLARIN resource, hosted by Lund University.
Carling, Gerd (ed.) 2017. Diachronic Atlas of Comparative Linguistics Online. Lund: Lund University. (DOI/URL: https://diacl.ht.lu.se/. Accessed on: x.).
Data in the DiACL database consist of individual data for specific languages, sometimes retrieved by fieldwork. For fieldwork data from individual languages, not available elsewhere, quote data by the source (see the Source menu: Literary Sources and Informants).
SOURCE. In: Carling, Gerd (ed.) 2017. Diachronic Atlas of Comparative Linguistics Online. Lund: Lund University. (DOI/URL https://diacl.ht.lu.se/. Accessed on: x.).
Where SOURCE
is for example: da Silva Sinha, V. & Cronhamn, S. (2013). Fieldwork data from Amazonian languages. Native speaker: Wary.
DiACL was previously also known as LUNDIC.