This site may store cookies on your machine.
DiACL is an open database with grammatical and lexical data for comparative and phylogenetic linguistics. It harbours data from 500 languages of 18 families, focusing on the areas of Eurasia, Pacific, and South America. The database has the following content:
DiACL contains data from contemporary and historical languages, and, if possible, reconstructed languages. Data is derived from dictionaries, grammars, or by new fieldwork (in particular data from Caucasus and the Amazon). All data is sourced in scientifically reliable literature.
Language metadata includes geographic position, alternative names, reliability, and family tree topology.
Culture vocabulary data is organized into semantic taxonomies of lexical meanings, which are adapted to macro-areas. Culture vocabulary meanings are selected according to geography and environment (by identifying culture-relevant flora and fauna of macro-areas), relevance to subsistence system of language families, cultural function or affordance, and occurrence in reconstructed vocabularies of targeted language families.
Lexemes are organized under etymologies (cognates), which are graphically reproduced as trees and maps on the database frontend. Lexical etymologies account for borrowing, morphological derivation, and semantic change.
Typological/morphosyntactic data is organized into a four-level hierarchy, which enables coding of polymorphic behaviour (e.g., several word orders) in individual languages.
Typological/morphosyntactic features are selected to match known prototypical features of linguistic areas of included macro-areas (Eurasia, Pacific, Amazon), targeting properties which ensure a typological variation and which are known to correlate typologically to each other. Besides, typological/morphosyntactic features are selected to whether they can be identified in historical languages.
From the frontend, data sets with basic vocabulary, culture vocabulary (by area), and typology/morphosyntax (by area) are downloadable in JSON format for computational analysis (requires registration/permission).
Visit gerdcarling.se and the Lund Language Typology and Evolution research group, for publications, news and guidelines to using data.
Data and results from the database are published in a new series of atlas monographs, as well as in journals (see the lists of publications on gerdcarling.se and the Lund Language Typology and Evolution research group).
Carling, Gerd (ed.) 2017. Diachronic Atlas of Comparative Linguistics Online. Lund: Lund University. (DOI/URL: https://diacl.ht.lu.se/. Accessed on: x.).
Data in the DiACL database consist of individual data for specific languages, sometimes retrieved by fieldwork. For fieldwork data from individual languages, not available elsewhere, quote data by the source (see the Source menu: Literary Sources and Informants).
SOURCE. In: Carling, Gerd (ed.) 2017. Diachronic Atlas of Comparative Linguistics Online. Lund: Lund University. (DOI/URL https://diacl.ht.lu.se/. Accessed on: x.).
SOURCE is for example:
da Silva Sinha, V. & Cronhamn, S. (2013). Fieldwork data from Amazonian languages. Native speaker: Wary.
DiACL was previously also known as LUNDIC.