Language
Validate against: http://json-schema.org/draft-07/schema#
Schema ID: http://schemas.digitallinguistics.io/Language-8.4.0.json
Type: object
Description
DLx uses the term language broadly to refer to any speech variety - whether a language, dialect, idiolect, or even a historical variety. The DLx Language object groups together all the information about the language, including writing systems, its phonological inventory, and metadata (e.g. names and language codes).
Required Properties
name
Properties
The following properties are defined for this object:
Type:
typeType:
stringRead-only:
trueDescription
The type of object. Must be set to
Language.This item must have the following value:
"Language"ID:
idDescription
A unique database identifier for this Language
Abbreviation:
abbreviationType:
stringDescription
A short, human-readable abbreviation for this Language
Referenced Schema
This item must validate against the following schema:
Access Rights:
accessType:
objectDescription
An object describing who may have access to materials on this language.
Referenced Schema
This item must validate against the following schema:
Autonyms:
autonymsType:
arrayDescription
A list of objects describing autonyms for this language (names for this language in the language itself). For the canonical scientific name, use the
namefield instead. For other ways of referring to this language by outsiders, use theexonymsfield.Items
Each item in this array must adhere to the following schema:
Autonym:
autonymsType:
objectRequired Properties
transcription
Properties
The following properties are defined for this object:
Autonym Transcription:
transcriptionDescription
A transcription of this autonym, optionally in multiple orthographies.
Referenced Schema
This item must validate against the following schema:
Note:
noteDescription
A note about this autonym, such as who uses this name and where, or perhaps its etymology.
Referenced Schema
This item must validate against the following schema:
Bibliography:
bibliographyType:
arrayDescription
A collection of citations to bibliographic references about this language.
Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Citation:
bibliographyDescription
A citation to a bibliographic source about this language
Referenced Schema
This item must validate against the following schema:
Genetic Classification:
classificationType:
arrayDescription
The genetic classification of this language. This property must be an array of language families (in English), with larger family groupings listed before smaller ones. For example, Spanish might have an array of the following strings:
Indo-European,Romance,Ibero-Romance. For language isolates, this property should be an array containing the single string,isolate.Items
Each item in this array must adhere to the following schema:
Language Family:
classificationType:
stringDescription
The name of a language family, as a string, in English.
Contributors:
contributorsType:
arrayDescription
A collection of people who contributed to the documentation, analysis, or curation of this language data in some way. It is strongly recommended that each Person object in this collection have its
rolesproperty specified, so that the contribution of each person is known.Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Contributor (Person - Database Reference):
contributorsType:
objectDescription
A database reference to a Person object describing a contributor
Referenced Schema
This item must validate against the following schema:
Date Created:
dateCreatedType:
stringDescription
The date and optionally time that this Language object was created
This item must also validate against exactly one of the following schemas:
Format:
dateFormat:
date-time
Date Modified:
dateModifiedType:
stringDescription
The date and optionally time that this Language object was last modified
This item must also validate against exactly one of the following schemas:
Format:
dateFormat:
date-time
Demographics:
demographicsType:
arrayDescription
An array of Demographic objects, each of which contain demographic information about the Language at a specific point in time, and/or at a specific location
Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Demographic Data:
demographicsType:
objectDescription
The demographic data of this Language, at a specific point in time, and/or at a specific location
Properties
The following properties are defined for this object:
Date:
dateType:
stringDescription
The date and optionally time that this Demographic data was created
This item must also validate against exactly one of the following schemas:
Format:
dateFormat:
date-time
Location:
locationType:
objectDescription
The geographic Location of this Demographic data
Referenced Schema
This item must validate against the following schema:
Notes:
notesType:
arrayDescription
A collection of notes about this Demographic data
Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Note:
notesType:
objectDescription
A Note about this Demographic data
Referenced Schema
This item must validate against the following schema:
Population:
populationType:
integerDescription
The Population of speakers included in this Demographic data
Minimum:
0Status:
statusType:
objectDescription
The Status of this Demographic data on one or more of the following scales: EGIDS, ELCat (using the LEI), GIDS, and/or UNESCO. The official status scales can be found here.
Properties
The following properties are defined for this object:
EGIDS:
EGIDSType:
stringDescription
The Expanded Graded Intergenerational Disruption Scale, or EGIDS, status of this Demographic data. Either the official level or the official label representing the EGIDS status of the data, as a string.
This item must also validate against exactly one of the following schemas:
Regular expression to match:
^[0-9|10|6a|6b|8a|8b]$Allowed Values
"International""National""Regional""Trade""Educational""Written""Vigorous""Threatened""Shifting""Moribund""Nearly Extinct""Dormant""Extinct"
ELCat Level of Endangerment:
ELCatType:
stringDescription
The ELCat status of this Demographic data. ELCat uses the Language Endangerment Index, or LEI, scale. This status should be the LEI level of endangerment of this Demographic data, as a string.
Allowed Values
"Safe""Vulnerable""Threatened""Endangered""Severely Endangered""Critically Endangered"
GIDS:
GIDSType:
integerDescription
The Graded Intergenerational Disruption Scale, or GIDS, status of this Demographic data. This is the scale created by Thomas Fishman in his 1991 work Reversing Language Shift. The GIDS status should be represented by the integer, 1-8, that corresponds with the GIDS Stage of the data.
Minimum:
1Maximum:
8UNESCO:
UNESCOType:
stringDescription
The UNESCO Language Degree of Endangerment for this Demographic data. This status should be the string representation of the Degree of endangerment level.
Allowed Values
"Safe""Vulnerable""Definitely Endangered""Severely Endangered""Critically Endangered""Extinct"
Description:
descriptionDescription
A high-level overview of the Language and the sociohistorical and documentary context for the accompanying data
Referenced Schema
This item must validate against the following schema:
Exonyms:
exonymsType:
arrayDescription
A list of exonyms for this language (names of the language in other languages). For the canonical scientific name of the language (usually in English), use the
namefield. This field is for any additional exonyms beyond the canonical scientific name. For autonyms (names of the language in the language), use theautonymsfield.Items
Each item in this array must adhere to the following schema:
Exonym:
exonymsType:
objectRequired Properties
transcription
Properties
The following properties are defined for this object:
Exonym Transcription:
transcriptionDescription
A transcription of this exonym, optionally in multiple languages.
Referenced Schema
This item must validate against the following schema:
Note:
noteDescription
A note about this exonym, such as who uses this name and where, or perhaps its etymology.
Referenced Schema
This item must validate against the following schema:
Glottolog Code:
glottologType:
stringDescription
The Glottolog code for this language variety, as a String in the format abcd1234
Regular expression to match:
^[a-z]{4}[0-9]{4}$ISO 639-3 Code:
isoType:
stringDescription
The ISO 639-3 code for this language variety, as a 3-letter String
Regular expression to match:
^[a-z]{3}$Link:
linkType:
stringDescription
A URL where a presentational format for this resource may be viewed
Format:
uriLocations:
locationsType:
arrayDescription
Geographic locations where this language variety is spoken, optionally with geographic coordinate data
Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Location (Database Reference):
locationsType:
objectDescription
A database reference to a Location object describing a location where this language variety is spoken
Referenced Schema
This item must validate against the following schema:
Language Name:
nameDescription
The canonical scientific name of this language, usually in English, and also optionally in multiple languages. There must be a name provided in at least one language. Use the
exonymsfield to list other ways this language is referred to, and theautonymsfield to list names for the language in the language itself (a.k.a. endonyms).Referenced Schema
This item must validate against the following schema:
Minimum number of properties:
1Notes:
notesType:
arrayDescription
A collection of notes about this language
Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Note:
notesType:
objectDescription
A Note about this Language
Referenced Schema
This item must validate against the following schema:
Orthographies:
orthographiesType:
arrayDescription
A collection of orthographies used to represent this Language
Items must be unique:
trueItems
Each item in this array must adhere to the following schema:
Orthography:
orthographiesType:
objectDescription
An orthography used to represent this language
Referenced Schema
This item must validate against the following schema:
Phonemes:
phonemesType:
arrayDescription
The phonological inventory for this language, i.e. a collection of phonemes
Items
Each item in this array must adhere to the following schema:
Phoneme:
phonemesType:
objectDescription
A Phoneme object
Referenced Schema
This item must validate against the following schema:
Tags:
tagsType:
objectDescription
A set of tags for this Language
Referenced Schema
This item must validate against the following schema:
URL:
urlType:
stringDescription
A URL where the JSON data for this language can be found
Format:
uri
Additional Properties
Any additional properties must adhere to the following schema:
This schema imposes no restrictions. All values are valid.
Examples
The following are example values for this schema:
-
{ "abbreviation": "chiti", "access": { "AILLA": "password", "note": { "eng": "Materials on this language should not be made available to non-tribal members without permission." } }, "autonyms": [ { "transcription": { "APA": "sitimaša", "IPA": "sitimaʃa", "Mod": "sitimaxa", "Swad": "sitimaša" }, "note": { "text": "This is the traditional name used for the Chitimacha language by the Chitimacha people themselves. It most likely derives from _siit‑_ 'body of water' + _‑ma_ pluractional + _‑x_ topic marker, meaning 'people of the waters'. This is most likely a reference to the fact that the Chitimacha people live deep in the bayou waterways of Louisiana." } } ], "bibliography": [ { "citationKey": "Swadesh1946" } ], "contributors": [ { "abbreviation": "DWH", "roles": [ "researcher" ] }, { "abbreviation": "BP", "roles": [ "speaker" ] }, { "abbreviation": "Swad", "roles": [ "researcher" ] } ], "dateCreated": "2018-10-13T18:16:57.497Z", "dateModified": "2018-10-13T18:17:06.515Z", "defaultOrthography": "modern", "exonyms": [ { "transcription": "Shetimashas", "note": { "text": "This is the name commonly used for the language by French colonials, borrowed from the Chitimacha name for the language, _Sitimaxa_." } }, { "transcription": "Chetimachas" } ], "glottolog": "chit1248", "id": "4d633ee3-25ba-4add-83cf-b588ba51f758", "iso": "ctm", "link": "https://data.digitallinguistics.io/languages/Chitimacha", "locations": [ { "name": "Charenton" } ], "name": { "eng": "Chitimacha", "fra": "Chitimachas", "spa": "chitimacha" }, "notes": [ { "text": "The best documentary materials on this language are those compiled by Morris Swadesh in 1930–1934." }, { "text": "Chitimacha is a language isolate." } ], "orthographies": [ { "abbreviation": "Mod", "graphemes": [ { "form": "a", "pronunciation": [ "a" ] }, { "form": "aa", "pronunciation": [ "aː" ] }, { "form": "b", "pronunciation": [ "pˀ" ] }, { "form": "c", "pronunciation": [ "t͡ʃ" ] }, { "form": "d", "pronunciation": [ "tˀ" ] }, { "form": "dz", "pronunciation": [ "t͡sˀ" ] }, { "form": "e", "pronunciation": [ "e" ] }, { "form": "ee", "pronunciation": [ "eː" ] }, { "form": "g", "pronunciation": [ "kˀ" ] }, { "form": "h", "pronunciation": [ "h" ] }, { "form": "j", "pronunciation": [ "t͡ʃˀ" ] }, { "form": "k", "pronunciation": [ "k" ] }, { "form": "m", "pronunciation": [ "m" ] }, { "form": "mq", "pronunciation": [ "ˀm" ] }, { "form": "n", "pronunciation": [ "n" ] }, { "form": "nq", "pronunciation": [ "ˀn" ] }, { "form": "o", "pronunciation": [ "o" ] }, { "form": "oo", "pronunciation": [ "oː" ] }, { "form": "p", "pronunciation": [ "p" ] }, { "form": "q", "pronunciation": [ "ʔ" ] }, { "form": "s", "pronunciation": [ "s" ] }, { "form": "t", "pronunciation": [ "t" ] }, { "form": "ts", "pronunciation": [ "t͡s" ] }, { "form": "u", "pronunciation": [ "u" ] }, { "form": "uu", "pronunciation": [ "uː" ] }, { "form": "w", "pronunciation": [ "w" ] }, { "form": "x", "pronunciation": [ "ʃ" ] }, { "form": "y", "pronunciation": [ "j" ] } ], "link": "https://data.digitallinguistics.io/languages/Chitimacha/orthographies/Modern", "name": "Modern", "notes": [ { "text": "This is the official orthography used by the Chitimacha Tribe." } ], "punctuation": [ ".", "?", "!" ] }, { "abbreviation": "APA", "graphemes": [], "link": "https://data.digitallinguistics.io/languages/Chitimacha/orthographies/APA", "name": "Americanist" } ], "phonemes": [ { "airstream": "pulmonic", "ipa": "t", "manner": "plosive", "place": "alveolar", "voicing": "voiceless" }, { "airstream": "pulmonic", "ipa": "t͡ʃ", "manner": "affricate", "place": "post-alveolar", "voicing": "voiceless" }, { "airstream": "ejective", "ipa": "t͡ʃˀ", "manner": "affricate", "place": "post-alveolar", "voicing": "voiceless" } ], "tags": { "morphologicalType": "synthetic" }, "type": "Language", "url": "https://data.digitallinguistics.io/languages/Chitimacha" } -
{ "name": "Shangaji", "description": "Shangaji is spoken in three small villages in the Nampula province of Mozambique. The deposit focuses on the variant spoken in Naatthembo village, just west of the Sangage peninsula which begins north of Angoche town. Naatthembo has more or less 4000 inhabitants and represents the major Shangaji speaking centre with the two other locations; Nakonya village and the Likookha area in Mogincual adding more or less 500 and 700 speakers to this number of speakers which is in decline due to a lack of intergenerational language transfer. Nthamala village, some 10 km away from Naatthembo, no longer has Shangaji speakers because of a complete shift to the regional lingua franca Makhuwa (Enlai). The Shangaji deposit thus responds to an urgent documentation need, also because the language is one of four coastal languages that are believed to bear traces of the Swahili world which once stretched from Somalia to the south of Mozambique." } -
{ "name": "Muskogee", "demographics": [ { "date": "2015-01-01", "location": { "name": "Oklahoma" }, "notes": [ { "text": "Muskogee is also spoken in Florida" }, { "text": "This population data is based on a 2015 census" } ], "population": 4470, "status": { "EGIDS": "Shifting", "ELCat": "Severely Endangered", "GIDS": 7, "UNESCO": "Definitely Endangered" } } ] } -
{ "abbreviation": "gusii", "access": { "note": { "eng": "Unless otherwise noted, materials for this language may be made publicly accessible, and the speakers in the content identified." } }, "autonyms": [ { "transcription": { "IPA": "ékeɣusií", "Gus": "ékegusií", "Swa": "ekegusii" } } ], "classification": [ "Niger-Congo", "Atlantic-Congo", "Benue-Congo", "Bantoid", "Bantu", "Northeast Bantu", "Great Lakes Bantu", "Logooli-Kuria (E40)" ], "contributors": [ { "abbreviation": "KB", "roles": [ "lexicographer", "speaker" ] }, { "abbreviation": "GM", "roles": [ "lexicographer", "reviewer", "speaker" ] }, { "abbreviation": "DWH", "roles": [ "researcher" ] } ], "dateCreated": "2018-10-13T18:17:24.477Z", "dateModified": "2018-10-13T18:17:34.512Z", "exonyms": [ { "transcription": { "swa": "Kisii" }, "note": { "text": "This is the most typical way that Swahili speakers refer to the language." } }, { "transcription": { "swa": "Kikisii" }, "note": { "text": "This is a Swahili version of the name of this language. It is an alternate for simply `Kisii`." } } ], "glottolog": "gusi1247", "id": "708d1bd4-84f6-42e5-8b09-5c64493441b3", "iso": "guz", "link": "https://data.digitallinguistics.io/languages/Gusii", "locations": [ { "key": "Kisii" } ], "name": { "eng": "Gusii", "spa": "gusii", "swa": "Kisii" }, "notes": [ { "text": "In older versions of your notes, you referred to this language as Kisii rather than Gusii." } ], "orthographies": [ { "abbreviation": "Practical", "direction": "ltr-ttb", "graphemes": [], "link": "https://data.digitallinguistics.io/languages/Gusii/orthographies/Practical", "name": "Practical", "punctuation": [ ".", "?", "!" ] }, { "abbreviation": "Swahili", "direction": "ltr-ttb", "graphemes": [], "link": "https://data.digitallinguistics.io/languages/Gusii/orthographies/Swahili", "name": "Swahili", "notes": [ { "text": "This orthography represents the way that the language is written when using the Swahili orthography. Tone is omitted from this orthography." } ], "punctuation": [ ".", "?", "!" ] } ], "phonemes": [], "type": "Language" }
Developer Notes
This is a top-level database object