Word

Validate against: http://json-schema.org/draft-07/schema#

Schema ID: http://schemas.digitallinguistics.io/Word-4.3.1.json

Type: object

Description

A Word object represents a specific token in a text, rather than an abstract lexeme (see the Lexeme object for that).

Required Properties

transcription

Properties

The following properties are defined for this object:

Analysis: analysis

Description

The morpheme breakdown for the word

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Transcription.json
Type: type

Type: string

Read-only: true

Description

The type of object. Must be set to Word.

This item must have the following value:
```
"Word"
```
End Time: endTime

Type: number

Description

The point in time in the text at which the word token stop being spoken, formatted as seconds and milliseconds (SS.MMM). This property is useful for phonetic data analysis.

Minimum: 0.001
Gloss: gloss

Description

A Leipzig-style gloss for this Word. This does not need to be supplied if it can be generated automatically from the glosses at the morpheme level. May be in multiple languages.

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/MultiLangString.json
Key: key

Type: string

Description

A key that uniquely identifies this word token within the Text. The key for a Word consists of the abbreviation of the text, a period, dash, or underscore, the number of Utterance within the text, another period, dash, or underscore, and then the number of this word within the Utterance (indexing starts at 1). For example, for the fourth word of the third Utterance of a text with the abbreviation A, the key would be A.3.4. Keys should be unique within a corpus.

Regular expression to match: ^[(a-z)|(A-Z)|(0-9)]+[-_\.][0-9]{1,3}[-_\.][0-9]{1,2}$
Literal Translation: literal

Description

A literal translation of the word at the level of the morpheme token

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Translation.json
Morphemes / Components: morphemes

Type: array

Description

A collection of the components (morphemes, lexemes) in this Word. Components may be repeated if the same morpheme appears twice in the Word.

Items must be unique: false

Items

Each item in this array must adhere to the following schema:

Morpheme: morphemes

Type: object

Description

An object representing this morpheme. This is not an object representing a lexeme (the abstract type). It represents the specific morpheme token (the particular instance in speech) instead.

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Morpheme.json
Notes: notes

Type: array

Description

A collection of notes about this token

Items must be unique: true

Items

Each item in this array must adhere to the following schema:

Note: notes

Type: object

Description

A Note about this token

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Note.json
Phonemes: phonemes

Type: array

Description

An array of the phones (phoneme tokens) in this word, in IPA. Each phone should appear in order, and does not have to be unique.

Items must be unique: false
Items

Each item in this array must adhere to the following schema:
Phone (Phoneme Token): phonemes

Type: object

Description

An IPA representation of this phoneme. The phoneme field is required, but the allophone field is optional, for use only when you need to specify that this phoneme is being realized as a particular phoneme. Both phoneme and allophone must be in IPA rather than a practical orthography.

Required Properties

phoneme

Properties

The following properties are defined for this object:

Allophone: allophone

Type: string

Description

This property may be used to specify which allophone this particular phoneme token is. Must be written in IPA.

End Time: endTime

Type: number

Description

The point in time in the text at which this phone stops being pronounced. This property is primarily useful for phonetic data analysis.

Minimum: 0.001

Notes: notes

Type: array

Description

A collection of notes about this phone

Items must be unique: true

Items

Each item in this array must adhere to the following schema:

Note: notes

Type: object

Description

A note about this phone. Useful for one-time observations of interesting allophonic variation in the data.

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Note.json

Phoneme: phoneme

Type: string

Description

An IPA representation of this phoneme token. Must be a representation of the abstract phoneme rather than the particular allophone (use the allophone property for that insetad).

Start Time: startTime

Type: number

Description

The point in time in the text at which this phone starts being pronounced. This property is primarily useful for phonetic data analysis. The endTime property is not necessarily required if the startTime property is present (this is useful when the endTime of one phone is the same as the startTime of the next).

Minimum: 0

Tags: tags

Type: object

Description

A set of tags for this phone

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Tags.json

Additional Properties

Any additional properties must adhere to the following schema:

This schema imposes no restrictions. All values are valid.
Start Time: startTime

Type: number

Description

The point in time in the text at which this word token starts being uttered, formatted as seconds and milliseconds (SS.MMM). This property is primarily useful for phonetic data analysis. The endTime property is not necessarily required if the startTime property is present (this is useful when the endTime of one word token is the same as the startTime of the next).

Minimum: 0
Tags: tags

Type: object

Description

A set of tags for this word token

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Tags.json
Transcription: transcription

Type: object

Description

A transcription of this word token, optionally in multiple orthographies. A transcription must be provided in at least one orthography.

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Transcription.json

Minimum number of properties: 1
Translation: translation

Type: object

Description

A translation of this word, optionally in multiple languages. Note that the translation is not the same as a gloss. Use the translation field for free, natural language translations of a word. There is a separate field for literal translations.; use the gloss field for Leipzig-style glosses only.

Referenced Schema

This item must validate against the following schema:

http://schemas.digitallinguistics.io/Translation.json

Additional Properties

Any additional properties must adhere to the following schema:

This schema imposes no restrictions. All values are valid.

Examples

The following are example values for this schema:

{
  "type": "Word",
  "endtime": 1.01,
  "gloss": "man",
  "key": "A1_1_2",
  "analysis": {
    "transcription": "man"
  },
  "literal": {
    "eng": "man"
  },
  "morphemes": [
    {
      "gloss": "man",
      "lexeme": {
        "id": "06ed7bda-9923-4295-97da-8c9a24b98fd4",
        "key": "qasi"
      },
      "transcription": {
        "APA": "ʔasi",
        "IPA": "ʔasi",
        "Mod": "qasi"
      }
    }
  ],
  "phonemes": [
    {
      "phoneme": "ʔ"
    },
    {
      "phoneme": "a"
    },
    {
      "phoneme": "s"
    },
    {
      "phoneme": "i"
    }
  ],
  "startTime": 1,
  "tags": {
    "role": "subject"
  },
  "transcription": {
    "APA": "ʔasi",
    "IPA": "ʔasi",
    "Mod": "qasi"
  },
  "translation": {
    "eng": "a man"
  }
}

Word

Description

Required Properties

Properties

Analysis: analysis

Description

Referenced Schema

Type: type

Description

End Time: endTime

Description

Gloss: gloss

Description

Referenced Schema

Key: key

Description

Literal Translation: literal

Description

Referenced Schema

Morphemes / Components: morphemes

Description

Items

Morpheme: morphemes

Description

Referenced Schema

Notes: notes

Description

Items

Note: notes

Description

Referenced Schema

Phonemes: phonemes

Description

Items

Phone (Phoneme Token): phonemes

Description

Required Properties

Properties

Allophone: allophone

Description

End Time: endTime

Description

Notes: notes

Description

Items

Note: notes

Description

Referenced Schema

Phoneme: phoneme

Description

Start Time: startTime

Description

Tags: tags

Description

Referenced Schema

Additional Properties

Start Time: startTime

Description

Tags: tags

Description

Referenced Schema

Transcription: transcription

Description

Referenced Schema

Translation: translation

Description

Referenced Schema

Additional Properties

Examples

Analysis: `analysis`

Type: `type`

End Time: `endTime`

Gloss: `gloss`

Key: `key`

Literal Translation: `literal`

Morphemes / Components: `morphemes`

Morpheme: `morphemes`

Notes: `notes`

Note: `notes`

Phonemes: `phonemes`

Phone (Phoneme Token): `phonemes`

Allophone: `allophone`

End Time: `endTime`

Notes: `notes`

Note: `notes`

Phoneme: `phoneme`

Start Time: `startTime`

Tags: `tags`

Start Time: `startTime`

Tags: `tags`

Transcription: `transcription`

Translation: `translation`