Struct: LanguageIdentification
Key
|
Field
|
Type
|
Description
|
Requiredness
|
Default value
|
1
|
uuid
|
uuid.UUID
|
Unique identifier for this language identification.
|
required
|
|
2
|
metadata
|
metadata.AnnotationMetadata
|
Information about where this language identification came from.
|
required
|
|
3
|
languageToProbabilityMap
|
map<
string
,
double
>
|
A list mapping from a language to the probability that that
language occurs in a given communication. Each language code should
occur at most once in this list. The probabilities do
not
need to sum to one -- for example, if a single communication is known
to contain both English and French, then it would be appropriate
to assign a probability of 1 to both langauges. (Manually
annotated LanguageProb objects should always have probabilities
of either zero or one; machine-generated LanguageProbs may have
intermediate probabilities.)
Note: The string key should represent the ISO 639-3 three-letter code.
|
required
|
|
A theory about what languages are present in a given communication
or piece of communication. Note that it is possible to have more
than one language present in a given communication.