Roberta Sets 37-70.zip | Wals

: Gender assignment (32A), coding of nominal plurality (33A), and the number of cases (49A).

: Using the WALS database features as labels to see if a model's internal representations (embeddings) cluster according to known linguistic traits, such as whether a language uses definite articles.

: Ordinal (53A) and distributive (54A) numerals, and numeral classifiers (55A). Nominal Syntax (Chapters 58–64) : WALS roberta sets 37-70.zip

: Inclusive/exclusive distinctions (39A–40A), distance contrasts in demonstratives (41A), and third-person pronouns (43A).

For more information on the specific data points, you can explore the Official WALS Features List or the WALS-Bench dataset on Hugging Face. : Gender assignment (32A), coding of nominal plurality

This specific set is often used in for the following purposes:

World languages with features and coordinates - Dataset Search distance contrasts in demonstratives (41A)

: Definite (37A) and Indefinite (38A) article systems.