KZ | RU | EN
Welcome to the Main Corpus of the Kazakh Language! The Main Corpus is an electronic collection of texts from 5 functional styles of the Kazakh language (fiction, scientific, journalistic, official/business, and colloquial), serving as an IT resource for research and education. The purpose of the Main Corpus is to be a text resource that covers all stylistic layers of the Kazakh language and represents a unified picture of the language. The total volume is 31,105,900 word usages. The Main Corpus includes a search system by word and word form (inflection). The Main Corpus operates with morphological, semantic, lexical, and phonetic-phonological annotation types. These annotations provide information about the searched word at all levels of the language: In morphological annotation, the analyzer automatically splits the word/word form into root and affixes (lemmatization) and assigns a part of speech to the root (lemma). It also provides grammatical characteristics of affixes. Lexical annotation shows all meanings of words from explanatory dictionaries. Phonetic annotation provides the orthoepy of the word, automatically divides it into syllables, and describes types of syllables. Phonological annotation provides phonemic characteristics of the sounds within the word. Each text included in the Main Corpus has a source (metadata). The metadata window opens on a separate page when the cursor is pointed at the author. Users of the corpus can search for the required word using metadata types (text author, text title, author gender, text style, audience, distribution type, time period, topic, full source).


DEVELOPERS OF THE MAIN CORPUS



Zhanabekova Ayman Abdildakyzy

Doctor of Philological Sciences, Head of the Department of Applied Linguistics, Project Leader

Kozhakhmetova Aktoty Kozhakhmetkyzy

PhD Doctoral Student, Research Scientist of the Department of Applied Linguistics

Tlegenova Gulden Bakytkazykyzy

PhD Doctoral Student, Research Scientist of the Department of Applied Linguistics

Besirov Erkin Bekzhanuly

PhD Doctoral Student, Research Scientist of the Department of Applied Linguistics

Barmenkulova Aida Serikkhankyzy

PhD Doctoral Student, Research Scientist of the Department of Applied Linguistics

Mursal Aikerim

PhD Doctoral Student, Junior Research Scientist of the Department of Language History and Turkology

Karbozova Bulbul Dauletkanyzy

PhD in Philosophy, Research Scientist of the Department of Applied Linguistics (2015–2020)

Pirmanova Kunsulu Kambarbekkyzy

PhD Doctoral Student, Junior Research Scientist of the Department of Applied Linguistics (2018–2023)

Sadyrbayeva Zhubayra Boranbekkyzy

Research Scientist of the Onomastics Department

Karshygaeva Aynur Aralbekkyzy

Candidate of Philological Sciences, Senior Research Scientist of the Phonetics Department

Soltanbekova Alfiya Abdikenkyzy

Candidate of Philological Sciences, Head of the Grammar Department

Otebayeva Elmira Abdigalikyzy

Candidate of Philological Sciences, Head of the Ethnolinguistics Department