all news

ISSAI KAZ-LLM: Kazakhstan’s Large Language Model

Since September 2019, ISSAI has demonstrated that cutting-edge and impactful AI research can be conducted in Kazakhstan using the country's intellectual workforce. NU stories

# #

Following President Kassym-Jomart Tokayev's instructions, ISSAI started its  transformation into a full-fledged research institute in May 2024 as a private entity under Nazarbayev University (NU). Today, ISSAI employs 70 staff members, including data scientists, data analysts, computer engineers, data curators (linguists), and administrative personnel. 
The AI computation hardware, consisting of four NVIDIA DGX A100 servers, is housed in NU’s C2 data center. ISSAI occupies over 1,200 square meters in Block 1 of NU and operates a branch office in Almaty. 

ISSAI actively contributes to the soft digital infrastructure of Kazakhstan by creating vital digital public goods that form the core of AI-based products and services in Kazakh. Notable research projects include automatic speech recognition (Soyle), text-to-speech, multilingual translation (Tilmash), named entity recognition, question answering, speech command recognition, and large language models (KAZ-LLM).

ISSAI's primary mission is to conduct advanced AI research with strict ethical guidelines, under the motto “AI for Good” and “AI for Kazakhstan”. The institute aims to elevate the global ranking of Nazarbayev University and Kazakhstan in AI research, while also developing a skilled local AI workforce. By taking a hands-on approach to data preparation, model training, and deployment, ISSAI is fostering a new generation of AI researchers capable of creating state-of-the-art models and tools for the future.

Since April 2024, ISSAI has been developing Kazakhstan's Large Language Model (KAZ-LLM) so that Kazakhstan can also benefit from the advances in generative AI and use it to improve the quality of life of its people and foster economic development. KAZ-LLM will be able to create content in the languages most relevant to Kazakhstan: Kazakh, Russian, and English. KAZ-LLM will play a crucial role in preserving national cultural heritage and will encompass historical context, specialized domains, and conversational data that represent Kazakhstan. By tailoring generative AI to meet local needs, KAZ-LLM exemplifies how national projects can address linguistic gaps and contribute to the global landscape of AI innovation.

ISSAI KAZ-LLM is a landmark step in Kazakhstan's generative AI development. As the first foundational model, it will be extended to include other modalities, such as vision and speech, and support other Turkic and regional languages in the coming years, establishing a strong foundation for broader AI advancements in the country.

For more information regarding research projects, publications, team members, and contacts for collaboration, please visit ISSAI's website.

 

Nazarbayev University, 2024 graduate

Share this article