Machine Learning Links Two New Genes to Ischemic Stroke

A team of scientists from HSE University and the Kurchatov Institute used machine learning methods to investigate genetic predisposition to stroke. Their analysis of the genomes of over 5,000 people identified 131 genes linked to the risk of ischemic stroke. For two of these genes, the association was found for the first time. The paper has been published in PeerJ Computer Science.
Ischemic stroke is a major cause of death and disability worldwide. This condition occurs when blood supply to a part of the brain is interrupted, causing cell death and impaired brain function. Scientists have long studied the genetic factors influencing stroke risk, but a definitive list of genes linked to stroke predisposition has yet to be established. There are hopes that artificial intelligence methods may provide answers in this regard.
A team of scientists from the HSE Faculty of Computer Science and the Kurchatov Institute proposed using machine learning algorithms to analyse genetic predisposition to stroke. They analysed genomic data from 5,500 unrelated individuals over the age of 55, including ischemic stroke survivors and their healthy counterparts. Samples for the study were collected from 11 laboratories in Europe and 13 in the United States.
The analysis was based on the concept of ranking through learning. First, the researchers developed a predictive model in which the key parameter was the presence or absence of a stroke. Single nucleotide polymorphisms (SNPs), which are variations in the genome at specific sites, were used as markers. The scientists then ranked these markers and selected the most significant ones.
SNPs were analysed and selected using various methods, enabling a new analysis of the data and the identification of genes previously not associated with ischemic stroke. The list of 'suspicious' genetic markers common to two or more methods highlights the reliability of the results.
Working with such a large dataset—nearly 900,000 SNPs per 5,500 participants—required us to move beyond purely statistical analysis methods. Machine learning made it possible to process all of this. As a result, we identified 131 genes, most of which had already been linked to ischemic stroke. However, for two of these genes, this was the first time we discovered the association,' explains Dmitry Ignatov, Head of the Laboratory for Models and Methods of Computational Pragmatics at HSE University.
In particular, the scientists found an association between stroke and ACOT11, a gene involved in fatty acid metabolism and shown in animal experiments to affect inflammatory processes and blood lipid levels. The second gene newly linked to ischemic stroke is UBQLN1, which is involved in the mechanisms that protect cells from oxidative stress. There is evidence that a mutation in this gene is associated with neurodegenerative diseases.
These discoveries could help develop multigenic risk models that predict a person's predisposition to stroke. Information about the newly associated genes could also serve as the foundation for developing drugs and therapies aimed at reducing the risk of ischemic stroke.
Gennady Khvorykh
'Identifying two new stroke-associated genes is an excellent outcome for any method. Our machine learning approach clearly holds strong potential for detecting genes linked to diseases that result from a variety of factors,' comments Gennady Khvorykh, Chief Specialist at the Kurchatov Institute.
The proposed approach to analysing genetic markers demonstrates versatility and can be effectively adapted for a wide range of studies beyond ischemic stroke. This methodology can be applied to any diseases or markers with data available in the 'sample—SNP—class' format.
'Although we initially developed this tool for a specific task, the results reveal its potential in a broader context. The ability to work with a variety of genetic data makes our method valuable to researchers across various fields of biology and medicine,' says Stefan Nikolić, graduate of the Faculty of Computer Science and the Doctoral School of Computer Science at HSE University.
See also:
HSE Scientists Uncover Mechanism Behind Placental Lipid Metabolism Disorders in Preeclampsia
Scientists at HSE University have discovered that in preeclampsia—one of the most severe complications of pregnancy—the placenta remodels its lipid metabolism, reducing its own cholesterol synthesis while increasing cholesterol transfer to the foetus. This compensatory mechanism helps sustain foetal nutrition but accelerates placental deterioration and may lead to preterm birth. The study findings have been published in Frontiers in Molecular Biosciences.
HSE Experts Reveal Low Accuracy of Technology Forecasts in Transportation
HSE researchers evaluated the accuracy of technology forecasts in the transportation sector over the past 50 years and found that the average accuracy rate does not exceed 25%, with the lowest accuracy observed in aviation and rail transport. According to the scientists, this is due to limitations of the forecasting method and the inherent complexities of the sector. The study findings have been published in Technological Forecasting and Social Change.
Wearable Device Data and Saliva Biomarkers Help Assess Stress Resilience
A team of scientists, including researchers from HSE University, has proposed a method for assessing stress resilience using physiological markers derived from wearable devices and saliva samples. The participants who adapted better to stress showed higher heart rate variability, higher zinc concentrations in saliva, and lower potassium levels. The findings were published in the Journal of Molecular Neuroscience.
When Circumstances Are Stronger Than Habits: How Financial Stress Affects Smoking Cessation
HSE researchers have found that the likelihood of quitting smoking rises with increasing financial struggles. While low levels of financial difficulties do not affect smoking behaviour, moderate financial stress can increase the probability of quitting by 13% to 21%. Responses to high financial stress differ by gender: men are almost 1.5 times more likely to give up cigarettes than under normal conditions, whereas no significant effect is observed on women’s decisions to quit smoking. These conclusions are based on data from the Russia Longitudinal Monitoring Survey (RLMS-HSE) for 2000–2023 and have been published in Monitoring of Public Opinion: Economic and Social Changes.
HSE Researchers Propose New Method of Verbal Fluency Analysis for Early Detection of Cognitive Impairment
Researchers from the HSE Center for Language and Brain and the Mental Health Research Centre have proposed a new method of linguistic analysis that enables the distinction between normal and pathological ageing. Using this approach, they showed that patterns in patients’ word choices during verbal fluency tests allow clinicians to more accurately differentiate clinically significant impairments from subjective memory complaints. Incorporating this type of analysis into clinical practice could improve the accuracy of early dementia diagnosis. The results have been published in Applied Neuropsychology: Adult.
How the Brain Processes a Word: HSE Researchers Compare Reading Routes in Adults and Children
Researchers from the HSE Center for Language and Brain used magnetoencephalography to study how the brains of adults and children respond to words during reading. They showed that in children the brain takes longer to process words that are frequently used in everyday speech, while rare words and pseudowords are processed in the same way—slowly and in parts. With age, the system is reorganised: high-frequency words shift to a fast route, whereas new letter combinations are still analysed slowly. The study was published in the journal Psychophysiology.
How Neural Networks Detect and Interpret Wordplay: New Insights from HSE Researchers
An international team including researchers from the HSE Faculty of Computer Science has presented KoWit-24, an annotated dataset of 2,700 Russian-language Kommersant news headlines containing wordplay. The dataset enables an assessment of how artificial intelligence detects and interprets wordplay. Experiments with five large language models show that even advanced systems still make mistakes, and that interpreting wordplay is more challenging for them than detecting it. The results were presented at the RANLP conference; the paper is available on Arxiv.org, and the dataset and the code for reproducing the experiments are available on GitHub.
HSE Economists Find That Auction Prices Depend on Artist’s Life Story
Researchers from the Centre for Big Data in Economics and Finance at the HSE Faculty of Economic Sciences have found that facts from an artist’s life are statistically significant in pricing a painting, alongside such traditional characteristics as the material, the size of the canvas, or the presence of the artist’s signature. This conclusion is based on an analysis of prices for 15,000 works by 158 artists sold since 1999 by the major auction houses Sotheby’s and Christie’s. The article has been published in the journal Empirical Studies of the Arts.
HSE Physicists Propose Unified Theory for Describing Electric Double Layer
To develop more efficient batteries and catalysts, it is essential to understand the processes occurring at the metal–solution interface in the electric double layer (EDL). Physicists at HSE MIEM have proposed a unified theoretical model of the EDL that simultaneously accounts for selective adsorption of ions on the surface and partial charge transfer between ions and the metal—phenomena that had previously been described separately. The model’s predictions are consistent with experimental data. In the future, it may be used in the development of batteries, supercapacitors, and catalysts. The study has been published in Electrochimica Acta.
HSE Researchers Experimentally Demonstrate Positive Effects of Urban Parks on the Brain
Scientists at HSE University have investigated the effect of parks on the cognitive and emotional resources of city dwellers. The researchers compared brain electrical activity in 30 participants while they watched videos of walks through parks and along busy highways. The results showed that green urban environments with trees produce a consistent effect across individuals, helping the brain calm down and relax. By contrast, walks along busy streets were found to be distracting. The findings have been published in Scientific Reports.


