Summary: A new AI system is not only able to analyze potential new variants of COVID-19, it can also vaccine design cycles within minutes, researchers report.
USC researchers have developed a new method to counter emergent mutations of the coronavirus and hasten vaccine development to stop the pathogen responsible for killing thousands of people and ruining the economy.
Using artificial intelligence (AI), the research team at the USC Viterbi School of Engineering developed a method to speed the analysis of vaccines and zero in on the best potential preventive medical therapy.
The method is easily adaptable to analyze potential mutations of the virus, ensuring the best possible vaccines are quickly identified — solutions that give humans a big advantage over the evolving contagion. Their machine-learning model can accomplish vaccine design cycles that once took months or years in a matter of seconds and minutes, the study says.
“This AI framework, applied to the specifics of this virus, can provide vaccine candidates within seconds and move them to clinical trials quickly to achieve preventive medical therapies without compromising safety,” said Paul Bogdan, associate professor of electrical and computer engineering at USC Viterbi and corresponding author of the study. “Moreover, this can be adapted to help us stay ahead of the coronavirus as it mutates around the world.”
The findings appear today in Nature Research’s Scientific Reports
When applied to SARS-CoV-2 — the virus that causes COVID-19 — the computer model quickly eliminated 95% of the compounds that could’ve possibly treated the pathogen and pinpointed the best options, the study says.
The AI-assisted method predicted 26 potential vaccines that would work against the coronavirus. From those, the scientists identified the best 11 from which to construct a multi-epitope vaccine, which can attack the spike proteins that the coronavirus uses to bind and penetrate a host cell. Vaccines target the region — or epitope — of the contagion to disrupt the spike protein, neutralizing the ability of the virus to replicate.
Moreover, the engineers can construct a new multi-epitope vaccine for a new virus in less than a minute and validate its quality within an hour. By contrast, current processes to control the virus require growing the pathogen in the lab, deactivating it and injecting the virus that caused a disease. The process is time-consuming and takes more than one year; meanwhile, the disease spreads.
USC method could help counter COVID-19 mutations
The method is especially useful during this stage of the pandemic as the coronavirus begins to mutate in populations around the world. Some scientists are concerned that the mutations may minimize the effectiveness of vaccines by Pfizer and Moderna, which are now being distributed. Recent variants of the virus that have emerged in the United Kingdom, South Africa and Brazil seem to spread more easily, which scientists say will rapidly lead to many more cases, deaths and hospitalizations.
But Bogdan said that if SARS-CoV-2 becomes uncontrollable by current vaccines, or if new vaccines are needed to deal with other emerging viruses, then USC’s AI-assisted method can be used to design other preventive mechanisms quickly.
For example, the study explains that the USC scientists used only one B-cell epitope and one T-cell epitope, whereas applying a bigger dataset and more possible combinations can develop a more comprehensive and quicker vaccine design tool. The study estimates the method can perform accurate predictions with over 700,000 different proteins in the dataset.
“The proposed vaccine design framework can tackle the three most frequently observed mutations and be extended to deal with other potentially unknown mutations,” Bogdan said.
The raw data for the research comes from a giant bioinformatics database called the Immune Epitope Database (IEDB) in which scientists around the world have been compiling data about the coronavirus, among other diseases. IEDB contains over 600,000 known epitopes from some 3,600 different species, along with the Virus Pathogen Resource, a complementary repository of information about pathogenic viruses. The genome and spike protein sequence of SARS-CoV-2 comes from the National Center for Biotechnical Information.
COVID-19 has led to 87 million cases and more than 1.88 million deaths worldwide, including more than 400,000 fatalities in the United States. It has devastated the social, financial and political fabric of many countries.
The study authors are Bogdan, Zikun Yang and Shahin Nazarian of the Ming Hsieh Department of Electrical and Computer Engineering at USC Viterbi.
Funding: Support for the study comes from the National Science Foundation (NSF) under the Career Award (CPS/CNS-1453860) and NSF grants (CCF-1837131, MCB-1936775 and CNS-1932620); a U.S. Army Research Office grant (W911NF-17-1-0076); a Defense Advanced Research Projects Agency (DARPA) Young Faculty Award and Director Award grant (N66001-17-1-4044), and a Northrop Grumman grant.
About this artificial intelligence and coronavirus research news
Source:USC Contact: Gary Polakovic – USC Image: The image is in the public domain
An in silico deep learning approach to multi-epitope vaccine design: a SARS-CoV-2 case study
The rampant spread of COVID-19, an infectious disease caused by SARS-CoV-2, all over the world has led to over millions of deaths, and devastated the social, financial and political entities around the world. Without an existing effective medical therapy, vaccines are urgently needed to avoid the spread of this disease. In this study, we propose an in silico deep learning approach for prediction and design of a multi-epitope vaccine (DeepVacPred). By combining the in silico immunoinformatics and deep neural network strategies, the DeepVacPred computational framework directly predicts 26 potential vaccine subunits from the available SARS-CoV-2 spike protein sequence. We further use in silico methods to investigate the linear B-cell epitopes, Cytotoxic T Lymphocytes (CTL) epitopes, Helper T Lymphocytes (HTL) epitopes in the 26 subunit candidates and identify the best 11 of them to construct a multi-epitope vaccine for SARS-CoV-2 virus. The human population coverage, antigenicity, allergenicity, toxicity, physicochemical properties and secondary structure of the designed vaccine are evaluated via state-of-the-art bioinformatic approaches, showing good quality of the designed vaccine. The 3D structure of the designed vaccine is predicted, refined and validated by in silico tools. Finally, we optimize and insert the codon sequence into a plasmid to ensure the cloning and expression efficiency. In conclusion, this proposed artificial intelligence (AI) based vaccine discovery framework accelerates the vaccine design process and constructs a 694aa multi-epitope vaccine containing 16 B-cell epitopes, 82 CTL epitopes and 89 HTL epitopes, which is promising to fight the SARS-CoV-2 viral infection and can be further evaluated in clinical studies. Moreover, we trace the RNA mutations of the SARS-CoV-2 and ensure that the designed vaccine can tackle the recent RNA mutations of the virus.