Antigenicity prediction and vaccine recommendation of human influenza virus A (H3N2) using convolutional neural networks.
The rapid evolution of influenza A viruses poses a great challenge to vaccine development. Analytical and machine learning models have been applied to facilitate the process of antigenicity determination. In this study, we designed deep convolutional neural networks (CNNs) to predict Influenza antigenicity. Our model is the first that systematically analyzed 566 amino acid properties and 141 amino acid substitution matrices for their predictability. We then optimized the structure of the CNNs using particle swarm optimization. The optimal neural networks outperform other predictive models with a blind validation accuracy of 95.8%. Further, we applied our model to vaccine recommendations in the period 1997 to 2011 and contrasted the performance of previous vaccine recommendations using traditional experimental approaches. The results show that our model outperforms the WHO recommendation and other existing models and could potentially improve the vaccine recommendation process. Our results show that WHO often selects virus strains with small variation from year to year and learns slowly and recovers once coverage dips very low. In contrast, the influenza strains selected via our CNN model can differ quite drastically from year to year and exhibit consistently good coverage. In summary, we have designed a comprehensive computational pipeline for optimizing a CNN in the modeling of Influenza A antigenicity and vaccine recommendation. It is more cost and time-effective when compared to traditional hemagglutination inhibition assay analysis. The modeling framework is flexible and can be adopted to study other type of viruses.
Authors
Eva K Lee; Haozheng Tian; Helder I Nakaya
External link
Publication Year
Publication Journal
Associeted Project
Systems Vaccinology
Lista de serviços
-
StructRNAfinder: an automated pipeline and web server for RNA families prediction.StructRNAfinder: an automated pipeline and web server for RNA families prediction.
-
CEMiTool: a Bioconductor package for performing comprehensive modular co-expression analyses.CEMiTool: a Bioconductor package for performing comprehensive modular co-expression analyses.
-
webCEMiTool: Co-expression Modular Analysis Made Easy.webCEMiTool: Co-expression Modular Analysis Made Easy.
-
Assessing the Impact of Sample Heterogeneity on Transcriptome Analysis of Human Diseases Using MDP Webtool.Assessing the Impact of Sample Heterogeneity on Transcriptome Analysis of Human Diseases Using MDP Webtool.
-
Predicting RNA Families in Nucleotide Sequences Using StructRNAfinder.Predicting RNA Families in Nucleotide Sequences Using StructRNAfinder.
-
OUTBREAK: a user-friendly georeferencing online tool for disease surveillance.OUTBREAK: a user-friendly georeferencing online tool for disease surveillance.
-
Noninvasive prenatal paternity determination using microhaplotypes: a pilot study.Noninvasive prenatal paternity determination using microhaplotypes: a pilot study.
-
Editorial: User-Friendly Tools Applied to Genetics or Systems Biology.Editorial: User-Friendly Tools Applied to Genetics or Systems Biology.
-
Automatic detection of the parasite Trypanosoma cruzi in blood smears using a machine learning approach applied to mobile phone imagesAutomatic detection of the parasite Trypanosoma cruzi in blood smears using a machine learning approach applied to mobile phone images
-
Tucuxi-BLAST: Enabling fast and accurate record linkage of large-scale health-related administrative databases through a DNA-encoded approachTucuxi-BLAST: Enabling fast and accurate record linkage of large-scale health-related administrative databases through a DNA-encoded approach
-
Ten quick tips for harnessing the power of ChatGPT in computational biologyTen quick tips for harnessing the power of ChatGPT in computational biology