Jo Lab

Research

AI and deep learning approaches for Alzheimer's disease across genomics, neuroimaging, and blood-based biomarkers

NIH National Institute on Aging Alzheimer's Association ADNI AIAD ADSP
  • AARG-22974053 (PI) A Dual-Deep Learning AI Strategy to Identify Tau-associated Genetic Variants in Alzheimer's Disease
  • U01 AG068057 (Co-I) Ultra-scale Machine Learning to Empower Discovery in Alzheimer's Disease Biobanks
  • U01 AG072177 (Co-I) Korean Brain Aging Study (KBASE2)
  • P30 AG072976 (Co-I) Indiana Alzheimer's Disease Research Center (IADRC)
  • U19 AG024904 (Co-I) Alzheimer's Disease Neuroimaging Initiative (ADNI-4)
  • U01 AG061359 (Co-I) Metabolomic Signatures for Disease Sub-classification in Alzheimer's Disease
SWAT-CNN architecture for genomic variant detection
Sliding window-based variant analysis on WGS data
Genomics & AI

Deep Learning for Genetic Variant Discovery

We develop deep learning frameworks to analyze whole-genome sequencing (WGS) data from large-scale cohorts such as the Alzheimer's Disease Sequencing Project (ADSP). Our SWAT-CNN approach applies a sliding-window strategy to detect AD-associated genetic variants, achieving an AUC of 0.82. More recently, our Deep-Block framework analyzed 7,416 WGS participants using linkage disequilibrium-informed deep learning, identifying novel SNPs beyond the well-known APOE region. We also developed DuAL-Net, a dual-attention architecture that integrates local genomic patterns with global biological knowledge for improved AD risk prediction from APOE-centered regional WGS data.

Selected Publications

  1. E.H. Lee, T. Jo*, DuAL-Net: A dual-network approach for Alzheimer's disease risk prediction using APOE-centered regional WGS data. Computational and Structural Biotechnology Journal, 2026.
  2. T. Jo* et al., LD-informed deep learning for Alzheimer's gene loci detection using WGS data. Alzheimer's & Dementia: TRCI, 2025.
  3. T. Jo et al., Deep learning-based identification of genetic variants: Application to Alzheimer's disease classification. Briefings in Bioinformatics, 2022.
Tools: SWAT-web, DuAL-Net
TrUE-Net framework for uncertainty-aware classification
TrUE-Net: Uncertainty stratification of genomic predictions
Uncertainty-Aware AI

Reliable Prediction through Uncertainty Estimation

A critical challenge in applying deep learning to clinical genomics is knowing when a model's prediction can be trusted. Standard classifiers produce point estimates without indicating their own confidence, which limits their utility in high-stakes medical decisions. Our TrUE-Net (Transformer Uncertainty Estimation Network) framework addresses this by combining a transformer encoder and random forest classifier with Monte Carlo dropout to quantify prediction uncertainty.

Applied to WGS data from 1,050 ADNI participants (607 AD, 443 controls), TrUE-Net achieved an overall AUC of 0.664. More importantly, by stratifying predictions based on uncertainty scores, the model identified a high-confidence subset (24.6% of samples) with 72.9% accuracy and F1 score of 0.821. This approach enables clinicians and researchers to distinguish reliable predictions from uncertain ones, providing a practical framework for more informed decision-making in genomic-based disease classification.

Selected Publications

  1. T. Jo*, E.H. Lee et al., Uncertainty-aware genomic classification of Alzheimer's disease: A transformer-based ensemble approach with Monte Carlo dropout. Briefings in Bioinformatics, 2025.
Tools: TrUE-Net
Tau PET brain scans analyzed by deep learning
3D CNN classification on tau PET scans
Neuroimaging & AI

Brain Imaging and Early Diagnosis

Tau PET and structural MRI capture pathological changes in the brain years before clinical symptoms of Alzheimer's disease emerge. Our work applies 3D convolutional neural networks to tau PET scans, achieving 90.8% classification accuracy and revealing informative spatial features in the medial temporal and parietal cortices. We have also published a comprehensive review of deep learning applications across multimodal neuroimaging for AD diagnostic classification and prognostic prediction.

Blood-based omics biomarker analysis
Multi-omics profiling of blood-based biomarkers
Metabolomics / Proteomics & AI

Blood-Based Biomarkers for Disease Prediction

Blood-based biomarkers offer a minimally invasive pathway to early Alzheimer's detection. Our longitudinal plasma proteomics study identified a seven-protein signature achieving AUC 0.848 for predicting incident AD dementia, with proteins including ACES, C7, and IL-17C emerging as key predictors. In metabolomics, we developed circular-SWAT (c-SWAT), a deep learning method for serum-based metabolomic classification that reached AUC 0.808, demonstrating the potential of metabolites as accessible, cost-effective biomarkers for population-level screening.

Selected Publications

  1. E.H. Lee, ..., A.J. Saykin, T. Jo, K. Nho et al., Longitudinal plasma proteomics: Relation to incident Alzheimer's disease dementia and biomarkers. Alzheimer's & Dementia, 2025.
  2. T. Jo et al., Circular-SWAT for deep learning based diagnostic classification of Alzheimer's disease. eBioMedicine, 2023.
Tools: c-SWAT
Multimodal data integration for precision medicine
Integrating multi-omics and imaging for personalized prediction
Precision Medicine

Multimodal Integration for Personalized Prediction

Alzheimer's disease is driven by complex interactions across genetic, molecular, and structural domains. Our precision medicine research integrates genomic, neuroimaging, and metabolomic data into unified AI frameworks to produce individualized risk profiles. By combining the complementary strengths of each data modality, these multimodal models aim to improve prediction accuracy beyond what any single data type can achieve alone, ultimately guiding personalized intervention strategies.

Selected Publications

  1. T. Jo et al., Multimodal deep learning for Alzheimer's disease prediction. Alzheimer's Association International Conference (AAIC), 2023.

Software & Tools

Open-source software developed in our lab for genomic and biomarker analysis.

DuAL-Net

Dual-attention deep learning for AD-associated genetic variant identification from whole-genome sequencing data.

Learn more →

TrUE-Net

Transformer and random forest ensemble with Monte Carlo dropout for uncertainty-aware genomic classification.

Learn more →

SWAT-web

Web application for sliding-window association testing across genome-wide variant data.

Launch →