Statistical Connectomics

Thesis Committee Meeting


Jaewon Chung

(he/him) - NeuroData lab
Johns Hopkins University
Department of Biomedical Engineering

icon j1c@jhu.edu
icon @j1c (Github)
icon @j1c (Twitter)

Outline

  • What we've done

    • Deriving Connectomes of Human Brains
    • Statistical Modeling for Connectomes
    • Heritability of Human Connectomes
    • graspologic + hyppo + m2g
  • Graduation plan

Representing brains as networks

Networks (or graphs) are mathematical abstractions to represent relational data
  • Vertices - the set of objects (brain regions)
  • Edges - the set of connections between those objects (brain regions)
    • E.g. region 1 connects to region 2 with 100 neural bundles

center

Connectomes from diffusion MRI (dMRI)

  • in vivo imaging technique
  • Exploits direction of water diffusion
    • Anisotropic in white matter tracts
    • Isotropic in other tissues
  • Estimates of number of white matter tracts via tractography

center

MRI to graphs (m2g)

  • Easy to use end-to-end pipeline
    • Input: MRI data
    • Output: Connectomes, QA measures, derivatives
  • Reproduces biological properties
    • Stronger ipsilateral connections
  • High discriminability
    • Same subjects' connectomes are more similar than different subjects'

center

Outline

  • What we've done

    • Connectomes of Human Brains
    • Statistical Modeling for Connectomes
    • Heritability of Human Connectomes
    • graspologic + hyppo + m2g
  • Graduation plan

Different data, same statistics (Ascombe's Quartet)

  • These four datasets have same statistics!

center

Different networks, same statistics

  • These four networks have same (graph) statistics!


center

Statistical models for networks

  • Random dot product graphs (RDPGs)
    • Each vertex has a low dimensional latent position.
    • Estimate latent position matrix via adjacency spectral embedding.
    • =

center

Two sample graph testing

  • Suppose we have two networks
  • Want to test if they are "same" or not

Hypothesis:

  • Network 1Network 2
  • Network 1Network 2

More precisely:

Drosophila Left vs Right Brain

center

Outline

  • What we've done

    • Connectomes of Human Brains
    • Statistical Modeling for Connectomes
    • Heritability of Human Connectomes
    • graspologic + hyppo + m2g
  • Graduation plan

Heritability of connectomes?

  • Heritability = proportion of phenotypic variance due to genetic variance
    • Predict disease rick
    • Potential for targeted treatments
    • Genes -> structure -> function -> behavior

Heritability as causal problem

center

Do genomes affect connectomes?

  • Our hypothesis:
    C, GCG
    C, GCG

  • Known as independence testing

  • Test statistic: distance correlation (Dcorr)

  • Implication if false: there exists an associational heritability.



center

Do genomes affect connectomes given covariates?

  • Want to test:
    C, G|CoC|CoG|Co
    C, G|CoC|CoG|Co
  • Known as conditional independence test
  • Test statistic: Conditional distance correlation (CDcorr)
  • Implication if false: there exists causal dependence of connectomes on genomes.

center

Human Connectome Project

  • Brain scans from identical (monozygotic), fraternal (dizygotic), non-twin siblings.
  • Regions defined using Glasser parcellation (180 regions).

center

Van Essen, David C., et al., The WU-Minn human connectome project: an overview (2013)

Glasser, Matthew F., et al. "A multi-modal parcellation of human cerebral cortex." Nature (2016).

Methods of comparing connectomes

  • Exact : measures all differences in latent positions
    • Differences in the latent positions implying differences in the connectomes themselves
  • Global : considers the latent positions of one connectome are a scaled version of the other
    • E.g. males may have globally fewer connections than females
  • Vertex : similar to the global differences, but it allows for each vertex to be scaled differently
    • E.g regions have preferences in connections
    • regions tend to connect strongly within hemisphere

We see stochastic ordering along familial relationships

Connectome Models
center

Neuroanatomy

center

center

We detect heritability (associational)

center

Some signals disappear after conditioning

center

To sum up...

center

  • Statistical models = nuanced investigations
  • Connectomes are dependent on genome, up to some common structures.

Outline

  • What we've done

    • Connectomes of Human Brains
    • Statistical Modeling for Connectomes
    • Heritability of Human Connectomes
    • graspologic + hyppo + m2g
  • Graduation plan

How to use these tools?

Outline

  • What we've done

    • Connectomes of Human Brains
    • Statistical Modeling for Connectomes
    • Heritability of Human Connectomes
    • graspologic + hyppo
  • Graduation plan

Summary of work so far

Manuscripts

  • (Co)-First author
    • Heritability, in review at Imaging Neuro (2024)
    • m2g, in review at Nature Methods (2024)
    • Two-sample graph testing, Stat (2022)
    • Statistical Connectomics, ARISA (2021)
    • graspologic, JMLR (2019)
  • Second author
    • Indep. Testing in Time Series, TMLR (2024)
    • Causal Conditional DCorr, in review (2023)
    • Multiscale Connectomics, in review (2023)
  • Others
    • 5 others published

Conference Presentations

  • OHBM (x3)
  • SfN (x3)
  • Neuromatch (x2)

Invited Lectures & Talks

  • JSM, 2023
  • Advanced Graph Analytics Workshop (JHU), 2023
  • OHBM, 2019

Awards

  • BRAIN Initiative Trainee Highlight Award
  • AWS Research Credit Grants (x2)

Summary of work to be done

Manuscripts

  • Respond to reviews
  • Collaboration with Child Mind Institute

Conferences/Talks

  • Collaborative Research in Computational Neuroscience (CRCNS)
  • Advanced Graph Analytics Workshop (JHU), 2024

Code

  • Continue to develop graspologic and hyppo

Graduation May 2024

Acknowledgements

Team

person
Eric Bridgeford

person
Ben Pedigo

person
Derek Pisner

person
Cencheng Shen

person
Ronak Mehta

person
Vivek Gopalakrishnan

person
Mike Powell

person
Carey Priebe

person
Joshua Vogelstein

NeuroData lab, Microsoft Research

Feedback?




Jaewon Chung

icon j1c@jhu.edu
icon @j1c (Github)
icon j1c.org

Appendix

How do we compare genomes?

  • Neuroimaging twin studies do not sequence genomes.
  • Coefficient of kinship ()
    • Probabilities of finding a particular gene at a particular location.
  • d(Genome, Genome) = 1 - 2.

Relationship
Monozygotic
Dizygotic
Non-twin siblings
Unrelated

Neuroanatomy (mediator), Age (confounder)

  • Literature show:
    • neuroanatomy (e.g. brain volume) is highly heritable.
    • age affects genomes and potentially connectomes
  • d(Covariates, Covariates) = ||Covariates - Covariates||

How do we compare connectomes?

  • Random dot product graph (RDPG)

    • Each vertex (region of interest) has a low dimensional latent vector (position).
    • Estimate latent position matrix via adjacency spectral embedding.
  • d(Connectome, Connectome) =

Distance correlation

  • Measures dependence between two multivariate quantities.
    • For example: connectomes, genomes.
  • Can detect nonlinear associations.
  • Measures correlation between pairwise distances.

center

Conditional distance correlation

  • Augment distance correlation procedure with third distance matrix.

center

Associational Test for Connectomic Heritability

  • Connectome, GenomeConnectomeGenome
    Connectome, GenomeConnectomeGenome

center


Sex All Females Males
p-value

Associational Test for Neuroanatomy

  • Neuroanatomy, GenomeNeuroanatomyGenome
    Neuroanatomy, GenomeNeuroanatomyGenome

center


Sex All Females Males
p-value

Causal Test for Connectomic Heritability

  • Conn., Genome|CovariatesConn.|CovariatesGenome|Covariates
    Conn., Genome|CovariatesConn.|CovariatesGenome|Covariates

Sex All Females Males
p-value

https://neurodata.io/talks/tathey1/23_06_12_thesis/pres.html#2

![center h:525](../images/fiber-tract-vert.jpeg)

<footer> Athreya et al. "RDPG..." _JMLR_ (2021) </footer>

- Causal models = rigorous, interpretab

![w:450](../images/logos/brain-logo.jpeg)

- NeurIPS Workshop (x1)

- Collaboration with Alex Badea

- $P[i\rightarrow j]$ = $\langle x_i, x_j\rangle$