fundación de CDSB • 2018: ◦ primer taller ^_^, con instructores de Bioconductor: Martin Morgan & Benilton Carvalho • 2019: ◦ BioC2019: apoyo a solicitud de becas ◦ Taller con materiales adaptados de RStudio • 2020: ◦ regutools: primer paquete en Bioconductor ◦ Taller con RStudio & Bioconductor • 2021: ◦ primera vez con 2 talleres https://comunidadbioinfo.github.io/
= 36 Discovery data Postmortem Human Brain Samples Fetal Infant Child Teen Adult 50+ 6 / group, N = 36 Replication data Andrew E Jaffe @andrewejaffe Ph.D. co-advisor Developmental regulation of human cortex transcription and its clinical relevance at single base resolution doi.org/10.1038/nn.3898 github.com/leekgroup/libd_n36
(GitHub) Christopher Wilks @chrisnwilks Shannon Ellis @Shannon_E_Ellis Kasper Daniel Hansen @KasperDHansen Andrew E Jaffe @andrewejaffe Ph.D. co-advisor + LIBD former boss Jeff Leek @jtleek Ph.D. advisor
N=9,962 TCGA N=11,284 SRA N=49,848 samples expression estimates gene exon junctions ERs Answer meaningful questions about human biology and expression slide adapted from Shannon Ellis Reproducible RNA-seq analysis using #recount2 + Improving the value of public RNA-seq expression data by phenotype prediction doi.org/10.1038/nbt.3838 doi.org/10.1093/nar/gky102
and queries for large-scale RNA-seq expression and splicing Christopher Wilks @chrisnwilks research.libd.org/recount3-docs/ doi.org/10.1186/s13059-021-02533-6
different methods perform best on different data sets (Cobos et al, Nature Communications, 2020) • Benchmarking results from different papers on “real” data ◦ MuSiC paper: MuSiC > NNLS > BSEQ-sx > CIBERSORT ▪ Pancreatic Islet: Beta cells vs. HbA1c (Fig 2a) ◦ Bisque paper: Bisque > MuSiC > CIBERSORT ▪ DLPFC: Microglia vs. Braak stage, Neuron vs. Cognitive diagnostic category (Fig 4) ◦ SCDC paper: SCDC > MuSiC > Bisque > DWLS > CIBERSORT ▪ Pancreatic Islet: Beta cells vs. HbA1c (Fig 4b) ◦ Cobos benchmark: DWLS > MuSiC > Bisque > deconvoSeq ▪ Human PMBC flow sorted (Fig 7) 29 Louise A Huuki-Myers @lahuuki
- A slide contains 4 capture areas, each full of thousands of 55um-wide “spots” (often containing 1-10 cells) - Unique barcodes in each spot bind to particular genes; after sequencing, gene expression can be tied back to exact spots, forming a spatial map Kristen R. Maynard 38
• k=2: separate white vs. grey matter • k=9: best reiterated histological layers • k=16: data-driven optimal k based on fast H+ statistic 46 More Clusters = More Complexity doi.org/10.1101/2023.02.15.528722
structure • Correlate enrichment t-statistics for top marker genes of reference ◦ Cluster vs. manual annotation • Annotate with strongly associated histological layer 47 Sp k D d ~L doi.org/10.1101/2023.02.15.528722
from the literature Software name Overall approach Input Cell Counts Output Tangram (Biancalani et al.) Mapping individual cells Every spot Integer counts Cell2location (Kleshchevnikov et al.) Matching gene-expression profile Average across spots Decimal counts SPOTlight (Elosua-Bayes et al.) Matching gene-expression profile Not used Proportions 52 Excit L5 Counts
Visium - Multi-channel fluorescent images captured of the same tissue - Channels measure proteins marking for specific cell types Kristen R. Maynard 56 Sang Ho Kwon Visium-SPG = Visium SRT + immunofluorescence (using identical tissue samples) Fluorescent Protein Cell Type TMEM119 Microglia Neun Neurons OLIG2 Oligodendrocytes GFAP Astrocytes
spatial domains B. Cell-cell communication; cell-type-informed ligand-receptor interactions in the context of schizophrenia risk A 62 Boyi Guo Melissa Grant-Peters
aim to use the best methods 74 Moses, L., Pachter, L. Museum of spatial transcriptomics. Nat Methods 19, 534–546 (2022). https://doi.org/10.1038/s41592-022-01409-2
software can change dramatically (function and syntax) between versions - Promotes collaboration by allowing two researchers to share exact code and instantly run software without special set-up SpatialExperiment release 3.14 SpatialExperiment devel 3.15 module load tangram/1.0.2 module load cell2location/0.8a0 module load spagcn/1.2.0 https://github.com/LieberInstitute/jhpce_mod_source https://github.com/LieberInstitute/jhpce_module_config Nicholas J Eagles @Nick-Eagles (GitHub)
clarify functionality and report bugs - Documentation for code and author responsiveness on GitHub can be critical in successfully applying software to our data Nicholas J Eagles @Nick-Eagles (GitHub)
think adding 0, multiplying by 1 • It nearly always takes a team • Data sharing accelerates science + democratizes access to it • Zooming in allows us to reduce the heterogeneity • We can learn from each other: from uniformly processing our data & re-using it → replicate / validate?
Weber @stephaniehicks Stephanie C Hicks @abspangler Abby Spangler @martinowk Keri Martinowich @CerceoPage Stephanie C Page @kr_maynard Kristen R Maynard @lcolladotor Leonardo Collado-Torres @Nick-Eagles (GH) Nicholas J Eagles Kelsey D Montgomery Sang Ho Kwon Image Analysis Expression Analysis Data Generation Thomas M Hyde @lahuuki Louise A Huuki-Myers @BoyiGuo Boyi Guo @mattntran Matthew N Tran @sowmyapartybun Sowmya Parthiban Slides available at speakerdeck.com /lcolladotor + Many more LIBD, JHU, and external collaborators @mgrantpeters Melissa Grant-Peters @prashanthi-ravichandran (GH) Prashanthi Ravichandran