(ENS de Lyon, Physics laboratory)
Title — Sparse coding methods applied to DNA replication analysis
Abstract — Cellular replication is widely studied using modern imaging methods. This work aims to understand DNA replication at its most detailed level, which can not be achieved with microscopy images. The data combines genetic sequencing and replication progress [1]. Our goal is to estimate important parameters such as the positions of replication origins in the DNA strand, speed of replication and direction. We are going to discuss a time-scale dictionary approach with several sparse coding methods: LASSO, Matching Pursuit and Sliding Frank-Wolfe [2,4] (off-the-grid approach). A hybrid method [3], that considers speed as a discrete parameter and position as continuous, will be presented and compared to others in the case of DNA replication. We will also discuss other ways of approaching this problem by reformulating it as an inverse problem.
References
[1] B. Theulot et al. “Genome-wide mapping of individual replication fork velocities using nanopore sequencing”. In: Nature Communications 13 (2022). doi: 10.1038/s41467-022-31012-0.
[2] L. Blanc-Féraud B. Laville and G. Aubert. “Off-The-Grid Variational Sparse Spike Recovery: Methods and Algorithms”. In: Journal of Imaging 7.12 (2021). issn: 2313-433X. doi: 10 . 3390 / jimaging7120266. url: https://www.mdpi.com/2313-433X/7/12/266.
[3] Clara Lage et al. Time-Scale Hybrid Continuous-Discrete Sliding Frank-Wolfe Method. eprint: https://hal.science/hal-04146737/.
[4] G. Peyré Q. Denoyelle V. Duval and E. Soubies. “The sliding Frank–Wolfe algorithm and its application to super-resolution microscopy”. In: Inverse Problems 36.1 (2019), p. 014001. doi: 10.1088/1361-6420/ab2a29.