Taiwan University HiSeq X Ten • $1,000 genome reached – include typical instrument depreciation, DNA extraction, library preparation, and estimated labor • bundled as at least 10 HiSeq X machines – new optics and chemistry makes them run 10x faster than HiSeq 2500, 18TB in 3 days w/ 6B clusters – new flowcells (use nanowells) 2014.01 Slides by Liang Bo Wang
Taiwan University Cost Behind the $1,000 Holy Grail • $ 10M capital budget for machines only • Run all machines 24/7/365 for 4 years “to get the instrument amortization costs down to $135 per genome, they stretched out the lifecycle to four years” • You’ve got 72,000 human whole genome samples • Requires $ 67M operating cost during 4 years – library prep = reagents AND labor = $65 per sample • Who could analyze the data? 2014.01 Slides by Liang Bo Wang
Taiwan University Who has bought HiSeq X Ten? • one set + 4 by Broad Institute • one set by the Garvan Institute of Medical Research from Australia • one set by Macrogen, leading next-generation sequencing service organization based in Seoul, South Korea and its CLIA laboratory in Rockville, Maryland 2014.01 Slides by Liang Bo Wang
Taiwan University NextSeq 500 • 130M / 400M clusters per run – 120 Gb with 150bp pair end • Somewhere between HiSeq and MiSeq • New optics allow six cameras in a single unit one third of the cost of the current model, and now use an LED allowing Illumina – … ? 2014.01 Slides by Liang Bo Wang
Taiwan University Current(Previous) Technology • two-color laser, 4 bases with separate dyes. • a filter wheel to discriminate the spectra • 4 pictures are captured by CCD per SBS cycle 2014.01 Slides by Liang Bo Wang
Taiwan University Tech used by HiSeq 500 (Assumed) • only two dyes are used – two based labeled with single dye – third with both dyes – fourth with no dyes • only two pictures are taken per SBS cycle – make computation easier – low lib prep complexity – reagent and instrument will be cheaper 2014.01 Slides by Liang Bo Wang
Taiwan University Future Work • Test new RNA-Seq pipeline – Tophat + Cufflinks cannot be scaled on Hadoop – STAR, MapSplice, … • Run DNA-Seq GATK 2.x pipeline – GATK 1.x outdated, parameters change • Analysis report form – Can also be used for Phalanx service • UI design (for prototype) 2014.01 Slides by Liang Bo Wang