Motion’s CCRT miRNA Head Into R/Bioconductor 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang Brand-new slide style ~ pixiv id: 36875076
eaeabdab-0bb5-42ed-9ee9-dd3d4a58a5e2 • Accept fastq.gz output by CASAVA – can be paired • Total size of a task is limited by 3TB • I can run SNP detection of a sample now – first 10 paired reads of Prof. Chou, human, No94 – using GATK to detect SNP – total process time: approx. 3.5hr – no SNP found 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang
originally No94 has 30,563,931 paired reads – after compressed, total file size is ~800 MB – task ID: 20b45f9a-65ff-476c-8076-11c1b7bbed05 – start at 7/8 16:30 – end at 7/8 18:30 using 2 hrs 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang
dataset – containing both normal and various cancer subtypes • in silico verification by mapping candidates to breast and lung dataset • We will focus on breast dataset 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang
pathways of these miRNA candidates – wish to find breast related target genes and pathway • This prediction will be done by 建樂學長’s algorithm – most prediction algorithms are for known miRNAs only 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang
special issues – multiple alignment 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang
miRNA – # mature form: 2245 to 2801 – # precursor form: 1600 to 1872 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang
has multiple alignment position on its precursor form • Solution: go to miRBase to check the reported position 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang mature precursor # alignment on precursor hsa-miR-3142 hsa-mir-3142 2 hsa-miR-3673 hsa-mir-3673 3 hsa-miR-4487 hsa-mir-4487 2
alternative miR-3142 location • These kind of miRNAs are usually low-expressed and are not expressed in our data set. 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang u aaa cc --g a ! ucag ggccuuucugaa uucagaaaggcu cuga u! |||| |||||||||||| |||||||||||| |||| ! aguc ucggaaagacuu aagucuuuccgg gacu c! a --g cc aaa u!
reference) • Run time ~ 5mins 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang Full code on https://gist.github.com/ccwang002/5978498
too big (65,247 columns) 2013.07 Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine, National Taiwan University Slides by Liang Bo Wang