Upgrade to Pro — share decks privately, control downloads, hide ads and more …

クラウドを活用したゲノム情報解析の現状

 クラウドを活用したゲノム情報解析の現状

情報処理学会 連続セミナー 2016 第2回 クラウド http://www.ipsj.or.jp/event/seminar/2016/program02.html

Tazro Inutano Ohta

July 22, 2016
Tweet

More Decks by Tazro Inutano Ohta

Other Decks in Research

Transcript

  1. Ϋϥ΢υΛ׆༻ͨ͠ήϊϜ৘ใղੳͷݱঢ় 22 July 2016 | ৘ใॲཧֶձ ࿈ଓηϛφʔ 2016 ୈ2ճ Ϋϥ΢υ

    େా ୡ࿠! େֶڞಉར༻ػؔ๏ਓ ৘ใɾγεςϜݚڀػߏ " σʔλαΠΤϯεڞಉར༻ج൫ࢪઃ " ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ ಛ೚ݚڀһ" [email protected] Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS)
  2. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) Agenda! #

    1. ࠓੜ໋Պֶͷ෼໺ͰԿ͕ى͖͍ͯΔͷ͔" # 2. ࠓͲͷΑ͏ͳܭࢉػ͕ٻΊΒΕ͍ͯΔͷ͔" # 3. Ϋϥ΢υΛ׆༻ͯ͠໰୊Λղܾ͢Δ
  3. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ࠓੜ໋Պֶͷ෼໺ͰԿ͕ى͖͍ͯΔͷ͔ #

    ࣮ݧػցͷਐาʹΑͬͯσʔλͷαΠζͱྔ͕૿Ճ" # ήϊϜ෼໺Ͱ͸ʮ࣍ੈ୅DNAγʔΫΤϯαʔʯ͕ొ৔" # σʔλͷ஝ੵʹΑͬͯܭࢉػੜ෺ֶ͕੝Μʹͳ͍ͬͯΔ" # λϯύΫཱ࣭ମߏ଄σʔλɺը૾σʔλ" # σʔλॲཧɾղੳͷޮ཰Խ͸ࠓͳ͓ٸ຿" # ΞϧΰϦζϜͷਐาΛ଴͍ͬͯΔ࣌ؒ͸ͳ͍" # ϋʔυ΢ΣΞͷੑೳͰ໰୊Λղܾ͢Δ৔߹΋
  4. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ήϊϜՊֶͷ෼໺ͰԿ͕ى͖͍ͯΔͷ͔ #

    ࣮ݧػցͷਐาΛཚ๫ʹྫ͑ΔͳΒ…" # ւ = ήϊϜ, ڕ = Ҩ఻ࢠ" # ʮͲΜͳڕ͕͍Δ͔ௐ΂Δ͜ͱͰւΛಛ௃͚ͮΔʯ" # ٕज़ͷਐาͰಓ۩ͷੑೳ͕޲্ͨ͠" # ௼Γ؄͕ఈҾ͖໢ʹͳͬͨ
  5. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) DNAγʔέϯα͔ΒಘΒΕΔσʔλ #

    ʮήϊϜΛղಡ͢ΔʯͱҰݴͰݴ͏΋ͷͷ…" # ੜମαϯϓϧ͔ΒDNAΛநग़͢Δ" # நग़ͨ͠DNAΛ୹͍෼ࢠʹஅยԽ͢Δ" # DNAγʔέϯαͰղੳ͢Δ" # ୹͘அยԽ͞ΕͨԘج഑ྻͷϦετͰग़ྗ͞ΕΔ" # େྔͷDNAஅยͷ৘ใ͔ΒݩͷDNAΛ෮ݩ͢Δ! # de novo Assemble" # Reference Alignment" "
  6. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) σʔλղੳιϑτ΢ΣΞ (ղੳπʔϧ)

    # ଟ͘ͷղੳπʔϧ͕ΦʔϓϯιʔεͰެ։͞Ε͍ͯΔ" # ର৅σʔλͷੑ࣭ʹΑͬͯ࠷దͳπʔϧ͕ҟͳΔ" # σʔλղੳऀ (ੜ෺ֶऀ) ͕σʔλղੳΛߦ͏" # πʔϧ։ൃऀ(࣮૷ऀ)ͱར༻ऀ͸ಉҰͰ͸ͳ͍" # ར༻ऀ͕πʔϧͷڍಈΛ׬શʹ೺Ѳ͍ͯ͠Δͱ͸ݶΒͳ͍" # ղੳऀ͸ৗʹσʔλղੳΛ͍ͯ͠ΔΘ͚Ͱ͸ͳ͍" # ੜ෺࣮ݧͷยखؒʹղੳΛ͢Δݚڀऀ΋ଟ͍
  7. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ࠓੜ໋Պֶͷ෼໺ͰԿ͕ى͖͍ͯΔͷ͔! #

    ·ͱΊ" # σʔλͷྔͱ਺͕ٸܹʹ૿͓͑ͯΓɺࠓޙ΋૿͑Δ" # ໨తʹΑͬͯҟͳΔπʔϧɾΞϧΰϦζϜ͕࢖༻͞ΕΔ" # σʔλղੳऀͱπʔϧ։ൃऀ(࣮૷ऀ)͸ҟͳΔ͜ͱ͕ଟ͍
  8. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ࠓͲͷΑ͏ͳܭࢉػ͕࢖ΘΕ͍ͯΔͷ͔ #

    PC" # PCΫϥελ" # ڌ఺εύίϯ" # ࠃཱҨ఻ֶݚڀॴ εʔύʔίϯϐϡʔλγεςϜ
  9. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ͲͷΑ͏ͳܭࢉػ͕ٻΊΒΕΔͷ͔ #

    ର৅σʔλ͕େ͖͘ͳΔ/૿͑Δͱ௨ৗͷPCͰ͸ݫ͍͠" # ղੳσʔλ͕ͲΜͲΜཷ·Δ" # ಡΈॻ͖͕ߴ଎ͰڊେͳετϨʔδ! # πʔϧ͕Out of memoryͰམͪΔ" # େن໛ϝϞϦ! # όονॲཧΛେྔͷαϯϓϧʹର࣮ͯ͠ߦ͢Δ" # ෼ࢄ࣮ߦδϣϒεέδϡʔϦϯάγεςϜ! # େܕڞ༻ܭࢉػ΁ͷཁٻͷߴ·Γ" # Ҩ఻ֶݚڀॴSCͷಋೖ (2012~) => ·ͩे෼Ͱ͸ͳ͍
  10. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ݱ৔Ͱ͸Կ͕ϘτϧωοΫͳͷ͔! εύίϯϢʔβձͳͲͷώΞϦϯάΑΓ

    # ܭࢉػʹෆ׳ΕͳϢʔβͷ೰Έ" # ܭࢉػ͝ͱʹԿ͕Ͱ͖ͯԿ͕Ͱ͖ͳ͍ͷ͔Θ͔Βͳ͍" # େن໛ͳܭࢉػΛඞཁͱ͢Δ͕CUI͕࢖͑ͳ͍" # ܭࢉػΛ࢖͍͜ͳ͢ਓͷ೰Έ" # ܭࢉػ͕ࠞΜͰ͍ͯδϣϒ͕ྲྀͤͳ͍" # σʔλͷղੳ΍อଘʹे෼ʹ༧ࢉΛ౤ೖͰ͖ͳ͍! # ؀ڥߏஙʹίετ͕͔͔Δ" # ܭࢉػͷ໘౗Λݟͨ͘ͳ͍
  11. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ͲͷΑ͏ͳܭࢉػ͕ٻΊΒΕ͍ͯΔͷ͔ #

    ·ͱΊ" # ର৅σʔλͱ໨తʹΑͬͯཁٻʹ͕ࠩ͋Δ" # ήϊϜ෼໺Ͱ͸ετϨʔδ΍ϝϞϦͷ೰Έ͕ਂࠁ" # ϢʔβͷܭࢉػϦςϥγʹ΋෯͕͋Δ" # ϢʔβͷϨϕϧʹΑͬͯٻΊΔϨΠϠʔ͕ҧ͏
  12. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) Ϋϥ΢υΛ׆༻ͯ͠໰୊Λղܾ͢Δ #

    Ϋϥ΢υͰղܾͰ͖Δ໰୊" # ಋೖίετ" # ϊʔυͷࠞࡶ" # ϝϯςφϯείετ" # Ϋϥ΢υར༻ʹ͓͚Δ՝୊" # ετϨʔδͷίετ" # ݚڀඅͰͷࢧ෷͍" # ະൃදσʔλ / ݸਓ৘ใΛؚΉσʔλͷѻ͍
  13. The NIH Commons! ถࠃͰ͸ϑΝϯσΟϯάଆ͕Ϋϥ΢υར༻Λଅਐ “The Commons is a shared virtual

    space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage, share, use and reuse data, software, metadata and workflows.” - https://datascience.nih.gov/ commons
  14. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) Ϋϥ΢υ׆༻ࣄྫ (PaaS/SaaS)!

    ήϊϜղੳύΠϓϥΠϯ on ΞΧσϛοΫɾΠϯλʔΫϥ΢υ # JST CREST: ΠϯλʔΫϥ΢υΛ׆༻ͨ͠ΞϓϦέʔγϣϯத৺ܕΦʔόʔ ϨΠΫϥ΢υٕज़ʹؔ͢Δݚڀ (୅ද: NII߹ాઌੜ)" # ΞΧσϛοΫɾΠϯλʔΫϥ΢υͷࢼΈ" # Ҩ఻ݚεύίϯΛ৘ใݚΫϥ΢υଞࠃ಺ͷΞΧσϛοΫΫϥ΢υͱ࿈ܞ" # ղੳʹ༻͍ΒΕΔ֤πʔϧΛDockerԽ͢Δ͜ͱͰΞϓϦέʔγϣϯΛ ϙʔλϒϧʹ" # ༧ΊπʔϧΛ૊Έ߹ΘͤͨϫʔΫϑϩʔΛߏங͠GUIΛఏڙ" # ղੳσʔλ͝ͱʹ࠷దͳϦιʔεΛׂΓ౰ͯͨܭࢉػΛ্ཱͪ͛
  15. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) Ϋϥ΢υΛ׆༻ͯ͠໰୊Λղܾ͢Δ #

    ·ͱΊ: Ϋϥ΢υར༻ʹ͓͚Δ՝୊" # ετϨʔδͷίετ" # ܭࢉ࣌͸ߴ଎ͳI/OΛཁٻ" # อ؅࣌͸௿ίετͳετϨʔδ" # (঎༻Ϋϥ΢υͷ৔߹) ݚڀඅͰͷࢧ෷͍" # ݸਓ৘ใΛؚΉσʔλͷѻ͍" # ҆શੑͷཱ֬ - ར༻࣮੷ͷ஝ੵ" # ΨΠυϥΠϯ౳ͷࡦఆ
  16. Secure cloud computing for genomic data! Datta, Somalee, Keith Bettinger,

    and Michael Snyder. "Secure cloud computing for genomic data." Nature Biotechnology 34.6 (2016): 588-591.! ήϊϜσʔλղੳʹΫϥ΢υΛ༻͍Δ͋ͨΊʹඞཁͳηΩϡϦςΟ͸ ݚڀػؔͱΫϥ΢υϓϩόΠμͷ࿈ܞʹΑͬͯ੒͞ΕΔඞཁ͕͋Δ
  17. Secure cloud computing for genomic data! Datta, Somalee, Keith Bettinger,

    and Michael Snyder. "Secure cloud computing for genomic data." Nature Biotechnology 34.6 (2016): 588-591.! # Security requirements" # The data privacy agreement / σʔλͷऔѻʹ͍ͭͯͷݚڀػؔͱͷ߹ҙ" # Physical and logical security / ෺ཧ/࿦ཧͰͷηΩϡϦςΟ" # Encryption data / σʔλͷอ؅/సૹ࣌ͷ҉߸Խ" # Authentication / Ϣʔβೝূ " # Principle of Least Privilege / ࠷খݖݶͷݪଇ" # Firewalls / ϑΝΠϠʔ΢Υʔϧ" # Logging and monitoring / ϩΪϯάͱϞχλϦϯά" # Training / ηΩϡϦςΟ΍ೝূʹ͍ͭͯͷτϨʔχϯά" # Security and privacy / ݸਓ৘ใͷอޢ
  18. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) Summary #

    ࠓੜ໋Պֶͷ෼໺ͰԿ͕ى͖͍ͯΔͷ͔" ◦ େن໛ͳσʔλͷ஝ੵʹΑΓܭࢉػधཁ͕ߴ·͍ͬͯΔ" # ࠓͲͷΑ͏ͳܭࢉػ͕ٻΊΒΕ͍ͯΔͷ͔" ◦ ήϊϜ෼໺Ͱ͸ετϨʔδ΍ϝϞϦ͕ॏࢹ͞ΕΔ" ◦ ར༻ऀʹΑͬͯཁٻ͕ࡉ͔͘ҧ͏" # Ϋϥ΢υΛ׆༻ͯ͠໰୊Λղܾ͍͖͍ͯͨ͠" ◦ Ϋϥ΢υͷརศੑΛ͞ΒʹߴΊ͍ͯ͘" ◦ ར༻ࣄྫΛ૿΍͢͜ͱ͕ॏཁ
  19. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔʹ͍ͭͯ #

    ϥΠϑαΠΤϯε෼໺ʹ͓͚Δσʔλϕʔε౷߹ʹࢿ͢Δٕज़ ։ൃΛ୲͏" # ج൫ٕज़։ൃ" # ηϚϯςΟοΫ΢Σϒٕज़΍ࣗવݴޠॲཧΛ༻͍ͨϑΣσ Ϩʔγϣϯܕσʔλ౷߹ͷͨΊͷٕज़։ൃ΍ࠃࡍඪ४ͷࡦ ఆʹऔΓ૊Ή" # DDBJ࿈ܞ" # େن໛ήϊϜσʔλΛ࢝Ίͱ͢Δσʔλͷ׆༻ͷ
 ͨΊͷٕज़։ൃΛߦ͏
  20. Licensed under CC-BY 4.0 ©2016 Tazro Ohta (DBCLS) ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔʹ͍ͭͯ #

    JSTͷηϯλʔ NBDC ͱڞಉͰσʔλϕʔεࣄۀΛਐΊΔ" # DDBJͱ͸ಉ͡૊৫ (ROIS, NII΋ಉ͡) Ͱ࿈ܞ͍ͯ͠Δ http://dbcls.rois.ac.jp/about