Upgrade to Pro — share decks privately, control downloads, hide ads and more …

【初心者向け勉強会#9】MLOpsの基本 ~構築から運用まで~ / MLOps Basics:...

Avatar for Ogata Katsuya Ogata Katsuya
March 08, 2026
120

【初心者向け勉強会#9】MLOpsの基本 ~構築から運用まで~ / MLOps Basics: From Development to Operations

Avatar for Ogata Katsuya

Ogata Katsuya

March 08, 2026
Tweet

More Decks by Ogata Katsuya

Transcript

  1. ࣗݾ঺հ • ໊લ: ॹํ ࠀ࠸ (͓͕ͨ ͔ͭ΍) • ग़਎: ٶ࡚ݝখྛࢢ

    • ॴଐ: େࡕେֶ ৘ใՊֶݚڀՊ B4 • झຯ: ւ֎ཱྀߦ / ϙʔΧʔ / ొࢁ • X: @ogata_katsuya • Homepage: www.ogatakatsuya.com 2 ஍ݩͱ࣮ՈͱʹΌΜ͜
  2. MLOpsͱ͸ʁ Machine Learning × Operations 6 Machine Learning Operations ×

    DevOps for Machine Learning
 
 The techniques for operating the system 
 with machine learning modules.
  3. MLOpsͷྺ࢙ 2020೥͝Ζ͔Β஫໨͞Ε͍ͯΔ👀 10 ੨ઢ: MLOps
 ੺ઢ: DevOps ʮHidden Technical Debt

    in 
 Machine Learning Systemsʯ
 ͷެ։ (2015) Google Cloud NextͰ
 MLOpsͱ͍͏ݴ༿͕ॳΊͯ࢖ΘΕΔ
  4. ࣮ݧج൫ MLͷ࠶ݱੑͱ։ൃੜ࢈ੑΛߴΊΔ • ։ൃ؀ڥΛվળ͢Δ • ύοέʔδϚωʔδϟʔͷಋೖ (uv, poetry, etc.) •

    ؀ڥͷίϯςφԽ • ࣮ݧ༻ͷ؀ڥΛ༻ҙ͢Δ (Jupyter notebookͳͲͷNaaS) • ࣮ݧ؅ཧπʔϧͷಋೖ (Weights and Bias, TensorBoard, etc.) 18
  5. ֶशύΠϓϥΠϯ લॲཧ͔ΒֶशɾσϦόϦʔ·ͰͷϑϩʔΛ੔උ͢Δ • ֶशύΠϓϥΠϯ • σʔλऔಘ -> લॲཧ -> ֶश

    -> ݕূ -> σϓϩΠ • ͜ͷύΠϓϥΠϯΛ͋Β͔͡Ίߏ੒͓ͯ͘͜͠ͱʹΑΓɺ҆શʹ͔ͭߴ଎ʹ ߴ඼࣭ͳMLϞδϡʔϧΛఏڙͰ͖Δ • MLνʔϜ͸Ϟσϧͷ։ൃͷΈʹूதͰ͖Δ 19
  6. ਪ࿦ج൫ ΦϯϥΠϯਪ࿦ / όονਪ࿦ 21 Users Server Data Store Server

    ML Model ML Model Online inference Batch inference
  7. ਪ࿦ج൫ ͦΕͧΕͷਪ࿦ํ๏ͷPros/Cons • ΦϯϥΠϯਪ࿦ • Pros: ϦΞϧλΠϜʹ༧ଌ݁ՌΛฦ͢͜ͱ͕Ͱ͖Δ • Cons: αʔόʔΛৗʹཱͯͯɺϦΫΤετΛࡹ͚Δঢ়ଶʹ͓ͯ͘͠ඞཁ͕͋Δ

    • όονਪ࿦ • Pros: ߴεϧʔϓοτͰඅ༻ରޮՌ͕ߴ͍ • Cons: ϢʔβʔͷϦΫΤετ΍Πϕϯτʹରͯ͠ଈ࠲ʹਪ࿦Ͱ͖ͳ͍ 22
  8. ਪ࿦ج൫ ͦΕͧΕͷਪ࿦ํ๏ͷϢʔεέʔε • ΦϯϥΠϯਪ࿦ • ΫϨδοτΧʔυͷෆਖ਼༧ଌ • ޿ࠂαʔόʔͷ͓͢͢Ί༧ଌ • όονਪ࿦

    • γϣοϐϯάαΠτͷ͓͢͢Ί༧ଌ • إೝূͷຒΊࠐΈੜ੒ (ϦΫΤετ͕དྷͨΒ͢Ͱʹ͋Δ΋ͷͱݟൺ΂Δ͚ͩ) • RAGͷߏங (υΩϡϝϯτͱΠϯσοΫεͷಉظ͸εέδϡʔϧ͢Δ) 23
  9. ܧଓతֶश σʔλͷมԽʹؤ݈ͳϞσϧΛܧଓతʹ࡞Δ • CI/CD (Continuous Integration, Continuous Deployment/Delivery)ͱಉ͡Α͏ ʹɺCT (Continuous

    Training) ͱݺ͹ΕͨΓ͢Δ • ͳͥ͜ͷ࢓૊Έ͕ඞཁ͔ʁ • ࣮ϓϩμΫτͰ͸σʔληοτͷ܏޲͸มΘΓ͏Δ • e.g. ϢʔβʔΠϕϯτ/ ੈͷதͷ৘੎ / ग़඼͞ΕΔ঎඼ɾΧςΰϦ • ͜ΕΒͷมԽʹ௥ਵ͢ΔͨΊʹఆظతʹֶशΛ͢Δඞཁ͕͋Δ 26
  10. LLMOpsͱ͸ʁ Large Language Model × Operations 34 Large Language Model

    Operations × DevOps for Large Language Model
  11. LLMOpsͱ͸ʁ ͓͢͢Ίͷษڧձ 37 MLOps/LLMOps/AgentOpsษڧձ • MLOps/LLMOpsͷੜ͖ͨφϨοδ͸اۀͷ ࣮ϓϩμΫτʹଟ෼ʹଘࡏ͢Δ • ಛʹLLOps, AgentOpsʹؔͯ͠͸୭΋ਖ਼ղΛ

    ͓࣋ͬͯΒͣɺ໛ࡧஈ֊ • (࠶ܝ) ςοΫϒϩά΍ษڧձʹࢀՃ͍ͯ͠ ͖·͠ΐ͏ • ٯʹݴ͑͹ίϛϡχςΟʹد༩͢Δνϟϯε
  12. LLMOpsͱ͸ʁ ࣄྫ঺հ (ςετ/඼࣭อূ) 38 • גࣜձࣾIVRy͞Μ • LLMΛ༻͍ͨࣗಈి࿩ରԠαʔϏε • ϓϩόΠμʹڧ͘ґଘ͍ͯ͠ΔҎ্ɺ

    ϑΥʔϧόοΫઓུΛߟ͑Δඞཁ͕͋Δ • Ϟσϧ͕ߋ৽͞Εͨ࣌ɾϓϩϯϓτΛमਖ਼ ͨ࣌͠ͷ඼࣭Λ୲อ͢ΔςετΛͲ͏͢Δ ͔ • LLM as a Judge ࣮ӡ༻ͰֶΜͩԻ੠ର࿩γεςϜͷධՁͱςετ LLM APIΛ2೥ؒຊ൪ӡ༻ͯۤ͠࿑ͨ͠࿩
  13. LLMOpsͱ͸ʁ ࣄྫ঺հ (؂ࢹج൫) 39 • גࣜձࣾαΠόʔΤʔδΣϯτ͞Μ • LLMͷ඼࣭͸མͪΔՄೳੑ͕ଟ෼ʹ͋Δ • ໌ࣔతͳΤϥʔ΋ͳ͘མͪΔͷͰɺؔ͠ج൫͕ॏཁ

    • LLMͷ୅දతͳ؂ࢹج൫ • LangFuse • PromptOpsతͳϓϩϯϓτͷόʔδϣχϯά • LLM as a judgeΛ༻͍ͨࣗಈςετ • LLMͷೖྗɾग़ྗͷ؂ࢹ Langfuseͷߏங
  14. LLMOpsͱ͸ʁ ࣄྫ঺հ (ࣗಈԽ) 40 • גࣜձࣾαΠόʔΤʔδΣϯτ͞Μ • גࣜձࣾLayerX͞Μ • ϓϩϯϓτͷࣗಈ࠷దԽ

    • ͜Ε΋LLMOpsͷҰछͩͱࢥ͏ • ϓϩϯϓτΛίωίω͢Δͷ͸ଐਓతʹͳΓ͕͔ͪͭɺ ܾ·ͬͨܕ͕͋ΔͷͰࣗಈԽͰ͖ͦ͏ • LLM͸ϓϩϯϓτͱ͍͏Ϣʔβʔ͕ૢ࡞Ͱ͖Δॊೈͳύ ϥϝʔλ͕͋ΔͷͰ܇࿅ͳ͠ͰύʔιφϥΠζͰ͖Δ • In-Context Learning LLMͷϕϯνϚʔΫείΞΛ7෼ɺ100ԁͰ͋͛Δ AI Agent࣌୅ʹ͓͚Δʮ࢖͑͹࢖͏΄Ͳݡ͘ͳΔAIػೳʯͷ։ൃ
  15. (ؓ࿩ٳ୊) LLMΞϓϦέʔγϣϯͷצॴ ϑϥΠϗΠʔϧ 41 • ϑϥΠϗΠʔϧ • AI͕Ϣʔβʔͷߦಈ΍ᅂ޷Λֶश͢Δ͜ ͱʹΑΓɺ࢖͑͹࢖͏͚ͩݡ͘ͳ͍ͬͯ ͘޷॥؀

    • ઌड़ͷ௨ΓɺϓϩϯϓτΛ༻͍Ε͹͍ΖΜ ͳ͜ͱ͕Ͱ͖ͦ͏ • DeNA͞ΜͷYouTube΍ϓϩμΫτ͕
 ษڧʹͳΓ·͢ Edge
  16. ࠓճ࡞੒͢ΔύΠϓϥΠϯ YOLOΛ༻͍ͨը૾ೝࣝ 45 VertexAI Cloud Storage Cloud Build Artifact Registry

    Pipelines trigger Training Deployment Cloud Run Custom Jobs GitHub trigger Weights & Bias
  17. Vertex AIͱ͸ʁ اۀ޲͚ͷ౷߹ܕAI/ػցֶशϓϥοτϑΥʔϜ • ओͳػೳ • Gemini API • ADKΛ༻͍ͨAI

    Agentͷߏங • Vertex AI Search (RAGͷΠϯσοΫεͱݕࡧ) • Ϟσϧ։ൃ • τϨʔχϯάɺσʔληοτɺFeature Storeɺςετ • τϨʔχϯάͰ͸ಠࣗͷύΠϓϥΠϯߏங͕Մೳ • ϞσϧͷσϓϩΠϝϯτ (ΤϯυϙΠϯτ) 46
  18. Vertex AIͱ͸ʁ اۀ޲͚ͷ౷߹ܕAI/ػցֶशϓϥοτϑΥʔϜ 47 VertexAI Pipelines Training Deployment Custom Jobs

    • ύΠϓϥϯػೳΛ༻͍ͯ
 ֶशύΠϓϥΠϯΛ੔උ • Custom jobΛఆٛͰ͖ΔͷͰ
 ਪ࿦αʔόʔ΁ͷϞσϧͷσϓϩΠ·Ͱ • +α • σϓϩΠલʹϞσϧͷݕূΛߦ͏ • σʔλͷલॲཧΛߦ͏
  19. όʔδϣχϯάͱ࣮ݧ؅ཧ Cloud Storage / Weights and Bias 48 Cloud Storage

    Weights & Bias • Cloud StorageΛ༻͍ͯσʔληοτɾϞσϧͷ
 όʔδϣχϯάΛߦ͏ • σʔληοτ͸ࠓճ͸खಈͰόʔδϣχϯάϞσϧ͸ Cloud BuildͷBuildIDͱରԠ͚ͮͯόʔδϣϯχϯά • +α • DVCΛಋೖͯ͠ΈΔ • Vertex AIͷσʔληοτɾFeature StoreػೳΛ
 ࢖ͬͯΈΔ
  20. Vertex AIͱ͸ʁ GitHub͔ΒͷࣗಈτϦΨʔ / CloudRun΁ͷσϓϩΠ 49 • GitHub΁ͷ܇࿅ઃఆϑΝΠϧͷpushΛτϦΨʔʹֶशύΠϓ ϥΠϯશମΛτϦΨʔ •

    CloudBuildΛ༻͍ͯ܇࿅ͱਪ࿦༻ͷdocker imageΛϏϧυ • ਪ࿦ج൫ʹ͸Cloud RunΛ࠾༻ • ࠷ۙ GPU on Cloud Run͕GAʹ • +α • GitHub ActionsͰtest/linter/formatterΛೖΕͯΈΔ • Vertex AI EndpointsΛ࢖ͬͯΈΔ Cloud Run Cloud Build GitHub trigger
  21. ϋοΧιϯʹ޲͚ͯ 51 • MLΛϓϩμΫτʹ૊ΈࠐΉͱ΍ΕΔ͜ͱ͕޿͕Γ·͢ • ಛʹίϯϐϡʔλϏδϣϯܥ(YOLO΋)͸΍ΕΔ͜ͱ͕ଟ͍Ͱ͢ • ߟ͑Δ͜ͱ͸ • σʔληοτΛߏஙͰ͖Δ͔(܇࿅͠ͳ͍ͳΒ͍Βͳ͍)

    • ӡ༻Ͱ͖Δ͔ (CPU্Ͱಈ͘ͳΒͳ͓ྑ͍) • ࣮૷ͷΠϝʔδ͕͔ͭ͘ • MLOpsతͳ؍఺ΛೖΕΔ͔Ͳ͏͔͸৻ॏʹɻɻ • Ͱ΋ҙࣝͰ͖ͯΔͱධՁ͕ྑͦ͞͏ʁ • ϓϨθϯʹ͔ͬ͠Γ૊ΈࠐΈ·͠ΐ͏
  22. ;Γ͔͑Γ 52 • MLOpsͱ͸ʁ • Machine Learning Operations / DevOps

    for ML • MLOpsͷߏ੒ཁૉ • ֶशύΠϓϥΠϯɾόʔδϣχϯάɾ؂ࢹج൫ɾܧଓతֶशɾɾɾ • LLMOpsͱ͸ʁ • LLM Operations / DevOps for LLM • ϋοΧιϯؤு͍ͬͯͩ͘͞💪