【MLN】Visual Blocks for ML

Visual Blocks for ML MLN 2023/06/24

Who am I ? Name Twitter：KzhtTkhs ：高橋かずひと所属：サイバーエージェント AI事業本部
行政DX Div GovTech開発センター仕事：画像処理兼なんでも屋その他：Axross寄稿やインディーゲーム開発のお手伝いしています

Image-Processing-Node-Editor 特徴・高橋自作のノードエディタ形式の画像処理アプリ・ノードをドラッグ& ドロップで接続できる・各処理を視覚化しながら処理を試すことが出来る・OSS(Apache-2.0 license
)

0. Overview

Visual Blocks for ML とは特徴・Google製のノーコードのグラフエディター・ノードをドラッグ& ドロップで接続できる
・MLワークフローのプロトタイプを視覚化しながら試すことが出来る・Web上で動作するデモと Colaboratoryで動くパッケージがある https://visualblocks.withgoogle.com/ より引用

Visual Blocks for ML とは https://visualblocks.withgoogle.com/ より引用注意点・Google製ではあるが、正式にサポートされている
Google製品ではない・このプロダクトの関心を測ることを目的とした実験的なリリース・長期的なメンテナンスの保証はない ※現時点で既にGitHubの更新はあまり活発ではない…… 正確に言うとコア部分がパブリックに開発されていないので状況が謎

1. Input node

Visual Blocks for ML：Inputノード・Input image ・Input text ・Live camera

・静止画を入力・デフォルトでいくつかのテスト画像・アップロードも可能

・テキスト入力が出来る・他ノードと組み合わせて重畳表示や APIの入力に

・Webカメラを入力画像として利用・( 現時点では ) カメラの切り替えは不可

2. Effect node

Visual Blocks for ML：Effectノード・Image Mixer ・Image processor ・Shader processing
・Text processer ・Virtual sticker

・Text processer ・Virtual sticker ・2つの画像を合成・複数の合成モード・テキストの重畳表示

・Text processer ・Virtual sticker ・以下の画像処理を実行輝度、コントラスト、クロップ、リサイズ、シアー、回転、ブラーノイズ付与

・Text processer ・Virtual sticker ・Code editorで作成した任意のシェーダーコードを実行

・Text processer ・Virtual sticker ・テキスト処理を行う - テキストの結合 - テキストの置換 - Mustacheテンプレート

・Text processer ・Virtual sticker ・画像と顔のランドマーク座標を用いて、画像のオーバーレイ表示を行う

3. Model node

Visual Blocks for ML：Modelノード① ・Body segmentation ・Face detection ・Face landmark
・Gesture Recognition ・Hand pose detection ・Mobilenet ・Object detection ・PaLM Chat

・人物や（一部の）動物を切り抜いたマスクを取得する Visual Blocks for ML：Modelノード① ・Body segmentation ・Face
detection ・Face landmark ・Gesture Recognition ・Hand pose detection ・Mobilenet ・Object detection ・PaLM Chat

・顔検出結果（バウンディングボックス、キーポイント）を取得する Visual Blocks for ML：Modelノード① ・Body segmentation ・Face

・顔のランドマーク検出結果を取得する Visual Blocks for ML：Modelノード① ・Body segmentation ・Face detection
・Face landmark ・Gesture Recognition ・Hand pose detection ・Mobilenet ・Object detection ・PaLM Chat

・ハンドジェスチャーを検出する（エラー出る😇 Visual Blocks for ML：Modelノード① ・Body segmentation ・Face detection

・手のキーポイント検出結果を取得する Visual Blocks for ML：Modelノード① ・Body segmentation ・Face detection

・Mobilenet（v1 or v2）を用いてクラス分類結果を取得する Visual Blocks for ML：Modelノード① ・Body
segmentation ・Face detection ・Face landmark ・Gesture Recognition ・Hand pose detection ・Mobilenet ・Object detection ・PaLM Chat

・物体検出結果を取得する（モデル不明。 MobileNet-SSDLite？） Visual Blocks for ML：Modelノード① ・Body segmentation ・Face

・PaLMチャットを実行する ※要APIキー Visual Blocks for ML：Modelノード① ・Body segmentation ・Face detection

Visual Blocks for ML：Modelノード② ・Portrait depth ・Pose landmark ・Text Toxicity
・TFJS model runner ・TFLite model runner ・PaLM Text Generator

・TFJS model runner ・TFLite model runner ・PaLM Text Generator ・PaLMによる文章生成を行う ※要APIキー

・TFJS model runner ・TFLite model runner ・PaLM Text Generator ・セグメンテーション＋深度推定結果を取得する

・TFJS model runner ・TFLite model runner ・PaLM Text Generator ・姿勢推定結果を取得する

・TFJS model runner ・TFLite model runner ・PaLM Text Generator ・入力テキストが有害なテキストかクラス分類を行う

・TFJS model runner ・TFLite model runner ・PaLM Text Generator ・TensorFlow-Hub等の TFJSモデルを実行する ※前処理・後処理はTensor 操作用ノードで実施する

・TFJS model runner ・TFLite model runner ・PaLM Text Generator ・TensorFlow-Hub等の TFLiteモデルを実行する ※前処理・後処理はTensor 操作用ノードで実施する

4. Output node

Visual Blocks for ML：Outputノード① ・3D Phote ・Bounding box visualizer ・Classification
Viewer ・HTML viewer ・Image comparison ・Image viewer

Viewer ・HTML viewer ・Image comparison ・Image viewer ・深度推定結果と画像から 3D画像を表示する

Viewer ・HTML viewer ・Image comparison ・Image viewer ・物体検出などのバウンディングボックスを画像に重畳表示する

Viewer ・HTML viewer ・Image comparison ・Image viewer ・クラス分類結果を表示する

Viewer ・HTML viewer ・Image comparison ・Image viewer ・Tailwind CSSを用いた簡易なHTMLを表示する

Viewer ・HTML viewer ・Image comparison ・Image viewer ・複数画像を並べて比較表示する

Viewer ・HTML viewer ・Image comparison ・Image viewer ・画像を表示する URL指定での画像表示や複数並べての表示も可能 ※複数表示出来ると、Image comparisonとの差別化がイマイチ……

Visual Blocks for ML：Outputノード② ・Landmark visualizer ・Mask visualizer ・Tensor to
depthmap ・Tensor to image ・Tensor viewer

depthmap ・Tensor to image ・Tensor viewer ・姿勢推定結果などのキーポイントと画像を用いて可視化する

depthmap ・Tensor to image ・Tensor viewer ・セグメンテーションなどのマスクを用いて切り抜いた画像を表示する

depthmap ・Tensor to image ・Tensor viewer ・深度推定結果を擬似カラーで表示する

depthmap ・Tensor to image ・Tensor viewer ・Tensorを画像に変換する

depthmap ・Tensor to image ・Tensor viewer ・Tensorの値を表示する

5. Tensor node

Visual Blocks for ML：Tensorノード・Convert tensor type ・Crop and resize
・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification

・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification ・Tensorの型を変換する

・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification ・Tensorのクロップ、リサイズを実行する

・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification ・画像に対し前処理を実行してTensorへ変換する - リサイズ - 正規化

・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification ・最小値、最大値を指定して値の範囲をリマップする

・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification ・Tensor[]に対し、インデックスを指定して Tensorを取得する

・Preprocess image ・Remap value range ・Tensor picker ・Tensor to classification ・Tensor型のクラス分類の出力を「Classification viewer」で表示できる形に変換する

6. MISC node

Visual Blocks for ML：MISCノード・Code editor ・Custom API ・Embed webcite
・Get image size ・Logger ・String picker ・Text selector ・Url Param

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・コードを記述するノード・言語は自由・このノード単体では動作せず、 Shader processing などに接続して使用

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・Web APIを実行した結果を取得する

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・指定URLの内容を iframeとして埋め込む

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・画像のサイズを取得し json形式で出力する

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・入力されたデータをログ出力する

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・文字列選択リストを生成し、選択されている文字列を出力する

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・文字列を範囲選択し、選択した内容を出力する

・Get image size ・Logger ・String picker ・Text selector ・Url Param ・指定URLのURLパラメータを抽出して出力する ※うまく動かない……調査中

7. Colaboratory

Visual Blocks for ML：Colaboratory ・Colaboratory 向けに visualblocks と言うパッケージが提供されている https://colab.research.google.com/github/google/visualblocks/blob/main/examples/quick_start_style_transfer.ipynb より引用

Visual Blocks for ML：Colaboratory ・Webデモとの大きな違いは、Generic関数を定義しブロックに登録できること

Visual Blocks for ML：Colaboratory

8. 作例

https://github.com/google/visualblocks/tree/main/pipelines/ より引用その他にも多くの作例が掲載されている

ご清聴ありがとうございました🙂

【MLN】Visual Blocks for ML

【MLN】Visual Blocks for ML

More Decks by 高橋かずひと

Other Decks in Technology

Featured

Transcript