Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Pelemay Backend: A memory-saving, fault-toleran...
Search
Susumu Yamazaki (ZACKY)
September 07, 2023
Programming
280
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Pelemay Backend: A memory-saving, fault-tolerant and distributed collection of Nx compilers and backends for embedded systems
Susumu Yamazaki (ZACKY)
September 07, 2023
More Decks by Susumu Yamazaki (ZACKY)
See All by Susumu Yamazaki (ZACKY)
新規のC言語処理系を実装することによる 組込みシステム研究にもたらす価値 についての考察
zacky1972
1
550
ザキ研Elixir研究動向2023
zacky1972
0
440
Elixir/Nerves衛星を打ち上げる日
zacky1972
1
380
Robust, Distributed, and Parallel Processing for Enormous Images Using Supervisor, Node, Flow, Nx, and Evision
zacky1972
0
430
人工衛星高速画像処理で 社会問題を解決しよう
zacky1972
0
420
長い機械学習/画像処理でも メモリ不足を起こさないElixir新技術
zacky1972
0
330
Future Possibilities and Effectiveness of JIT from Elixir Code of Image Processing and Machine Learning into Native Code with SIMD Instructions
zacky1972
0
390
世界は意外と近い!? 「遠くへ行くならみんなで行こう」 次は貴方が発表する番だ
zacky1972
0
120
スーパーコンピュータが組込みシステムに降りてくる! 〜新時代の高性能組込みシステムの SIMD/ベクトル処理の要点を押さえる
zacky1972
0
390
Other Decks in Programming
See All in Programming
Make SRE Operations Easier with Azure SRE Agent
kkamegawa
0
6.9k
1B+ /day規模のログを管理する技術
broadleaf
0
100
「AIで開発し、AIを届ける」をEvalでつなぐ 〜AIネイティブに始めるプロダクト開発の実践〜 / Connecting "Develop with AI, deliver AI" with Eval
rkaga
4
5.3k
Javaの型とAI時代に型が大事な理由 / java types and type in AI era
kishida
2
140
技術記事、AIに書かせるか、自分で書くか? 〜それでも私が自分の手で書く理由〜 / #QiitaConference
jnchito
2
1.4k
ADKを使って簡単にAIエージェントを作ってみよう
k1mu21
0
270
Datadog × OpenTelemetry 入門と実践のあいだ
kn_to_maxpno
1
160
技術的負債解消で開発者の未来を開く- AIの力でコード刷新
kmd2kmd
0
110
Webフレームワークの ベンチマークについて
yusukebe
0
170
TypeScript+Orvalで実現する型安全かつ堅牢でスケーラブルなマルチチャネル通知基盤 / TSKaigi Night talks ~after conference~
d0riven
0
350
Agentic UI
manfredsteyer
PRO
0
180
Spring Security 実践 ─ GraphQL APIで実務に役立つ 認証・認可 を学ぶ
wagyu
0
250
Featured
See All Featured
We Are The Robots
honzajavorek
0
250
Designing Experiences People Love
moore
143
24k
VelocityConf: Rendering Performance Case Studies
addyosmani
333
25k
Chasing Engaging Ingredients in Design
codingconduct
0
220
Become a Pro
speakerdeck
PRO
31
6k
Why Our Code Smells
bkeepers
PRO
340
58k
The innovator’s Mindset - Leading Through an Era of Exponential Change - McGill University 2025
jdejongh
PRO
1
200
The Invisible Side of Design
smashingmag
302
52k
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
62
44k
Learning to Love Humans: Emotional Interface Design
aarron
275
41k
From π to Pie charts
rasagy
0
210
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
10
1.2k
Transcript
Pelemay Backend: A memory-saving, fault- tolerant and distributed collection of
Nx compilers and backends for embedded systems Susumu Yamazaki (ZACKY) This work was partially supported by the Asahi Kohsan Group Research Support Program of the Kitakyushu Foundation for the Advancement of Industry Science and Technology. 1 ©︎ 2023 Susumu Yamazaki
About Susumu Yamazaki (ZACKY) • This slides are in my
Speaker Deck https://speakerdeck.com/zacky1972 • From Japan 🇯🇵. • An organizer of ElixirConf JP. • Associate Professor at Univ. of Kitakyushu. • My hobby in my childhood was to describe science fiction stories! • I wanted to write longer stories, like Perry Rhodan, but my advantage was to write shorter stories… 2 ©︎ 2023 Susumu Yamazaki
Background • You know that de facto standard frameworks of
machine learning for Elixir are Nx, Axon and their ecosystem. • The talk by Sean Moriarity at this ElixirConf showed the positioning strategy of MLOps towards distributed and parallel computing with multiple GPUs for LLM! • We are quite much inspired by his talk. • However, current focuses of Nx, Axon and their ecosystem, especially EXLA, are unsuitable for most embedded systems due to lack of GPUs. • So, we have been developing Pelemay Backend, a lightweight Nx backend specialized for embedded systems, since 2022. 3 ©︎ 2023 Susumu Yamazaki
Our Positioning Strategy 4 ©︎ 2023 Susumu Yamazaki
Lesson Learned of Pelemay Backend 1st ed. • We developed
Pelemay Backend 1st ed. in 2022. • This proves utilization of OpenBLAS as an Nx backend. • BLAS means Basic Linear Algebra Subroutines, which has been developed and sophisticated since the FORTRAN era. • OpenBLAS is an open-source software compatible with BLAS, and has faster implementation with SIMD or vector instructions for most ISAs, including ARM and RISC-V, than that written in C. • We implemented a partial builder that can compile only necessary modules of OpenBLAS, and a prototype backend using it. 5 ©︎ 2023 Susumu Yamazaki
Lesson Learned of Pelemay Backend 2nd ed. • Next, we
have developed Pelemay Backend 2nd ed. since 2023. • One of its concepts is component-based for maintainability, based on Aspect-oriented programming (AOP). • That is, we will develop a backend generator to decorate the specified based backend with the functions before and after a set of functions in the backend. • The set can be specified with the style of AspectJ, an AOP language, and with grouping written in HexDocs of Nx, for example, Aggregates, Backend, Conversion, and so on. • The another is memory-saving. We proved that converting ONNX for ResNet to Axon and loading it require 9GB memory. That is too much to execute them on an embedded system. • However, the Sean’s talk shows the roadmap to realize memory-saving processing for LLM. Then, we will wait for the realization. 6 ©︎ 2023 Susumu Yamazaki
What Pelemay Backend focuses on • Thus, now, we’ll focus
on implementation of the component-based architecture with OpenBLAS. • Some module focuses on only multiplication of matrix and matrix. • Some module focuses on only addition with vectors or matrices with scalar multiplication. • Some module focuses on only scalar multiplication. • Some module focuses on only dividing large vectors or matrices into smaller pieces. • Other unfrequent operations are delegated to the default backend. • Such many simple modules collaborate to operate given numerical functions. • This makes architecture simpler to maintain than monolith. • This approach is to accumulate shorter stories towards a longer and longer story! 7 ©︎ 2023 Susumu Yamazaki
To get source of Pelemay Backend • https://github.com/zeam-vm/pelemay_backend • Look
forward to our future progress of such accumulated stories! • Thank you! 8 ©︎ 2023 Susumu Yamazaki