Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
FlexiBO: A Decoupled Cost-Aware Multi-Objective...
Search
Pooyan Jamshidi
February 29, 2024
Science
0
140
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
AAAI 2024
Pooyan Jamshidi
February 29, 2024
Tweet
Share
More Decks by Pooyan Jamshidi
See All by Pooyan Jamshidi
Reconciling Accuracy, Cost, and Latency of Inference Serving Systems
pjamshidi
0
160
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
pjamshidi
0
170
Learning from Valerie Issarny: Insights Gained from Program Co-Chairing SEAMS’23
pjamshidi
0
330
Artificial Intelligence and Systems Laboratory (AISys): A Research Overview
pjamshidi
0
640
Experiential Learning by Building Real-World AI Systems
pjamshidi
0
220
Understanding and Explaining the Root Causes of Performance Faults with Causal AI: A Path towards Building Dependable Computer Systems
pjamshidi
0
180
On Debugging the Performance of Configurable Software Systems: Developer Needs and Tailored Tool Support
pjamshidi
0
270
Unicorn: Reasoning about Configurable System Performance through the Lens of Causality
pjamshidi
0
460
Causal AI for Systems
pjamshidi
0
310
Other Decks in Science
See All in Science
生成検索エンジン最適化に関する研究の紹介
ynakano
2
1.1k
生成AI による論文執筆サポートの手引き(ワークショップ) / A guide to supporting dissertation writing with generative AI (workshop)
ks91
PRO
0
510
マウス肝炎ウイルス感染の遺伝子発現へのテンソル分解の適用によるSARS-CoV-2感染関連重要ヒト遺伝子と有効な薬剤の同定
tagtag
0
120
KH Coderチュートリアル(スライド版)
koichih
1
41k
機械学習 - pandas入門
trycycle
PRO
0
280
論文紹介 音源分離:SCNET SPARSE COMPRESSION NETWORK FOR MUSIC SOURCE SEPARATION
kenmatsu4
0
210
安心・効率的な医療現場の実現へ ~オンプレAI & ノーコードワークフローで進める業務改革~
siyoo
0
260
Lean4による汎化誤差評価の形式化
milano0017
1
250
トラブルがあったコンペに学ぶデータ分析
tereka114
2
1.6k
3次元点群を利用した植物の葉の自動セグメンテーションについて
kentaitakura
2
1.3k
Online Feedback Optimization
floriandoerfler
0
2.4k
データベース02: データベースの概念
trycycle
PRO
2
760
Featured
See All Featured
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
34
3.1k
4 Signs Your Business is Dying
shpigford
184
22k
Optimizing for Happiness
mojombo
379
70k
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
Making the Leap to Tech Lead
cromwellryan
134
9.4k
KATA
mclloyd
30
14k
GraphQLの誤解/rethinking-graphql
sonatard
71
11k
How to Ace a Technical Interview
jacobian
278
23k
Typedesign – Prime Four
hannesfritz
42
2.7k
Side Projects
sachag
455
42k
Bootstrapping a Software Product
garrettdimon
PRO
307
110k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
Transcript
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
Shahriar Iqbal, Jianhai Su, Lars Kotthoff, Pooyan Jamshidi
[email protected]
AAAI, 24 February 2024 1
One Size Does Not Fit All 1 1.5 2 2.5
3 3.5 ·104 15 20 25 30 35 40 Energy Consumption (mJ) Prediction Error (%) Xception ← Energy consumption varies 4 × → ← Prediction Error varies 3 × → 2
Heterogeneous Parameters Num of Filters, Filter Size, Learning Rate, Num
of Epochs DN N Design Compiler Hardware Deployment Num of Active CPUs, CPU/ GPU/ EMC Frequency Cloud, IoT, Edge Num of Threads, GPU Threads, Memory Growth 3
Cost-Unaware Methods Waste Resources Coupled Unaware Pareto Optimal Prediction Error
(%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 Decoupled Aware Pareto Optimal Prediction Error (%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 4
Proposed Method ▷ weight expected benefit of evaluation by cost
▷ choose which objective(s) to evaluate ▷ more efficient use of resources – lower cost, more evaluations 5
Results – Computer Vision 0 50 100 150 200 Cumulative
Log WallClock Time 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10000 15000 20000 25000 Energy Consumption (mJ) 15 20 25 30 35 40 Prediction Error (%) Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 6
Results – NLP 0 50 100 150 200 Cumulative Log
WallClock Time 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 70000 80000 90000 Energy Consumption (mJ) 20 25 30 35 Prediction Error (%) BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 7
Results – Speech Recognition 0 50 100 150 200 250
300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 8
Results – Evaluations 0 20 40 60 80 100 120
140 160 180 200 PAL 0 20 40 60 80 100 120 140 160 180 200 PESMO-DEC 2 4 6 8 0 20 40 60 80 100 120 140 160 180 200 Iteration CA-MOBO 0 20 40 60 80 100 120 140 160 180 200 Iteration FlexiBO 2 4 6 8 9
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
▷ cost-aware acquisition function decreases cost and improves results ▷ code available at https://github.com/softsys4ai/FlexiBO 0 50 100 150 200 250 300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10