Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
FlexiBO: A Decoupled Cost-Aware Multi-Objective...
Search
Pooyan Jamshidi
February 29, 2024
Science
0
180
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
AAAI 2024
Pooyan Jamshidi
February 29, 2024
Tweet
Share
More Decks by Pooyan Jamshidi
See All by Pooyan Jamshidi
Reconciling Accuracy, Cost, and Latency of Inference Serving Systems
pjamshidi
0
210
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems
pjamshidi
0
220
Learning from Valerie Issarny: Insights Gained from Program Co-Chairing SEAMS’23
pjamshidi
0
430
Artificial Intelligence and Systems Laboratory (AISys): A Research Overview
pjamshidi
0
800
Experiential Learning by Building Real-World AI Systems
pjamshidi
0
240
Understanding and Explaining the Root Causes of Performance Faults with Causal AI: A Path towards Building Dependable Computer Systems
pjamshidi
0
200
On Debugging the Performance of Configurable Software Systems: Developer Needs and Tailored Tool Support
pjamshidi
0
300
Unicorn: Reasoning about Configurable System Performance through the Lens of Causality
pjamshidi
0
490
Causal AI for Systems
pjamshidi
0
350
Other Decks in Science
See All in Science
SpatialRDDパッケージによる空間回帰不連続デザイン
saltcooky12
0
180
安心・効率的な医療現場の実現へ ~オンプレAI & ノーコードワークフローで進める業務改革~
siyoo
0
480
データマイニング - グラフデータと経路
trycycle
PRO
1
310
LayerXにおける業務の完全自動運転化に向けたAI技術活用事例 / layerx-ai-jsai2025
shimacos
7
22k
People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text
rudorudo11
0
210
20251212_LT忘年会_データサイエンス枠_新川.pdf
shinpsan
0
250
次代のデータサイエンティストへ~スキルチェックリスト、タスクリスト更新~
datascientistsociety
PRO
3
31k
My Little Monster
juzishuu
0
640
やるべきときにMLをやる AIエージェント開発
fufufukakaka
2
1.2k
データマイニング - ウェブとグラフ
trycycle
PRO
0
260
AIPシンポジウム 2025年度 成果報告会 「因果推論チーム」
sshimizu2006
3
370
PPIのみを用いたAIによる薬剤–遺伝子–疾患 相互作用の同定
tagtag
PRO
0
190
Featured
See All Featured
Getting science done with accelerated Python computing platforms
jacobtomlinson
2
140
Java REST API Framework Comparison - PWX 2021
mraible
34
9.2k
Mozcon NYC 2025: Stop Losing SEO Traffic
samtorres
0
180
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.2k
DBのスキルで生き残る技術 - AI時代におけるテーブル設計の勘所
soudai
PRO
62
51k
How Software Deployment tools have changed in the past 20 years
geshan
0
32k
The Spectacular Lies of Maps
axbom
PRO
1
620
Design in an AI World
tapps
0
170
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
120
A Guide to Academic Writing Using Generative AI - A Workshop
ks91
PRO
0
230
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
120
Being A Developer After 40
akosma
91
590k
Transcript
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
Shahriar Iqbal, Jianhai Su, Lars Kotthoff, Pooyan Jamshidi
[email protected]
AAAI, 24 February 2024 1
One Size Does Not Fit All 1 1.5 2 2.5
3 3.5 ·104 15 20 25 30 35 40 Energy Consumption (mJ) Prediction Error (%) Xception ← Energy consumption varies 4 × → ← Prediction Error varies 3 × → 2
Heterogeneous Parameters Num of Filters, Filter Size, Learning Rate, Num
of Epochs DN N Design Compiler Hardware Deployment Num of Active CPUs, CPU/ GPU/ EMC Frequency Cloud, IoT, Edge Num of Threads, GPU Threads, Memory Growth 3
Cost-Unaware Methods Waste Resources Coupled Unaware Pareto Optimal Prediction Error
(%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 Decoupled Aware Pareto Optimal Prediction Error (%) Log Wall Clock Time Energy Consumption (mJ) 3000 6000 9000 12000 15 25 35 45 3.65 3.50 3.35 4
Proposed Method ▷ weight expected benefit of evaluation by cost
▷ choose which objective(s) to evaluate ▷ more efficient use of resources – lower cost, more evaluations 5
Results – Computer Vision 0 50 100 150 200 Cumulative
Log WallClock Time 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10000 15000 20000 25000 Energy Consumption (mJ) 15 20 25 30 35 40 Prediction Error (%) Xception PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 6
Results – NLP 0 50 100 150 200 Cumulative Log
WallClock Time 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 70000 80000 90000 Energy Consumption (mJ) 20 25 30 35 Prediction Error (%) BERT-SQuAD PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 7
Results – Speech Recognition 0 50 100 150 200 250
300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 8
Results – Evaluations 0 20 40 60 80 100 120
140 160 180 200 PAL 0 20 40 60 80 100 120 140 160 180 200 PESMO-DEC 2 4 6 8 0 20 40 60 80 100 120 140 160 180 200 Iteration CA-MOBO 0 20 40 60 80 100 120 140 160 180 200 Iteration FlexiBO 2 4 6 8 9
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization of Deep Neural Networks
▷ cost-aware acquisition function decreases cost and improves results ▷ code available at https://github.com/softsys4ai/FlexiBO 0 50 100 150 200 250 300 Cumulative Log WallClock Time 0.25 0.30 0.35 0.40 0.45 0.50 0.55 Hypervolume Error DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 20000 30000 40000 50000 60000 Energy Consumption (mJ) 17.5 20.0 22.5 25.0 27.5 30.0 32.5 35.0 Prediction Error (%) DeepSpeech PAL PESMO ParEGO SMSEGO CA-MOBO PESMO-DEC FLEXIBO-GPLC 10