Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Linear Algebra at Large Scale
Search
Elizabeth Ramirez
April 27, 2018
Science
920
7
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Linear Algebra at Large Scale
Elizabeth Ramirez
April 27, 2018
More Decks by Elizabeth Ramirez
See All by Elizabeth Ramirez
Maritime Transportation from Space: The most important industry you know nothing about.
eramirem
0
47
LADL-Code Mesh V
eramirem
0
220
Transition Matrix Estimation in High Dimensional Time Series.
eramirem
0
290
The Linear Algebra of Deep Learning
eramirem
2
750
Linear Algebra for FE Developers
eramirem
1
640
Top 10: Los mejores algoritmos del Siglo XX
eramirem
0
480
Numerical Analysis for Orbit Propagation
eramirem
0
290
A New Approach to Linear Filtering and Prediction Problems
eramirem
0
1.6k
Kalman Filters for non-rocket science - PyCon 2016
eramirem
2
410
Other Decks in Science
See All in Science
Bear-safety-running
akirun_run
0
150
1. CPC理論の展開と集合的知能モデル(JSAI2026 KS-27 集合的予測符号化と新たな知性の時代)
hayashiyus884
1
190
ITTF卓球世界ランキングのポイント比を用いた試合結果予測モデルの性能評価 / Performance evaluation of match result prediction models using the point ratio of the ITTF Table Tennis World Ranking
konakalab
0
130
機械学習 - pandas入門
trycycle
PRO
0
610
生成AI・プレプリント時代における 研究成果公開の再設計 ― トップカンファレンス文化はどこへ向かうのか / Redesigning the Dissemination of Research Outputs in the Age of Generative AI and Preprints — Where Is the Top-Conference Culture Heading?
ykiyota
0
24k
機械学習 - 決定木からはじめる機械学習
trycycle
PRO
0
1.5k
東北地方における過去20年間の降水量の変化
naokimuroki
1
250
大黒市で発生した大規模インシデント の ポストモーテムから読み解く、 記憶媒体消去の大切さ
shucho0103
0
180
AIPシンポジウム 2025年度 成果報告会 「因果推論チーム」
sshimizu2006
3
520
データベース06: SQL (3/3) 副問い合わせ
trycycle
PRO
1
970
Distributional Regression
tackyas
0
540
YouTubeにおける撤回論文の参照実態 / metascience-meetup2026
corgies
3
280
Featured
See All Featured
Measuring & Analyzing Core Web Vitals
bluesmoon
9
860
How GitHub (no longer) Works
holman
316
150k
A Soul's Torment
seathinner
6
2.9k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
201
75k
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
720
Discover your Explorer Soul
emna__ayadi
2
1.1k
Practical Orchestrator
shlominoach
191
11k
HTML-Aware ERB: The Path to Reactive Rendering @ RubyCon 2026, Rimini, Italy
marcoroth
1
180
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
380
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
190
技術選定の審美眼(2025年版) / Understanding the Spiral of Technologies 2025 edition
twada
PRO
118
120k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.7k
Transcript
Linear Algebra at Large Scale Elizabeth Ramirez @eramirem
Computational Engineer We model complex systems on the planet, like
forestry and agriculture using satellite imagery.
None
Top 10 Algorithms of the 20th Century
Often the most expensive computations in large-scale codes. Curse of
Dimensionality
Linear Systems Nonlinear Systems Machine Learning Deep Learning
Most ubiquitous problem in Scientific Computing and Data Analysis
What solves? Systems of Equations Polynomial Interpolation Linear Least-Squares
What we know? Gaussian Elimination Complexity
HPC Alternative: Iterative Methods General Form
Jacobi Gauss-Seidel
Convergence of Basic Iterative Methods Spectral radius
Krylov Subspaces
Conjugate Gradient Method (CG) i) ii)
Conjugate Gradient (CG)
Bi-conjugate gradient (BiCG) Any linear system
Deep Learning Primitives Weights, inputs, outputs stored in tensors Matrix
Multiplication Convolution Inner Product Transposition Rectified Linear Unit (ReLu)
Matrix Multiplication Fundamental task Naive: Strassen:
Low-Rank Approximation Accelerates matrix multiplication, therefore, accelerates convolution. Requires SVD:
Low-Rank Multiplication:
Single Instruction Multiple Data (SIMD) Data-level parallelism Incompatible with code
designed for sequential processors Instruction set available in commercial CPUs and GPGPUs
Intel® Math Kernel Library (Intel® MKL) Improved Matrix Multiplication Performance
in LAPACK LU decomposition and inverse without pivoting Take advantage of SIMD instruction set In summary: High Performance Linear Algebra
None
References http://www.siam.org/pdf/news/637.pdf https://software.intel.com/en-us/mkl https://software.intel.com/en-us/articles/t ensorflow-optimizations-on-modern-intel-arc hitecture