Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Open software for Astronomical Data Analysis
Search
Dan Foreman-Mackey
February 28, 2023
Science
0
130
Open software for Astronomical Data Analysis
@ NASA Goddard
Dan Foreman-Mackey
February 28, 2023
Tweet
Share
More Decks by Dan Foreman-Mackey
See All by Dan Foreman-Mackey
Open Software for Astrophysics, AAS241
dfm
2
520
My research talk for CCA promotion
dfm
1
770
Astronomical software
dfm
1
720
emcee-odi
dfm
1
650
Exoplanet population inference: a tutorial
dfm
3
450
Data-driven discovery in the astronomical time domain
dfm
6
710
TensorFlow for astronomers
dfm
6
790
How to find a transiting exoplanets
dfm
1
460
Long-period transiting exoplanets
dfm
1
310
Other Decks in Science
See All in Science
3次元点群を利用した植物の葉の自動セグメンテーションについて
kentaitakura
2
1.2k
mathematics of indirect reciprocity
yohm
1
120
眼科AIコンテスト2024_特別賞_6位Solution
pon0matsu
0
390
点群ライブラリPDALをGoogleColabにて実行する方法の紹介
kentaitakura
1
250
AI(人工知能)の過去・現在・未来 —AIは人間を超えるのか—
tagtag
0
120
Quelles valorisations des logiciels vers le monde socio-économique dans un contexte de Science Ouverte ?
bluehats
1
350
Healthcare Innovation through Business Entrepreneurship
clintwinters
0
220
Cross-Media Information Spaces and Architectures (CISA)
signer
PRO
3
31k
観察研究における因果推論
nearme_tech
PRO
1
240
私たちのプロダクトにとってのよいテスト/good test for our products
camel_404
0
300
RoPE(Rotary Position Embeddings)
liberalarts
0
100
サイゼミ用因果推論
lw
1
7.2k
Featured
See All Featured
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
31
1.2k
Measuring & Analyzing Core Web Vitals
bluesmoon
7
460
How to Ace a Technical Interview
jacobian
276
23k
RailsConf 2023
tenderlove
30
1.1k
KATA
mclloyd
29
14k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
45
9.6k
Statistics for Hackers
jakevdp
799
220k
Designing Experiences People Love
moore
142
24k
Six Lessons from altMBA
skipperchong
28
3.8k
Performance Is Good for Brains [We Love Speed 2024]
tammyeverts
10
840
Mobile First: as difficult as doing things right
swwweet
223
9.6k
Why Our Code Smells
bkeepers
PRO
336
57k
Transcript
OPEN SOFTWARE FOR ASTRONOMICAL DATA ANALYSIS by Dan Foreman-Mackey
None
open software for astrophysics 0
credit: Adrian Price-Whelan / / data: SAO/NASA ADS
7
many fundamental software packages have a shockingly small number of
maintainers.
7 credit: Adrian Price-Whelan
* astronomical software can be very high impact * we
should think about career trajectories & mechanisms for supporting this work
None
case study: gaussian processes 1
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022) ignoring correlated noise accounting for
correlated noise
reference: Aigrain & DFM (2022)
a Gaussian Process is a drop - in replacement for
chi - squared
more details: Aigrain & Foreman-Mackey (2023) arXiv:2209.08940
None
7 [1] model building [2] computational cost
reference: Luger, DFM, Hedges (2021)
[2] computational cost
7 [1] bigger/better computers [2] exploit matrix structure [3] approximate
linear algebra [4] etc.
1 3 2
None
None
1 3 2
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Gordon, Agol, DFM (2020) / tinygp.readthedocs.io
* a Gaussian Process is a drop - in replacement
for chi squared * model building & computational cost are (solvable!) challenges * you should check out tinygp!
case study: probabilistic inference 2
have: physics = > data
want: data = > physics
7 [1] physical models [2] legacy code
None
number of parameters patience required a few tenish not outrageously
many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many how things should be reference: DFM (priv. comm.)
None
None
None
None
3.0 3.5 4.0 4.5 5.0 Wavelength [micron] 2.05 2.10 2.15
2.20 2.25 2.30 Transit Depth [%] Alderson et al. 2023 Joint Fit (N = 50) reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in prep) WASP-39b / NIRSpec
reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in
prep) showing 23 of the 404 parameters (8 per channel + 4 shared)
how?
d(physics = > data) / dphysics
automatic differentiation aka “backpropagation”
None
7 [1] physical models [2] legacy code
7 [1] domain - specif i c libraries [2] emulation
None
* gradient - based inference using autodiff can improve eff
i ciency * there are practical challenges with these methods in astro * of interest: domain - specif i c libraries & emulation
aside: JAX 3
None
import numpy as np def linear_least_squares(x, y) : A =
np.vander(x, 2) return np.linalg.lstsq(A, y)[0]
import jax.numpy as jnp def linear_least_squares(x, y) : A =
jnp.vander(x, 2) return jnp.linalg.lstsq(A, y)[0]
None
open research practices 4
None
None
None
None
None
None
None
open software is foundational to astrophysics research there are opportunities
at the interface of astro & applied f i elds there are ways you can participate & benef i t right away
7 I want to chat about… [1] your data analysis
problems [2] building astronomical software [3] writing documentation & tutorials
get in touch! dfm.io github.com/dfm