Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Open software for Astronomical Data Analysis
Search
Dan Foreman-Mackey
February 28, 2023
Science
0
110
Open software for Astronomical Data Analysis
@ NASA Goddard
Dan Foreman-Mackey
February 28, 2023
Tweet
Share
More Decks by Dan Foreman-Mackey
See All by Dan Foreman-Mackey
Open Software for Astrophysics, AAS241
dfm
2
460
My research talk for CCA promotion
dfm
1
740
Astronomical software
dfm
1
690
emcee-odi
dfm
1
600
Exoplanet population inference: a tutorial
dfm
3
420
Data-driven discovery in the astronomical time domain
dfm
6
680
TensorFlow for astronomers
dfm
6
730
How to find a transiting exoplanets
dfm
1
440
Long-period transiting exoplanets
dfm
1
290
Other Decks in Science
See All in Science
最適化超入門
tkm2261
13
3.2k
Machine Learning for Materials (Lecture 6)
aronwalsh
0
500
General Parasitology
uni_of_nomi
0
120
機械学習による確率推定とカリブレーション/probabilistic-calibration-on-classification-model
ktgrstsh
2
210
はじめての「相関と因果とエビデンス」入門:“動機づけられた推論” に抗うために
takehikoihayashi
17
6.8k
The Incredible Machine: Developer Productivity and the Impact of AI
tomzimmermann
0
380
統計的因果探索の方法
sshimizu2006
1
1.1k
Презентация программы бакалавриата СПбГУ "Искусственный интеллект и наука о данных"
dscs
0
690
(2024) Livres, Femmes et Math
mansuy
0
100
学術講演会中央大学学員会八王子支部
tagtag
0
220
解説!データ基盤の進化を後押しする手順とタイミング
shomaekawa
1
330
Machine Learning for Materials (Lecture 9)
aronwalsh
0
200
Featured
See All Featured
Embracing the Ebb and Flow
colly
84
4.4k
Making the Leap to Tech Lead
cromwellryan
132
8.9k
Fashionably flexible responsive web design (full day workshop)
malarkey
404
65k
KATA
mclloyd
29
13k
Six Lessons from altMBA
skipperchong
26
3.5k
The Power of CSS Pseudo Elements
geoffreycrofte
72
5.3k
Optimising Largest Contentful Paint
csswizardry
33
2.9k
Code Review Best Practice
trishagee
64
17k
Build The Right Thing And Hit Your Dates
maggiecrowley
32
2.4k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
46
2.1k
[RailsConf 2023] Rails as a piece of cake
palkan
51
4.9k
Building Applications with DynamoDB
mza
90
6.1k
Transcript
OPEN SOFTWARE FOR ASTRONOMICAL DATA ANALYSIS by Dan Foreman-Mackey
None
open software for astrophysics 0
credit: Adrian Price-Whelan / / data: SAO/NASA ADS
7
many fundamental software packages have a shockingly small number of
maintainers.
7 credit: Adrian Price-Whelan
* astronomical software can be very high impact * we
should think about career trajectories & mechanisms for supporting this work
None
case study: gaussian processes 1
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022) ignoring correlated noise accounting for
correlated noise
reference: Aigrain & DFM (2022)
a Gaussian Process is a drop - in replacement for
chi - squared
more details: Aigrain & Foreman-Mackey (2023) arXiv:2209.08940
None
7 [1] model building [2] computational cost
reference: Luger, DFM, Hedges (2021)
[2] computational cost
7 [1] bigger/better computers [2] exploit matrix structure [3] approximate
linear algebra [4] etc.
1 3 2
None
None
1 3 2
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Gordon, Agol, DFM (2020) / tinygp.readthedocs.io
* a Gaussian Process is a drop - in replacement
for chi squared * model building & computational cost are (solvable!) challenges * you should check out tinygp!
case study: probabilistic inference 2
have: physics = > data
want: data = > physics
7 [1] physical models [2] legacy code
None
number of parameters patience required a few tenish not outrageously
many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many how things should be reference: DFM (priv. comm.)
None
None
None
None
3.0 3.5 4.0 4.5 5.0 Wavelength [micron] 2.05 2.10 2.15
2.20 2.25 2.30 Transit Depth [%] Alderson et al. 2023 Joint Fit (N = 50) reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in prep) WASP-39b / NIRSpec
reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in
prep) showing 23 of the 404 parameters (8 per channel + 4 shared)
how?
d(physics = > data) / dphysics
automatic differentiation aka “backpropagation”
None
7 [1] physical models [2] legacy code
7 [1] domain - specif i c libraries [2] emulation
None
* gradient - based inference using autodiff can improve eff
i ciency * there are practical challenges with these methods in astro * of interest: domain - specif i c libraries & emulation
aside: JAX 3
None
import numpy as np def linear_least_squares(x, y) : A =
np.vander(x, 2) return np.linalg.lstsq(A, y)[0]
import jax.numpy as jnp def linear_least_squares(x, y) : A =
jnp.vander(x, 2) return jnp.linalg.lstsq(A, y)[0]
None
open research practices 4
None
None
None
None
None
None
None
open software is foundational to astrophysics research there are opportunities
at the interface of astro & applied f i elds there are ways you can participate & benef i t right away
7 I want to chat about… [1] your data analysis
problems [2] building astronomical software [3] writing documentation & tutorials
get in touch! dfm.io github.com/dfm