Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Open software for Astronomical Data Analysis
Search
Dan Foreman-Mackey
February 28, 2023
Science
0
110
Open software for Astronomical Data Analysis
@ NASA Goddard
Dan Foreman-Mackey
February 28, 2023
Tweet
Share
More Decks by Dan Foreman-Mackey
See All by Dan Foreman-Mackey
Open Software for Astrophysics, AAS241
dfm
2
460
My research talk for CCA promotion
dfm
1
740
Astronomical software
dfm
1
690
emcee-odi
dfm
1
600
Exoplanet population inference: a tutorial
dfm
3
420
Data-driven discovery in the astronomical time domain
dfm
6
680
TensorFlow for astronomers
dfm
6
730
How to find a transiting exoplanets
dfm
1
440
Long-period transiting exoplanets
dfm
1
290
Other Decks in Science
See All in Science
はじめての「相関と因果とエビデンス」入門:“動機づけられた推論” に抗うために
takehikoihayashi
17
6.8k
Snowflake上でRを使う: RStudioセットアップとShinyアプリケーションのデプロイ
ktatsuya
0
420
LIMEを用いた判断根拠の可視化
kentaitakura
0
340
いまAI組織が求める企画開発エンジニアとは?
roadroller
2
1.3k
The Incredible Machine: Developer Productivity and the Impact of AI
tomzimmermann
0
390
How were Quaternion discovered
kinakomoti321
2
1.1k
Sarcoptic Mange
uni_of_nomi
1
110
Science of Scienceおよび科学計量学に関する研究論文の俯瞰可視化_ポスター版
hayataka88
0
130
Analysis-Ready Cloud-Optimized Data for your community and the entire world with Pangeo-Forge
jbusecke
0
110
はじめてのバックドア基準:あるいは、重回帰分析の偏回帰係数を因果効果の推定値として解釈してよいのか問題
takehikoihayashi
2
740
Презентация программы бакалавриата СПбГУ "Искусственный интеллект и наука о данных"
dscs
0
720
マテリアルズ・インフォマティクスの先端で起きていること / What's Happening at the Cutting Edge of Materials Informatics
snhryt
1
130
Featured
See All Featured
XXLCSS - How to scale CSS and keep your sanity
sugarenia
246
1.3M
Site-Speed That Sticks
csswizardry
0
27
Automating Front-end Workflow
addyosmani
1366
200k
The MySQL Ecosystem @ GitHub 2015
samlambert
250
12k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
26
2.1k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Code Review Best Practice
trishagee
64
17k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
364
24k
[RailsConf 2023] Rails as a piece of cake
palkan
52
4.9k
Building a Modern Day E-commerce SEO Strategy
aleyda
38
6.9k
Learning to Love Humans: Emotional Interface Design
aarron
273
40k
Mobile First: as difficult as doing things right
swwweet
222
8.9k
Transcript
OPEN SOFTWARE FOR ASTRONOMICAL DATA ANALYSIS by Dan Foreman-Mackey
None
open software for astrophysics 0
credit: Adrian Price-Whelan / / data: SAO/NASA ADS
7
many fundamental software packages have a shockingly small number of
maintainers.
7 credit: Adrian Price-Whelan
* astronomical software can be very high impact * we
should think about career trajectories & mechanisms for supporting this work
None
case study: gaussian processes 1
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022)
reference: Aigrain & DFM (2022) ignoring correlated noise accounting for
correlated noise
reference: Aigrain & DFM (2022)
a Gaussian Process is a drop - in replacement for
chi - squared
more details: Aigrain & Foreman-Mackey (2023) arXiv:2209.08940
None
7 [1] model building [2] computational cost
reference: Luger, DFM, Hedges (2021)
[2] computational cost
7 [1] bigger/better computers [2] exploit matrix structure [3] approximate
linear algebra [4] etc.
1 3 2
None
None
1 3 2
°0.6 °0.3 0.0 0.3 0.6 raw [ppt] 0 5 10
15 20 25 time [days] °0.30 °0.15 0.00 de-trended [ppt] N = 1000 reference: DFM+ (2017)
reference: Gordon, Agol, DFM (2020) / tinygp.readthedocs.io
* a Gaussian Process is a drop - in replacement
for chi squared * model building & computational cost are (solvable!) challenges * you should check out tinygp!
case study: probabilistic inference 2
have: physics = > data
want: data = > physics
7 [1] physical models [2] legacy code
None
number of parameters patience required a few tenish not outrageously
many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many reference: DFM (priv. comm.)
number of parameters patience required emcee a few tenish not
outrageously many how things should be reference: DFM (priv. comm.)
None
None
None
None
3.0 3.5 4.0 4.5 5.0 Wavelength [micron] 2.05 2.10 2.15
2.20 2.25 2.30 Transit Depth [%] Alderson et al. 2023 Joint Fit (N = 50) reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in prep) WASP-39b / NIRSpec
reference: Soichiro Hattori, Ruth Angus, DFM, . . . (in
prep) showing 23 of the 404 parameters (8 per channel + 4 shared)
how?
d(physics = > data) / dphysics
automatic differentiation aka “backpropagation”
None
7 [1] physical models [2] legacy code
7 [1] domain - specif i c libraries [2] emulation
None
* gradient - based inference using autodiff can improve eff
i ciency * there are practical challenges with these methods in astro * of interest: domain - specif i c libraries & emulation
aside: JAX 3
None
import numpy as np def linear_least_squares(x, y) : A =
np.vander(x, 2) return np.linalg.lstsq(A, y)[0]
import jax.numpy as jnp def linear_least_squares(x, y) : A =
jnp.vander(x, 2) return jnp.linalg.lstsq(A, y)[0]
None
open research practices 4
None
None
None
None
None
None
None
open software is foundational to astrophysics research there are opportunities
at the interface of astro & applied f i elds there are ways you can participate & benef i t right away
7 I want to chat about… [1] your data analysis
problems [2] building astronomical software [3] writing documentation & tutorials
get in touch! dfm.io github.com/dfm