Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Embedding Openness in Our Universities
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Arfon Smith
September 15, 2015
Research
150
1
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Embedding Openness in Our Universities
Arfon Smith
September 15, 2015
More Decks by Arfon Smith
See All by Arfon Smith
Why Generative AI makes collaborative, versioned science more important than ever
arfon
0
80
Generative AI is here: What are we going to do about it?
arfon
0
190
Five principles for building generative AI products
arfon
0
150
Five principles for building generative AI products
arfon
0
240
Learning from NASA's commitment to open
arfon
0
120
JOSS rOpenSci presentation
arfon
0
320
Five ways to use GitHub to automate scholarly work
arfon
0
150
Journal of Open Source Software: Bot-assisted community peer-review
arfon
0
140
A vision for the future of astronomical archives
arfon
0
170
Other Decks in Research
See All in Research
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
300
[IR Reading 2026春 論文紹介] LLM-based Listwise Reranking under the Effect of Positional Bias (ECIR 2026) /IR-Reading-2026-Spring
koheishinden
PRO
0
140
LLMアプリケーションの透明性について
fufufukakaka
0
240
AY 2026 Guide to Academic Writing Using Generative AI - Workshop
ks91
PRO
0
120
Claude Code × autoresearch 実践
mathbullet
0
170
2026 東京科学大 情報通信系 研究室紹介 (すずかけ台)
icttitech
0
3.9k
Scalable dynamic origin-destination demand estimation enhanced by high-resolution satellite imagery data
satai
3
290
Ankylosing Spondylitis
ankh2054
0
180
計算情報学研究室(数理情報学第7研究室)2026
tomohirokoana
0
570
Can We Teach Logical Reasoning to LLMs? – An Approach Using Synthetic Corpora (AAAI 2026 bridge keynote)
morishtr
1
260
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
shunk031
4
1.1k
(SIGQS17) Frasco-VS:フラグメントに基づく薬剤候補化合物選抜の量子アニーリングによる実現
keisukeyanagisawa
PRO
0
130
Featured
See All Featured
Producing Creativity
orderedlist
PRO
348
40k
Code Reviewing Like a Champion
maltzj
528
40k
Practical Orchestrator
shlominoach
191
11k
AI Search: Implications for SEO and How to Move Forward - #ShenzhenSEOConference
aleyda
1
1.3k
ラッコキーワード サービス紹介資料
rakko
1
3.7M
jQuery: Nuts, Bolts and Bling
dougneiner
66
8.5k
DevOps and Value Stream Thinking: Enabling flow, efficiency and business value
helenjbeal
1
240
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.8k
Automating Front-end Workflow
addyosmani
1370
210k
Believing is Seeing
oripsolob
1
150
How to Think Like a Performance Engineer
csswizardry
28
2.7k
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
290
Transcript
Embedding Openness in Our Universities Arfon Smith @arfon Creative Commons
Attribution 3.0 Unported License
Three ideas we should steal from Open Source Creative Commons
Attribution 3.0 Unported License Arfon Smith @arfon
None
None
None
None
None
!
What is a GitHub?
None
GitHub
None
0 3,000,000 6,000,000 9,000,000 2007 2008 2009 2010 2011 2012
2013 2014 Users
4,000,000 8,000,000 12,000,000 16,000,000 20,000,000 2007 2008 2009 2010 2011
2012 2013 2014 Repositories
Why build a GitHub?
Made writing code a social experience 1.
None
None
None
None
Changed the collaborative model of open source 2.
None
‘May I have access to your codes please?’
None
From 653314448c7c6f6ec2f93de346896895f786773f Mon Sep 17 00:00:00 2001 From: Arfon Smith
<
[email protected]
> Date: Mon, 13 Oct 2014 16:37:46 -0500 Subject: [PATCH] Bust that cache --- lib/linguist/repository.rb | 14 ++++++++++++-- test/test_repository.rb | 12 ++++++++++++ 2 files changed, 24 insertions(+), 2 deletions(-) diff --git a/lib/linguist/repository.rb b/lib/linguist/repository.rb index 1f9e09c..9998ee6 100644 --- a/lib/linguist/repository.rb +++ b/lib/linguist/repository.rb @@ -1,6 +1,6 @@ require 'linguist/lazy_blob' require 'rugged' - +require 'pry' module Linguist # A Repository is an abstraction of a Grit::Repo or a basic file # system tree. It holds a list of paths pointing to Blobish objects. @@ -128,13 +128,23 @@ def current_tree protected def compute_stats(old_commit_oid, cache = nil) - file_map = cache ? cache.dup : {} old_tree = old_commit_oid && Rugged::Commit.lookup(repository,
GitHub delivered on a theoretical promise of open source
Open source collaborations Open Source: the right to modify
Open source collaborations Open Source: the right to modify, not
the right to contribute.
"
Open source collaborations Forking a project was done as a
last resort
Open source collaborations GitHub made forking the norm
None
None
1. Open Collaborations
Open source collaborations Open Source vs Open Collaborations
Open source collaborations Open Source: the right to modify
Open source collaborations Open Collaborations: a highly collaborative development process
and are receptive to contributions of code, documentation, discussion, etc from anyone who shows competent interest.
Open source collaborations Open Collaborations: a highly collaborative development process
and are receptive to contributions of code, documentation, discussion, etc from anyone who shows competent interest. THIS
How do 4000 people work together?
The pull request
None
None
None
None
None
None
None
discuss improve Code first, permission later
Exposed process
Every time this happens the community learns
Academia makes the same promise
None
None
Explain what you did
So that others can repeat
Everybody learns
None
(doesn’t have to mean this) Open Public? =
Open (within your team, department or institution)
Electronic & Available
Exposed process
Exposed process
Exposed process
Asynchronous & Lock-free
Open, low friction collaborations
Culture of Reuse 2.
A story from my life (~10 years ago)
http://amandabauer.blogspot.com/
None
130 130 1 2048 189 189 258 258 480 562
378 378 493 521 390 397 851 851 247 274 319 319 304 580 493 511 610 636 188 188 228 228 > cat bad_pix_mask.txt
2 days work 3 observing runs/week 52 weeks in year
15 year detector lifetime 2*3*52*15 = 4680 days (13 years)
A second story from my life (~6 months ago)
None
None
None
None
None
None
Software composed of many components
Your software is the thing that is different
Open Source: Ubiquitous culture of reuse
Verification 3.
None
None
None
None
None
None
None
None
None
None
None
None
Robots doing work
“open source is… reproducible by necessity” Fernando Perez http://blog.fperez.org/2013/11/an-ambitious-experiment-in-data-science.html
Why steal ideas from open source?
Academic landscape is changing
None
None
A VISION AND STRATEGY FOR SOFTWARE FOR SCIENCE, ENGINEERING, AND
EDUCATION
Open is the new normal
Software & Data Services
None
None
None
None
New tools. New ways of working.
New tools. New ways of publishing.
http://www.flickr.com/photos/tamaleaver/
Reproducibility Data intensive
Complex (unpublished) things Numbers, data Science!
Verification & benchmarking services Likely thing #1:
None
None
None
None
None
Software is an unforgiving medium
Automating processes
Benchmarking services
None
None
None
None
None
None
None
None
None
Most innovation around shared challenges/data products Likely thing #2:
10 ? n Level 1 (continual) Level 2 (periodic)
Software composed of many components
Your software is the thing that is different
Open Source: Ubiquitous culture of reuse
Ecosystem around data products
Stars Rocks SN WR NEOs Josh Bloom’s Type Ia supernovae
Level 1 (continual) 10 n
None
None
None
‘Normal’ citations won’t be sufficient for software Likely thing #3:
“Academic environments of today do not reward tool builders” Ed
Lazowska, OSTP event http://lazowska.cs.washington.edu/MS/MS.OSTP.pdf
None
None
None
“publishing a paper about code is basically just advertising” David
Donoho http://www.stanford.edu/~vcs/Video.html
Transitive Credit
Paper Author 1 Author 2 Paper Software Data 0.2 0.2
0.4 0.1 0.1 Paper Software Software Author 1 0.5 0.3 0.1 0.1 http://arxiv.org/abs/1407.5117, Katz & Smith
Authorship isn’t static
None
None
None
None
None
None
None
None
Where does progress come first?
Where do communities form?
Around a shared challenge?
Around shared data?
Be more exact
Where peers can most easily recognise value
Open source has solved much of what academia needs
The challenge is to adapt and evolve the academy in
this new collaborative age
Thanks.
[email protected]
@arfon #