Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Tools for Data Journalism | MediaLab Prado DDJ ...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Friedrich Lindenberg
October 23, 2015
250
0
Share
Tools for Data Journalism | MediaLab Prado DDJ Workshop
Presented Fri, 23.10.2015
Friedrich Lindenberg
October 23, 2015
More Decks by Friedrich Lindenberg
See All by Friedrich Lindenberg
Introducción a OCCRP Data
pudo
0
430
Getting started with OCCRP Data
pudo
0
1.7k
#nr16: Recherche-Tools
pudo
1
120
data.occrp.org
pudo
0
180
Digitial Research Tools for Investigative Reporters
pudo
0
11k
Grano: A Python tool for investigating influence
pudo
1
300
Data doesn't grow in tables
pudo
2
290
Dr. Freezefile
pudo
2
450
Intro presentation for Naivasha
pudo
1
180
Featured
See All Featured
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
670
Agile Leadership in an Agile Organization
kimpetersen
PRO
0
120
WCS-LA-2024
lcolladotor
0
520
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
520
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
140
SEO in 2025: How to Prepare for the Future of Search
ipullrank
3
3.4k
Being A Developer After 40
akosma
91
590k
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
500
Ecommerce SEO: The Keys for Success Now & Beyond - #SERPConf2024
aleyda
1
1.9k
Test your architecture with Archunit
thirion
1
2.2k
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
140
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
170
Transcript
Tools for data journalism
I’m @pudo I make software for journalists at @occrp
None
Find data where no (wo)man has gone before
Dig into bureaucracy
None
None
Everything is data
None
Voluntary Release Involuntary Release Active acquisition FoI Scraping Passive acquisition
Open Data Leaks
All web pages are data! import.io / Chrome Scraper /
ScraperWiki
Documents, too! Tabula PDF / CometDocs / ABBYY
Collect a treasure (and share it!)
None
Interview the data
Data cleaning with Google Refine
Google Sheets & Pivot Tables
WARNING: Use your brain
Make a point
None
None
DataWrapper
CartoDB
When is a map a map?
None
None
None
If you like, learn to code
JavaScript/D3 SQL/Databases Python for scraping
[email protected]