Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Championing Analytic Infrastructure

kellobri
April 27, 2019

Championing Analytic Infrastructure

An Introduction to R Administration

kellobri

April 27, 2019
Tweet

More Decks by kellobri

Other Decks in Technology

Transcript

  1. Advocating for Analytic Infrastructure - Why it matters - Ideas

    for exploring the space and developing the skill set
  2. What happens when... Industrial Research Business Management Human Resources Government

    Work Regulated Environments Big Data Applications Cloud Infrastructure R in Production What is there to learn? What are the needs? What can we build? The R for Data Science Workflow Drops in Solutions Engineers!
  3. Things work out well when... Ideal: Someone with Linux experience

    and the vision to understand what data scientists need and how to enable them Commonly: This person does not exist
  4. Why it matters “It” = Analytic Infrastructure Meaning… All the

    How, Where and with What, that goes into your daily data science work.
  5. Lower the Cost of Turning Ideas into Realities If you

    have something of value and a PayPal account - you can: Start an online business in minutes
  6. Lower the Cost of Turning Ideas into Realities If you

    have data analysis of value and the R data product toolchain - you can: Share that analysis with everyone you know in minutes
  7. Lessons from the world of DevOps Tactical (dismissible) metric: code

    deployment lead time How long does it take you to get from raw materials (data) to some kind of finished product? How many teams do you have to traverse to make a real impact with the product of your work?
  8. Lessons from the world of DevOps 1. Architecture is what

    enables teams to deliver value through decreasing code deployment lead time 2. Architecture dominates how daily work is performed The improvement of daily work is more important than daily work itself Possibly from the DevOps handbook?? - Gene Kim @RealGeneKim
  9. “R Admin” - Analytic Administrator Role A data scientist who:

    Onboards new tools, deploys solutions, supports existing standards Works closely with IT to maintain, upgrade and scale analytic environments Influences others in the organization to be more effective Passionate about making R a legitimate analytic standard within the organization Check out Nathan Stephens on the RViews Blog - Analytics Administration for R
  10. My personal journey through platforms and tools Undergrad: R Terminal

    Grad School: RStudio IDE (local) + shinyapps.io (free account) My first “real” engineering job: - AWS Cloud $ - Open Source RStudio and Shiny Server (free!) Solutions Engineer: - All the clouds, all the products - limited by imagination (privileged)
  11. The R Admin Goals RViews: Making R legitimate part of

    or your organization - Nathan Stephens
  12. Lay of the Land RStudio - view of the world

    We build tools that you can use to design an excellent platform for data scientists - Three core products - Run on your Linux servers Open source or professional
  13. The R Admin Playbook Build a Sandbox (Proof of Concept)

    Invest in Learning Develop Best Practices Extend your Domain Integrate and Interoperate
  14. Sandboxes - Learning Resources github.com/sol-eng/data-science-lab • A modern Linux operating

    system • An internet connection • Sudo access Step-by Guide: Instance + RStudio + Integration Sean Kross
  15. github.com/sol-eng/vagrant-ansible-sandbox Graduate from Sandboxes to Data Science Labs - Start

    translating your experience into recipes - Recipes are scripts for teaching other people what you know - Configuration management tools are a powerful way to communicate with IT through recipes
  16. If doesn’t bring you joy, don’t force it Please don’t

    force yourself to become an R Admin if the work is tedious to you. But also don’t ignore it. Getting the right tools matters. Seek out an analytic administrator or encourage that growth in someone around you.