Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Class 2: Cancer and Causation

David Evans
January 17, 2019
750

Class 2: Cancer and Causation

https://uvammm.github.io/class2

Markets, Mechanisms, and Machines
University of Virginia
cs4501/econ4559 Spring 2019
David Evans and Denis Nekipelov
https://uvammm.github.io/

David Evans

January 17, 2019
Tweet

Transcript

  1. MARKETS, MECHANISMS, MACHINES University of Virginia, Spring 2019 Class 2:

    Cancer and Causation cs4501/econ4559 Spring 2019 David Evans and Denis Nekipelov https://uvammm.github.io 17 January 2019
  2. Plan Course Why I’m Teaching this Class Causation Definitions Correlation

    Cancer 1 Everyone who wants to take the class (including unregistered students) should have a teammate for Project 1. If you don’t, talk to us after class today.
  3. 4

  4. 5

  5. 6

  6. Software Vulnerabilities as Externalities 10 “According to one common view,

    information security comes down to technical measures. Given better access control policy models, formal proofs of cryptographic protocols, approved firewalls, better ways of detecting intrusions and malicious code, and better tools for system evaluation and assurance, the problems can be solved. In this note, I put forward a contrary view: information insecurity is at least as much due to perverse incentives. Many, if not most, of the problems can be explained more clearly and convincingly using the language of microeconomics: network externalities, asymmetric information, moral hazard, adverse selection, liability dumping and the tragedy of the commons.”
  7. How much should we spend on security? $124B Projected 2019

    spending on information security [Gartner] 11
  8. How much should we spend on security? $124B Projected 2019

    worldwide spending on information security [Gartner] 12 $265B Apple’s 2018 Revenues $1700B Military spending worldwide (2017) US: $610B $3.5B University of Virginia, 2018 operating budget (~50% Medical) Half the money I spend on advertising is wasted; the trouble is I don't know which half. John Wanamaker
  9. 15

  10. 16

  11. Correlation 24 Hanging suicides US spending on science US spending

    on science, space, and technology correlates with Suicides by hanging, strangulation and suffocation Hanging suicides US spending on science 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 6000 suicides 8000 suicides 4000 suicides 10000 suicides $15 billion $20 billion $25 billion $30 billion tylervigen.com
  12. Random Variable Definition: a random variable (e.g., !) is a

    distribution of values that is the measured outcome of some experiment. 25
  13. Random Variable Definition: a random variable (e.g., !) is a

    distribution of values that is the measured outcome of some experiment. Function from a probability space (set of possible outcomes) to a measurable space (usually a real numbers) 26 !: Ω → &
  14. 28

  15. 29

  16. Covariance Measure of joint variability of two random variables: 30

    '()*+,*-'. !, 0 = E ! − E ! (0 − E 0 )
  17. Independent Variables don’t Covary Theorem: If ! is independent of

    0, covariance !, 0 = 0. 31 covariance !, 0 = E ! − E ! (0 − E 0 )
  18. Independent Variables don’t Covary Theorem: If ! is independent of

    0, covariance !, 0 = 0. 32 covariance !, 0 = E ! − E ! (0 − E 0 ) = E !0 − ! @ E 0 − Y @ E ! + &[!] @ &[0] = E !0] − E ! @ E 0 − E 0 @ E ! + &[!] @ &[0] = E !0 − E ! @ E 0 = 0
  19. Covariance with Itself covariance !, ! =? 33 covariance !,

    ! = E ! − E ! (! − E ! ) = E[ X − µ H] = variance(X)
  20. Correlation 36 Hanging suicides US spending on science US spending

    on science, space, and technology correlates with Suicides by hanging, strangulation and suffocation Hanging suicides US spending on science 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 6000 suicides 8000 suicides 4000 suicides 10000 suicides $15 billion $20 billion $25 billion $30 billion tylervigen.com r = 0.997
  21. 38 Hanging suicides US spending on science US spending on

    science, space, and technology correlates with Suicides by hanging, strangulation and suffocation Hanging suicides US spending on science 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 6000 suicides 8000 suicides 4000 suicides 10000 suicides $15 billion $20 billion $25 billion $30 billion tylervigen.com http://tylervigen.com/spurious-correlations
  22. 40 In England and Wales the phenomenal increase in the

    number of deaths attributed to cancer of the lung provides one of the most striking changes in the pattern of mortality recorded by the Registrar-General. For example, in the quarter of a century between 1922 and 1947 the annual number of deaths recorded increased from 612 to 9,287, or roughly fifteenfold. ... The rise seems to have been particularly rapid since the end of the first world war, between 1921- 30 and 1940-4 the death rate of men at ages 45 and over increased sixfold and of women of the same ages approximately threefold. This increase is still continuing. It has occurred, too, in Switzerland, Denmark, the U.S.A., Canada, and Australia, and has been reported from Turkey and Japan. Sir Richard Doll (1912-2005) Sir Austin Bradford Hill (1897-1991)
  23. 41

  24. 43 Study Design: arrange for hospitals to contact investigators when

    a patent is admitted with lung cancer interview patient about smoking also interview a non-cancer “control” patient
  25. 44

  26. 45

  27. 46

  28. Probability Test Probability that if the null hypothesis were true,

    the measured correlation would be higher than observed. 49
  29. 50 Sir Ronald Fisher (1890-1962) "...the null hypothesis is never

    proved or established, but is possibly disproved, in the course of experimentation. Every experiment may be said to exist only in order to give the facts a chance of disproving the null hypothesis."
  30. 54

  31. 64

  32. 65

  33. 66

  34. Lessons Learned? 68 “Statistics has gained a place of modest

    usefulness in medical research. It can derive and retain this only by complete impartiality, which is not unattainable by rational minds. We should not be content to be “not so unfair”, for without fairness the statistician is in danger of scientific errors through his moral fault. ...”
  35. Lessons Learned? 69 “Statistics has gained a place of modest

    usefulness in medical research. It can derive and retain this only by complete impartiality, which is not unattainable by rational minds. We should not be content to be “not so unfair”, for without fairness the statistician is in danger of scientific errors through his moral fault. ...” Ronald A. Fisher Alleged Dangers of Cigarette-Smoking, British Medical Journal 1957
  36. Hill’s Lessons 71 1. Strength of association 2. Consistency –

    repeated observation 3. Specificity 4. Temporality 5. Gradient: more smoking → more cancer 6. Plausibility (not required) 7. Coherence (not conflict with known facts) 8. Experiment 9. Analogy
  37. 72

  38. Charge Next class: Statistical Learning Theory Project 1: due 9:29am,

    Tuesday (Jan 22) 73 Australian Cigarette Packaging