Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Talk at West Coast Association of Shared Direct...

Talk at West Coast Association of Shared Directors meeting

Deepak Singh

October 28, 2011
Tweet

More Decks by Deepak Singh

Other Decks in Technology

Transcript

  1. Platforms for Deepak Singh P r i n c i

    p a l P r o d u c t M a n a g e r Data Science
  2. Hardware CPU, storage, memory Data management Collections, datasets, provenance Software

    parallelization, optimization Availability Backup, redundant, replicated Cost Small
  3. Netflix needed to transcode 17,000 titles (80TB of data) to

    support the launch of Sony PS3. They provisioned 1200 Amazon EC2 instances and completed the transcoding process in just days. Source: Adrian Cockroft (Netflix)
  4. “Our 40-instance (m2.2xlarge) cluster can scan, filter, and aggregate 1

    billion rows in 950 milliseconds.” Mike Driscoll - Metamarkets
  5. WIEN2K Parallel Performance H size 56,000 (25GB) Runtime (16x8 processors)

    Local (Infiniband) 3h:48 Cloud (10Gbps) 1h:30 ($40) 1200 atom unit cell; SCALAPACK+MPI diagonalization, matrix size 50k-100k Credit: K. Jorissen, F. D. Villa, and J. J. Rehr (U. Washington)
  6. “The process of moving StarMolsim over to the cloud to

    support the “Introduction to Modeling and Simulation” course at MIT was a huge success. The cloud enabled the STAR group to move away from the responsibility of owning and maintaing dedicated hardware and instead focus on their core mission of developing software and services for faculty, students, and researchers at MIT” http://web.mit.edu/stardev/cluster/about.html