Spark on Mesos - MesosCon 2016

Spark on Mesos Tim Chen Mirantis [email protected] Dean Wampler Lightbend
[email protected]

Dean Wampler • Architect for Big Data Products at Lightbend
• Early advocate for Spark on Mesos • O’Reilly author – Programming Scala, 2nd Edition – Programming Hive – Functional Programming for Java Developers Timothy Chen • Principal Engineer at Mirantis • Previously Lead engineer at Mesosphere • Apache Mesos PMC • Spark contributor, help maintain Spark on Mesos

What’s this all about, then? • Why Spark on Mesos?
• What’s happened since last year? • Demo • What’s next for Spark and Mesos?

Why Spark on Mesos • Hadoop is great, but ...
– … resource management with YARN is limited to compute engines like MapReduce and Spark. • What if your clustering system could run everything?

– … Big Data is moving to streaming (“Fast Data”) and Spark offers mini-batch streaming. • What if your cluster system offered dynamic and flexible resource scheduling able to meet the needs of evolving, long-running streams?

– … it doesn’t support other popular tools like Cassandra, Akka, web frameworks, ... • Maybe you need the SMACK stack: – Spark – Mesos – Akka – Cassandra – Kafka There’s a Scheduler for that!

What’s happened since last year? • What’s new in Mesos
• What’s new in Spark on Mesos • Deprecating fine-grained mode

What’s new in Mesos? • Resource quotas • Dynamic reservation
*Beta* • CNI network Support • GPU Support • Unified Containerizer • More..

What’s new in Spark on Mesos? • Integration test suite
• New Coarse grained scheduler • Mesos framework authentication • Cluster mode now supports Python

Integration Test Suite • A recent release candidate for Spark
broke Mesos integration completely. – Better integration testing clearly needed. – Lightbend and Mesosphere collaborated on an automated integration test suite. https://github.com/typesafehub/mesos-spark-integration-tests

Integration Test Suite • “mesos-docker” subproject: – Builds Docker image
with Ubuntu, Mesos, Spark, and HDFS. – Scripts to run cluster with 1 master and N slaves, configurable #s of CPUs, memory, etc. • (Not needed if you already have a Mesos cluster ;^)

Integration Test Suite • “test-runner” subproject: – Executes a suite
of tests on your Mesos or DC/OS cluster. – Currently exercises dynamic allocation, coarse-grain and fine-grain modes, etc.

New Coarse Grain Scheduler How the old Coarse grain scheduler
works? Launch 1 Spark executor per agent - Rough steps: - Evaluate offers as it comes in from the master - Offers that meets min cpu (1) and min memory requirements - Use as much cores until meets spark.cores.max - Every executor requests fixed memory

works? Mesos Agent 1 CPU: 8 Memory: 8gb Mesos Agent 2 CPU: 8 Memory: 8gb Mesos Agent 3 CPU: 8 Memory: 8gb CoarseMesosSchedulerBackend spark.cores.max=12 spark.executor.memory=4gb Spark Executor CPU 8 Memory 4gb Spark Executor CPU 4 Memory 4gb

works? Mesos Agent 1 CPU: 8 Memory: 8gb Mesos Agent 2 CPU: 2 Memory: 8gb Mesos Agent 3 CPU: 2 Memory: 8gb CoarseMesosSchedulerBackend spark.cores.max=12 spark.executor.memory=4gb Spark Executor CPU 8 Memory 4gb Spark Executor CPU 2 Memory 4gb Spark Executor CPU 2 Memory 4gb

works? Mesos Agent CPU: 8 Memory: 64gb Mesos Agent CPU: 2 Memory: 64gb Mesos Agent CPU: 2 Memory: 64gb CoarseGrainedMesosScheduler Spark Executor CPU 8 Memory 64gb Spark Executor CPU 2 Memory 64gb Spark Executor CPU 2 Memory 64gb spark.cores.max=12 spark.executor.memory=64gb

New Coarse Grain Scheduler Problems with the old scheduler: -
Only allow one executor per slave - Unpredictable executor performance - Unpredictable allocations

New Coarse Grain Scheduler Mesos Agent 1 CPU: 8 Memory:
8gb Mesos Agent 2 CPU: 8 Memory: 8gb Mesos Agent 3 CPU: 8 Memory: 8gb CoarseMesosSchedulerBackend spark.cores.max=12 spark.executor.memory=4gb spark.executor.cores=4 Spark Executor CPU 4 Memory 4gb Spark Executor CPU 4 Memory 4gb Spark Executor CPU 4 Memory 4gb

New Coarse Grain Scheduler - Allows multiple executors per slave
- More predictable executor performance - (Soon) Better allocation

Mesos Framework Authentication • Mesos supports framework authentication. • Roles
can be set per framework – Impacts the relative weight of resource allocation • Optional authentication information to allow the framework to be connected to the master.

Getting rid of fine-grained mode? Coarse-grained Mode Fine-grained Mode

Getting rid of fine-grained mode? • Why two modes? –
FG uses resources more efficiently, because of start- on-demand and Spark executor+task are removed when no longer needed. – CG holds onto all allocated tasks until the job finishes. – But that makes CG faster to start tasks; nice for interactive jobs (e.g., SQL queries). – While FG has a longer start up time.

Getting rid of fine-grained mode? • Today: – Dynamic Allocation
reclaims unused executors. • (Although running this service on every node is a disadvantage) • Hence, the advantages of FG are becoming less important.

Getting rid of fine-grained mode? • Spark has lots of
redundant code to implement both modes. • So, to simplify the code base and operations, FG is now deprecated, but it can’t be removed yet.

GPUs Mesos Running _____________ on __________with _______ on top of
______ using _____ in the _____! Demo Deep Learning Tensorflow Spark Cloud

What’s Next for Mesos? • Pod support • Multiple roles
support • Event Bus • Improved Container Security (capabilities, etc) • More….

What’s Next for Spark on Mesos? • GPU Support on
Mesos • Multi-tenant cluster mode • Use revocable resources • Better scheduling – Strategies (e.g: Spread, Binpack) – Scheduling metrics • More integration test coverage: – More cluster and job configuration options. – Roles and authentication scenarios.

What’s Next for Spark on Mesos? • Make “production” easier:
– Easier overriding of configuration with config files outside the jars. – Better documentation. – Easier access to Spark UIs and logs from Mesos UIs – Improved metrics and UI. – Smarter acceptance of resources offered.

What’s this all about, then? • Why Spark on Mesos?
• What’s happened since last year? • Demo • What’s next for Spark and Mesos?

THANK YOU. [email protected] @tnachen [email protected] @deanwampler

Spark on Mesos - MesosCon 2016

Spark on Mesos - MesosCon 2016

Timothy Chen

More Decks by Timothy Chen

Other Decks in Technology

Featured

Transcript

Spark on Mesos Tim Chen Mirantis [email protected] Dean Wampler Lightbend

Dean Wampler • Architect for Big Data Products at Lightbend

What’s this all about, then? • Why Spark on Mesos?

Why Spark on Mesos • Hadoop is great, but ...

Why Spark on Mesos • Hadoop is great, but ...

Why Spark on Mesos • Hadoop is great, but ...

What’s happened since last year? • What’s new in Mesos

What’s new in Mesos? • Resource quotas • Dynamic reservation

What’s new in Spark on Mesos? • Integration test suite

Integration Test Suite • A recent release candidate for Spark

Integration Test Suite • “mesos-docker” subproject: – Builds Docker image

Integration Test Suite • “test-runner” subproject: – Executes a suite

New Coarse Grain Scheduler How the old Coarse grain scheduler

New Coarse Grain Scheduler How the old Coarse grain scheduler

New Coarse Grain Scheduler How the old Coarse grain scheduler

New Coarse Grain Scheduler How the old Coarse grain scheduler

New Coarse Grain Scheduler Problems with the old scheduler: -

New Coarse Grain Scheduler Mesos Agent 1 CPU: 8 Memory:

New Coarse Grain Scheduler - Allows multiple executors per slave

Mesos Framework Authentication • Mesos supports framework authentication. • Roles

Getting rid of fine-grained mode? Coarse-grained Mode Fine-grained Mode

Getting rid of fine-grained mode? • Why two modes? –

Getting rid of fine-grained mode? • Today: – Dynamic Allocation

Getting rid of fine-grained mode? • Spark has lots of

GPUs Mesos Running _____ on with _ on top of

What’s Next for Mesos? • Pod support • Multiple roles

What’s Next for Spark on Mesos? • GPU Support on

What’s Next for Spark on Mesos? • Make “production” easier:

What’s this all about, then? • Why Spark on Mesos?

THANK YOU. [email protected] @tnachen [email protected] @deanwampler