Developer Productivity Engineering: What's in it for me?

Enterprise Developer Productivity Engineering What’s in it for me?

⬢ Lead Developer Advocate ⬢ Java Champion ⬢ 20+ years
Java experience ⬢ …and author Trisha Gee

https://trishagee.com/books/

But Bottlenecks to Productivity are Everywhere Code Code Wait Time
for Local Build Debug Build Failure Lunch Code Wait Time for Local Build Investigate/Fix Flaky Tests Sprint Waiting time for CI Build

“Bottlenecks in the toolchain are holding back the rockstar 10x
developers” Pete Smoot, Software Architect, Dell Technologies

The “best” programmers outperformed the worst by roughly a 10:1
ratio

What Mattered?

⬢ Paired programmers performed at roughly the same level What
Mattered?

⬢ Paired programmers performed at roughly the same level ⬢
The average difference was only 21% between paired participants What Mattered?

The average difference was only 21% between paired participants ⬢ They didn’t work together on the task, but they came from the same organization What Mattered?

The average difference was only 21% between paired participants ⬢ They didn’t work together on the task, but they came from the same organization ⬢ The best organization performed 11.1x better than the worst What Mattered?

“While this productivity differential among programmers is understandable, there is
also a 10 to 1 difference in productivity among software organizations.” Software Productivity in the Enterprise Harlan (HD) Mills https://trace.tennessee.edu/cgi/viewcontent.cgi?article=1010&context=utk_harlan

“The bald fact is that many companies provide developers with
a workplace that is so crowded, noisy, and interruptive as to fill their days with frustration. That alone could explain reduced efficiency as well as a tendency for good people to migrate elsewhere.” Peopleware: Productive Projects and Teams, Third Edition Tom DeMarco, Tim Lister

Though the phrase had not yet been coined, increased productivity
came down to developer experience.

Gradle is Pioneering DPE DPE is a new software development
practice used by leading software development organizations to maximize developer productivity and happiness.

What Problems Does DPE Solve?

DevOps, 12-Factor, Agile, etc, have still not captured all bottlenecks,
friction, and obstacles to throughput Many are hiding in plain sight, in the developer experience itself

A 10x organization should be reducing build and test feedback
times and improving the consistency and reliability of builds

Pain Point: Waiting for Builds & Tests to Complete

Are you tracking local build and test times?

The only initiatives that will positively impact performance are ones
which increase throughput while simultaneously decreasing cost

Faster Builds Improve Creative Flow Team 1 Team 2 No.
of Devs 11 6 Build Time 4 mins 1 mins No. of local builds 850 1010

Very Fast Feedback Is Important

Solution: Acceleration Technologies

Build Caching Speeds up Builds and Tests

⬢ Introduced to the Java world by Gradle in 2017
⬢ Used by leading technology companies like Google and Facebook ⬢ Can support both user local and remote caching for distributed teams Build Caching

Build Caching When the inputs have not changed, the outputs
can be reused from a previous run.

Demo: Build Cache for Maven and Gradle

Remote Build Cache ⬢ Shared among different machines ⬢ Speeds
up development for the whole team ⬢ Reuses build results among CI agents/jobs and individual developers

Test Distribution Parallelizes Test Execution

Existing solutions: Single machine parallelism Parallelism in Gradle is controlled
by these flags: -- parallel / org.gradle.parallel   Controls project parallelism, defaults to false -- max-workers / org.gradle.workers.max   Controls the maximum number of workers, defaults to the number of processors/cores test.maxParallelForks   Controls how many VMs are forked by an individual test task, defaults to 1 See https://guides.gradle.org/performance/#parallel_execution for more information

Existing solutions: CI fanout See https://builds.gradle.org/project/Gradle for an example of
this strategy Test execution is distributed by manually partitioning the test set and then running partitions in parallel on several CI nodes. pipeline {  stage('compile') { ... }  parallelStage('test') {  step {  sh './gradlew :testGroup1'   }  step {  sh './gradlew :testGroup2'   }  step {  sh './gradlew :testGroup3'   }  }   }

Assessment of existing solutions ⬢ Build Caching is great in
many cases but doesn’t help when test inputs have changed. ⬢ Single machine parallelism is limited by that machine’s resources. ⬢ CI fanout does not help during local development, requires manual setup and test partitioning, and result collection/aggregation

Test Distribution in Gradle Enterprise

Test Distribution Results ‑ ~50% ‑ ~50% ‑ ~50% Measurements
from the demo project Doubling the number of executors cuts build time in half

Netflix reduced a 62-minute test cycle time down to just
under 5 minutes!

Machine learning leads to greater efficiencies

Predictive Test Selection 01 Instead of trying to analyze which
tests could possibly be impacted by developer changes, Predictive Test Selection looks at the history of changes and what has happened to tests in the past 02 When tests complete, they can either FAIL, SUCCEED, or be FLAKY. Predictive Test Selection will predict the outcome of the test based on the history it is analyzing 03 PTS will recommend skipping tests that are successful, and will only run tests that are likely to provide valuable feedback https://arxiv.org/pdf/1810.05286.pdf

Force multiplier when used in combination 1. Build Cache. Avoid
unnecessarily running components of builds and tests whose inputs have not changed. 2. Predictive Test Selection. Run only the relevant subset of test tasks likely to provide useful feedback. 3. Test Distribution. Speed up the execution of the necessary and relevant remaining tests by running them in parallel. 4. Performance Continuity. Sustain Test Distribution and other performance improvements over time with data analytic and performance profiling capabilities.

Is the build and test cycle fast enough?

Is the build and test cycle as fast as it
can possibly be?

Pain Point: Inefficient troubleshooting of broken builds

“ You can observe a lot by just watching.” Yogi
Berra, Catcher and Philosopher Blank background use at will

Build Scan: scans.gradle.com

Learn more https://bit.ly/grdl-scan

DPE Organizations Track Failure Rates

Pain Point: Flaky Tests & Other Avoidable Failures

Flaky builds and tests are maddening

⬢ Try it again ⬢ Re-run it ⬢ Re-run it
again ⬢ Ignore it and approve PR ⬢ All of the above The test is flaky. What do you do now?

Identify and Track Flaky Tests

https://youtu.be/vHBzZHE4tJ0

Pain Point: No Metric/KPI Observability

Without focus, problems can sneak back in

Continuous Improvement: It doesn’t really matter what you improve as
long as you are constantly improving something, because… …entropy denotes that if you aren’t doing anything, you’re always getting worse.

“The tools, services, and environments that developers need to do
their jobs should be treated with production-level SLAs. The development platform is the production environment for the job of creating software” Release It! Second Edition Michael Nygard

Pain Point: Inefficient use of CI Resources

All Of This Will Improve CI Body text

In Summary

⬢ 10x Developers might be a myth, but 10x Organisations
are real In Summary

are real ⬢ Developer Productivity is deeply linked to Developer Experience In Summary

are real ⬢ Developer Productivity is deeply linked to Developer Experience ⬢ If you do nothing about productivity, life will get worse In Summary

are real ⬢ Developer Productivity is deeply linked to Developer Experience ⬢ If you do nothing about productivity, life will get worse ⬢ Fast feedback, efficient troubleshooting, and reliable cycles are key In Summary

are real ⬢ Developer Productivity is deeply linked to Developer Experience ⬢ If you do nothing about productivity, life will get worse ⬢ Fast feedback, efficient troubleshooting, and reliable cycles are key ⬢ Start with observation, and then take action on data In Summary

are real ⬢ Developer Productivity is deeply linked to Developer Experience ⬢ If you do nothing about productivity, life will get worse ⬢ Fast feedback, efficient troubleshooting, and reliable cycles are key ⬢ Start with observation, and then take action on data ⬢ Proactively solve problems for the whole team In Summary

Source: TechValidate. TVID: 066-EEE-DB1

DPE Transforms Every Business Layer

Next Steps

https://bit.ly/speed-build Build speed challenge

There’s a Book for This

https://bit.ly/dpe-4me

Thank you!

How it works… 1. When a test run starts, the
build tool submits a test input snapshot and test set to a machine learning model. 2. PTS automatically develops a test selection strategy by learning from historical code changes and test outcomes from your Build Scan data to predict a subset of relevant tests, which are then executed by your build. 3. Code change and test results data are processed immediately after a Build Scan is uploaded to PTS and updates the test selection strategy based on new results.

Cache Key/Value Calculation The cacheKey for Gradle Tasks/Maven Goals is
based on the Inputs: cacheKey(javaCompile) = hash(sourceFiles, jdk version, classpath, compiler args) The cacheEntry contains the output: cacheEntry[cacheKey(javaCompile)] = fileTree(classFiles) For more information, see: https://docs.gradle.org/current/userguide/build_cache.html

Developer Productivity Engineering: What's in i...

Developer Productivity Engineering: What's in it for me?

More Decks by Trisha Gee

Other Decks in Technology

Featured

Transcript