Scaling Python for Machine Learning: Beyond Data Parallelism • Holden Karau • GOTO 2023
This presentation was recorded at GOTO Chicago 2023. #GOTOcon #GOTOchgo
Holden Karau - Open Source Engineer at Netflix @HoldenKarau
RESOURCES
@holden
ABSTRACT
Data Parallelism can be amazing and it frees us from so many fiddly complicated tasks (like dealing with locks). On the other hand, as training large machine learning models becomes increasingly popular, we’re seeing the need to move beyond purely data-parallel techniques. Depending on recompute exclusively for failure is no longer sufficient as our operations are not idempotent.
In this talk we will look at Spark, Dask, and Ray in the context of scaling machine learning models and how you can take advantage of other types of distributed parallelism (including the actor model for managing model weights during training). [...]
TIMECODES
00:00 Intro
04:16 Probable (relevant) biases
06:14 Quick refresher on data parallelism
11:52 What do (distributed) tasks look like?
12:49 Dask distributed tasks
13:42 Ray distributed tasks
14:03 How are they different & same?
18:41 Task fault tolerance
21:53 Does Spark have tasks & actors?
27:39 Ray Diagram
31:33 Ray actor fault tolerance
33:43 What’s up with Ray & Netflix?
34:39 Dask actor fault tolerance
37:28 Outro
Download slides and read the full abstract here:
RECOMMENDED BOOKS
Holden Karau • Distributed Computing 4 Kids •
Holden Karau • Scaling Python with Dask •
Holden Karau & Boris Lublinsky • Scaling Python with Ray •
Holden Karau & Rachel Warren • High Performance Spark •
Holden Karau, Konwinski, Wendell & Zaharia • Learning Spark •
Holden Karau & Krishna Sankar • Fast Data Processing with Spark 2nd Edition •
Holden Karau • Fast Data Processing with Spark 1st Edition •
#MachineLearning #PythonProgramming #Parallelism #DataParallelism #HoldenKarau #PythonForML #ApacheRay #ApacheDask #ApacheSpark #Joblib #FaultTolerance
Looking for a unique learning experience?
Attend the next GOTO conference near you! Get your ticket at
Sign up for updates and specials at
SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily.
1 view
0
0
1 month ago 01:39:08 1
Все о работе учителем в школе
1 month ago 00:02:44 1
How Florida biologists discovered a Burmese python swallowing a fully-grown deer