Previous | Next --- Slide 33 of 55
Back to Lecture Thumbnails
abraoliv

I am still a little confused on where this code is run? Is it run on a master node and distributed to workers or is it run on each worker on its subset of the data?

michzrrr

So there will be multiple RDD's as this program is created, is that an excess use of space or can users choose to delete ones they won't need

Please log in to leave a comment.