Improve developing efficiency in pySpark? [Discussion]

Davidat0r · 10 months ago

Improve developing efficiency in pySpark? [Discussion]

Davidat0r · 10 months ago

I’m sorry…I still don’t understand. I thought it I sampled it would be faster? Isn’t that what people do with large datasets? And if it’s like you say, what’s the option during the development phase? I can’t really wait 15 minutes between instructions (if I want to keep my job haha)