Background info:

Greetings. I am a student who attentds computer science uni and as part of my dissertation I have to train some models. The thing is,I’m quite new with machine learning and my knowledge is limited so far.

Main: I’m trying to open a 10gb dataset in Google colab to sanitize and preprocess the data before feeding them into a CNN model and I don’t know which is the best way to do it. Thanks for your time