- on big data sets you need to utilize simultaneous computations, or else it takes way too long for training - always avoid using for loops - vectorization can be done on both CPU & GPU, but GPU is better