Great stuff!! I am just a bit confused about this line at the Mini-BGD section near the bottom: "Since in SGD, only one example is used simultaneously, so vectorized implementation cannot be implemented."
the "vectorized implementation" referred to SGD or mini-BGD? Thanks!