two insteresting points gonna record in this post Link to heading

  • MLlib
  • real-time data analysis with spark streaming

MLlib Link to heading

MLlib is handy that just simply import module and then use it.

worth metioning root-mean-squared error, aka RMSE, using this error we can compare two predictions with each other. if RMSE is similiar, then two predictions are about the same in accuracy.

real-time data analysis Link to heading

when read in files, using readStream instead of read.

spark streaming can be explained by following image

read stream