two insteresting points gonna record in this post Link to heading
- MLlib
- real-time data analysis with spark streaming
MLlib Link to heading
MLlib is handy that just simply import module and then use it.
worth metioning root-mean-squared error, aka RMSE, using this error we can compare two predictions with each other. if RMSE is similiar, then two predictions are about the same in accuracy.
real-time data analysis Link to heading
when read in files, using readStream instead of read.
spark streaming can be explained by following image