* Implement GPU accelerated coordinate descent algorithm. * Exclude external memory tests for GPU
* Refactor linear modelling and add new coordinate descent updater * Allow unsorted column iterator * Add prediction cacheing to gblinear