Jiaming Yuan
d9a47794a5
Fix CPU hist init for sparse dataset. ( #4625 )
...
* Fix CPU hist init for sparse dataset.
* Implement sparse histogram cut.
* Allow empty features.
* Fix windows build, don't use sparse in distributed environment.
* Comments.
* Smaller threshold.
* Fix windows omp.
* Fix msvc lambda capture.
* Fix MSVC macro.
* Fix MSVC initialization list.
* Fix MSVC initialization list x2.
* Preserve categorical feature behavior.
* Rename matrix to sparse cuts.
* Reuse UseGroup.
* Check for categorical data when adding cut.
Co-Authored-By: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
* Sanity check.
* Fix comments.
* Fix comment.
2019-07-04 16:27:03 -07:00
Rory Mitchell
5e582b0fa7
Combine thread launches into single launch per tree for gpu_hist ( #4343 )
...
* Combine thread launches into single launch per tree for gpu_hist
algorithm.
* Address deprecation warning
* Add manual column sampler constructor
* Turn off omp dynamic to get a guaranteed number of threads
* Enable openmp in cuda code
2019-04-29 09:58:34 +12:00
Rory Mitchell
00465d243d
Optimisations for gpu_hist. ( #4248 )
...
* Optimisations for gpu_hist.
* Use streams to overlap operations.
* ColumnSampler now uses HostDeviceVector to prevent repeatedly copying feature vectors to the device.
2019-03-20 13:30:06 +13:00
Andy Adinets
42bf90eb8f
Column sampling at individual nodes (splits). ( #3971 )
...
* Column sampling at individual nodes (splits).
* Documented colsample_bynode parameter.
- also updated documentation for colsample_by* parameters
* Updated documentation.
* GetFeatureSet() returns shared pointer to std::vector.
* Sync sampled columns across multiple processes.
2018-12-14 22:37:35 +08:00
Rory Mitchell
78bea0d204
Add google test for a column sampling, restore metainfo tests ( #3637 )
...
* Add google test for a column sampling, restore metainfo tests
* Update metainfo test for visual studio
* Fix multi-GPU bug introduced in #3635
2018-08-28 16:10:26 +12:00