xgboost

Author	SHA1	Message	Date
Vadim Khotilovich	94e655329f	Replacing cout with LOG (#3076 ) * change cout to LOG * lint fix	2018-02-06 02:00:34 -06:00
Andrew V. Adinetz	24c2e41287	Fixed the bug with illegal memory access in test_large_sizes.py with 4 GPUs. (#3068 ) - thrust::copy() called from dvec::copy() for gpairs invoked a GPU kernel instead of cudaMemcpy() - this resulted in illegal memory access if the GPU running the kernel could not access the data being copied - new version of dvec::copy() for thrust::device_ptr iterators calls cudaMemcpy(), avoiding the problem.	2018-02-01 16:54:46 +13:00
Rory Mitchell	f87802f00c	Fix GPU bugs (#3051 ) * Change uint to unsigned int * Fix no root predictions bug * Remove redundant splitting due to numerical instability	2018-01-23 13:14:15 +13:00
Thejaswi	84ab74f3a5	Objective function evaluation on GPU with minimal PCIe transfers (#2935 ) * Added GPU objective function and no-copy interface. - xgboost::HostDeviceVector<T> syncs automatically between host and device - no-copy interfaces have been added - default implementations just sync the data to host and call the implementations with std::vector - GPU objective function, predictor, histogram updater process data directly on GPU	2018-01-12 21:33:39 +13:00
PSEUDOTENSOR / Jonathan McKinney	4d36036fe6	Avoid repeated cuda API call in GPU predictor and only synchronize used GPUs (#2936 )	2017-12-09 16:00:42 +13:00
Rory Mitchell	1b77903eeb	Fix several GPU bugs (#2916 ) * Fix #2905 * Fix gpu_exact test failures * Fix bug in GPU prediction where multiple calls to batch prediction can produce incorrect results * Fix GPU documentation formatting	2017-12-04 08:27:49 +13:00
Rory Mitchell	c51adb49b6	Monotone constraints for gpu_hist (#2904 )	2017-11-30 10:26:19 +13:00
EvanChong	790da458e7	Sync number of features after loaded matrix in different workers. (#2722 )	2017-11-29 11:19:12 -08:00
Rory Mitchell	c55f14668e	Update gpu_hist algorithm (#2901 )	2017-11-27 13:44:24 +13:00
Rory Mitchell	24f527a1c0	AVX gradients (#2878 ) * AVX gradients * Add google test for AVX * Create fallback implementation, remove fma instruction * Improved accuracy of AVX exp function	2017-11-27 08:56:01 +13:00
Rory Mitchell	40c6e2f0c8	Improved gpu_hist_experimental algorithm (#2866 ) - Implement colsampling, subsampling for gpu_hist_experimental - Optimised multi-GPU implementation for gpu_hist_experimental - Make nccl optional - Add Volta architecture flag - Optimise RegLossObj - Add timing utilities for debug verbose mode - Bump required cuda version to 8.0	2017-11-11 13:58:40 +13:00
Rory Mitchell	d9d5293cdb	Add warnings for large labels when using GPU histogram algorithms (#2834 )	2017-10-26 17:31:10 +13:00
Rory Mitchell	13e7a2cff0	Various bug fixes (#2825 ) * Fatal error if GPU algorithm selected without GPU support compiled * Resolve type conversion warnings * Fix gpu unit test failure * Fix compressed iterator edge case * Fix python unit test failures due to flake8 update on pip	2017-10-25 14:45:01 +13:00
Philip Cho	452063c32d	Fix issue #2800 (#2817 ) Problem: Fast histogram updater crashes whenever subsampling picks zero rows Diagnosis: Row set data structure uses "nullptr" internally to indicate a non-existent row set. Since you cannot take the address of the first element of an empty vector, a valid row set ends up getting "nullptr" as well. Fix: Use an arbitrary value (not equal to "nullptr") to bypass nullptr check.	2017-10-23 10:46:25 -05:00
Qiang Luo	c09ad421a8	fix bug in loading config for pred task (#2795 )	2017-10-20 00:10:14 -05:00
Scott Lundberg	78c4188cec	SHAP values for feature contributions (#2438 ) * SHAP values for feature contributions * Fix commenting error * New polynomial time SHAP value estimation algorithm * Update API to support SHAP values * Fix merge conflicts with updates in master * Correct submodule hashes * Fix variable sized stack allocation * Make lint happy * Add docs * Fix typo * Adjust tolerances * Remove unneeded def * Fixed cpp test setup * Updated R API and cleaned up * Fixed test typo	2017-10-12 12:35:51 -07:00
Rory Mitchell	4cb2f7598b	-Add experimental GPU algorithm for lossguided mode (#2755 ) -Improved GPU algorithm unit tests -Removed some thrust code to improve compile times	2017-10-01 00:18:35 +13:00
Vadim Khotilovich	74db9757b3	[R package] GPU support (#2732 ) * [R] MSVC compatibility * [GPU] allow seed in BernoulliRng up to size_t and scale to uint32_t * R package build with cmake and CUDA * R package CUDA build fixes and cleanups * always export the R package native initialization routine on windows * update the install instructions doc * fix lint * use static_cast directly to set BernoulliRng seed * [R] demo for GPU accelerated algorithm * tidy up the R package cmake stuff * R pack cmake: installs main dependency packages if needed * [R] version bump in DESCRIPTION * update NEWS * added short missing/sparse values explanations to FAQ	2017-09-28 18:15:28 -05:00
Rory Mitchell	e6a9063344	Integer gradient summation for GPU histogram algorithm. (#2681 )	2017-09-08 15:07:29 +12:00
Rory Mitchell	15267eedf2	[GPU-Plugin] Major refactor 2 (#2664 ) * Change cmake option * Move source files * Move google tests * Move python tests * Move benchmarks * Move documentation * Remove makefile support * Fix test run * Move GPU tests	2017-09-08 09:57:16 +12:00
Rory Mitchell	19a53814ce	[GPU-Plugin] Major refactor (#2644 ) * Removal of redundant code/files. * Removal of exact namespace in GPU plugin * Revert double precision histograms to single precision for performance on Maxwell/Kepler	2017-08-30 10:53:52 +12:00
Rory Mitchell	ef23e424f1	[GPU-Plugin] Add GPU accelerated prediction (#2593 ) * [GPU-Plugin] Add GPU accelerated prediction * Improve allocation message * Update documentation * Resolve linker error for predictor * Add unit tests	2017-08-16 12:31:59 +12:00
Vadim Khotilovich	2b3a4318c5	Several fixes (#2572 ) * repared serialization after update process; fixes #2545 * non-stratified folds in python could omit some data instances * Makefile: fixes for older makes on windows; clean R-package too * make cub to be a shallow submodule * improve $(MAKE) recovery	2017-08-06 13:03:50 -05:00
Rory Mitchell	eda9e180f0	[GPU-Plugin] Various fixes (#2579 ) * Fix test large * Add check for max_depth 0 * Update readme * Add LBS specialisation for dense data * Add bst_gpair_precise * Temporarily disable accuracy tests on test_large.py * Solve unused variable compiler warning * Fix max_bin > 1024 error	2017-08-05 22:16:23 +12:00
Rory Mitchell	0e06d1805d	[WIP] Extract prediction into separate interface (#2531 ) * [WIP] Extract prediction into separate interface * Add copyright, fix linter errors * Add predictor to amalgamation * Fix documentation * Move prediction cache into predictor, add GBTreeModel * Updated predictor doc comments	2017-07-28 17:01:03 -07:00
Vadim Khotilovich	00eda28b3c	MinGW: shared library prefix and appveyor CI (#2539 ) * for MinGW, drop the 'lib' prefix from shared library name * fix defines for 'g++ 4.8 or higher' to include g++ >= 5 * fix compile warnings * [Appveyor] add MinGW with python; remove redundant jobs * [Appveyor] also do python build for one of msvc jobs	2017-07-25 01:06:47 -05:00
PSEUDOTENSOR / Jonathan McKinney	6b375f6ad8	Multi-threaded XGDMatrixCreateFromMat for faster DMatrix creation (#2530 ) * Multi-threaded XGDMatrixCreateFromMat for faster DMatrix creation from numpy arrays for python interface.	2017-07-21 14:43:17 +12:00
PSEUDOTENSOR / Jonathan McKinney	ca7fc9fda3	[GPU-Plugin] Fix gpu_hist to allow matrices with more than just 2^{32} elements. Also fixed CPU hist algorithm. (#2518 )	2017-07-18 11:19:27 +12:00
Rory Mitchell	530f01e21c	[GPU-Plugin] Add load balancing search to gpu_hist. Add compressed iterator. (#2504 )	2017-07-11 22:36:39 +12:00
Philip Cho	64c8f6fa6d	Use old parallel algorithm for histogram construction by default (#2501 ) It has been reported that new parallel algorithm (#2493) results in excessive message usage (see issue #2326). Until issues are resolved, XGBoost should use the old parallel algorithm by default. The user would have to specify `enable_feature_grouping=1` manually to enable the new algorithm.	2017-07-10 09:35:48 -07:00
Vadim Khotilovich	7350085955	Fix broken make on windows (#2499 ) * fix Makefile for make on windows * clean up compilation warnings * fix for `no file name for include` make warning	2017-07-08 09:17:31 -07:00
Philip Cho	ba820847f9	Patch to improve multithreaded performance scaling (#2493 ) * Patch to improve multithreaded performance scaling Change parallel strategy for histogram construction. Instead of partitioning data rows among multiple threads, partition feature columns instead. Useful heuristics for assigning partitions have been adopted from LightGBM project. * Add missing header to satisfy MSVC * Restore max_bin and related parameters to TrainParam * Fix lint error * inline functions do not require static keyword * Feature grouping algorithm accepting FastHistParam Feature grouping algorithm accepts many parameters (3+), and it gets annoying to pass them one by one. Instead, simply pass the reference to FastHistParam. The definition of FastHistParam has been moved to a separate header file to accomodate this change.	2017-07-07 08:25:07 -07:00
Rory Mitchell	5f1b0bb386	[GPU-Plugin] Unify gpu_gpair/bst_gpair. Refactor. (#2477 )	2017-07-01 17:31:13 +12:00
PSEUDOTENSOR / Jonathan McKinney	6b287177c8	[GPU-Plugin] Multi-GPU gpu_id bug fixes for grow_gpu_hist and grow_gpu methods, and additional documentation for the gpu plugin. (#2463 )	2017-06-30 20:04:17 +12:00
Rory Mitchell	48f3003302	[GPU-Plugin] Change GPU plugin to use tree_method parameter, bump cmake version to 3.5 for GPU plugin, add compute architecture 3.5, remove unused cmake files (#2455 )	2017-06-29 16:19:45 +12:00
Rory Mitchell	0e48f87529	[GPU-Plugin] Make node_idx type 32 bit for hist algo. Set default n_gpus to 1. (#2445 )	2017-06-23 18:26:45 +12:00
PSEUDOTENSOR / Jonathan McKinney	41efe32aa5	[GPU-Plugin] Multi-GPU for grow_gpu_hist histogram method using NVIDIA NCCL. (#2395 )	2017-06-12 05:06:08 +12:00
Xiaoguang Sun	2ae56ca84f	Use int32_t explicitly when serializing version (#2389 ) Use int32_t explicitly when serializing version field of dmatrix in binary format. On ILP64 architectures, although very little, size of int is 64 bits.	2017-06-07 10:03:42 -07:00
Vadim Khotilovich	da1629e848	[gbtree] fix update process to work with multiclass and multitree; fixes #2315 (#2332 )	2017-05-21 23:47:57 -05:00
Vadim Khotilovich	b52db87d5c	adding feature contributions to R and gblinear (#2295 ) * [gblinear] add features contribution prediction; fix DumpModel bug * [gbtree] minor changes to PredContrib * [R] add feature contribution prediction to R * [R] bump up version; update NEWS * [gblinear] fix the base_margin issue; fixes #1969 * [R] list of matrices as output of multiclass feature contributions * [gblinear] make order of DumpModel coefficients consistent: group index changes the fastest	2017-05-21 07:41:51 -04:00
Sergei Lebedev	e5e721722e	Fix compilation on OS X with GCC 7 (#2256 ) * Fix compilation on OS X with GCC 7 Compilation failed with In file included from src/tree/tree_updater.cc:6:0: include/xgboost/tree_updater.h:75:46: error: 'function' is not a member of 'std' std::function<TreeUpdater* ()> > { caused by a missing <functional> include. * Fixed another occurence of that issue spotted by @ClimberPG	2017-05-19 22:04:07 -07:00
Maurus Cuelenaere	6bd1869026	Add prediction of feature contributions (#2003 ) * Add prediction of feature contributions This implements the idea described at http://blog.datadive.net/interpreting-random-forests/ which tries to give insight in how a prediction is composed of its feature contributions and a bias. * Support multi-class models * Calculate learning_rate per-tree instead of using the one from the first tree * Do not rely on node.base_weight * learning_rate having the same value as the node mean value (aka leaf value, if it were a leaf); instead calculate them (lazily) on-the-fly * Add simple test for contributions feature * Check against param.num_nodes instead of checking for non-zero length * Loop over all roots instead of only the first	2017-05-14 00:58:10 -05:00
Rory Mitchell	6bf968efe6	[GPU Plugin] Fast histogram speed improvements. Updated benchmarks. (#2258 )	2017-05-08 09:21:38 -07:00
Philip Cho	d769b6bcb5	Fix performance degradation of BuildHist on Windows (#2243 ) Reported in issue #2165. Dynamic scheduling of OpenMP loops involve implicit synchronization. To implement synchronization, libgomp uses futex (fast userspace mutex), whereas MinGW uses kernel-space mutex, which is more costly. With chunk size of 1, synchronization overhead may become prohibitive on Windows machines. Solution: use 'guided' schedule to minimize the number of syncs	2017-05-01 15:54:44 -07:00
ebernhardson	da58f34ff8	Store metrics with learner (#2241 ) Storing and then loading a model loses any eval_metric that was provided. This causes implementations that always store/load, like xgboost4j-spark, to be unable to eval with the desired metric.	2017-04-30 14:23:24 -07:00
Rory Mitchell	8ab5d4611c	[GPU-Plugin] (#2227 ) * Add fast histogram algorithm * Fix Linux build * Add 'gpu_id' parameter	2017-04-25 16:37:10 -07:00
Alex Bain	dbaa5d0bdf	Disable invalid check for completely sparse batch that results in failed assertion for issue #1827 (#2213 )	2017-04-21 09:28:02 -07:00
Preston Parry	1ab8088a09	Removes extraneous log (#2186 ) This log appears to fire every time I ask the python package to make a prediction. It's the only log that fires from XGBoost. When we're getting predictions on millions of items a day in production, this log seems out of place.	2017-04-11 17:38:29 -07:00
Philip Cho	2715baef64	Fix bugs in multithreaded ApplySplitSparseData() (#2161 ) * Bugfix 1: Fix segfault in multithreaded ApplySplitSparseData() When there are more threads than rows in rowset, some threads end up with empty ranges, causing them to crash. (iend - 1 needs to be accessible as part of algorithm) Fix: run only those threads with nonempty ranges. * Add regression test for Bugfix 1 * Moving python_omp_test to existing python test group Turns out you don't need to set "OMP_NUM_THREADS" to enable multithreading. Just add nthread parameter. * Bugfix 2: Fix corner case of ApplySplitSparseData() for categorical feature When split value is less than all cut points, split_cond is set incorrectly. Fix: set split_cond = -1 to indicate this scenario * Bugfix 3: Initialize data layout indicator before using it data_layout_ is accessed before being set; this variable determines whether feature 0 is included in feat_set. Fix: re-order code in InitData() to initialize data_layout_ first * Adding regression test for Bugfix 2 Unfortunately, no regression test for Bugfix 3, as there is no way to deterministically assign value to an uninitialized variable.	2017-04-02 11:37:39 -07:00
Huffers	d45cf240a9	Remove xgboost's thread_local and switch to dmlc::ThreadLocalStore (#2121 ) * Remove xgboost's own version of thread_local and switch to dmlc::ThreadLocalStore (#2109) * Update dmlc-core	2017-03-27 09:09:18 -07:00

... 9 10 11 12 13 ...

995 Commits