xgboost

Author	SHA1	Message	Date
ShvetsKS	956beead70	Thread local memory allocation for BuildHist (#6358 ) * thread mem locality * fix apply * cleanup * fix lint * fix tests * simple try * fix * fix * apply comments * fix comments * fix * apply simple comment Co-authored-by: ShvetsKS <kirill.shvets@intel.com>	2020-11-25 17:50:12 +03:00
ShvetsKS	512b464cfa	Disable HT for DMatrix creation (#6386 ) Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com>	2020-11-14 22:18:33 +08:00
Rory Mitchell	29745c6df2	Fix inclusive scan for large sizes (#6234 )	2020-11-03 17:01:43 +13:00
Igor Moura	5e1e972aea	Clean up warnings (#6325 )	2020-10-30 23:50:29 +08:00
Sergio Gavilán	b181a88f9f	Reduced some C++ compiler warnings (#6197 ) * Removed some warnings * Rebase with master * Solved C++ Google Tests errors made by refactoring in order to remove warnings * Undo renaming path -> path_ * Fix style check Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-29 12:36:00 -07:00
Jiaming Yuan	b180223d18	Cleanup RABIT. (#6290 ) * Remove recovery and MPI speed tests. * Remove readme. * Remove Python binding. * Add checks in C API.	2020-10-27 08:48:22 +08:00
Igor Moura	d1254808d5	Clean up C++ warnings (#6213 )	2020-10-19 23:02:33 +08:00
Jiaming Yuan	ddf37cca30	Unify thread configuration. (#6186 )	2020-10-19 16:05:42 +08:00
Jiaming Yuan	bed7ae4083	Loop over `thrust::reduce`. (#6229 ) * Check input chunk size of dqdm. * Add doc for current limitation.	2020-10-14 10:40:56 +13:00
Rory Mitchell	734a911a26	Loop over copy_if (#6201 ) * Loop over copy_if * Catch OOM. Co-authored-by: fis <jm.yuan@outlook.com>	2020-10-14 10:23:16 +13:00
Jiaming Yuan	b05073bda5	[dask] Test for data initializaton. (#6226 )	2020-10-13 11:08:35 +08:00
Jiaming Yuan	2241563f23	Handle duplicated values in sketching. (#6178 ) * Accumulate weights in duplicated values. * Fix device id in iterative dmatrix.	2020-10-10 19:32:44 +08:00
Jiaming Yuan	b5b24354b8	More categorical tests and disable shap sparse test. (#6219 ) * Fix tree load with 32 category.	2020-10-10 16:12:37 +08:00
Jiaming Yuan	70ce5216b5	Add high level tests for categorical data. (#6179 ) * Fix unique.	2020-10-09 09:27:23 +08:00
vcarpani	6bc9747df5	Reduce compile warnings (#6198 ) Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-08 23:14:59 +08:00
Jiaming Yuan	f0c63902ff	Use default allocator in sketching. (#6182 )	2020-09-30 14:55:59 +08:00
Jiaming Yuan	444131a2e6	Add categorical data support to GPU Hist. (#6164 )	2020-09-29 11:27:25 +08:00
Jiaming Yuan	798af22ff4	Add categorical data support to GPU predictor. (#6165 )	2020-09-29 11:25:34 +08:00
Philip Hyunsu Cho	72ef553550	Fall back to CUB allocator if RMM memory pool is not set up (#6150 ) * Fall back to CUB allocator if RMM memory pool is not set up * Fix build * Prevent memory leak * Add note about lack of memory initialisation * Add check for other fast allocators * Set use_cub_allocator_ to true when RMM is not enabled * Fix clang-tidy * Do not demangle symbol; add check to ensure Linux+Clang/GCC combo	2020-09-24 11:04:50 -07:00
Jiaming Yuan	210c131ce7	Support categorical data in GPU sketching. (#6137 )	2020-09-21 13:53:06 +08:00
Jiaming Yuan	e319b63f9e	Merge extract cuts into QuantileContainer. (#6125 ) * Use pruning for initial summary construction.	2020-09-18 16:36:39 +08:00
Jiaming Yuan	5384ed85c8	Use caching allocator from RMM, when RMM is enabled (#6131 )	2020-09-17 21:51:49 -07:00
Philip Hyunsu Cho	d0ccb13d09	Work around a compiler bug in MacOS AppleClang 11 (#6103 ) * Workaround a compiler bug in MacOS AppleClang * [CI] Run C++ test with MacOS Catalina + AppleClang 11.0.3 * [CI] Migrate cmake_test on MacOS from Travis CI to GitHub Actions * Install OpenMP runtime * [CI] Use CMake to locate lz4 lib	2020-09-09 21:21:55 -07:00
Jiaming Yuan	93e9af43bb	Unify set index data. (#6062 )	2020-09-08 11:38:41 +08:00
Rory Mitchell	2e907abdb8	Updates to GPUTreeShap (#6087 ) * Extract paths on device * Update GPUTreeShap	2020-09-06 13:39:08 +12:00
Jiaming Yuan	2fcc4f2886	Unify evaluation functions. (#6037 )	2020-08-26 14:23:27 +08:00
Jiaming Yuan	80c8547147	Make binary bin search reusable. (#6058 ) * Move binary search row to hist util. * Remove dead code.	2020-08-26 05:05:11 +08:00
Jiaming Yuan	20c95be625	Expand categorical node. (#6028 ) Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-25 18:53:57 +08:00
Rory Mitchell	9a4e8b1d81	GPUTreeShap (#6038 )	2020-08-25 12:47:41 +12:00
ShvetsKS	24f2e6c97e	Optimize DMatrix build time. (#5877 ) Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com>	2020-08-20 01:37:03 +08:00
Jiaming Yuan	29b7fea572	Optimize cpu sketch allreduce for sparse data. (#6009 ) * Bypass RABIT serialization reducer and use custom allgather based merging.	2020-08-19 10:03:45 +08:00
Qi Zhang	989ddd036f	Swap byte-order in binary serializer to support big-endian arch (#5813 ) * fixed some endian issues * Use dmlc::ByteSwap() to simplify code * Fix lint check * [CI] Add test for s390x * Download latest CMake on s390x * Fix a bug in my code * Save magic number in dmatrix with byteswap on big-endian machine * Save version in binary with byteswap on big-endian machine * Load scalar with byteswap in MetaInfo * Add a debugging message * Handle arrays correctly when byteswapping * EOF can also be 255 * Handle magic number in MetaInfo carefully * Skip Tree.Load test for big-endian, since the test manually builds little-endian binary model * Handle missing packages in Python tests * Don't use boto3 in model compatibility tests * Add s390 Docker file for local testing * Add model compatibility tests * Add R compatibility test * Revert "Add R compatibility test" This reverts commit c2d2bdcb7dbae133cbb927fcd20f7e83ee2b18a8. Co-authored-by: Qi Zhang <q.zhang@ibm.com> Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-18 14:47:17 -07:00
Jiaming Yuan	4d99c58a5f	Feature weights (#5962 )	2020-08-18 19:55:41 +08:00
Jiaming Yuan	d240463b38	Revert "Remove warning about memset. (#6003 )" (#6020 ) This reverts commit `12e3fb6a6c`.	2020-08-17 20:10:15 +08:00
Jiaming Yuan	12e3fb6a6c	Remove warning about memset. (#6003 )	2020-08-13 08:25:46 +08:00
Philip Hyunsu Cho	9adb812a0a	RMM integration plugin (#5873 ) * [CI] Add RMM as an optional dependency * Replace caching allocator with pool allocator from RMM * Revert "Replace caching allocator with pool allocator from RMM" This reverts commit e15845d4e72e890c2babe31a988b26503a7d9038. * Use rmm::mr::get_default_resource() * Try setting default resource (doesn't work yet) * Allocate pool_mr in the heap * Prevent leaking pool_mr handle * Separate EXPECT_DEATH() in separate test suite suffixed DeathTest * Turn off death tests for RMM * Address reviewer's feedback * Prevent leaking of cuda_mr * Fix Jenkinsfile syntax * Remove unnecessary function in Jenkinsfile * [CI] Install NCCL into RMM container * Run Python tests * Try building with RMM, CUDA 10.0 * Do not use RMM for CUDA 10.0 target * Actually test for test_rmm flag * Fix TestPythonGPU * Use CNMeM allocator, since pool allocator doesn't yet support multiGPU * Use 10.0 container to build RMM-enabled XGBoost * Revert "Use 10.0 container to build RMM-enabled XGBoost" This reverts commit 789021fa31112e25b683aef39fff375403060141. * Fix Jenkinsfile * [CI] Assign larger /dev/shm to NCCL * Use 10.2 artifact to run multi-GPU Python tests * Add CUDA 10.0 -> 11.0 cross-version test; remove CUDA 10.0 target * Rename Conda env rmm_test -> gpu_test * Use env var to opt into CNMeM pool for C++ tests * Use identical CUDA version for RMM builds and tests * Use Pytest fixtures to enable RMM pool in Python tests * Move RMM to plugin/CMakeLists.txt; use PLUGIN_RMM * Use per-device MR; use command arg in gtest * Set CMake prefix path to use Conda env * Use 0.15 nightly version of RMM * Remove unnecessary header * Fix a unit test when cudf is missing * Add RMM demos * Remove print() * Use HostDeviceVector in GPU predictor * Simplify pytest setup; use LocalCUDACluster fixture * Address reviewers' commments Co-authored-by: Hyunsu Cho <chohyu01@cs.wasshington.edu>	2020-08-12 01:26:02 -07:00
Jiaming Yuan	ee70a2380b	Unify CPU hist sketching (#5880 )	2020-08-12 01:33:06 +08:00
Vladislav Epifanov	388f975cf5	Introducing DPC++-based plugin (predictor, objective function) supporting oneAPI programming model (#5825 ) * Added plugin with DPC++-based predictor and objective function * Update CMakeLists.txt * Update regression_obj_oneapi.cc * Added README.md for OneAPI plugin * Added OneAPI predictor support to gbtree * Update README.md * Merged kernels in gradient computation. Enabled multiple loss functions with DPC++ backend * Aligned plugin CMake files with latest master changes. Fixed whitespace typos * Removed debug output * [CI] Make oneapi_plugin a CMake target * Added tests for OneAPI plugin for predictor and obj. functions * Temporarily switched to default selector for device dispacthing in OneAPI plugin to enable execution in environments without gpus * Updated readme file. * Fixed USM usage in predictor * Removed workaround with explicit templated names for DPC++ kernels * Fixed warnings in plugin tests * Fix CMake build of gtest Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-08 18:40:40 -07:00
Jiaming Yuan	6c0c87216f	Fix Windows 2016 build. (#5902 )	2020-07-18 05:50:17 +08:00
Philip Hyunsu Cho	71b0528a2f	GPU implementation of AFT survival objective and metric (#5714 ) * Add interval accuracy * De-virtualize AFT functions * Lint * Refactor AFT metric using GPU-CPU reducer * Fix R build * Fix build on Windows * Fix copyright header * Clang-tidy * Fix crashing demo * Fix typos in comment; explain GPU ID * Remove unnecessary #include * Add C++ test for interval accuracy * Fix a bug in accuracy metric: use log pred * Refactor AFT objective using GPU-CPU Transform * Lint * Fix lint * Use Ninja to speed up build * Use time, not /usr/bin/time * Add cpu_build worker class, with concurrency = 1 * Use concurrency = 1 only for CUDA build * concurrency = 1 for clang-tidy * Address reviewer's feedback * Update link to AFT paper	2020-07-17 01:18:13 -07:00
Jiaming Yuan	7c2686146e	Dask device dmatrix (#5901 ) * Fix softprob with empty dmatrix.	2020-07-17 13:17:43 +08:00
Jiaming Yuan	e471056ec4	Fix sketch size calculation. (#5898 )	2020-07-17 08:33:16 +08:00
Jiaming Yuan	dd445af56e	Cleanup on device sketch. (#5874 ) * Remove old functions. * Merge weighted and un-weighted into a common interface.	2020-07-14 10:15:54 +08:00
Rong Ou	06320729d4	fix device sketch with weights in external memory mode (#5870 )	2020-07-08 08:44:07 +08:00
Jiaming Yuan	048d969be4	Implement GK sketching on GPU. (#5846 ) * Implement GK sketching on GPU. * Strong tests on quantile building. * Handle sparse dataset by binary searching the column index. * Hypothesis test on dask.	2020-07-07 12:16:21 +08:00
Jiaming Yuan	93c44a9a64	Move feature names and types of DMatrix from Python to C++. (#5858 ) * Add thread local return entry for DMatrix. * Save feature name and feature type in binary file. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-07-07 09:40:13 +08:00
Jiaming Yuan	4b0852ee41	Use dmlc stream when URI protocol is not local file. (#5857 )	2020-07-07 03:07:12 +08:00
Philip Hyunsu Cho	efe3e48ae2	Ensure that LoadSequentialFile() actually read the whole file (#5831 )	2020-07-04 16:17:11 +08:00
Jiaming Yuan	1a0801238e	Implement iterative DMatrix. (#5837 )	2020-07-03 11:44:52 +08:00
Jiaming Yuan	8234091368	Remove unweighted GK quantile. (#5816 )	2020-06-23 14:27:46 +08:00

1 2 3 4 5 ...

286 Commits