xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	f6169c0b16	[CI] Use separate Docker cache for each CUDA version (#6305 )	2020-10-28 11:07:00 -07:00
James Lamb	e1de390e6e	[ci] replace 'egrep' with 'grep -E' (#6287 )	2020-10-27 12:05:48 -07:00
Jiaming Yuan	b5c2a47b20	Drop single point model recovery (#6262 ) * Pass rabit params in JVM package. * Implement timeout using poll timeout parameter. * Remove OOB data check.	2020-10-21 15:27:03 +08:00
Jiaming Yuan	81c37c28d5	Time the CPU tests on Jenkins. (#6257 ) * Time the CPU tests on Jenkins. * Reduce thread contention. * Add doc. * Skip heavy tests on ARM.	2020-10-20 17:19:07 -07:00
Philip Hyunsu Cho	7f6ed5780c	[CI] Build a Python wheel for aarch64 platform (#6253 )	2020-10-18 22:35:19 -07:00
Philip Hyunsu Cho	c991eb612d	[jvm-packages] Fix up build for xgboost4j-gpu, xgboost4j-spark-gpu (#6216 ) * [CI] Clean up build for JVM packages * Use correct path for saving native lib * Fix groupId of maven-surefire-plugin * Fix stashing of xgboost4j_jar_gpu * [CI] Don't run xgboost4j-tester with GPU, since it doesn't use gpu_hist	2020-10-09 14:08:15 -07:00
Philip Hyunsu Cho	f121f2738f	[CI] Fix Docker build for CUDA 11 (#6202 )	2020-10-05 17:54:14 -07:00
Rory Mitchell	dda9e1e487	Update GPUTreeshap (#6163 ) * Reduce shap test duration * Test interoperability with shap package * Add feature interactions * Update GPUTreeShap	2020-09-28 09:43:47 +13:00
Philip Hyunsu Cho	678ea40b24	[CI] Upgrade cuDF and RMM to 0.16 nightlies; upgrade to Ubuntu 18.04 (#6157 ) * [CI] Upgrade cuDF and RMM to 0.16 nightlies * Use Ubuntu 18.04 in RMM test, since RMM needs GCC 7+	2020-09-23 19:48:44 -07:00
Jiaming Yuan	452ac8ea62	Time GPU tests on CI. (#6141 )	2020-09-22 14:25:10 +08:00
Jiaming Yuan	c92d751ad1	Enable building rabit on Windows (#6105 )	2020-09-11 11:54:46 +08:00
ShvetsKS	c1ca872d1e	Modin DF support (#6055 ) * Modin DF support * mode change * tests were added, ci env was extended * mode change * Remove redundant installation of modin * Add a pytest skip marker for modin * Install Modin[ray] from PyPI * fix interfering * avoid extra conversion * delete cv test for modin * revert cv function Co-authored-by: ShvetsKS <kirill.shvets@intel.com> Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-29 22:33:30 +03:00
Philip Hyunsu Cho	cfced58c1c	[CI] Port CI fixes from the 1.2.0 branch (#6050 ) * Fix a unit test on CLI, to handle RC versions * [CI] Use mgpu machine to run gpu hist unit tests * [CI] Build GPU-enabled JAR artifact and deploy to xgboost-maven-repo	2020-08-22 23:24:46 -07:00
Philip Hyunsu Cho	1fd29edf66	[CI] Migrate linters to GitHub Actions (#6035 ) * [CI] Move lint to GitHub Actions * [CI] Move Doxygen to GitHub Actions * [CI] Move Sphinx build test to GitHub Actions * [CI] Reduce workload for Windows R tests * [CI] Move clang-tidy to Build stage	2020-08-19 12:33:51 -07:00
Qi Zhang	989ddd036f	Swap byte-order in binary serializer to support big-endian arch (#5813 ) * fixed some endian issues * Use dmlc::ByteSwap() to simplify code * Fix lint check * [CI] Add test for s390x * Download latest CMake on s390x * Fix a bug in my code * Save magic number in dmatrix with byteswap on big-endian machine * Save version in binary with byteswap on big-endian machine * Load scalar with byteswap in MetaInfo * Add a debugging message * Handle arrays correctly when byteswapping * EOF can also be 255 * Handle magic number in MetaInfo carefully * Skip Tree.Load test for big-endian, since the test manually builds little-endian binary model * Handle missing packages in Python tests * Don't use boto3 in model compatibility tests * Add s390 Docker file for local testing * Add model compatibility tests * Add R compatibility test * Revert "Add R compatibility test" This reverts commit c2d2bdcb7dbae133cbb927fcd20f7e83ee2b18a8. Co-authored-by: Qi Zhang <q.zhang@ibm.com> Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-18 14:47:17 -07:00
Philip Hyunsu Cho	9adb812a0a	RMM integration plugin (#5873 ) * [CI] Add RMM as an optional dependency * Replace caching allocator with pool allocator from RMM * Revert "Replace caching allocator with pool allocator from RMM" This reverts commit e15845d4e72e890c2babe31a988b26503a7d9038. * Use rmm::mr::get_default_resource() * Try setting default resource (doesn't work yet) * Allocate pool_mr in the heap * Prevent leaking pool_mr handle * Separate EXPECT_DEATH() in separate test suite suffixed DeathTest * Turn off death tests for RMM * Address reviewer's feedback * Prevent leaking of cuda_mr * Fix Jenkinsfile syntax * Remove unnecessary function in Jenkinsfile * [CI] Install NCCL into RMM container * Run Python tests * Try building with RMM, CUDA 10.0 * Do not use RMM for CUDA 10.0 target * Actually test for test_rmm flag * Fix TestPythonGPU * Use CNMeM allocator, since pool allocator doesn't yet support multiGPU * Use 10.0 container to build RMM-enabled XGBoost * Revert "Use 10.0 container to build RMM-enabled XGBoost" This reverts commit 789021fa31112e25b683aef39fff375403060141. * Fix Jenkinsfile * [CI] Assign larger /dev/shm to NCCL * Use 10.2 artifact to run multi-GPU Python tests * Add CUDA 10.0 -> 11.0 cross-version test; remove CUDA 10.0 target * Rename Conda env rmm_test -> gpu_test * Use env var to opt into CNMeM pool for C++ tests * Use identical CUDA version for RMM builds and tests * Use Pytest fixtures to enable RMM pool in Python tests * Move RMM to plugin/CMakeLists.txt; use PLUGIN_RMM * Use per-device MR; use command arg in gtest * Set CMake prefix path to use Conda env * Use 0.15 nightly version of RMM * Remove unnecessary header * Fix a unit test when cudf is missing * Add RMM demos * Remove print() * Use HostDeviceVector in GPU predictor * Simplify pytest setup; use LocalCUDACluster fixture * Address reviewers' commments Co-authored-by: Hyunsu Cho <chohyu01@cs.wasshington.edu>	2020-08-12 01:26:02 -07:00
Philip Hyunsu Cho	071e10c1d1	[CI] Fix broken Docker container 'cpu' (#5956 )	2020-07-29 04:29:57 -07:00
Philip Hyunsu Cho	5879acde9a	[CI] Improve R linter script (#5944 ) * [CI] Move lint to a separate script * [CI] Improved lintr launcher * Add lintr as a separate action * Add custom parsing logic to print out logs * Fix lintr issues in demos * Run R demos * Fix CRAN checks * Install XGBoost into R env before running lintr * Install devtools (needed to run demos)	2020-07-27 00:55:35 -07:00
Bobby Wang	8943eb4314	[BLOCKING] [jvm-packages] add gpu_hist and enable gpu scheduling (#5171 ) * [jvm-packages] add gpu_hist tree method * change updater hist to grow_quantile_histmaker * add gpu scheduling * pass correct parameters to xgboost library * remove debug info * add use.cuda for pom * add CI for gpu_hist for jvm * add gpu unit tests * use gpu node to build jvm * use nvidia-docker * Add CLI interface to create_jni.py using argparse Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-07-26 21:53:24 -07:00
Jiaming Yuan	66cc1e02aa	Setup github action. (#5917 )	2020-07-22 15:05:25 +08:00
Philip Hyunsu Cho	627cf41a60	Add option to enable all compiler warnings in GCC/Clang (#5897 ) * Add option to enable all compiler warnings in GCC/Clang * Fix -Wall for CUDA sources * Make -Wall private req for xgboost-r	2020-07-21 23:34:03 -07:00
Andy Adinets	b3d2e7644a	Support building XGBoost with CUDA 11 (#5808 ) * Change serialization test. * Add CUDA 11 tests on Linux CI. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-07-20 07:58:41 +08:00
Philip Hyunsu Cho	ac9136ee49	Further improvements and savings in Jenkins pipeline (#5904 ) * Publish artifacts only on the master and release branches * Build CUDA only for Compute Capability 7.5 when building PRs * Run all Windows jobs in a single worker image * Build nightly XGBoost4J SNAPSHOT JARs with Scala 2.12 only * Show skipped Python tests on Windows * Make Graphviz optional for Python tests * Add back C++ tests * Unstash xgboost_cpp_tests * Fix label to CUDA 10.1 * Install cuPy for CUDA 10.1 * Install jsonschema * Address reviewer's feedback	2020-07-18 03:30:40 -07:00
Philip Hyunsu Cho	71b0528a2f	GPU implementation of AFT survival objective and metric (#5714 ) * Add interval accuracy * De-virtualize AFT functions * Lint * Refactor AFT metric using GPU-CPU reducer * Fix R build * Fix build on Windows * Fix copyright header * Clang-tidy * Fix crashing demo * Fix typos in comment; explain GPU ID * Remove unnecessary #include * Add C++ test for interval accuracy * Fix a bug in accuracy metric: use log pred * Refactor AFT objective using GPU-CPU Transform * Lint * Fix lint * Use Ninja to speed up build * Use time, not /usr/bin/time * Add cpu_build worker class, with concurrency = 1 * Use concurrency = 1 only for CUDA build * concurrency = 1 for clang-tidy * Address reviewer's feedback * Update link to AFT paper	2020-07-17 01:18:13 -07:00
Bobby Wang	730866a7bc	[CI] update spark version to 3.0.0 (#5890 ) * [CI] update spark version to 3.0.0 * Update Dockerfile.jvm_cross Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-07-16 00:23:44 -07:00
Philip Hyunsu Cho	0d411b0397	[CI] Simplify CMake build with modern CMake techniques (#5871 ) * [CI] Simplify CMake build * Make sure that plugins can be built * [CI] Install lz4 on Mac	2020-07-08 04:23:24 -07:00
Philip Hyunsu Cho	a6d9a06b7b	[CI] Fix cuDF install; merge 'gpu' and 'cudf' test suite (#5814 )	2020-06-19 16:42:57 +08:00
Rory Mitchell	b47b5ac771	Use hypothesis (#5759 ) * Use hypothesis * Allow int64 array interface for groups * Add packages to Windows CI * Add to travis * Make sure device index is set correctly * Fix dask-cudf test * appveyor	2020-06-16 12:45:59 +12:00
Philip Hyunsu Cho	91c646392d	Require Python 3.6+; drop Python 3.5 from CI (#5715 )	2020-05-27 16:19:30 -07:00
Andy Adinets	646def51e0	C++14 for xgboost (#5664 )	2020-05-21 12:26:40 +12:00
Philip Hyunsu Cho	f68155de6c	Fix compilation on Mac OSX High Sierra (10.13) (#5597 ) * Fix compilation on Mac OSX High Sierra * [CI] Build Mac OSX binary wheel using Travis CI	2020-04-25 10:53:03 -07:00
Philip Hyunsu Cho	92913aaf7f	[CI] Use Vault repository to re-gain access to devtoolset-4 (#5589 ) * [CI] Use Vault repository to re-gain access to devtoolset-4 * Use manylinux2010 tag * Update Dockerfile.jvm * Fix rename_whl.py * Upgrade Pip, to handle manylinux2010 tag * Update insert_vcomp140.py * Update test_python.sh	2020-04-23 18:53:54 -07:00
Philip Hyunsu Cho	0676a19e70	[jvm-packages] [CI] Publish XGBoost4J JARs with Scala 2.11 and 2.12 (#5539 )	2020-04-15 09:32:02 -07:00
Philip Hyunsu Cho	ec02f40d42	[CI] Use Ubuntu 18.04 LTS in JVM CI, because 19.04 is EOL (#5537 )	2020-04-15 07:32:46 -07:00
Jiaming Yuan	8b04736b81	[dask] dask cudf inplace prediction. (#5512 ) * Add inplace prediction for dask-cudf. * Remove Dockerfile.release, since it's not used anywhere * Use Conda exclusively in CUDF and GPU containers * Improve cupy memory copying. * Add skip marks to tests. * Add mgpu-cudf category on the CI to run all distributed tests. Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-04-15 18:15:51 +08:00
Philip Hyunsu Cho	1b1969f20d	[jvm-packages] [CI] Create a Maven repository to host SNAPSHOT JARs (#5533 )	2020-04-14 19:33:32 -07:00
Philip Hyunsu Cho	88b64c8162	Ensure that configured dmlc/build_config.h is picked up by Rabit and XGBoost (#5514 ) * Ensure that configured header (build_config.h) from dmlc-core is picked up by Rabit and XGBoost * Check which Rabit target is being used * Use CMake 3.13 in all Jenkins tests * Upgrade CMake in Travis CI * Install CMake using Kitware installer * Remove existing CMake (3.12.4)	2020-04-11 23:48:28 -07:00
Liang-Chi Hsieh	449ab79e0c	[CI] Use devtoolset-6 because devtoolset-4 is EOL and no longer available (#5506 ) * Use devtoolset-6. * [CI] Use devtoolset-6 because devtoolset-4 is EOL and no longer available * CUDA 9.0 doesn't work with devtoolset-6; use devtoolset-4 for GPU build only Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-04-11 19:49:06 -07:00
Jiaming Yuan	a3db79df22	Remove makefiles. (#5513 )	2020-04-11 13:25:53 +08:00
Jiaming Yuan	0012f2ef93	Upgrade clang-tidy on CI. (#5469 ) * Correct all clang-tidy errors. * Upgrade clang-tidy to 10 on CI. Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-04-05 04:42:29 +08:00
Rory Mitchell	13b10a6370	Device dmatrix (#5420 )	2020-03-28 14:42:21 +13:00
Philip Hyunsu Cho	7ac7e8778f	Port patches from 1.0.0 branch (#5336 ) * Remove f-string, since it's not supported by Python 3.5 (#5330) * Remove f-string, since it's not supported by Python 3.5 * Add Python 3.5 to CI, to ensure compatibility * Remove duplicated matplotlib * Show deprecation notice for Python 3.5 * Fix lint * Fix lint * Fix a unit test that mistook MINOR ver for PATCH ver * Enforce only major version in JSON model schema * Bump version to 1.1.0-SNAPSHOT	2020-02-21 13:13:21 -08:00
Jiaming Yuan	911a902835	Merge model compatibility fixes from 1.0rc branch. (#5305 ) * Port test model compatibility. * Port logit model fix. https://github.com/dmlc/xgboost/pull/5248 https://github.com/dmlc/xgboost/pull/5281	2020-02-13 20:41:58 +08:00
Rory Mitchell	9c56480c61	Support dmatrix construction from cupy array (#5206 )	2020-01-22 13:15:27 +13:00
Philip Hyunsu Cho	9b0af6e882	Enable OpenMP with Apple Clang (Mac default compiler) (#5146 ) * Add OpenMP as CMake target * Require CMake 3.12, to allow linking OpenMP target to objxgboost * Specify OpenMP compiler flag for CUDA host compiler * Require CMake 3.16+ if the OS is Mac OSX * Use AppleClang in Mac tests. * Update dmlc-core	2019-12-26 16:53:12 +08:00
Jiaming Yuan	1d0ca49761	Example JSON model parser and Schema. (#5137 )	2019-12-23 19:47:35 +08:00
Jiaming Yuan	b915788708	Remove benchmark code in GPU test. (#5141 ) * Update Jenkins script.	2019-12-21 11:00:21 +08:00
Philip Hyunsu Cho	74f545bde3	[CI] Repair download URL for Maven 3.6.1 (#5139 )	2019-12-20 10:07:40 +08:00
Philip Hyunsu Cho	64f4361b47	[CI] Locate vcomp140.dll from System32 directory (#5078 )	2019-12-02 02:09:32 -08:00
Jiaming Yuan	7663de956c	Run training with empty DMatrix. (#4990 ) This makes GPU Hist robust in distributed environment as some workers might not be associated with any data in either training or evaluation. * Disable rabit mock test for now: See #5012 . * Disable dask-cudf test at prediction for now: See #5003 * Launch dask job for all workers despite they might not have any data. * Check 0 rows in elementwise evaluation metrics. Using AUC and AUC-PR still throws an error. See #4663 for a robust fix. * Add tests for edge cases. * Add `LaunchKernel` wrapper handling zero sized grid. * Move some parts of allreducer into a cu file. * Don't validate feature names when the booster is empty. * Sync number of columns in DMatrix. As num_feature is required to be the same across all workers in data split mode. * Filtering in dask interface now by default syncs all booster that's not empty, instead of using rank 0. * Fix Jenkins' GPU tests. * Install dask-cuda from source in Jenkins' test. Now all tests are actually running. * Restore GPU Hist tree synchronization test. * Check UUID of running devices. The check is only performed on CUDA version >= 10.x, as 9.x doesn't have UUID field. * Fix CMake policy and project variables. Use xgboost_SOURCE_DIR uniformly, add policy for CMake >= 3.13. * Fix copying data to CPU * Fix race condition in cpu predictor. * Fix duplicated DMatrix construction. * Don't download extra nccl in CI script.	2019-11-06 16:13:13 +08:00

1 2 3

103 Commits