xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	366f3cb9d8	Add use_rmm flag to global configuration (#6656 ) * Ensure RMM is 0.18 or later * Add use_rmm flag to global configuration * Modify XGBCachingDeviceAllocatorImpl to skip CUB when use_rmm=True * Update the demo * [CI] Pin NumPy to 1.19.4, since NumPy 1.19.5 doesn't work with latest Shap	2021-03-09 14:53:05 -08:00
MBSMachineLearning	95cbfad990	"featue_map" typo changed to "feature_map" (#6540 )	2020-12-21 22:11:11 +08:00
Philip Hyunsu Cho	cd0821500c	Add Saturn Cloud Dask XGBoost tutorial to Awesome XGBoost [skip ci] (#6532 )	2020-12-19 15:57:05 -08:00
hzy001	c2ba4fb957	Fix broken links. (#6455 ) Co-authored-by: Hao Ziyu <haoziyu@qiyi.com> Co-authored-by: fis <jm.yuan@outlook.com>	2020-12-02 17:39:12 +08:00
Jiaming Yuan	f4ff1c53fd	Fix CLI ranking demo. (#6439 ) Save model at final round.	2020-11-29 03:12:06 +08:00
Jiaming Yuan	c90f968d92	Update Python documents. (#6376 )	2020-11-12 17:51:32 +08:00
Jiaming Yuan	dfac5f89e9	Group CLI demo into subdirectory. (#6258 ) CLI is not most developed interface. Putting them into correct directory can help new users to avoid it as most of the use cases are from a language binding.	2020-10-28 14:40:44 -07:00
Rory Mitchell	f0c3ff313f	Update GPUTreeShap, add docs (#6281 ) * Update GPUTreeShap, add docs * Fix test Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-27 18:22:12 +13:00
Jiaming Yuan	81c37c28d5	Time the CPU tests on Jenkins. (#6257 ) * Time the CPU tests on Jenkins. * Reduce thread contention. * Add doc. * Skip heavy tests on ARM.	2020-10-20 17:19:07 -07:00
Manikya Bardhan	549f361b71	Updated winning solutions list (#6254 )	2020-10-19 04:06:48 +08:00
Wittty-Panda	0fc263ead5	Update the list of winning solutions (#6222 )	2020-10-13 20:05:12 +08:00
Jiaming Yuan	ab5b35134f	Rework Python callback functions. (#6199 ) * Define a new callback interface for Python. * Deprecate the old callbacks. * Enable early stopping on dask.	2020-10-10 17:52:36 +08:00
DIVYA CHAUHAN	750bd0ae9a	Update the list of winning solutions using XGBoost (#6192 ) Co-authored-by: divya <divyachauhan661@gmail.com> Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-03 13:39:58 -07:00
Christian Lorentzen	cf4f019ed6	[Breaking] Change default evaluation metric for classification to logloss / mlogloss (#6183 ) * Change DefaultEvalMetric of classification from error to logloss * Change default binary metric in plugin/example/custom_obj.cc * Set old error metric in python tests * Set old error metric in R tests * Fix missed eval metrics and typos in R tests * Fix setting eval_metric twice in R tests * Add warning for empty eval_metric for classification * Fix Dask tests Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-02 12:06:47 -07:00
John Quitto-Graham	e0e4f15d0e	Fix a comment in demo to use correct reference (#6190 ) Co-authored-by: John Quitto Graham <johnq@dgx07.aselab.nvidia.com>	2020-10-01 13:16:04 -07:00
lacrosse91	6bc41df2fe	[Doc] Add list of winning solutions in data science competitions using XGBoost (#6177 )	2020-09-30 14:41:29 -07:00
Alexander Gugel	03b8fdec74	Add DMatrix usage examples to c-api-demo (#5854 ) * Add DMatrix usage examples to c-api-demo * Add XGDMatrixCreateFromCSREx example * Add XGDMatrixCreateFromCSCEx example	2020-09-26 02:10:12 -07:00
Philip Hyunsu Cho	2c4dedb7a0	[CI] Test C API demo (#6159 ) * Fix CMake install config to use dependencies * [CI] Test C API demo * Explicitly cast num_feature, to avoid warning in Linux	2020-09-25 14:49:01 -07:00
Jiaming Yuan	78d72ef936	Add DaskDeviceQuantileDMatrix demo. (#6156 )	2020-09-24 14:08:28 +08:00
Jiaming Yuan	b5f52f0b1b	Validate weights are positive values. (#6115 )	2020-09-15 09:03:55 +08:00
Jiaming Yuan	e5d40b39cd	[Breaking] Don't save leaf child count in JSON. (#6094 ) The field is deprecated and not used anywhere in XGBoost.	2020-09-08 11:11:13 +08:00
Daniel Steinberg	68c55a37d9	Add cache name back to external_memory.py files. (#6088 )	2020-09-06 16:01:09 +08:00
Jiaming Yuan	4d99c58a5f	Feature weights (#5962 )	2020-08-18 19:55:41 +08:00
Philip Hyunsu Cho	511bb22ffd	[Doc] Add dtreeviz as a showcase example of integration with 3rd-party software (#6013 )	2020-08-13 20:53:59 -07:00
Philip Hyunsu Cho	9adb812a0a	RMM integration plugin (#5873 ) * [CI] Add RMM as an optional dependency * Replace caching allocator with pool allocator from RMM * Revert "Replace caching allocator with pool allocator from RMM" This reverts commit e15845d4e72e890c2babe31a988b26503a7d9038. * Use rmm::mr::get_default_resource() * Try setting default resource (doesn't work yet) * Allocate pool_mr in the heap * Prevent leaking pool_mr handle * Separate EXPECT_DEATH() in separate test suite suffixed DeathTest * Turn off death tests for RMM * Address reviewer's feedback * Prevent leaking of cuda_mr * Fix Jenkinsfile syntax * Remove unnecessary function in Jenkinsfile * [CI] Install NCCL into RMM container * Run Python tests * Try building with RMM, CUDA 10.0 * Do not use RMM for CUDA 10.0 target * Actually test for test_rmm flag * Fix TestPythonGPU * Use CNMeM allocator, since pool allocator doesn't yet support multiGPU * Use 10.0 container to build RMM-enabled XGBoost * Revert "Use 10.0 container to build RMM-enabled XGBoost" This reverts commit 789021fa31112e25b683aef39fff375403060141. * Fix Jenkinsfile * [CI] Assign larger /dev/shm to NCCL * Use 10.2 artifact to run multi-GPU Python tests * Add CUDA 10.0 -> 11.0 cross-version test; remove CUDA 10.0 target * Rename Conda env rmm_test -> gpu_test * Use env var to opt into CNMeM pool for C++ tests * Use identical CUDA version for RMM builds and tests * Use Pytest fixtures to enable RMM pool in Python tests * Move RMM to plugin/CMakeLists.txt; use PLUGIN_RMM * Use per-device MR; use command arg in gtest * Set CMake prefix path to use Conda env * Use 0.15 nightly version of RMM * Remove unnecessary header * Fix a unit test when cudf is missing * Add RMM demos * Remove print() * Use HostDeviceVector in GPU predictor * Simplify pytest setup; use LocalCUDACluster fixture * Address reviewers' commments Co-authored-by: Hyunsu Cho <chohyu01@cs.wasshington.edu>	2020-08-12 01:26:02 -07:00
Jiaming Yuan	9c93531709	Update Python custom objective demo. (#5981 )	2020-08-05 12:27:19 +08:00
Jiaming Yuan	18349a7ccf	[Breaking] Fix custom metric for multi output. (#5954 ) * Set output margin to true for custom metric. This fixes only R and Python.	2020-07-29 19:25:27 +08:00
Jiaming Yuan	75b8c22b0b	Fix prediction heuristic (#5955 ) * Relax check for prediction. * Relax test in spark test. * Add tests in C++.	2020-07-29 19:24:07 +08:00
Alexander Gugel	970b4b3fa2	Add XGBoosterGetNumFeature (#5856 ) - add GetNumFeature to Learner - add XGBoosterGetNumFeature to C API - update c-api-demo accordingly	2020-07-13 23:25:17 -07:00
Jiaming Yuan	a3ec964346	Accept iterator in device dmatrix. (#5783 ) * Remove Device DMatrix.	2020-07-07 21:44:48 +08:00
Alexander Gugel	0f17e35bce	Add c-api-demo to .gitignore (#5855 )	2020-07-05 04:35:22 +08:00
James Lamb	c35be9dc40	[R] replace uses of T and F with TRUE and FALSE (#5778 ) * [R-package] replace uses of T and F with TRUE and FALSE * enable linting * Remove skip Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-06-11 06:08:02 -04:00
Jiaming Yuan	5af8161a1a	Implement Python data handler. (#5689 ) * Define data handlers for DMatrix. * Throw ValueError in scikit learn interface.	2020-05-22 11:53:55 +08:00
Jiaming Yuan	7903286961	Remove silent from R demos. (#5675 ) * Remove silent from R demos. * Vignettes.	2020-05-19 18:20:46 +08:00
Jiaming Yuan	2c1a439869	Update Python demos with tests. (#5651 ) * Remove GPU memory usage demo. * Add tests for demos. * Remove `silent`. * Remove shebang as it's not portable.	2020-05-12 12:04:42 +08:00
Jiaming Yuan	9c1103e06c	[Breaking] Set output margin to True for custom objective. (#5564 ) * Set output margin to True for custom objective in Python and R. * Add a demo for writing multi-class custom objective function. * Run tests on selected demos.	2020-04-20 20:44:12 +08:00
Kamil A. Kaczmarek	2809fb8b6f	Add Neptune and Optuna to list of examples (#5528 )	2020-04-14 11:00:50 -07:00
Nicolas Scozzaro	04f69b43e6	fix typo "customized" (#5515 )	2020-04-12 14:43:48 +08:00
Jiaming Yuan	a3db79df22	Remove makefiles. (#5513 )	2020-04-11 13:25:53 +08:00
Philip Hyunsu Cho	5fc5ec539d	Implement robust regularization in 'survival:aft' objective (#5473 ) * Robust regularization of AFT gradient and hessian * Fix AFT doc; expose it to tutorial TOC * Apply robust regularization to uncensored case too * Revise unit test slightly * Fix lint * Update test_survival.py * Use GradientPairPrecise * Remove unused variables	2020-04-04 12:21:24 -07:00
Avinash Barnwal	dcf439932a	Add Accelerated Failure Time loss for survival analysis task (#4763 ) * [WIP] Add lower and upper bounds on the label for survival analysis * Update test MetaInfo.SaveLoadBinary to account for extra two fields * Don't clear qids_ for version 2 of MetaInfo * Add SetInfo() and GetInfo() method for lower and upper bounds * changes to aft * Add parameter class for AFT; use enum's to represent distribution and event type * Add AFT metric * changes to neg grad to grad * changes to binomial loss * changes to overflow * changes to eps * changes to code refactoring * changes to code refactoring * changes to code refactoring * Re-factor survival analysis * Remove aft namespace * Move function bodies out of AFTNormal and AFTLogistic, to reduce clutter * Move function bodies out of AFTLoss, to reduce clutter * Use smart pointer to store AFTDistribution and AFTLoss * Rename AFTNoiseDistribution enum to AFTDistributionType for clarity The enum class was not a distribution itself but a distribution type * Add AFTDistribution::Create() method for convenience * changes to extreme distribution * changes to extreme distribution * changes to extreme * changes to extreme distribution * changes to left censored * deleted cout * changes to x,mu and sd and code refactoring * changes to print * changes to hessian formula in censored and uncensored * changes to variable names and pow * changes to Logistic Pdf * changes to parameter * Expose lower and upper bound labels to R package * Use example weights; normalize log likelihood metric * changes to CHECK * changes to logistic hessian to standard formula * changes to logistic formula * Comply with coding style guideline * Revert back Rabit submodule * Revert dmlc-core submodule * Comply with coding style guideline (clang-tidy) * Fix an error in AFTLoss::Gradient() * Add missing files to amalgamation * Address @RAMitchell's comment: minimize future change in MetaInfo interface * Fix lint * Fix compilation error on 32-bit target, when size_t == bst_uint * Allocate sufficient memory to hold extra label info * Use OpenMP to speed up * Fix compilation on Windows * Address reviewer's feedback * Add unit tests for probability distributions * Make Metric subclass of Configurable * Address reviewer's feedback: Configure() AFT metric * Add a dummy test for AFT metric configuration * Complete AFT configuration test; remove debugging print * Rename AFT parameters * Clarify test comment * Add a dummy test for AFT loss for uncensored case * Fix a bug in AFT loss for uncensored labels * Complete unit test for AFT loss metric * Simplify unit tests for AFT metric * Add unit test to verify aggregate output from AFT metric * Use EXPECT_* instead of ASSERT_, so that we run all unit tests Use aft_loss_param when serializing AFTObj This is to be consistent with AFT metric * Add unit tests for AFT Objective * Fix OpenMP bug; clarify semantics for shared variables used in OpenMP loops * Add comments * Remove AFT prefix from probability distribution; put probability distribution in separate source file * Add comments * Define kPI and kEulerMascheroni in probability_distribution.h * Add probability_distribution.cc to amalgamation * Remove unnecessary diff * Address reviewer's feedback: define variables where they're used * Eliminate all INFs and NANs from AFT loss and gradient * Add demo * Add tutorial * Fix lint * Use 'survival:aft' to be consistent with 'survival:cox' * Move sample data to demo/data * Add visual demo with 1D toy data * Add Python tests Co-authored-by: Philip Cho <chohyu01@cs.washington.edu>	2020-03-25 13:52:51 -07:00
Jiaming Yuan	761a5dbdfc	[dask] Honor `nthreads` from dask worker. (#5414 )	2020-03-16 04:51:24 +08:00
Bart Broere	a931589c96	Fix typo (#5399 )	2020-03-09 19:41:39 +08:00
mattn	ff1342b252	Fix compilation error (#5215 )	2020-01-18 23:51:07 +08:00
Philip Hyunsu Cho	9b0af6e882	Enable OpenMP with Apple Clang (Mac default compiler) (#5146 ) * Add OpenMP as CMake target * Require CMake 3.12, to allow linking OpenMP target to objxgboost * Specify OpenMP compiler flag for CUDA host compiler * Require CMake 3.16+ if the OS is Mac OSX * Use AppleClang in Mac tests. * Update dmlc-core	2019-12-26 16:53:12 +08:00
Jiaming Yuan	73b1bd2789	Update demo for ranking. (#5154 )	2019-12-24 13:39:07 +08:00
Jiaming Yuan	0202e04a8e	Add base margin to sklearn interface. (#5151 )	2019-12-24 09:43:41 +08:00
Jiaming Yuan	1d0ca49761	Example JSON model parser and Schema. (#5137 )	2019-12-23 19:47:35 +08:00
Rory Mitchell	e67388fb8f	Some guidelines on device memory usage (#5038 ) * Add memory usage demo * Update documentation	2019-11-17 07:48:24 +13:00
Jiaming Yuan	7e72a12871	Don't `set_params` at the end of `set_state`. (#4947 ) * Don't set_params at the end of set_state. * Also fix another issue found in dask prediction. * Add note about prediction. Don't support other prediction modes at the moment.	2019-10-15 10:08:26 -04:00

1 2 3 4 5 ...

439 Commits