xgboost

Author	SHA1	Message	Date
Jiaming Yuan	8760ec4827	Ensure predict leaf output 1-dim vector where there's only 1 tree. (#6889 )	2021-04-23 15:07:48 +08:00
Jiaming Yuan	896aede340	Reorganize the installation documents. (#6877 ) * Split up installation and building from source. * Use consistent section titles. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2021-04-22 04:48:32 +08:00
Jiaming Yuan	74b41637de	Revert "[jvm-packages] Add `XGBOOST_RABIT_TRACKER_IP_FOR_TEST` to set rabit tracker IP. (#6869 )" (#6886 ) This reverts commit `2828da3c4c`.	2021-04-21 11:20:10 -07:00
Kai Fricke	c8cc3eacc9	[docs] Add tutorial for XGBoost-Ray (#6884 ) * Add XGBoost-Ray tutorial * Add link to modin	2021-04-22 02:07:13 +08:00
Bobby Wang	2828da3c4c	[jvm-packages] Add `XGBOOST_RABIT_TRACKER_IP_FOR_TEST` to set rabit tracker IP. (#6869 ) * Add `XGBOOST_RABIT_TRACKER_IP_FOR_TEST` to set rabit tracker IP * change spark and rabit tracker IP to 127.0.0.1on GitHub Action. Co-authored-by: fis <jm.yuan@outlook.com>	2021-04-22 02:00:22 +08:00
Jiaming Yuan	a5d7094a45	Update documents. (#6856 ) * Add early stopping section to prediction doc. * Remove best_ntree_limit. * Better doxygen output.	2021-04-16 12:41:03 +08:00
Philip Hyunsu Cho	ea7a6a0321	[CI] Pack R package tarball with pre-built xgboost.so (with GPU support) (#6827 ) * Add scripts for packaging R package with GPU-enabled libxgboost.so * [CI] Automatically build R package tarball * Add comments * Don't build tarball for pull requests * Update the installation doc	2021-04-07 21:15:34 -07:00
Jiaming Yuan	0cced530ea	[doc] Clarify prediction function. (#6813 )	2021-04-03 02:12:04 +08:00
Jiaming Yuan	bcc0277338	Re-implement ROC-AUC. (#6747 ) * Re-implement ROC-AUC. * Binary * MultiClass * LTR * Add documents. This PR resolves a few issues: - Define a value when the dataset is invalid, which can happen if there's an empty dataset, or when the dataset contains only positive or negative values. - Define ROC-AUC for multi-class classification. - Define weighted average value for distributed setting. - A correct implementation for learning to rank task. Previous implementation is just binary classification with averaging across groups, which doesn't measure ordered learning to rank.	2021-03-20 16:52:40 +08:00
Philip Hyunsu Cho	366f3cb9d8	Add use_rmm flag to global configuration (#6656 ) * Ensure RMM is 0.18 or later * Add use_rmm flag to global configuration * Modify XGBCachingDeviceAllocatorImpl to skip CUB when use_rmm=True * Update the demo * [CI] Pin NumPy to 1.19.4, since NumPy 1.19.5 doesn't work with latest Shap	2021-03-09 14:53:05 -08:00
Jiaming Yuan	9da2287ab8	[breaking] Save booster feature info in JSON, remove feature name generation. (#6605 ) * Save feature info in booster in JSON model. * [breaking] Remove automatic feature name generation in `DMatrix`. This PR is to enable reliable feature validation in Python package.	2021-02-25 18:54:16 +08:00
Jiaming Yuan	1335db6113	[dask] Improve documents. (#6687 ) * Add tag for versions. * use autoclass in sphinx build. Made some class methods to be private to avoid exporting documents.	2021-02-09 09:20:58 +08:00
Jiaming Yuan	9d62b14591	Fix document. [skip ci] (#6669 )	2021-02-02 20:43:31 +08:00
Jiaming Yuan	87ab1ad607	[dask] Accept `Future` of model for prediction. (#6650 ) This PR changes predict and inplace_predict to accept a Future of model, to avoid sending models to workers repeatably. * Document is updated to reflect functionality additions in recent changes.	2021-02-02 08:45:52 +08:00
Jiaming Yuan	d8ec7aad5a	[dask] Add a 1 line sample to infer output shape. (#6645 ) * [dask] Use a 1 line sample to infer output shape. This is for inferring shape with direct prediction (without DaskDMatrix). There are a few things that requires known output shape before carrying out actual prediction, including dask meta data, output dataframe columns. * Infer output shape based on local prediction. * Remove set param in predict function as it's not thread safe nor necessary as we now let dask to decide the parallelism. * Simplify prediction on `DaskDMatrix`.	2021-01-30 18:55:50 +08:00
Jiaming Yuan	4bf23c2391	Specify shape in prediction contrib and interaction. (#6614 )	2021-01-26 02:08:22 +08:00
Jiaming Yuan	561809200a	Fix document for tree methods. (#6633 )	2021-01-25 15:52:08 +08:00
Jiaming Yuan	f0fd7629ae	Add helper script and doc for releasing pip package. (#6613 ) * Fix `long_description_content_type`.	2021-01-21 14:46:52 +08:00
Jiaming Yuan	89a00a5866	[dask] Random forest estimators (#6602 )	2021-01-13 20:59:20 +08:00
Jiaming Yuan	2b049b32e9	Document various tree methods. (#6564 )	2021-01-02 15:40:46 +08:00
Jiaming Yuan	de8fd852a5	[dask] Add type hints. (#6519 ) * Add validate_features. * Show type hints in doc. Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-12-29 19:41:02 +08:00
Jiaming Yuan	c5876277a8	Drop saving binary format for memory snapshot. (#6513 )	2020-12-17 00:14:57 +08:00
James Lamb	1e2c3ade9e	[doc] [dask] Add example on early stopping with Dask (#6501 ) Co-authored-by: fis <jm.yuan@outlook.com>	2020-12-15 22:23:23 +08:00
James Lamb	afc4567268	[doc] [dask] fix partitioning in Dask example (#6389 )	2020-12-14 18:37:49 +08:00
Jiaming Yuan	a30461cf87	[dask] Support all parameters in regressor and classifier. (#6471 ) * Add eval_metric. * Add callback. * Add feature weights. * Add custom objective.	2020-12-14 07:35:56 +08:00
Philip Hyunsu Cho	55bdf084cb	[Doc] Document that AUC and AUCPR are for binary classification/ranking [skip ci] (#5899 )	2020-12-06 22:17:20 -08:00
Philip Hyunsu Cho	fb56da5e8b	Add global configuration (#6414 ) * Add management functions for global configuration: XGBSetGlobalConfig(), XGBGetGlobalConfig(). * Add Python interface: set_config(), get_config(), and config_context(). * Add unit tests for Python * Add R interface: xgb.set.config(), xgb.get.config() * Add unit tests for R Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2020-12-03 00:05:18 -08:00
hzy001	c2ba4fb957	Fix broken links. (#6455 ) Co-authored-by: Hao Ziyu <haoziyu@qiyi.com> Co-authored-by: fis <jm.yuan@outlook.com>	2020-12-02 17:39:12 +08:00
Jiaming Yuan	00218d065a	[dask] Update document. [skip ci] (#6413 )	2020-11-20 19:16:19 +08:00
Philip Hyunsu Cho	5cb24d0d39	Fix broken link in CLI doc (#6396 )	2020-11-14 17:58:07 -08:00
Philip Hyunsu Cho	5a33c2f3a0	[CI] Add noLD R test (#6382 ) * [CI] Add noLD test * Make noLD test only trigger with a PR comment * [CI] Don't install stringi * Add the Titanic example as a unit test * Document trigger * add to index * Clarify that it needs to be a review comment	2020-11-12 12:41:25 -08:00
Jiaming Yuan	c90f968d92	Update Python documents. (#6376 )	2020-11-12 17:51:32 +08:00
James Lamb	12d27f43ff	[doc] make Dask distributed example copy-pastable (#6345 )	2020-11-11 20:22:17 -08:00
Jean Lescut-Muller	9564886d9f	Update custom_metric_obj.rst (#6367 )	2020-11-10 22:29:22 +08:00
Jiaming Yuan	e65e3cf36e	Support shared library in system path. (#6362 )	2020-11-10 16:04:25 +08:00
Jiaming Yuan	519cee115a	Avoid resetting seed for every configuration. (#6349 )	2020-11-06 10:28:35 +08:00
Jiaming Yuan	2cc9662005	Support slicing tree model (#6302 ) This PR is meant the end the confusion around best_ntree_limit and unify model slicing. We have multi-class and random forests, asking users to understand how to set ntree_limit is difficult and error prone. * Implement the save_best option in early stopping. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-11-02 23:27:39 -08:00
Naveed Ahmed Saleem Janvekar	608bda7052	[jvm-packages] add example to handle missing value other than 0 (#5677 ) add example to handle missing value other than 0 under Dealing with missing values section	2020-10-28 17:24:35 -07:00
Jiaming Yuan	e8884c4637	Document tree method for feature weights. (#6312 )	2020-10-28 13:42:13 -07:00
DIVYA CHAUHAN	4e9c4f2d73	Create a tutorial for using the C API in a C/C++ application (#6285 ) Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-27 12:19:20 -07:00
Rory Mitchell	f0c3ff313f	Update GPUTreeShap, add docs (#6281 ) * Update GPUTreeShap, add docs * Fix test Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-27 18:22:12 +13:00
Jiaming Yuan	81c37c28d5	Time the CPU tests on Jenkins. (#6257 ) * Time the CPU tests on Jenkins. * Reduce thread contention. * Add doc. * Skip heavy tests on ARM.	2020-10-20 17:19:07 -07:00
Jiaming Yuan	bed7ae4083	Loop over `thrust::reduce`. (#6229 ) * Check input chunk size of dqdm. * Add doc for current limitation.	2020-10-14 10:40:56 +13:00
Jiaming Yuan	ab5b35134f	Rework Python callback functions. (#6199 ) * Define a new callback interface for Python. * Deprecate the old callbacks. * Enable early stopping on dask.	2020-10-10 17:52:36 +08:00
Jiaming Yuan	ddc4f20e54	Add JSON schema for categorical splits. (#6194 )	2020-10-07 17:33:31 +08:00
Igor Moura	5908598666	[Doc] Add info on GPU compiler (#6204 ) * Add note about the required compiler version for CUDA. * Also added a link that gives a short explanation on compute capability version	2020-10-06 11:35:18 +08:00
Christian Lorentzen	cf4f019ed6	[Breaking] Change default evaluation metric for classification to logloss / mlogloss (#6183 ) * Change DefaultEvalMetric of classification from error to logloss * Change default binary metric in plugin/example/custom_obj.cc * Set old error metric in python tests * Set old error metric in R tests * Fix missed eval metrics and typos in R tests * Fix setting eval_metric twice in R tests * Add warning for empty eval_metric for classification * Fix Dask tests Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-02 12:06:47 -07:00
Zeno Gantner	5b05f88ba9	Cosmetic fixes in faq.rst (#6161 )	2020-09-24 21:05:10 +08:00
Jiaming Yuan	a069a21e03	Implement intrusive ptr (#6129 ) * Use intrusive ptr for JSON.	2020-09-20 20:07:16 +08:00
neko	6bc9b9dc4f	Fix doc for CMake requirement. (#6123 )	2020-09-16 17:59:43 +08:00

1 2 3 4 5 ...

466 Commits