xgboost

Author	SHA1	Message	Date
Jiaming Yuan	ba50e6eb62	[backport] [CI] Require C++17 + CMake 3.18; Use CUDA 11.8 in CI (#8853 ) (#8971 ) Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2023-03-26 00:10:03 +08:00
Rong Ou	b3208aac4e	Fix NVFLARE demo (#8340 )	2022-10-14 12:18:34 +08:00
Jiaming Yuan	4633b476e9	[doc] Display survival demos in sphinx doc. [skip ci] (#8328 )	2022-10-13 20:51:23 +08:00
Rory Mitchell	ce0382dcb0	[CI] Refactor tests to reduce CI time. (#8312 )	2022-10-12 11:32:06 +02:00
Rong Ou	668b8a0ea4	[Breaking] Switch from rabit to the collective communicator (#8257 ) * Switch from rabit to the collective communicator * fix size_t specialization * really fix size_t * try again * add include * more include * fix lint errors * remove rabit includes * fix pylint error * return dict from communicator context * fix communicator shutdown * fix dask test * reset communicator mocklist * fix distributed tests * do not save device communicator * fix jvm gpu tests * add python test for federated communicator * Update gputreeshap submodule Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 14:39:01 -08:00
Jiaming Yuan	97c3a80a34	Add C document to sphinx, fix arrow. (#8300 ) - Group C API. - Add C API sphinx doc. - Consistent use of `OptionalArg` and the parameter name `config`. - Remove call to deprecated functions in demo. - Fix some formatting errors. - Add links to c examples in the document (only visible with doxygen pages) - Fix arrow.	2022-10-05 09:52:15 +08:00
Rory Mitchell	d686bf52a6	Reduce time for some multi-gpu tests (#8288 ) * Faster dask tests * Reuse AllReducer objects in tests. * Faster boost from prediction tests. * Use rmm dask fixture. * Speed up dask demo. * mypy * Format with black. * mypy * Clang-tidy Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-04 02:49:33 -08:00
Jiaming Yuan	570f8ae4ba	Use black on more Python files. (#8137 )	2022-08-11 01:38:11 +08:00
Jiaming Yuan	9ae547f994	Use config_context in sklearn interface. (#8141 )	2022-08-09 14:48:54 +08:00
Praateek Mahajan	ff471b3fab	In PySpark Estimator example use the model with validation_indicator (#8131 ) * use the validation_indicator model * use the validation_indicator model for regression	2022-08-03 13:57:41 +08:00
WeichenXu	f23cc92130	[pyspark] User guide doc and tutorials (#8082 ) Co-authored-by: Bobby Wang <wbo4958@gmail.com>	2022-07-19 22:25:14 +08:00
Jiaming Yuan	8fccc3c4ad	[dask] Fix potential error in demo. (#8079 ) * Use dask_cudf instead.	2022-07-15 18:42:29 +08:00
Rong Ou	6eb23353d7	Update nvflare demo for release 2.1.2 (#8038 )	2022-06-29 17:58:06 +08:00
Rong Ou	31e6902e43	Support GPU training in the NVFlare demo (#7965 )	2022-06-02 21:52:36 +08:00
Rong Ou	af907e2d0d	Demo of federated learning using NVFlare (#7879 ) Co-authored-by: jiamingy <jm.yuan@outlook.com>	2022-05-14 22:45:41 +08:00
Amit Bera	1823db53f2	updated winning solution under readme.md (#7862 )	2022-05-06 17:38:07 +08:00
Jiaming Yuan	52d4eda786	Deprecate `use_label_encoder` in XGBClassifier. (#7822 ) * Deprecate `use_label_encoder` in XGBClassifier. * We have removed the encoder, now prepare to remove the indicator.	2022-04-21 13:14:02 +08:00
Jiaming Yuan	bcce17e688	Remove text loading in basic walk through demo. (#7753 )	2022-04-01 00:59:42 +08:00
Jiaming Yuan	4d81c741e9	External memory support for hist (#7531 ) * Generate column matrix from gHistIndex. * Avoid synchronization with the sparse page once the cache is written. * Cleanups: Remove member variables/functions, change the update routine to look like approx and gpu_hist. * Remove pruner.	2022-03-22 00:13:20 +08:00
Jiaming Yuan	cd55823112	Demo for using custom objective with multi-target regression. (#7736 )	2022-03-20 17:44:25 +08:00
Jiaming Yuan	1d468e20a4	Optimize GPU evaluation function for categorical data. (#7705 ) * Use transform and cache.	2022-02-28 17:46:29 +08:00
Jiaming Yuan	0da7d872ef	[doc] Update for prediction. (#7648 )	2022-02-15 05:01:55 +08:00
Jiaming Yuan	0d0abe1845	Support optimal partitioning for GPU hist. (#7652 ) * Implement `MaxCategory` in quantile. * Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function. * Extract an evaluator from GPU Hist to store the needed states. * Added some CUDA stream/event utilities. * Update document with references. * Fixed a bug in approx evaluator where the number of data points is less than the number of categories.	2022-02-15 03:03:12 +08:00
Ed Shee	d152c59a9c	fixed broken link to Seldon XGBoost server (#7628 )	2022-02-05 01:03:29 +08:00
Philip Hyunsu Cho	c621775f34	Replace all uses of deprecated function sklearn.datasets.load_boston (#7373 ) * Replace all uses of deprecated function sklearn.datasets.load_boston * More renaming * Fix bad name * Update assertion * Fix n boosted rounds. * Avoid over regularization. * Rebase. * Avoid over regularization. * Whac-a-mole Co-authored-by: fis <jm.yuan@outlook.com>	2022-01-30 04:27:57 -08:00
Jiaming Yuan	b4ec1682c6	Update document for multi output and categorical. (#7574 ) * Group together categorical related parameters. * Update documents about multioutput and categorical.	2022-01-19 04:35:17 +08:00
Jiaming Yuan	001503186c	Rewrite approx (#7214 ) This PR rewrites the approx tree method to use codebase from hist for better performance and code sharing. The rewrite has many benefits: - Support for both `max_leaves` and `max_depth`. - Support for `grow_policy`. - Support for mono constraint. - Support for feature weights. - Support for easier bin configuration (`max_bin`). - Support for categorical data. - Faster performance for most of the datasets. (many times faster) - Support for prediction cache. - Significantly better performance for external memory. - Unites the code base between approx and hist.	2022-01-10 21:15:05 +08:00
Jiaming Yuan	ec56d5869b	[doc] Include dask examples into doc. (#7530 )	2022-01-05 03:27:22 +08:00
Jiaming Yuan	58a6723eb1	Initial support for multioutput regression. (#7514 ) * Add num target model parameter, which is configured from input labels. * Change elementwise metric and indexing for weights. * Add demo. * Add tests.	2021-12-18 09:28:38 +08:00
Qingyun Wu	b4a1236cfc	[doc] Update the link to the tuning example in FLAML	2021-12-17 14:31:00 +08:00
Jiaming Yuan	c024c42dce	Modernize XGBoost Python document. (#7468 ) * Use sphinx gallery to integrate examples. * Remove mock objects. * Add dask doc inventory.	2021-11-23 23:24:52 +08:00
Jiaming Yuan	e6ab594e14	Change shebang used in CLI demo. (#7389 ) Change from system Python to environment python3. For Ubuntu 20.04, only `python3` is available and there's no `python`. So at least `python3` is consistent with Python virtual env, Ubuntu and anaconda.	2021-11-02 22:11:19 +08:00
Jiaming Yuan	45aef75cca	Move skl `eval_metric` and `early_stopping rounds` to model params. (#6751 ) A new parameter `custom_metric` is added to `train` and `cv` to distinguish the behaviour from the old `feval`. And `feval` is deprecated. The new `custom_metric` receives transformed prediction when the built-in objective is used. This enables XGBoost to use cost functions from other libraries like scikit-learn directly without going through the definition of the link function. `eval_metric` and `early_stopping_rounds` in sklearn interface are moved from `fit` to `__init__` and is now saved as part of the scikit-learn model. The old ones in `fit` function are now deprecated. The new `eval_metric` in `__init__` has the same new behaviour as `custom_metric`. Added more detailed documents for the behaviour of custom objective and metric.	2021-10-28 17:20:20 +08:00
Jiaming Yuan	2eee87423c	Remove old custom objective demo. (#7369 ) We have 2 new custom objective demos covering both regression and classification with accompanying tutorials in documents.	2021-10-27 16:31:48 +08:00
Jiaming Yuan	15685996fc	[doc] Small improvements for categorical data document. (#7330 )	2021-10-20 18:04:32 +08:00
Jiaming Yuan	6cdcfe8128	Improve external memory demo. (#7320 ) * Use npy format. * Add evaluation. * Use make_regression.	2021-10-17 11:25:24 +08:00
Jiaming Yuan	0bd8f21e4e	Add document for categorical data. (#7307 )	2021-10-12 16:10:59 +08:00
Jiaming Yuan	0ed979b096	Support more input types for categorical data. (#7220 ) * Support more input types for categorical data. * Shorten the type name from "categorical" to "c". * Tests for np/cp array and scipy csr/csc/coo. * Specify the type for feature info.	2021-09-16 20:39:30 +08:00
Jiaming Yuan	d997c967d5	Demo for experimental categorical data support. (#7213 )	2021-09-15 08:20:12 +08:00
Jiaming Yuan	68a2c7b8d6	Fix memory leak in demo. (#7216 )	2021-09-09 13:51:03 +08:00
Jiaming Yuan	7bdedacb54	Document for `process_type`. (#7135 ) * Update document for prune and refresh. * Add demo.	2021-08-03 13:11:52 +08:00
Jiaming Yuan	36346f8f56	C API demo for inference. (#7151 )	2021-08-03 00:46:47 +08:00
Jiaming Yuan	778135f657	Fix parameter loading with training continuation. (#7121 ) * Add a demo for training continuation.	2021-07-23 10:51:47 +08:00
Jiaming Yuan	e6088366df	Export Python Interface for external memory. (#7070 ) * Add Python iterator interface. * Add tests. * Add demo. * Add documents. * Handle empty dataset.	2021-07-22 15:15:53 +08:00
Jiaming Yuan	345796825f	Optional find dependency in installed cmake config. (#7099 ) * Find dependency only when xgboost is built as static library. * Resolve msvc warning. * Add test for linking shared library.	2021-07-11 17:20:55 +08:00
Jiaming Yuan	663136aa08	Implement feature score for linear model. (#7048 ) * Add feature score support for linear model. * Port R interface to the new implementation. * Add linear model support in Python. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2021-06-25 14:34:02 +08:00
Philip Hyunsu Cho	655e6992f6	[Dask] Add example of using custom callback in Dask (#6995 )	2021-06-03 07:05:55 +08:00
Andrew Ziem	3e7e426b36	Fix spelling in documents (#6948 ) * Update roxygen2 doc. Co-authored-by: fis <jm.yuan@outlook.com>	2021-05-11 20:44:36 +08:00
Philip Hyunsu Cho	4224c08cac	Add demo for using AFT survival with Dask (#6853 )	2021-04-13 16:18:33 -07:00
Jiaming Yuan	ca998df912	Clarify the behavior of `use_rmm`. (#6808 ) * Clarify the `use_rmm` flag in document and demo.	2021-03-31 15:43:11 +08:00

1 2 3 4 5 ...

391 Commits