xgboost

Author	SHA1	Message	Date
Jiaming Yuan	228a46e8ad	Support learning rate for zero-hessian objectives. (#8866 )	2023-03-06 20:33:28 +08:00
Jiaming Yuan	6a892ce281	Specify src path for isort. (#8867 )	2023-03-06 17:30:27 +08:00
Philip Hyunsu Cho	6d8afb2218	[CI] Require C++17 + CMake 3.18; Use CUDA 11.8 in CI (#8853 ) * Update to C++17 * Turn off unity build * Update CMake to 3.18 * Use MSVC 2022 + CUDA 11.8 * Re-create stack for worker images * Allocate more disk space for Windows * Tempiorarily disable clang-tidy * RAPIDS now requires Python 3.10+ * Unpin cuda-python * Use latest NCCL * Use Ubuntu 20.04 in RMM image * Mark failing mgpu test as xfail	2023-03-01 09:22:24 -08:00
Jiaming Yuan	cce4af4acf	Initial support for quantile loss. (#8750 ) - Add support for Python. - Add objective.	2023-02-16 02:30:18 +08:00
Jiaming Yuan	e9c178f402	[doc] Document update [skip ci] (#8784 ) - Remove version specifics in cat demo. - Remove aws yarn. - Update faq. - Stop mentioning MPI. - Update sphinx inventory links. - Fix typo.	2023-02-12 04:25:22 +08:00
Jiaming Yuan	7b3d473593	[doc] Add demo for inference using individual tree. (#8752 )	2023-02-07 04:40:18 +08:00
Jiaming Yuan	c1786849e3	Use array interface for CSC matrix. (#8672 ) * Use array interface for CSC matrix. Use array interface for CSC matrix and align the interface with CSR and dense. - Fix nthread issue in the R package DMatrix. - Unify the behavior of handling `missing` with other inputs. - Unify the behavior of handling `missing` around R, Python, Java, and Scala DMatrix. - Expose `num_non_missing` to the JVM interface. - Deprecate old CSR and CSC constructors.	2023-02-05 01:59:46 +08:00
James Lamb	0d8248ddcd	[R] discourage use of regex for fixed string comparisons (#8736 )	2023-01-30 18:47:21 +08:00
Jiaming Yuan	d6018eb4b9	Remove all use of `DeviceQuantileDMatrix`. (#8665 )	2023-01-17 00:04:10 +08:00
Jiaming Yuan	badeff1d74	Init estimation for regression. (#8272 )	2023-01-11 02:04:56 +08:00
James Lamb	c7e82b5914	[R] enforce lintr checks (fixes #8012 ) (#8613 )	2022-12-25 05:02:56 +08:00
James Lamb	17ce1f26c8	[R] address some lintr warnings (#8609 )	2022-12-17 18:36:14 +08:00
James Lamb	53e6e32718	[R] resolve assignment_linter warnings (#8599 )	2022-12-17 01:22:41 +08:00
Rong Ou	0caf2be684	Update NVFlare demo to work with the latest release (#8576 )	2022-12-09 02:48:20 +08:00
James Lamb	ffee35e0f0	[R] [ci] remove dependency on {devtools} (#8563 )	2022-12-09 01:21:28 +08:00
James Lamb	fbe40d00d8	[R] resolve brace_linter warnings (#8564 )	2022-12-08 23:01:00 +08:00
Jiaming Yuan	0d3da9869c	Require isort on all Python files. (#8420 )	2022-11-08 12:59:06 +08:00
Jiaming Yuan	a408c34558	Update JSON parser demo with categorical feature. (#8401 ) - Parse categorical features in the Python example. - Add tests. - Update document.	2022-10-28 20:57:43 +08:00
Rong Ou	b3208aac4e	Fix NVFLARE demo (#8340 )	2022-10-14 12:18:34 +08:00
Jiaming Yuan	4633b476e9	[doc] Display survival demos in sphinx doc. [skip ci] (#8328 )	2022-10-13 20:51:23 +08:00
Rory Mitchell	ce0382dcb0	[CI] Refactor tests to reduce CI time. (#8312 )	2022-10-12 11:32:06 +02:00
Rong Ou	668b8a0ea4	[Breaking] Switch from rabit to the collective communicator (#8257 ) * Switch from rabit to the collective communicator * fix size_t specialization * really fix size_t * try again * add include * more include * fix lint errors * remove rabit includes * fix pylint error * return dict from communicator context * fix communicator shutdown * fix dask test * reset communicator mocklist * fix distributed tests * do not save device communicator * fix jvm gpu tests * add python test for federated communicator * Update gputreeshap submodule Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 14:39:01 -08:00
Jiaming Yuan	97c3a80a34	Add C document to sphinx, fix arrow. (#8300 ) - Group C API. - Add C API sphinx doc. - Consistent use of `OptionalArg` and the parameter name `config`. - Remove call to deprecated functions in demo. - Fix some formatting errors. - Add links to c examples in the document (only visible with doxygen pages) - Fix arrow.	2022-10-05 09:52:15 +08:00
Rory Mitchell	d686bf52a6	Reduce time for some multi-gpu tests (#8288 ) * Faster dask tests * Reuse AllReducer objects in tests. * Faster boost from prediction tests. * Use rmm dask fixture. * Speed up dask demo. * mypy * Format with black. * mypy * Clang-tidy Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-04 02:49:33 -08:00
Jiaming Yuan	570f8ae4ba	Use black on more Python files. (#8137 )	2022-08-11 01:38:11 +08:00
Jiaming Yuan	9ae547f994	Use config_context in sklearn interface. (#8141 )	2022-08-09 14:48:54 +08:00
Praateek Mahajan	ff471b3fab	In PySpark Estimator example use the model with validation_indicator (#8131 ) * use the validation_indicator model * use the validation_indicator model for regression	2022-08-03 13:57:41 +08:00
WeichenXu	f23cc92130	[pyspark] User guide doc and tutorials (#8082 ) Co-authored-by: Bobby Wang <wbo4958@gmail.com>	2022-07-19 22:25:14 +08:00
Jiaming Yuan	8fccc3c4ad	[dask] Fix potential error in demo. (#8079 ) * Use dask_cudf instead.	2022-07-15 18:42:29 +08:00
Rong Ou	6eb23353d7	Update nvflare demo for release 2.1.2 (#8038 )	2022-06-29 17:58:06 +08:00
Rong Ou	31e6902e43	Support GPU training in the NVFlare demo (#7965 )	2022-06-02 21:52:36 +08:00
Rong Ou	af907e2d0d	Demo of federated learning using NVFlare (#7879 ) Co-authored-by: jiamingy <jm.yuan@outlook.com>	2022-05-14 22:45:41 +08:00
Amit Bera	1823db53f2	updated winning solution under readme.md (#7862 )	2022-05-06 17:38:07 +08:00
Jiaming Yuan	52d4eda786	Deprecate `use_label_encoder` in XGBClassifier. (#7822 ) * Deprecate `use_label_encoder` in XGBClassifier. * We have removed the encoder, now prepare to remove the indicator.	2022-04-21 13:14:02 +08:00
Jiaming Yuan	bcce17e688	Remove text loading in basic walk through demo. (#7753 )	2022-04-01 00:59:42 +08:00
Jiaming Yuan	4d81c741e9	External memory support for hist (#7531 ) * Generate column matrix from gHistIndex. * Avoid synchronization with the sparse page once the cache is written. * Cleanups: Remove member variables/functions, change the update routine to look like approx and gpu_hist. * Remove pruner.	2022-03-22 00:13:20 +08:00
Jiaming Yuan	cd55823112	Demo for using custom objective with multi-target regression. (#7736 )	2022-03-20 17:44:25 +08:00
Jiaming Yuan	1d468e20a4	Optimize GPU evaluation function for categorical data. (#7705 ) * Use transform and cache.	2022-02-28 17:46:29 +08:00
Jiaming Yuan	0da7d872ef	[doc] Update for prediction. (#7648 )	2022-02-15 05:01:55 +08:00
Jiaming Yuan	0d0abe1845	Support optimal partitioning for GPU hist. (#7652 ) * Implement `MaxCategory` in quantile. * Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function. * Extract an evaluator from GPU Hist to store the needed states. * Added some CUDA stream/event utilities. * Update document with references. * Fixed a bug in approx evaluator where the number of data points is less than the number of categories.	2022-02-15 03:03:12 +08:00
Ed Shee	d152c59a9c	fixed broken link to Seldon XGBoost server (#7628 )	2022-02-05 01:03:29 +08:00
Philip Hyunsu Cho	c621775f34	Replace all uses of deprecated function sklearn.datasets.load_boston (#7373 ) * Replace all uses of deprecated function sklearn.datasets.load_boston * More renaming * Fix bad name * Update assertion * Fix n boosted rounds. * Avoid over regularization. * Rebase. * Avoid over regularization. * Whac-a-mole Co-authored-by: fis <jm.yuan@outlook.com>	2022-01-30 04:27:57 -08:00
Jiaming Yuan	b4ec1682c6	Update document for multi output and categorical. (#7574 ) * Group together categorical related parameters. * Update documents about multioutput and categorical.	2022-01-19 04:35:17 +08:00
Jiaming Yuan	001503186c	Rewrite approx (#7214 ) This PR rewrites the approx tree method to use codebase from hist for better performance and code sharing. The rewrite has many benefits: - Support for both `max_leaves` and `max_depth`. - Support for `grow_policy`. - Support for mono constraint. - Support for feature weights. - Support for easier bin configuration (`max_bin`). - Support for categorical data. - Faster performance for most of the datasets. (many times faster) - Support for prediction cache. - Significantly better performance for external memory. - Unites the code base between approx and hist.	2022-01-10 21:15:05 +08:00
Jiaming Yuan	ec56d5869b	[doc] Include dask examples into doc. (#7530 )	2022-01-05 03:27:22 +08:00
Jiaming Yuan	58a6723eb1	Initial support for multioutput regression. (#7514 ) * Add num target model parameter, which is configured from input labels. * Change elementwise metric and indexing for weights. * Add demo. * Add tests.	2021-12-18 09:28:38 +08:00
Qingyun Wu	b4a1236cfc	[doc] Update the link to the tuning example in FLAML	2021-12-17 14:31:00 +08:00
Jiaming Yuan	c024c42dce	Modernize XGBoost Python document. (#7468 ) * Use sphinx gallery to integrate examples. * Remove mock objects. * Add dask doc inventory.	2021-11-23 23:24:52 +08:00
Jiaming Yuan	e6ab594e14	Change shebang used in CLI demo. (#7389 ) Change from system Python to environment python3. For Ubuntu 20.04, only `python3` is available and there's no `python`. So at least `python3` is consistent with Python virtual env, Ubuntu and anaconda.	2021-11-02 22:11:19 +08:00
Jiaming Yuan	45aef75cca	Move skl `eval_metric` and `early_stopping rounds` to model params. (#6751 ) A new parameter `custom_metric` is added to `train` and `cv` to distinguish the behaviour from the old `feval`. And `feval` is deprecated. The new `custom_metric` receives transformed prediction when the built-in objective is used. This enables XGBoost to use cost functions from other libraries like scikit-learn directly without going through the definition of the link function. `eval_metric` and `early_stopping_rounds` in sklearn interface are moved from `fit` to `__init__` and is now saved as part of the scikit-learn model. The old ones in `fit` function are now deprecated. The new `eval_metric` in `__init__` has the same new behaviour as `custom_metric`. Added more detailed documents for the behaviour of custom objective and metric.	2021-10-28 17:20:20 +08:00

1 2 3 4 5 ...

408 Commits