xgboost

Author	SHA1	Message	Date
Jiaming Yuan	871fabeee3	[doc][dask] Update notes about k8s. (#10271 )	2024-05-14 04:21:02 +08:00
Jiaming Yuan	0715ab3c10	Use `dlopen` to load NCCL. (#9796 ) This PR adds optional support for loading nccl with `dlopen` as an alternative of compile time linking. This is to address the size bloat issue with the PyPI binary release. - Add CMake option to load `nccl` at runtime. - Add an NCCL stub. After this, `nccl` will be fetched from PyPI when using pip to install XGBoost, either by a user or by `pyproject.toml`. Others who want to link the nccl at compile time can continue to do so without any change. At the moment, this is Linux only since we only support MNMG on Linux.	2023-11-22 19:27:31 +08:00
Jiaming Yuan	98238d63fa	[dask] Change document to avoid using default import. (#9742 ) This aligns dask with pyspark, users need to explicitly call: ``` from xgboost.dask import DaskXGBClassifier from xgboost import dask as dxgb ``` In future releases, we might stop using the default import and remove the lazy loader.	2023-11-07 02:44:39 +08:00
James Lamb	9dbb71490c	[Doc] fix typos in documentation (#9458 )	2023-08-10 19:26:36 +08:00
Jiaming Yuan	16eb41936d	Handle the new `device` parameter in dask and demos. (#9386 ) * Handle the new `device` parameter in dask and demos. - Check no ordinal is specified in the dask interface. - Update demos. - Update dask doc. - Update the condition for QDM.	2023-07-15 19:11:20 +08:00
Jiaming Yuan	04aff3af8e	Define the new `device` parameter. (#9362 )	2023-07-13 19:30:25 +08:00
Jiaming Yuan	39390cc2ee	[breaking] Remove the `predictor` param, allow fallback to prediction using `DMatrix`. (#9129 ) - A `DeviceOrd` struct is implemented to indicate the device. It will eventually replace the `gpu_id` parameter. - The `predictor` parameter is removed. - Fallback to `DMatrix` when `inplace_predict` is not available. - The heuristic for choosing a predictor is only used during training.	2023-07-03 19:23:54 +08:00
Jiaming Yuan	bc267dd729	Use ptr from `mmap` for `GHistIndexMatrix` and `ColumnMatrix`. (#9315 ) * Use ptr from mmap for `GHistIndexMatrix` and `ColumnMatrix`. - Define a resource for holding various types of memory pointers. - Define ref vector for holding resources. - Swap the underlying resources for GHist and ColumnM. - Add documentation for current status. - s390x support is removed. It should work if you can compile XGBoost, all the old workaround code does is to get GCC to compile.	2023-06-27 19:05:46 +08:00
Jiaming Yuan	7f20eaed93	[doc] Troubleshoot nccl shared memory. [skip ci] (#9206 )	2023-05-31 05:00:02 +08:00
Jiaming Yuan	96d3f8a6f3	[doc] Update document. (#9098 ) - Mention flink is still under construction. - Update doxygen version. - Fix warnings from doxygen about defgroup title and mismatched parameter name.	2023-04-27 19:29:03 +08:00
Sarah Charlotte Johnson	ebd64f6e22	[doc] Update Dask deployment options (#9008 )	2023-04-07 01:09:15 +08:00
Jiaming Yuan	ea04d4c46c	[doc] [dask] Troubleshooting NCCL errors. (#8943 )	2023-03-22 22:17:26 +08:00
Jiaming Yuan	95e2baf7c2	[doc] Fix typo [skip ci] (#8907 )	2023-03-15 00:55:17 +08:00
Jiaming Yuan	bbee355b45	[doc][dask] Note on reproducible result. [skip ci] (#8903 )	2023-03-13 19:30:35 +08:00
Jiaming Yuan	54e001bbf4	[doc][dask] Reference examples from coiled. [skip ci] (#8891 )	2023-03-09 20:03:24 -08:00
Jiaming Yuan	8c16da8863	[doc] Add note for rabit port. [skip ci] (#8879 )	2023-03-08 19:00:10 +08:00
Jiaming Yuan	0b36f8fba1	[R] Fix CRAN test notes. (#8428 ) - Limit the number of used CPU cores in examples. - Add a note for the constraint. - Bring back the cleanup script.	2022-11-09 02:03:30 +08:00
Thomas Stanley	ba9cc43464	Fix acronym (#8386 )	2022-10-26 06:22:30 +08:00
Jiaming Yuan	f835368bcf	Mark next release as 1.7 instead of 2.0 (#8281 )	2022-09-28 14:33:37 +08:00
Jiaming Yuan	b791446623	Initial support for IPv6 (#8225 ) - Merge rabit socket into XGBoost. - Dask interface support. - Add test to the socket.	2022-09-21 18:06:50 +08:00
Jiaming Yuan	9ae547f994	Use config_context in sklearn interface. (#8141 )	2022-08-09 14:48:54 +08:00
Jiaming Yuan	5cd1f71b51	[dask] Improve configuration for port. (#7645 ) - Try port 0 to let the OS return the available port. - Add port configuration.	2022-02-14 21:34:34 +08:00
Jiaming Yuan	ef4dae4c0e	[dask] Add scheduler address to dask config. (#7581 ) - Add user configuration. - Bring back to the logic of using scheduler address from dask. This was removed when we were trying to support GKE, now we bring it back and let xgboost try it if direct guess or host IP from user config failed.	2022-01-22 01:56:32 +08:00
Jiaming Yuan	ec56d5869b	[doc] Include dask examples into doc. (#7530 )	2022-01-05 03:27:22 +08:00
Jiaming Yuan	a512b4b394	[doc] Promote dask from experimental. [skip ci] (#7509 )	2021-12-16 14:17:06 +08:00
Jiaming Yuan	cf06a266a8	[dask][doc] Wrap the example in main guard. (#6979 )	2021-05-25 08:24:47 +08:00
Jiaming Yuan	5cb51a191e	[dask][doc] Add small example for sklearn interface. (#6970 )	2021-05-19 13:50:45 +08:00
Andrew Ziem	3e7e426b36	Fix spelling in documents (#6948 ) * Update roxygen2 doc. Co-authored-by: fis <jm.yuan@outlook.com>	2021-05-11 20:44:36 +08:00
Jiaming Yuan	a5d7094a45	Update documents. (#6856 ) * Add early stopping section to prediction doc. * Remove best_ntree_limit. * Better doxygen output.	2021-04-16 12:41:03 +08:00
Jiaming Yuan	9d62b14591	Fix document. [skip ci] (#6669 )	2021-02-02 20:43:31 +08:00
Jiaming Yuan	87ab1ad607	[dask] Accept `Future` of model for prediction. (#6650 ) This PR changes predict and inplace_predict to accept a Future of model, to avoid sending models to workers repeatably. * Document is updated to reflect functionality additions in recent changes.	2021-02-02 08:45:52 +08:00
Jiaming Yuan	d8ec7aad5a	[dask] Add a 1 line sample to infer output shape. (#6645 ) * [dask] Use a 1 line sample to infer output shape. This is for inferring shape with direct prediction (without DaskDMatrix). There are a few things that requires known output shape before carrying out actual prediction, including dask meta data, output dataframe columns. * Infer output shape based on local prediction. * Remove set param in predict function as it's not thread safe nor necessary as we now let dask to decide the parallelism. * Simplify prediction on `DaskDMatrix`.	2021-01-30 18:55:50 +08:00
Jiaming Yuan	4bf23c2391	Specify shape in prediction contrib and interaction. (#6614 )	2021-01-26 02:08:22 +08:00
James Lamb	1e2c3ade9e	[doc] [dask] Add example on early stopping with Dask (#6501 ) Co-authored-by: fis <jm.yuan@outlook.com>	2020-12-15 22:23:23 +08:00
James Lamb	afc4567268	[doc] [dask] fix partitioning in Dask example (#6389 )	2020-12-14 18:37:49 +08:00
Jiaming Yuan	a30461cf87	[dask] Support all parameters in regressor and classifier. (#6471 ) * Add eval_metric. * Add callback. * Add feature weights. * Add custom objective.	2020-12-14 07:35:56 +08:00
Jiaming Yuan	00218d065a	[dask] Update document. [skip ci] (#6413 )	2020-11-20 19:16:19 +08:00
James Lamb	12d27f43ff	[doc] make Dask distributed example copy-pastable (#6345 )	2020-11-11 20:22:17 -08:00
Jiaming Yuan	08bdb2efc8	Fix dask doc. [skip ci] (#6108 )	2020-09-11 10:56:12 +08:00
James Bourbeau	3b88bc948f	Update XGBoost + Dask overview documentation (#5961 ) * Add imports to code snippet * Better writing.	2020-07-31 09:58:50 +08:00
Jiaming Yuan	fa3715f584	[Dask] Asyncio support. (#5862 )	2020-07-30 06:23:58 +08:00
Jiaming Yuan	529b5c2cfd	[DOC] Mention dask blog post in doc. [skip ci] (#5789 )	2020-06-14 13:00:19 +08:00
Jiaming Yuan	cd7d6f7d59	[dask] Fix missing value for scikit-learn interface. (#5435 )	2020-03-20 10:56:01 -04:00
Jiaming Yuan	761a5dbdfc	[dask] Honor `nthreads` from dask worker. (#5414 )	2020-03-16 04:51:24 +08:00
Samrat Pandiri	2d76d40dfd	Update dask.rst to correct a spelling mistake (#5371 ) Change `signle-node` to `single-node`	2020-02-27 20:46:41 +08:00
Jiaming Yuan	a4b929385e	Note for `DaskDMatrix`. (#5144 ) * Brief introduction to `DaskDMatrix`. * Add xgboost.dask.train to API doc	2019-12-23 18:55:32 +08:00
Jiaming Yuan	b8433c455a	Rewrite Dask interface. (#4819 )	2019-09-25 01:30:14 -04:00

47 Commits