xgboost

Author	SHA1	Message	Date
Jiaming Yuan	8189126d51	Add CUDA iterator to tensor view. (#10074 )	2024-03-01 14:15:31 +08:00
Jiaming Yuan	5ac233280e	Require context in aggregators. (#10075 )	2024-02-28 03:12:42 +08:00
Jiaming Yuan	2e4ea5ecc0	Support f64 for ubjson. (#10055 )	2024-02-21 02:18:42 +08:00
david-cortes	a730c7e67e	[R] allow using seed with regular RNG (#10029 )	2024-02-04 16:22:22 +08:00
Jiaming Yuan	cacb4b1fdd	Fix gain calculation in multi-target tree. (#9978 )	2024-01-17 13:18:44 +08:00
rpopescu	73b3955dd4	Fix FieldEntry ctor specialisation syntax error (#9980 ) Co-authored-by: Radu Popescu <radu.popescu@aptportfolio.com>	2024-01-11 12:10:21 +08:00
Jiaming Yuan	c03a4d5088	Check support status for categorical features. (#9946 )	2024-01-04 16:51:33 +08:00
david-cortes	252e018275	correct name of function in header (#9905 )	2023-12-20 10:52:00 +08:00
Dmitry Razdoburdin	43897b8296	Sycl implementation for objective functions (#9846 ) --------- Co-authored-by: Dmitry Razdoburdin <>	2023-12-12 14:41:50 +08:00
Jiaming Yuan	faf0f2df10	Support dataframe data format in native XGBoost. (#9828 ) - Implement a columnar adapter. - Refactor Python pandas handling code to avoid converting into a single numpy array. - Add support in R for transforming columns. - Support R data.frame and factor type.	2023-12-12 09:56:31 +08:00
Jiaming Yuan	39c637ee19	Use array interface in Python prediction return. (#9855 )	2023-12-08 03:42:14 +08:00
Dmitry Razdoburdin	381f1d3dc9	Add support inference on SYCL devices (#9800 ) --------- Co-authored-by: Dmitry Razdoburdin <> Co-authored-by: Nikolay Petrov <nikolay.a.petrov@intel.com> Co-authored-by: Alexandra <alexandra.epanchinzeva@intel.com>	2023-12-04 16:15:57 +08:00
Jiaming Yuan	0715ab3c10	Use `dlopen` to load NCCL. (#9796 ) This PR adds optional support for loading nccl with `dlopen` as an alternative of compile time linking. This is to address the size bloat issue with the PyPI binary release. - Add CMake option to load `nccl` at runtime. - Add an NCCL stub. After this, `nccl` will be fetched from PyPI when using pip to install XGBoost, either by a user or by `pyproject.toml`. Others who want to link the nccl at compile time can continue to do so without any change. At the moment, this is Linux only since we only support MNMG on Linux.	2023-11-22 19:27:31 +08:00
Jiaming Yuan	ada377c57e	[coll] Reduce the scope of lock in the event loop. (#9784 )	2023-11-15 14:16:19 +08:00
Jiaming Yuan	44099f585d	[coll] Add C API for the tracker. (#9773 )	2023-11-08 18:17:14 +08:00
Jiaming Yuan	06bdc15e9b	[coll] Pass context to various functions. (#9772 ) * [coll] Pass context to various functions. In the future, the `Context` object would be required for collective operations, this PR passes the context object to some required functions to prepare for swapping out the implementation.	2023-11-08 09:54:05 +08:00
Dmitry Razdoburdin	f41a08fda8	Add 'sycl' devices to the context (#9691 ) Co-authored-by: Dmitry Razdoburdin <>	2023-10-26 22:17:56 +08:00
Jiaming Yuan	7a02facc9d	Serialize expand entry for allgather. (#9702 )	2023-10-24 14:33:28 +08:00
Dmitry Razdoburdin	ea9f09716b	Reorder if-else statements to allow using of cpu branches for sycl-devices (#9682 )	2023-10-18 10:55:33 +08:00
Rong Ou	da6803b75b	Support column-wise data split with in-memory inputs (#9628 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-10-17 12:16:39 +08:00
Jiaming Yuan	946ae1c440	[coll] Implement a new tracker and a communicator. (#9650 ) * [coll] Implement a new tracker and a communicator. The new tracker and communicators communicate through the use of JSON documents. Along with which, communicators are aware of each other.	2023-10-12 12:49:16 +08:00
Rong Ou	0ecb4de963	[breaking] Change DMatrix construction to be distributed (#9623 ) * Change column-split DMatrix construction to be distributed * remove splitting code for row split	2023-10-10 23:35:57 +08:00
Jiaming Yuan	b14e535e78	[Coll] Implement get host address in libxgboost. (#9644 ) - Port `xgboost.tracker.get_host_ip` in C++.	2023-10-10 10:01:14 +08:00
Jiaming Yuan	680d53db43	Extract JSON utils. (#9645 )	2023-10-10 07:15:14 +08:00
Jiaming Yuan	4d7a187cb0	Remove `XGBoosterGetModelRaw`. (#9617 ) Deprecated in 1.6.	2023-09-29 02:29:33 +08:00
Jiaming Yuan	60526100e3	Support arrow through pandas ext types. (#9612 ) - Use pandas extension type for pyarrow support. - Additional support for QDM. - Additional support for inplace_predict.	2023-09-28 17:00:16 +08:00
Jiaming Yuan	c75a3bc0a9	[breaking] [jvm-packages] Remove rabit check point. (#9599 ) - Add `numBoostedRound` to jvm packages - Remove rabit checkpoint version. - Change the starting version of training continuation in JVM [breaking]. - Redefine the checkpoint version policy in jvm package. [breaking] - Rename the Python check point callback parameter. [breaking] - Unifies the checkpoint policy between Python and JVM.	2023-09-26 18:06:34 +08:00
Jiaming Yuan	8c676c889d	Remove internal use of gpu_id. (#9568 )	2023-09-20 23:29:51 +08:00
Jiaming Yuan	b438d684d2	Utilities and cleanups for socket. (#9576 ) - Use c++-17 nodiscard and nested ns. - Add bind method to socket. - Remove rabit parameters.	2023-09-14 01:41:42 +08:00
Jiaming Yuan	ccfc90e4c6	[rabit] Improved connection handling. (#9531 ) - Enable timeout. - Report connection error from the system. - Handle retry for both tracker connection and peer connection.	2023-08-30 13:00:04 +08:00
Jiaming Yuan	ddf2e68821	Use the new `DeviceOrd` in the linalg module. (#9527 )	2023-08-29 13:37:29 +08:00
Jiaming Yuan	be6a552956	[R] Support multi-class custom objective. (#9526 )	2023-08-29 08:27:13 +08:00
Jiaming Yuan	972730cde0	Use matrix for gradient. (#9508 ) - Use the `linalg::Matrix` for storing gradients. - New API for the custom objective. - Custom objective for multi-class/multi-target is now required to return the correct shape. - Custom objective for Python can accept arrays with any strides. (row-major, column-major)	2023-08-24 05:29:52 +08:00
Jiaming Yuan	3c09399f29	Fix device dispatch for linear updater. (#9507 )	2023-08-23 00:17:35 +08:00
Jiaming Yuan	58530b1bc4	Bump version to 2.1. (#9498 )	2023-08-18 01:04:04 +08:00
Jiaming Yuan	f05294a6f2	Fix clang warnings. (#9447 ) - static function in header. (which is marked as unused due to translation unit visibility). - Implicit copy operator is deprecated. - Unused lambda capture. - Moving a temporary variable prevents copy elision.	2023-08-09 15:34:45 +08:00
Philip Hyunsu Cho	7ce090e775	Handle UTF-8 paths correctly on Windows platform (#9443 ) * Fix round-trip serialization with UTF-8 paths * Add compiler version check * Add comment to C API functions * Add Python tests * [CI] Updatre MacOS deployment target * Use std::filesystem instead of dmlc::TemporaryDirectory	2023-08-07 23:27:25 -07:00
Jiaming Yuan	54029a59af	Bound the size of the histogram cache. (#9440 ) - A new histogram collection with a limit in size. - Unify histogram building logic between hist, multi-hist, and approx.	2023-08-08 03:21:26 +08:00
Jiaming Yuan	e93a274823	Small cleanup for histogram routines. (#9427 ) * Small cleanup for histogram routines. - Extract hist train param from GPU hist. - Make histogram const after construction. - Unify parameter names.	2023-08-02 18:28:26 +08:00
Jiaming Yuan	a196443a07	Implement sketching with Hessian on GPU. (#9399 ) - Prepare for implementing approx on GPU. - Unify the code path between weighted and uniform sketching on DMatrix.	2023-07-24 15:43:03 +08:00
Jiaming Yuan	16eb41936d	Handle the new `device` parameter in dask and demos. (#9386 ) * Handle the new `device` parameter in dask and demos. - Check no ordinal is specified in the dask interface. - Update demos. - Update dask doc. - Update the condition for QDM.	2023-07-15 19:11:20 +08:00
Jiaming Yuan	04aff3af8e	Define the new `device` parameter. (#9362 )	2023-07-13 19:30:25 +08:00
Jiaming Yuan	20c52f07d2	Support exporting cut values (#9356 )	2023-07-08 15:32:41 +08:00
Jiaming Yuan	645037e376	Improve test coverage with predictor configuration. (#9354 ) * Improve test coverage with predictor configuration. - Test with ext memory. - Test with QDM. - Test with dart.	2023-07-05 15:17:22 +08:00
Jiaming Yuan	39390cc2ee	[breaking] Remove the `predictor` param, allow fallback to prediction using `DMatrix`. (#9129 ) - A `DeviceOrd` struct is implemented to indicate the device. It will eventually replace the `gpu_id` parameter. - The `predictor` parameter is removed. - Fallback to `DMatrix` when `inplace_predict` is not available. - The heuristic for choosing a predictor is only used during training.	2023-07-03 19:23:54 +08:00
Rong Ou	962a20693f	More support for column split in cpu predictor (#9244 ) - Added column split support to `PredictInstance` and `PredictLeaf`. - Refactoring of tests.	2023-06-05 08:05:38 +08:00
Jiaming Yuan	17fd3f55e9	Optimize adapter element counting on GPU. (#9209 ) - Implement a simple `IterSpan` for passing iterators with size. - Use shared memory for column size counts. - Use one thread for each sample in row count to reduce atomic operations.	2023-05-30 23:28:43 +08:00
Stephan T. Lavavej	59edfdb315	Fix typo: `_defined` => `defined` (#9153 )	2023-05-11 16:34:45 -07:00
Jiaming Yuan	08ce495b5d	Use Booster context in DMatrix. (#8896 ) - Pass context from booster to DMatrix. - Use context instead of integer for `n_threads`. - Check the consistency configuration for `max_bin`. - Test for all combinations of initialization options.	2023-04-28 21:47:14 +08:00
Jiaming Yuan	1f9a57d17b	[Breaking] Require format to be specified in input URI. (#9077 ) Previously, we use `libsvm` as default when format is not specified. However, the dmlc data parser is not particularly robust against errors, and the most common type of error is undefined format. Along with which, we will recommend users to use other data loader instead. We will continue the maintenance of the parsers as it's currently used for many internal tests including federated learning.	2023-04-28 19:45:15 +08:00

1 2 3 4 5 ...

459 Commits