xgboost

Author	SHA1	Message	Date
Jiaming Yuan	96bbf80457	[EM] Suport quantile objectives for GPU-based external memory. (#10820 ) - Improved error message for memory usage. - Support quantile-based objectives for GPU external memory.	2024-09-17 13:27:02 +08:00
Jiaming Yuan	d94f6679fc	[EM] Avoid synchronous calls and unnecessary ATS access. (#10811 ) - Pass context into various functions. - Factor out some CUDA algorithms. - Use ATS only for update position.	2024-09-10 14:33:14 +08:00
Jiaming Yuan	5f7f31d464	[EM] Refactor ellpack construction. (#10810 ) - Remove the calculation of n_symbols in the accessor. - Pack initialization steps into the parameter list. - Pass the context into various ctors. - Specialization for dense data to prepare for further compression.	2024-09-09 14:10:10 +08:00
Jiaming Yuan	e1a2c1bbb3	[EM] Merge GPU partitioning with histogram building. (#10766 ) - Stop concatenating pages if there's no subsampling. - Use a single iteration for histogram build and partitioning.	2024-08-31 03:25:37 +08:00
Jiaming Yuan	61dd854a52	[EM] Refactor GPU histogram builder. (#10764 ) - Expose the maximum number of cached nodes to be consistent with the CPU implementation. Also easier for testing. - Extract the subtraction trick for easier testing. - Split up the `GradientQuantiser` to avoid circular dependency.	2024-08-30 02:39:14 +08:00
Jiaming Yuan	4fe67f10b4	[EM] Have one partitioner for each batch. (#10760 ) - Initialize one partitioner for each batch. - Collect partition size during initialization. - Support base ridx in the finalization.	2024-08-29 01:35:17 +08:00
Jiaming Yuan	bde1265caf	[EM] Return a full DMatrix instead of a Ellpack from the GPU sampler. (#10753 )	2024-08-28 01:05:11 +08:00
Jiaming Yuan	55aef8f546	[EM] Avoid resizing host cache. (#10734 ) * [EM] Avoid resizing host cache. - Add SAM allocator and resource. - Use page-based cache instead of stream-based cache.	2024-08-23 06:34:01 +08:00
Jiaming Yuan	582ea104b5	[EM] Enable prediction cache for GPU. (#10707 ) - Use `UpdatePosition` for all nodes and skip `FinalizePosition` when external memory is used. - Create `encode/decode` for node position, this is just as a refactor. - Reuse code between update position and finalization.	2024-08-15 21:41:59 +08:00
Jiaming Yuan	cc3b56fc37	Cleanup GPU Hist tests. (#10677 ) * Cleanup GPU Hist tests. - Remove GPU Hist gradient sampling test. The same properties are tested in the gradient sampler test suite. - Move basic histogram tests into the histogram test suite. - Remove the header inclusion of the `updater_gpu_hist.cu` in tests.	2024-08-06 11:50:44 +08:00
Jiaming Yuan	a19bbc9be5	Avoid caching allocator for large allocations. (#10582 )	2024-07-23 03:48:03 +08:00
Jiaming Yuan	6d9fcb771e	Move device histogram storage into `histogram.cuh`. (#10608 )	2024-07-21 14:10:13 +08:00
Jiaming Yuan	292bb677e5	[EM] Support mmap backed ellpack. (#10602 ) - Support resource view in ellpack. - Define the CUDA version of MMAP resource. - Define the CUDA version of malloc resource. - Refactor cuda runtime API wrappers, and add memory access related wrappers. - gather windows macros into a single header.	2024-07-18 08:20:21 +08:00
Jiaming Yuan	5a92ffe3ca	Partial fix for CTK 12.5 (#10574 )	2024-07-16 17:41:50 +08:00
Jiaming Yuan	1ca4bfd20e	Avoid thrust vector initialization. (#10544 ) * Avoid thrust vector initialization. - Add a wrapper for rmm device uvector. - Split up the `Resize` method for HDV.	2024-07-11 17:29:27 +08:00
Jiaming Yuan	5f910cd4ff	[EM] Handle base idx in GPU histogram. (#10549 )	2024-07-11 03:26:30 +08:00
Jiaming Yuan	620b2b155a	Cache GPU histogram kernel configuration. (#10538 )	2024-07-04 15:38:59 +08:00
Jiaming Yuan	a5a58102e5	Revamp the rabit implementation. (#10112 ) This PR replaces the original RABIT implementation with a new one, which has already been partially merged into XGBoost. The new one features: - Federated learning for both CPU and GPU. - NCCL. - More data types. - A unified interface for all the underlying implementations. - Improved timeout handling for both tracker and workers. - Exhausted tests with metrics (fixed a couple of bugs along the way). - A reusable tracker for Python and JVM packages.	2024-05-20 11:56:23 +08:00
Jiaming Yuan	53fc17578f	Use `std::uint64_t` for row index. (#10120 ) - Use std::uint64_t instead of size_t to avoid implementation-defined type. - Rename to bst_idx_t, to account for other types of indexing. - Small cleanup to the base header.	2024-03-15 18:43:49 +08:00
Jiaming Yuan	5ac233280e	Require context in aggregators. (#10075 )	2024-02-28 03:12:42 +08:00
Jiaming Yuan	06bdc15e9b	[coll] Pass context to various functions. (#9772 ) * [coll] Pass context to various functions. In the future, the `Context` object would be required for collective operations, this PR passes the context object to some required functions to prepare for swapping out the implementation.	2023-11-08 09:54:05 +08:00
Jiaming Yuan	6755179e77	[coll] Add nccl. (#9726 )	2023-10-28 16:33:58 +08:00
Jiaming Yuan	8c676c889d	Remove internal use of gpu_id. (#9568 )	2023-09-20 23:29:51 +08:00
Rong Ou	9bab06cbca	Support column split in gpu hist updater (#9384 )	2023-08-31 18:09:35 +08:00
Jiaming Yuan	ddf2e68821	Use the new `DeviceOrd` in the linalg module. (#9527 )	2023-08-29 13:37:29 +08:00
Jiaming Yuan	942b957eef	Fix GPU categorical split memory allocation. (#9529 )	2023-08-29 10:06:03 +08:00
Jiaming Yuan	972730cde0	Use matrix for gradient. (#9508 ) - Use the `linalg::Matrix` for storing gradients. - New API for the custom objective. - Custom objective for multi-class/multi-target is now required to return the correct shape. - Custom objective for Python can accept arrays with any strides. (row-major, column-major)	2023-08-24 05:29:52 +08:00
Rong Ou	6103dca0bb	Support column split in GPU evaluate splits (#9511 )	2023-08-23 16:33:43 +08:00
Jiaming Yuan	1caa93221a	Use `realloc` for histogram cache and expose the cache limit. (#9455 )	2023-08-10 14:05:27 +08:00
Jiaming Yuan	e93a274823	Small cleanup for histogram routines. (#9427 ) * Small cleanup for histogram routines. - Extract hist train param from GPU hist. - Make histogram const after construction. - Unify parameter names.	2023-08-02 18:28:26 +08:00
Jiaming Yuan	912e341d57	Initial GPU support for the approx tree method. (#9414 )	2023-07-31 15:50:28 +08:00
Jiaming Yuan	20c52f07d2	Support exporting cut values (#9356 )	2023-07-08 15:32:41 +08:00
Jiaming Yuan	39390cc2ee	[breaking] Remove the `predictor` param, allow fallback to prediction using `DMatrix`. (#9129 ) - A `DeviceOrd` struct is implemented to indicate the device. It will eventually replace the `gpu_id` parameter. - The `predictor` parameter is removed. - Fallback to `DMatrix` when `inplace_predict` is not available. - The heuristic for choosing a predictor is only used during training.	2023-07-03 19:23:54 +08:00
Jiaming Yuan	ee6809e642	Use mmap for external memory. (#9282 ) - Have basic infrastructure for mmap. - Release file write handle.	2023-06-19 18:52:55 +08:00
Rong Ou	e70810be8a	Refactor device communicator to make allreduce more flexible (#9295 )	2023-06-14 03:53:03 +08:00
Jiaming Yuan	ea0deeca68	Disable dense optimization in hist for distributed training. (#9272 )	2023-06-10 02:31:34 +08:00
Jiaming Yuan	08ce495b5d	Use Booster context in DMatrix. (#8896 ) - Pass context from booster to DMatrix. - Use context instead of integer for `n_threads`. - Check the consistency configuration for `max_bin`. - Test for all combinations of initialization options.	2023-04-28 21:47:14 +08:00
Jiaming Yuan	acc110c251	[MT-TREE] Support prediction cache and model slicing. (#8968 ) - Fix prediction range. - Support prediction cache in mt-hist. - Support model slicing. - Make the booster a Python iterable by defining `__iter__`. - Cleanup removed/deprecated parameters. - A new field in the output model `iteration_indptr` for pointing to the ranges of trees for each iteration.	2023-03-27 23:10:54 +08:00
Jiaming Yuan	6deaec8027	Pass obj info by reference instead of by value. (#8889 ) - Pass obj info into tree updater as const pointer. This way we don't have to initialize the learner model param before configuring gbm, hence breaking up the dependency of configurations.	2023-03-11 01:38:28 +08:00
Jiaming Yuan	5feee8d4a9	Define core multi-target regression tree structure. (#8884 ) - Define a new tree struct embedded in the `RegTree`. - Provide dispatching functions in `RegTree`. - Fix some c++-17 warnings about the use of nodiscard (currently we disable the warning on the CI). - Use uint32_t instead of size_t for `bst_target_t` as it has a defined size and can be used as part of dmlc parameter. - Hide the `Segment` struct inside the categorical split matrix.	2023-03-09 19:03:06 +08:00
Jiaming Yuan	228a46e8ad	Support learning rate for zero-hessian objectives. (#8866 )	2023-03-06 20:33:28 +08:00
Jiaming Yuan	4d665b3fb0	Restore clang tidy test. (#8861 )	2023-03-03 13:47:04 -08:00
Rory Mitchell	69a50248b7	Fix scope of feature set pointers (#8850 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-03-02 12:37:14 +08:00
Jiaming Yuan	282b1729da	Specify the number of threads for parallel sort. (#8735 ) * Specify the number of threads for parallel sort. - Pass context object into argsort. - Replace macros with inline functions.	2023-02-16 00:20:19 +08:00
Rory Mitchell	7214a45e83	Fix different number of features in gpu_hist evaluator. (#8754 )	2023-02-06 23:15:16 +08:00
Jiaming Yuan	0e61ba57d6	Fix GPU L1 error. (#8749 )	2023-02-04 03:02:00 +08:00
Jiaming Yuan	c6a8754c62	Define CUDA Context. (#8604 ) We will transition to non-default and non-blocking CUDA stream.	2022-12-20 15:15:07 +08:00
Jiaming Yuan	43a647a4dd	Fix inference with categorical feature. (#8591 )	2022-12-15 17:57:26 +08:00
Jiaming Yuan	3e26107a9c	Rename and extract `Context`. (#8528 ) * Rename `GenericParameter` to `Context`. * Rename header file to reflect the change. * Rename all references.	2022-12-07 04:58:54 +08:00
Rory Mitchell	210915c985	Use integer gradients in gpu_hist split evaluation (#8274 )	2022-10-11 12:16:27 +02:00

1 2 3 4 5

202 Commits