xgboost

Author	SHA1	Message	Date
Jiaming Yuan	508ac13243	Check cub errors. (#10721 ) - Make sure cuda error returned by cub scan is caught. - Avoid temporary buffer allocation in thrust device vector.	2024-08-21 02:50:26 +08:00
Jiaming Yuan	ec3f327c20	Add managed memory allocator. (#10711 )	2024-08-17 03:02:34 +08:00
Jiaming Yuan	8d7fe262d9	[EM] Enable access to the number of batches. (#10691 ) - Expose `NumBatches` in `DMatrix`. - Small cleanup for removing legacy CUDA stream and ~force CUDA context initialization~. - Purge old external memory data generation code.	2024-08-17 02:59:45 +08:00
Jiaming Yuan	abe65e3769	Reduce thread contention in column split histogram test. (#10708 )	2024-08-17 01:00:32 +08:00
Jiaming Yuan	582ea104b5	[EM] Enable prediction cache for GPU. (#10707 ) - Use `UpdatePosition` for all nodes and skip `FinalizePosition` when external memory is used. - Create `encode/decode` for node position, this is just as a refactor. - Reuse code between update position and finalization.	2024-08-15 21:41:59 +08:00
Dmitry Razdoburdin	773ded684b	[sycl] Add depth-wise policy (#10690 ) Co-authored-by: Dmitry Razdoburdin <>	2024-08-13 18:12:35 +08:00
Jiaming Yuan	2ecc85ffad	[EM] Support ExtMemQdm in the GPU predictor. (#10694 )	2024-08-13 12:21:11 +08:00
Jiaming Yuan	43704549a2	[coll] Reduce the amount of open files (socket). (#10693 ) Reduce the chance of hitting `Failed to call `socket`: Too many open files`.	2024-08-13 05:23:49 +08:00
Jiaming Yuan	d414fdf2e7	[EM] Add GPU version of the external memory QDM. (#10689 )	2024-08-10 10:49:43 +08:00
Jiaming Yuan	7bccc1ea2c	[EM] CPU implementation for external memory QDM. (#10682 ) - A new DMatrix type. - Extract common code into a new QDM base class. Not yet working: - Not exposed to the interface yet, will wait for the GPU implementation. - ~No meta info yet, still working on the source.~ - Exporting data to CSR is not supported yet.	2024-08-09 09:38:02 +08:00
Dmitry Razdoburdin	e555a238bc	[SYCL]. Add implementation for loss-guided policy (#10681 ) --------- Co-authored-by: Dmitry Razdoburdin <>	2024-08-09 09:04:46 +08:00
Jiaming Yuan	cc3b56fc37	Cleanup GPU Hist tests. (#10677 ) * Cleanup GPU Hist tests. - Remove GPU Hist gradient sampling test. The same properties are tested in the gradient sampler test suite. - Move basic histogram tests into the histogram test suite. - Remove the header inclusion of the `updater_gpu_hist.cu` in tests.	2024-08-06 11:50:44 +08:00
Jiaming Yuan	77c844cef7	Reduce thread contention in column split tests. (#10658 ) --------- Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2024-08-01 18:36:46 +08:00
Dmitry Razdoburdin	7720272870	[sycl] add split applications and tests (#10636 ) Co-authored-by: Dmitry Razdoburdin <>	2024-07-26 15:25:49 +08:00
Jiaming Yuan	a19bbc9be5	Avoid caching allocator for large allocations. (#10582 )	2024-07-23 03:48:03 +08:00
Dmitry Razdoburdin	f6cae4da85	[SYCL] Add splits evaluation (#10605 ) --------- Co-authored-by: Dmitry Razdoburdin <>	2024-07-22 18:14:06 +08:00
Jiaming Yuan	6d9fcb771e	Move device histogram storage into `histogram.cuh`. (#10608 )	2024-07-21 14:10:13 +08:00
Jiaming Yuan	292bb677e5	[EM] Support mmap backed ellpack. (#10602 ) - Support resource view in ellpack. - Define the CUDA version of MMAP resource. - Define the CUDA version of malloc resource. - Refactor cuda runtime API wrappers, and add memory access related wrappers. - gather windows macros into a single header.	2024-07-18 08:20:21 +08:00
Jiaming Yuan	e9fbce9791	Refactor `DeviceUVector`. (#10595 ) Create a wrapper instead of using inheritance to avoid inconsistent interface of the class.	2024-07-18 03:33:01 +08:00
Jiaming Yuan	a6a8a55ffa	Merge approx tests. (#10583 )	2024-07-16 19:03:48 +08:00
Jiaming Yuan	6c403187ec	Fix column split race condition. (#10572 )	2024-07-12 01:07:12 +08:00
Jiaming Yuan	1ca4bfd20e	Avoid thrust vector initialization. (#10544 ) * Avoid thrust vector initialization. - Add a wrapper for rmm device uvector. - Split up the `Resize` method for HDV.	2024-07-11 17:29:27 +08:00
Jiaming Yuan	89da9f9741	[fed] Split up federated test CMake file. (#10566 ) - Collect all federated test files into the same directory. - Independently list the files.	2024-07-11 13:09:18 +08:00
Jiaming Yuan	5f910cd4ff	[EM] Handle base idx in GPU histogram. (#10549 )	2024-07-11 03:26:30 +08:00
Jiaming Yuan	34b154c284	Avoid the use of size_t in the partitioner. (#10541 ) - Avoid the use of size_t in the partitioner. - Use `Span` instead of `Elem` where `node_id` is not needed. - Remove the `const_cast`. - Make sure the constness is not removed in the `Elem` by making it reference only. size_t is implementation-defined, which causes issue when we want to pass pointer or span.	2024-07-11 00:43:08 +08:00
Jiaming Yuan	620b2b155a	Cache GPU histogram kernel configuration. (#10538 )	2024-07-04 15:38:59 +08:00
Jiaming Yuan	628411a654	Enhance the threadpool implementation. (#10531 ) - Accept an initialization function. - Support void return tasks.	2024-07-03 12:13:27 +08:00
Jiaming Yuan	9cb4c938da	[EM] Move prefetch in reset into the end of the iteration. (#10529 )	2024-07-03 03:48:18 +08:00
Philip Hyunsu Cho	09d32f1f2b	Fix build and C++ tests for FreeBSD (#10480 )	2024-06-28 01:47:55 -07:00
Jiaming Yuan	e8a962575a	[EM] Allow staging ellpack on host for GPU external memory. (#10488 ) - New parameter `on_host`. - Abstract format creation and stream creation into policy classes.	2024-06-28 04:42:18 +08:00
Jiaming Yuan	26eb68859f	Consistently report error in tests. (#10453 )	2024-06-21 14:35:22 +08:00
Jiaming Yuan	e5f1720656	[EM] Avoid writing cut matrix to cache. (#10444 )	2024-06-19 18:03:38 +08:00
Jiaming Yuan	b9e5229ff2	Update rapids (#10435 ) * [CI] Update RAPIDS to latest stable * RMM. --------- Co-authored-by: hcho3 <2532981+hcho3@users.noreply.github.com> Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2024-06-18 05:01:57 +08:00
Jiaming Yuan	c9f5fcaf21	[col] Small cleanup to federated comm. (#10397 )	2024-06-07 21:19:04 +08:00
Jiaming Yuan	d2d01d977a	Remove unnecessary fetch operations in external memory. (#10342 )	2024-05-31 13:16:40 +08:00
Jiaming Yuan	e6eefea5e2	[coll] Move the rabit poll helper. (#10349 )	2024-05-31 08:02:21 +08:00
Jiaming Yuan	d5fcbee44b	Add timeout for distributed tests. (#10315 )	2024-05-23 11:11:49 +08:00
Jiaming Yuan	1b25d23583	[JVM-packages] Prevent memory leak. (#10307 )	2024-05-22 13:47:59 +08:00
Dmitry Razdoburdin	c7e7ce7569	[SYCL] Add nodes initialisation (#10269 ) --------- Co-authored-by: Dmitry Razdoburdin <> Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2024-05-21 23:38:52 +08:00
Jiaming Yuan	a5a58102e5	Revamp the rabit implementation. (#10112 ) This PR replaces the original RABIT implementation with a new one, which has already been partially merged into XGBoost. The new one features: - Federated learning for both CPU and GPU. - NCCL. - More data types. - A unified interface for all the underlying implementations. - Improved timeout handling for both tracker and workers. - Exhausted tests with metrics (fixed a couple of bugs along the way). - A reusable tracker for Python and JVM packages.	2024-05-20 11:56:23 +08:00
Jiaming Yuan	835e59e538	Use a thread pool for external memory. (#10288 )	2024-05-16 19:32:12 +08:00
Dmitry Razdoburdin	f588252481	[sycl] add loss guided hist building (#10251 ) Co-authored-by: Dmitry Razdoburdin <>	2024-05-10 22:35:13 +08:00
Dmitry Razdoburdin	dcc9639b91	[sycl] add data initialisation for training (#10222 ) Co-authored-by: Dmitry Razdoburdin <> Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu> Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2024-05-05 12:07:10 +08:00
Jiaming Yuan	5e64276a9b	Update nvtx. (#10227 )	2024-04-29 06:33:46 +08:00
Dmitry Razdoburdin	58513dc288	[SYCL] Add sampling initialization (#10216 ) --------- Co-authored-by: Dmitry Razdoburdin <>	2024-04-25 04:35:52 +08:00
Jiaming Yuan	3fbb221fec	[coll] Implement shutdown for tracker and comm. (#10208 ) - Force shutdown the tracker. - Implement shutdown notice for error handling thread in comm.	2024-04-20 04:08:17 +08:00
Jiaming Yuan	3f64b4fde3	[coll] Add global functions. (#10203 )	2024-04-19 03:17:23 +08:00
Jiaming Yuan	4b10200456	[coll] Improve event loop. (#10199 ) - Add a test for blocking calls. - Do not require the queue to be empty after waking up; this frees up the thread to answer blocking calls. - Handle EOF in read. - Improve the error message in the result. Allow concatenation of multiple results.	2024-04-18 03:29:52 +08:00
Dmitry Razdoburdin	6e5c335cea	[SYCL] Add basic features for QuantileHistMaker (#10174 ) --------- Co-authored-by: Dmitry Razdoburdin <>	2024-04-15 21:24:46 +08:00
Jiaming Yuan	8bad677c2f	Update collective implementation. (#10152 ) * Update collective implementation. - Cleanup resource during `Finalize` to avoid handling threads in destructor. - Calculate the size for allgather automatically. - Use simple allgather for small (smaller than the number of worker) allreduce.	2024-03-30 18:57:31 +08:00

1 2 3 4 5 ...

771 Commits