xgboost

Author	SHA1	Message	Date
Jiaming Yuan	8c676c889d	Remove internal use of gpu_id. (#9568 )	2023-09-20 23:29:51 +08:00
Jiaming Yuan	adea842c83	Fix inplace predict with fallback when base margin is used. (#9536 ) - Copy meta info from proxy DMatrix. - Use `std::call_once` to emit less warnings.	2023-09-05 01:04:24 +08:00
Jiaming Yuan	ddf2e68821	Use the new `DeviceOrd` in the linalg module. (#9527 )	2023-08-29 13:37:29 +08:00
Jiaming Yuan	972730cde0	Use matrix for gradient. (#9508 ) - Use the `linalg::Matrix` for storing gradients. - New API for the custom objective. - Custom objective for multi-class/multi-target is now required to return the correct shape. - Custom objective for Python can accept arrays with any strides. (row-major, column-major)	2023-08-24 05:29:52 +08:00
Jiaming Yuan	bb56183396	Normalize file system path. (#9463 )	2023-08-11 21:26:46 +08:00
Jiaming Yuan	f05294a6f2	Fix clang warnings. (#9447 ) - static function in header. (which is marked as unused due to translation unit visibility). - Implicit copy operator is deprecated. - Unused lambda capture. - Moving a temporary variable prevents copy elision.	2023-08-09 15:34:45 +08:00
Jiaming Yuan	54029a59af	Bound the size of the histogram cache. (#9440 ) - A new histogram collection with a limit in size. - Unify histogram building logic between hist, multi-hist, and approx.	2023-08-08 03:21:26 +08:00
Jiaming Yuan	912e341d57	Initial GPU support for the approx tree method. (#9414 )	2023-07-31 15:50:28 +08:00
Jiaming Yuan	a196443a07	Implement sketching with Hessian on GPU. (#9399 ) - Prepare for implementing approx on GPU. - Unify the code path between weighted and uniform sketching on DMatrix.	2023-07-24 15:43:03 +08:00
Jiaming Yuan	275da176ba	Document for device ordinal. (#9398 ) - Rewrite GPU demos. notebook is converted to script to avoid committing additional png plots. - Add GPU demos into the sphinx gallery. - Add RMM demos into the sphinx gallery. - Test for firing threads with different device ordinals.	2023-07-22 15:26:29 +08:00
Jiaming Yuan	04aff3af8e	Define the new `device` parameter. (#9362 )	2023-07-13 19:30:25 +08:00
Rong Ou	3632242e0b	Support column split with GPU quantile (#9370 )	2023-07-11 12:15:56 +08:00
Jiaming Yuan	20c52f07d2	Support exporting cut values (#9356 )	2023-07-08 15:32:41 +08:00
Jiaming Yuan	59787b23af	Allow empty page in external memory. (#9361 )	2023-07-08 09:24:35 +08:00
Jiaming Yuan	41c6813496	Preserve order of saved updaters config. (#9355 ) - Save the updater sequence as an array instead of object. - Warn only once. The compatibility is kept, but we should be able to break it as the config is not loaded in pickle model and it's declared to be not stable.	2023-07-05 20:20:07 +08:00
Jiaming Yuan	645037e376	Improve test coverage with predictor configuration. (#9354 ) * Improve test coverage with predictor configuration. - Test with ext memory. - Test with QDM. - Test with dart.	2023-07-05 15:17:22 +08:00
Jiaming Yuan	39390cc2ee	[breaking] Remove the `predictor` param, allow fallback to prediction using `DMatrix`. (#9129 ) - A `DeviceOrd` struct is implemented to indicate the device. It will eventually replace the `gpu_id` parameter. - The `predictor` parameter is removed. - Fallback to `DMatrix` when `inplace_predict` is not available. - The heuristic for choosing a predictor is only used during training.	2023-07-03 19:23:54 +08:00
Jiaming Yuan	bc267dd729	Use ptr from `mmap` for `GHistIndexMatrix` and `ColumnMatrix`. (#9315 ) * Use ptr from mmap for `GHistIndexMatrix` and `ColumnMatrix`. - Define a resource for holding various types of memory pointers. - Define ref vector for holding resources. - Swap the underlying resources for GHist and ColumnM. - Add documentation for current status. - s390x support is removed. It should work if you can compile XGBoost, all the old workaround code does is to get GCC to compile.	2023-06-27 19:05:46 +08:00
Jiaming Yuan	54da4b3185	Cleanup to prepare for using mmap pointer in external memory. (#9317 ) - Update SparseDMatrix comment. - Use a pointer in the bitfield. We will replace the `std::vector<bool>` in `ColumnMatrix` with bitfield. - Clean up the page source. The timer is removed as it's inaccurate once we swap the mmap pointer into the page.	2023-06-22 06:43:11 +08:00
Jiaming Yuan	ee6809e642	Use mmap for external memory. (#9282 ) - Have basic infrastructure for mmap. - Release file write handle.	2023-06-19 18:52:55 +08:00
amdsc21	af8845405a	sync Jun 5	2023-06-07 02:43:21 +02:00
Jiaming Yuan	0cba2cdbb0	Support linalg data structures in check device. (#9243 )	2023-06-06 09:47:24 +08:00
Jiaming Yuan	9fbde21e9d	Rework the precision metric. (#9222 ) - Rework the precision metric for both CPU and GPU. - Mention it in the document. - Cleanup old support code for GPU ranking metric. - Deterministic GPU implementation. * Drop support for classification. * type. * use batch shape. * lint. * cpu build. * cpu build. * lint. * Tests. * Fix. * Cleanup error message.	2023-06-02 20:49:43 +08:00
Your Name	42867a4805	sync Jun 1	2023-06-01 15:55:06 -07:00
Jiaming Yuan	17fd3f55e9	Optimize adapter element counting on GPU. (#9209 ) - Implement a simple `IterSpan` for passing iterators with size. - Use shared memory for column size counts. - Use one thread for each sample in row count to reduce atomic operations.	2023-05-30 23:28:43 +08:00
Jiaming Yuan	097f11b6e0	Support CUDA f16 without transformation. (#9207 ) - Support f16 from cupy. - Include CUDA header explicitly. - Cleanup cmake nvtx support.	2023-05-30 20:54:31 +08:00
Jiaming Yuan	053aababd4	Avoid thrust logical operation. (#9199 ) Thrust implementation of `thrust::all_of/any_of/none_of` adopts an early stopping strategy to bailout early by dividing the input into small batches. This is not ideal for data validation as we expect all data to be valid. The strategy leads to excessive kernel launches and stream synchronization. * Use reduce from dh instead.	2023-05-27 01:36:58 +08:00
Rong Ou	5b69534b43	Support column split in multi-target `hist` (#9171 )	2023-05-26 16:56:05 +08:00
amdsc21	b22644fc10	add hip.h	2023-05-20 01:25:33 +02:00
amdsc21	8cad8c693c	sync up May15 2023	2023-05-15 18:59:18 +02:00
Rong Ou	52311dcec9	Fix multi-threaded gtests (#9148 )	2023-05-10 19:15:32 +08:00
Jiaming Yuan	85988a3178	Wait for data CUDA stream instead of sync. (#9144 ) --------- Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2023-05-09 09:52:21 +08:00
amdsc21	b324d51f14	fix array_interface.h half type	2023-05-02 20:50:50 +02:00
amdsc21	65097212b3	fix IterativeDeviceDMatrix, support HIP	2023-05-02 20:20:11 +02:00
amdsc21	e4538cb13c	fix, to support hip	2023-05-02 17:43:11 +02:00
amdsc21	5446c501af	merge 23Mar01	2023-05-02 00:05:58 +02:00
Jiaming Yuan	08ce495b5d	Use Booster context in DMatrix. (#8896 ) - Pass context from booster to DMatrix. - Use context instead of integer for `n_threads`. - Check the consistency configuration for `max_bin`. - Test for all combinations of initialization options.	2023-04-28 21:47:14 +08:00
Jiaming Yuan	1f9a57d17b	[Breaking] Require format to be specified in input URI. (#9077 ) Previously, we use `libsvm` as default when format is not specified. However, the dmlc data parser is not particularly robust against errors, and the most common type of error is undefined format. Along with which, we will recommend users to use other data loader instead. We will continue the maintenance of the parsers as it's currently used for many internal tests including federated learning.	2023-04-28 19:45:15 +08:00
Jiaming Yuan	17ff471616	Optimize array interface input. (#9090 )	2023-04-28 18:01:58 +08:00
Jiaming Yuan	0e470ef606	Optimize prediction with QuantileDMatrix. (#9096 ) - Reduce overhead in `FVecDrop`. - Reduce overhead caused by `HostVector()` calls.	2023-04-28 00:51:41 +08:00
Rong Ou	a320b402a5	More refactoring to take advantage of collective aggregators (#9081 )	2023-04-26 03:36:09 +08:00
amdsc21	acad01afc9	sync Mar 29	2023-03-30 00:46:50 +02:00
Rong Ou	ff26cd3212	More tests for column split and vertical federated learning (#8985 ) Added some more tests for the learner and fit_stump, for both column-wise distributed learning and vertical federated learning. Also moved the `IsRowSplit` and `IsColumnSplit` methods from the `DMatrix` to the `MetaInfo` since in some places we only have access to the `MetaInfo`. Added a new convenience method `IsVerticalFederatedLearning`. Some refactoring of the testing fixtures.	2023-03-28 16:40:26 +08:00
amdsc21	7ee4734d3a	rm device_helpers.hip.h from cu	2023-03-26 00:24:11 +01:00
amdsc21	ee582f03c3	rm device_helpers.hip.h from cuh	2023-03-25 23:35:57 +01:00
amdsc21	7fbc561e17	initial merge	2023-03-25 04:31:55 +01:00
Jiaming Yuan	151882dd26	Initial support for multi-target tree. (#8616 ) * Implement multi-target for hist. - Add new hist tree builder. - Move data fetchers for tests. - Dispatch function calls in gbm base on the tree type.	2023-03-22 23:49:56 +08:00
Rong Ou	b240f055d3	Support vertical federated learning (#8932 )	2023-03-22 14:25:26 +08:00
Jiaming Yuan	f186c87cf9	Check inf in data for all types of DMatrix. (#8911 )	2023-03-15 11:24:35 +08:00
Jiaming Yuan	36a7396658	Replace dmlc any with std any. (#8892 )	2023-03-11 06:11:04 +08:00

1 2 3 4 5 ...

411 Commits