xgboost

Author	SHA1	Message	Date
amdsc21	65d83e288f	fix device query	2023-04-19 19:53:26 +02:00
amdsc21	18034a4291	tune histogram	2023-03-26 01:42:51 +01:00
amdsc21	7ee4734d3a	rm device_helpers.hip.h from cu	2023-03-26 00:24:11 +01:00
amdsc21	ee582f03c3	rm device_helpers.hip.h from cuh	2023-03-25 23:35:57 +01:00
amdsc21	595cd81251	add max shared mem workaround	2023-03-19 20:08:42 +01:00
amdsc21	a79a35c22c	add warp size	2023-03-15 22:00:26 +01:00
amdsc21	4484c7f073	disable Optin Shared Mem	2023-03-15 02:10:16 +01:00
amdsc21	364df7db0f	fix ../tree/gpu_hist/evaluate_splits.hip bugs, size 64	2023-03-14 06:17:21 +01:00
amdsc21	7d96758382	macro format	2023-03-11 06:57:24 +01:00
amdsc21	500428cc0f	finish row_partitioner.cu	2023-03-09 22:31:11 +01:00
amdsc21	495816f694	finished gradient_based_sampler.cu	2023-03-09 22:26:08 +01:00
amdsc21	df42dd2c53	finished evaluator.cu	2023-03-09 22:22:05 +01:00
amdsc21	f55243fda0	finish evaluate_splits.cu	2023-03-09 22:15:10 +01:00
amdsc21	1e09c21456	finished feature_groups.cu	2023-03-09 21:31:00 +01:00
amdsc21	0ed5d3c849	finished histogram.cu	2023-03-09 21:28:37 +01:00
amdsc21	270c7b4802	enable rocm, fix row_partitioner.cuh	2023-03-08 06:22:25 +01:00
amdsc21	f2009533e1	rm hip.h	2023-03-08 06:04:01 +01:00
amdsc21	ed45aa2816	Merge branch 'master' into dev-hui	2023-03-08 00:39:33 +01:00
amdsc21	c51a1c9aae	rename hip.cc to hip	2023-03-07 05:39:53 +01:00
amdsc21	cafbfce51f	add hip.h	2023-03-07 03:46:26 +01:00
amdsc21	6039a71e6c	add hip structure	2023-03-07 02:17:19 +01:00
Jiaming Yuan	4d665b3fb0	Restore clang tidy test. (#8861 )	2023-03-03 13:47:04 -08:00
Jiaming Yuan	594371e35b	Fix CPP lint. (#8807 )	2023-02-15 20:16:35 +08:00
Jiaming Yuan	70c9b885ef	Extract floating point rounding routines. (#8771 )	2023-02-12 04:26:41 +08:00
Rory Mitchell	7214a45e83	Fix different number of features in gpu_hist evaluator. (#8754 )	2023-02-06 23:15:16 +08:00
Jiaming Yuan	c6a8754c62	Define CUDA Context. (#8604 ) We will transition to non-default and non-blocking CUDA stream.	2022-12-20 15:15:07 +08:00
Jiaming Yuan	3e26107a9c	Rename and extract `Context`. (#8528 ) * Rename `GenericParameter` to `Context`. * Rename header file to reflect the change. * Rename all references.	2022-12-07 04:58:54 +08:00
Robert Maynard	16f96b6cfb	Work with newer thrust and libcudacxx (#8454 ) * Thrust 1.17 removes the experimental/pinned_allocator. When xgboost is brought into a large project it can be compiled against Thrust 1.17+ which don't offer this experimental allocator. To ensure that going forward xgboost works in all environments we provide a xgboost namespaced version of the pinned_allocator that previously was in Thrust.	2022-11-11 04:22:53 +08:00
Rory Mitchell	210915c985	Use integer gradients in gpu_hist split evaluation (#8274 )	2022-10-11 12:16:27 +02:00
Rong Ou	668b8a0ea4	[Breaking] Switch from rabit to the collective communicator (#8257 ) * Switch from rabit to the collective communicator * fix size_t specialization * really fix size_t * try again * add include * more include * fix lint errors * remove rabit includes * fix pylint error * return dict from communicator context * fix communicator shutdown * fix dask test * reset communicator mocklist * fix distributed tests * do not save device communicator * fix jvm gpu tests * add python test for federated communicator * Update gputreeshap submodule Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 14:39:01 -08:00
Rory Mitchell	8f77677193	Use quantised gradients in gpu_hist histograms (#8246 )	2022-09-26 17:35:35 +02:00
Jiaming Yuan	b5eb36f1af	Add `max_cat_threshold` to GPU and handle missing cat values. (#8212 )	2022-09-07 00:57:51 +08:00
Philip Hyunsu Cho	56395d120b	Work around MSVC behavior wrt constexpr capture (#8211 ) * Work around MSVC behavior wrt constexpr capture * Fix lint	2022-08-31 11:42:08 -08:00
Rory Mitchell	1703dc330f	Optimise histogram kernels (#8118 )	2022-08-18 14:07:26 +02:00
Rory Mitchell	1be09848a7	Refactor split valuation kernel (#8073 )	2022-07-21 15:41:50 +02:00
Jiaming Yuan	abaa593aa0	Fix compiler warnings. (#8059 ) - Remove unused parameters. - Avoid comparison of different signedness.	2022-07-14 05:29:56 +08:00
Rory Mitchell	0bdaca25ca	Use single precision in gain calculation, use pointers instead of span. (#8051 )	2022-07-12 21:56:27 +02:00
Rory Mitchell	794cbaa60a	Fuse split evaluation kernels (#8026 )	2022-07-05 10:24:31 +02:00
Rory Mitchell	bc4f802b17	Batch UpdatePosition using cudaMemcpy (#7964 )	2022-06-30 17:52:40 +02:00
Jiaming Yuan	142a208a90	Fix compiler warnings. (#8022 ) - Remove/fix unused parameters - Remove deprecated code in rabit. - Update dmlc-core.	2022-06-22 21:29:10 +08:00
Jiaming Yuan	1a33b50a0d	Fix compiler warnings. (#7974 ) - Remove unused parameters. There are still many warnings that are not yet addressed. Currently, the warnings in dmlc-core dominate the error log. - Remove `distributed` parameter from metric. - Fixes some warnings about signed comparison.	2022-06-06 22:56:25 +08:00
Rory Mitchell	71d3b2e036	Fuse gpu_hist all-reduce calls where possible (#7867 )	2022-05-17 13:27:50 +02:00
Rory Mitchell	7ef54e39ec	Small refactor to categoricals (#7858 )	2022-05-05 17:47:02 +02:00
Jiaming Yuan	317d7be6ee	Always use partition based categorical splits. (#7857 )	2022-05-03 22:30:32 +08:00
Jiaming Yuan	fdf533f2b9	[POC] Experimental support for l1 error. (#7812 ) Support adaptive tree, a feature supported by both sklearn and lightgbm. The tree leaf is recomputed based on residue of labels and predictions after construction. For l1 error, the optimal value is the median (50 percentile). This is marked as experimental support for the following reasons: - The value is not well defined for distributed training, where we might have empty leaves for local workers. Right now I just use the original leaf value for computing the average with other workers, which might cause significant errors. - Some follow-ups are required, for exact, pruner, and optimization for quantile function. Also, we need to calculate the initial estimation.	2022-04-26 21:41:55 +08:00
Jiaming Yuan	1d468e20a4	Optimize GPU evaluation function for categorical data. (#7705 ) * Use transform and cache.	2022-02-28 17:46:29 +08:00
Jiaming Yuan	d625dc2047	Work around nvcc error. (#7673 )	2022-02-19 01:41:46 +08:00
Jiaming Yuan	0d0abe1845	Support optimal partitioning for GPU hist. (#7652 ) * Implement `MaxCategory` in quantile. * Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function. * Extract an evaluator from GPU Hist to store the needed states. * Added some CUDA stream/event utilities. * Update document with references. * Fixed a bug in approx evaluator where the number of data points is less than the number of categories.	2022-02-15 03:03:12 +08:00
Ginko Balboa	29bfa94bb6	Fix external memory with gpu_hist and subsampling combination bug. (#7481 ) Instead of accessing data from the `original_page_`, access the data from the first page of the available batch. fix #7476 Co-authored-by: jiamingy <jm.yuan@outlook.com>	2021-12-24 11:15:35 +08:00
Jiaming Yuan	7f399eac8b	Use double for GPU Hist node sum. (#7507 )	2021-12-22 08:41:35 +08:00

1 2

90 Commits