Hendrik Groove
94ffd57641
try direct instantiation of EvaluateSplitsKernel
2024-10-21 21:22:57 +02:00
Hendrik Groove
8c15f3b665
revert
2024-10-21 11:43:29 +02:00
Hendrik Groove
bb2feab0b2
try
2024-10-21 01:55:41 +02:00
Hui Liu
b27f35e270
rm hip from src
2024-04-22 12:31:14 -07:00
Hui Liu
8b75204fed
merge latest change from upstream
2024-04-22 09:35:31 -07:00
Jiaming Yuan
3f64b4fde3
[coll] Add global functions. ( #10203 )
2024-04-19 03:17:23 +08:00
Hui Liu
ff549ae933
sync upstream code
2024-03-20 16:14:38 -07:00
Jiaming Yuan
53fc17578f
Use std::uint64_t for row index. ( #10120 )
...
- Use std::uint64_t instead of size_t to avoid implementation-defined type.
- Rename to bst_idx_t, to account for other types of indexing.
- Small cleanup to the base header.
2024-03-15 18:43:49 +08:00
Hui Liu
968dbf25fb
merge latest changes
2024-03-12 09:13:09 -07:00
Jiaming Yuan
5ac233280e
Require context in aggregators. ( #10075 )
2024-02-28 03:12:42 +08:00
Hui Liu
3fe874078c
merge latest changes
2024-01-24 13:30:08 -08:00
Hui Liu
2d7ffbdf3d
merge latest changes
2023-12-13 21:06:28 -08:00
Jiaming Yuan
fedd9674c8
Implement column sampler in CUDA. ( #9785 )
...
- CUDA implementation.
- Extract the broadcasting logic, we will need the context parameter after revamping the collective implementation.
- Some changes to the event loop for fixing a deadlock in CI.
- Move argsort into algorithms.cuh, add support for cuda stream.
2023-11-17 04:29:08 +08:00
Jiaming Yuan
06bdc15e9b
[coll] Pass context to various functions. ( #9772 )
...
* [coll] Pass context to various functions.
In the future, the `Context` object would be required for collective operations, this PR
passes the context object to some required functions to prepare for swapping out the
implementation.
2023-11-08 09:54:05 +08:00
Hui Liu
8fab17ae8f
rm hip.h files
2023-10-30 21:20:28 -07:00
Hui Liu
6762230d9a
namespace to reduce code
2023-10-27 10:51:32 -07:00
Hui Liu
4a4b528d54
add namespace aliases to reduce code
2023-10-27 09:11:55 -07:00
Hui Liu
3752b06550
Merge branch 'master' into sync-condition-2023Oct11
2023-10-24 10:46:38 -07:00
Jiaming Yuan
7a02facc9d
Serialize expand entry for allgather. ( #9702 )
2023-10-24 14:33:28 +08:00
Hui Liu
79319dfd4d
format
2023-10-23 22:29:48 -07:00
Hui Liu
15421e40d9
enable ROCm on latest XGBoost
2023-10-23 11:07:08 -07:00
Your Name
ffbbc9c968
add cuda to hip wrapper
2023-10-17 12:42:37 -07:00
Your Name
ea19555474
temp merge, disable 1 line, SetValid
2023-10-12 16:16:44 -07:00
Jiaming Yuan
8c676c889d
Remove internal use of gpu_id. ( #9568 )
2023-09-20 23:29:51 +08:00
Rong Ou
9bab06cbca
Support column split in gpu hist updater ( #9384 )
2023-08-31 18:09:35 +08:00
Jiaming Yuan
942b957eef
Fix GPU categorical split memory allocation. ( #9529 )
2023-08-29 10:06:03 +08:00
Rong Ou
6103dca0bb
Support column split in GPU evaluate splits ( #9511 )
2023-08-23 16:33:43 +08:00
Jiaming Yuan
912e341d57
Initial GPU support for the approx tree method. ( #9414 )
2023-07-31 15:50:28 +08:00
Rong Ou
7579905e18
Retry switching to per-thread default stream ( #9416 )
2023-07-26 07:09:12 +08:00
Jiaming Yuan
3a9996173e
Revert "Switch to per-thread default stream ( #9396 )" ( #9413 )
...
This reverts commit f7f673b00c15458fb4dd74a2a0d2ba80369c5faf.
2023-07-24 12:03:28 -07:00
Rong Ou
f7f673b00c
Switch to per-thread default stream ( #9396 )
2023-07-20 08:21:00 +08:00
Jiaming Yuan
ee6809e642
Use mmap for external memory. ( #9282 )
...
- Have basic infrastructure for mmap.
- Release file write handle.
2023-06-19 18:52:55 +08:00
amdsc21
5ca7daaa13
merge latest changes
2023-06-15 21:39:14 +02:00
ZHAOKAI WANG
2b76061659
remove redundant method in expand_entry ( #9283 )
2023-06-10 05:18:21 +08:00
amdsc21
b22644fc10
add hip.h
2023-05-20 01:25:33 +02:00
amdsc21
5446c501af
merge 23Mar01
2023-05-02 00:05:58 +02:00
amdsc21
313a74b582
add Shap Magic to check if use cat
2023-05-01 21:55:14 +02:00
Jiaming Yuan
08ce495b5d
Use Booster context in DMatrix. ( #8896 )
...
- Pass context from booster to DMatrix.
- Use context instead of integer for `n_threads`.
- Check the consistency configuration for `max_bin`.
- Test for all combinations of initialization options.
2023-04-28 21:47:14 +08:00
amdsc21
65d83e288f
fix device query
2023-04-19 19:53:26 +02:00
amdsc21
18034a4291
tune histogram
2023-03-26 01:42:51 +01:00
amdsc21
7ee4734d3a
rm device_helpers.hip.h from cu
2023-03-26 00:24:11 +01:00
amdsc21
ee582f03c3
rm device_helpers.hip.h from cuh
2023-03-25 23:35:57 +01:00
amdsc21
595cd81251
add max shared mem workaround
2023-03-19 20:08:42 +01:00
amdsc21
a79a35c22c
add warp size
2023-03-15 22:00:26 +01:00
amdsc21
4484c7f073
disable Optin Shared Mem
2023-03-15 02:10:16 +01:00
amdsc21
364df7db0f
fix ../tree/gpu_hist/evaluate_splits.hip bugs, size 64
2023-03-14 06:17:21 +01:00
amdsc21
7d96758382
macro format
2023-03-11 06:57:24 +01:00
amdsc21
500428cc0f
finish row_partitioner.cu
2023-03-09 22:31:11 +01:00
amdsc21
495816f694
finished gradient_based_sampler.cu
2023-03-09 22:26:08 +01:00
amdsc21
df42dd2c53
finished evaluator.cu
2023-03-09 22:22:05 +01:00