Jiaming Yuan
5f0c1e902b
Small cleanup for error message. ( #10502 )
...
- The `Fail` function can handle file location automatically.
- Report concatenated error for connection poll.
- Typos.
2024-07-02 13:36:41 +08:00
Philip Hyunsu Cho
09d32f1f2b
Fix build and C++ tests for FreeBSD ( #10480 )
2024-06-28 01:47:55 -07:00
Jiaming Yuan
e8a962575a
[EM] Allow staging ellpack on host for GPU external memory. ( #10488 )
...
- New parameter `on_host`.
- Abstract format creation and stream creation into policy classes.
2024-06-28 04:42:18 +08:00
Jiaming Yuan
2b400b18d5
Small cleanup for rowset collection. ( #10401 )
2024-06-19 18:06:23 +08:00
Jiaming Yuan
e5f1720656
[EM] Avoid writing cut matrix to cache. ( #10444 )
2024-06-19 18:03:38 +08:00
Jiaming Yuan
b4cc350ec5
Fix categorical data with external memory. ( #10433 )
2024-06-18 04:34:54 +08:00
Jiaming Yuan
49e25cfb36
Allow unaligned pointer if the array is empty. ( #10418 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2024-06-15 19:10:21 +08:00
Philip Hyunsu Cho
1ace9c66ec
[CI] Fix JVM tests on Windows ( #10404 )
2024-06-15 00:21:40 -07:00
Jiaming Yuan
0808e50ae8
Sync stream in ellpack format. ( #10374 )
2024-06-04 12:58:26 +08:00
Jiaming Yuan
4f48647932
Fix typo. ( #10353 )
2024-06-02 02:07:55 +08:00
Jiaming Yuan
d2d01d977a
Remove unnecessary fetch operations in external memory. ( #10342 )
2024-05-31 13:16:40 +08:00
Jiaming Yuan
e6eefea5e2
[coll] Move the rabit poll helper. ( #10349 )
2024-05-31 08:02:21 +08:00
Jiaming Yuan
2de67f0050
[coll] Prevent race during error check. ( #10319 )
2024-05-28 15:43:16 -07:00
Philip Hyunsu Cho
7ae5c972f9
[CI] Upgrade github workflows to use latest Conda setup action ( #10320 )
...
Co-authored-by: Christian Clauss <cclauss@me.com>
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>
2024-05-28 10:23:07 -07:00
Jiaming Yuan
5627af6b21
[coll] Increase timeout limit. ( #10332 )
2024-05-28 10:20:49 +08:00
Jiaming Yuan
966dc81788
[coll] Keep the tracker alive during initialization error. ( #10306 )
2024-05-23 11:13:59 +08:00
Jiaming Yuan
d5fcbee44b
Add timeout for distributed tests. ( #10315 )
2024-05-23 11:11:49 +08:00
Jiaming Yuan
a5a58102e5
Revamp the rabit implementation. ( #10112 )
...
This PR replaces the original RABIT implementation with a new one, which has already been partially merged into XGBoost. The new one features:
- Federated learning for both CPU and GPU.
- NCCL.
- More data types.
- A unified interface for all the underlying implementations.
- Improved timeout handling for both tracker and workers.
- Exhausted tests with metrics (fixed a couple of bugs along the way).
- A reusable tracker for Python and JVM packages.
2024-05-20 11:56:23 +08:00
Jiaming Yuan
835e59e538
Use a thread pool for external memory. ( #10288 )
2024-05-16 19:32:12 +08:00
Jiaming Yuan
5de57435c7
Be more lenient on floating point error for AUC. ( #10264 )
2024-05-11 08:48:11 +08:00
Jiaming Yuan
5e64276a9b
Update nvtx. ( #10227 )
2024-04-29 06:33:46 +08:00
Jiaming Yuan
3fbb221fec
[coll] Implement shutdown for tracker and comm. ( #10208 )
...
- Force shutdown the tracker.
- Implement shutdown notice for error handling thread in comm.
2024-04-20 04:08:17 +08:00
Jiaming Yuan
3f64b4fde3
[coll] Add global functions. ( #10203 )
2024-04-19 03:17:23 +08:00
Jiaming Yuan
4b10200456
[coll] Improve event loop. ( #10199 )
...
- Add a test for blocking calls.
- Do not require the queue to be empty after waking up; this frees up the thread to answer blocking calls.
- Handle EOF in read.
- Improve the error message in the result. Allow concatenation of multiple results.
2024-04-18 03:29:52 +08:00
Jiaming Yuan
1022909bbe
Fix global config for external memory. ( #10173 )
...
Pass the thread-local configuration between threads.
2024-04-11 01:29:28 +08:00
Jiaming Yuan
f0a138f33a
Fix pyspark with verbosity=3. ( #10172 )
2024-04-09 23:18:56 +08:00
Jiaming Yuan
8bad677c2f
Update collective implementation. ( #10152 )
...
* Update collective implementation.
- Cleanup resource during `Finalize` to avoid handling threads in destructor.
- Calculate the size for allgather automatically.
- Use simple allgather for small (smaller than the number of worker) allreduce.
2024-03-30 18:57:31 +08:00
Jiaming Yuan
230010d9a0
Cleanup set info. ( #10139 )
...
- Use the array interface internally.
- Deprecate `XGDMatrixSetDenseInfo`.
- Deprecate `XGDMatrixSetUIntInfo`.
- Move the handling of `DataType` into the deprecated C function.
---------
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2024-03-26 23:26:24 +08:00
Jiaming Yuan
ca4801f81d
Work with IPv6 in the new tracker. ( #10125 )
2024-03-20 05:19:23 +08:00
Jiaming Yuan
53fc17578f
Use std::uint64_t for row index. ( #10120 )
...
- Use std::uint64_t instead of size_t to avoid implementation-defined type.
- Rename to bst_idx_t, to account for other types of indexing.
- Small cleanup to the base header.
2024-03-15 18:43:49 +08:00
Jiaming Yuan
56b1868278
Fix compilation with the latest ctk. ( #10123 )
2024-03-15 08:04:41 +08:00
Jiaming Yuan
1450aebb74
Fix pairwise objective with NDCG metric along with custom gain. ( #10100 )
...
* Fix pairwise objective with NDCG metric.
- Allow setting `ndcg_exp_gain` for `rank:pairwise`.
This is useful when using pairwise for objective but ndcg for metric.
2024-03-11 14:54:10 +08:00
Jiaming Yuan
2c13f90384
Support graphviz plot for multi-target tree. ( #10093 )
2024-03-09 05:35:25 +08:00
Jiaming Yuan
e14c3b9325
Optional normalization for learning to rank. ( #10094 )
2024-03-08 12:41:21 +08:00
Jiaming Yuan
3941b31ade
Disable column sample by node for the exact tree method. ( #10083 )
2024-03-01 14:16:10 +08:00
Jiaming Yuan
8189126d51
Add CUDA iterator to tensor view. ( #10074 )
2024-03-01 14:15:31 +08:00
Jiaming Yuan
5ac233280e
Require context in aggregators. ( #10075 )
2024-02-28 03:12:42 +08:00
Jiaming Yuan
0ce4372bd4
Use UBJSON for serializing splits for vertical data split. ( #10059 )
2024-02-25 00:18:23 +08:00
Jiaming Yuan
2e4ea5ecc0
Support f64 for ubjson. ( #10055 )
2024-02-21 02:18:42 +08:00
Jiaming Yuan
d37b83e8d9
Fix UBJSON with boolean value. ( #10054 )
2024-02-20 22:13:51 +08:00
david-cortes
6e3c899ba7
[R] Don't cap global number of threads for serialization ( #10028 )
2024-02-20 11:13:00 +08:00
Louis Desreumaux
edf501d227
Implement contribution prediction with QuantileDMatrix ( #10043 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>
2024-02-19 21:03:29 +08:00
david-cortes
a730c7e67e
[R] allow using seed with regular RNG ( #10029 )
2024-02-04 16:22:22 +08:00
Philip Hyunsu Cho
4dfbe2a893
[CI] Test building for 32-bit arch ( #10021 )
...
* [CI] Test building for 32-bit arch
* Update CMakeLists.txt
* Fix yaml
* Use Debian container
* Remove -Werror for 32-bit
* Revert "Remove -Werror for 32-bit"
This reverts commit c652bc6a037361bcceaf56fb01863210b462793d.
* Don't error for overloaded-virtual warning
* Ignore some warnings from dmlc-core
* Fix compiler warnings
* Fix formatting
* Apply suggestions from code review
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>
* Add more cast
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>
2024-01-31 13:20:51 -08:00
Jiaming Yuan
a76d6c6131
Fix cpp deprecation. ( #10010 )
2024-01-26 02:13:40 +08:00
Philip Hyunsu Cho
c8f5d190c6
[CI] Stop Windows pipeline upon a failing pytest ( #10003 )
2024-01-24 22:54:21 -08:00
Jiaming Yuan
cacb4b1fdd
Fix gain calculation in multi-target tree. ( #9978 )
2024-01-17 13:18:44 +08:00
Jiaming Yuan
85d09245f6
Fix error handling in the event loop. ( #9990 )
2024-01-17 05:35:35 +08:00
Jiaming Yuan
2f57bbde3c
Additional tests for attributes and model booosted rounds. ( #9962 )
2024-01-09 09:54:39 +08:00
Jiaming Yuan
38dd91f491
Save model in ubj as the default. ( #9947 )
2024-01-05 17:53:36 +08:00