Jiaming Yuan
e1a2c1bbb3
[EM] Merge GPU partitioning with histogram building. ( #10766 )
...
- Stop concatenating pages if there's no subsampling.
- Use a single iteration for histogram build and partitioning.
2024-08-31 03:25:37 +08:00
Jiaming Yuan
4fe67f10b4
[EM] Have one partitioner for each batch. ( #10760 )
...
- Initialize one partitioner for each batch.
- Collect partition size during initialization.
- Support base ridx in the finalization.
2024-08-29 01:35:17 +08:00
Jiaming Yuan
d6ebcfb032
[EM] Support CPU quantile objective for external memory. ( #10751 )
2024-08-27 04:16:57 +08:00
Jiaming Yuan
e8a962575a
[EM] Allow staging ellpack on host for GPU external memory. ( #10488 )
...
- New parameter `on_host`.
- Abstract format creation and stream creation into policy classes.
2024-06-28 04:42:18 +08:00
Jiaming Yuan
6a892ce281
Specify src path for isort. ( #8867 )
2023-03-06 17:30:27 +08:00
Jiaming Yuan
cf70864fa3
Move Python testing utilities into xgboost module. ( #8379 )
...
- Add typehints.
- Fixes for pylint.
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>
2022-10-26 16:56:11 +08:00
Rory Mitchell
ce0382dcb0
[CI] Refactor tests to reduce CI time. ( #8312 )
2022-10-12 11:32:06 +02:00
Jiaming Yuan
2e63af6117
Mitigate flaky data iter test. ( #8244 )
...
- Reduce the number of batches.
- Verify labels.
2022-09-14 17:54:14 +08:00
Jiaming Yuan
8b3ecfca25
Mitigate flaky tests. ( #7749 )
...
* Skip non-increasing test with external memory when subsample is used.
* Increase bin numbers for boost from prediction test. This mitigates the effect of
non-deterministic partitioning.
2022-03-28 21:20:50 +08:00
Ginko Balboa
29bfa94bb6
Fix external memory with gpu_hist and subsampling combination bug. ( #7481 )
...
Instead of accessing data from the `original_page_`, access the data from the first page of the available batch.
fix #7476
Co-authored-by: jiamingy <jm.yuan@outlook.com>
2021-12-24 11:15:35 +08:00
Jiaming Yuan
e6088366df
Export Python Interface for external memory. ( #7070 )
...
* Add Python iterator interface.
* Add tests.
* Add demo.
* Add documents.
* Handle empty dataset.
2021-07-22 15:15:53 +08:00