Jiaming Yuan
4ccf92ea34
[dask] Fix union of workers. ( #6375 )
2020-11-13 16:55:05 +08:00
Jiaming Yuan
fcfeb4959c
Deprecate positional arguments. ( #6365 )
...
Deprecate positional arguments in following functions:
- `__init__` for all classes in sklearn module.
- `fit` method for all classes in sklearn module.
- dask interface.
- `set_info` for `DMatrix` class.
Refactor the evaluation matrices handling.
2020-11-13 11:10:30 +08:00
Philip Hyunsu Cho
e5193c21a1
[dask] Allow empty data matrix in AFT survival ( #6379 )
...
* [dask] Allow empty data matrix in AFT survival
* Add unit test
2020-11-12 17:49:58 -08:00
Jiaming Yuan
c1a62b5fa2
Expect gpu external memory to fail. ( #6381 )
2020-11-12 19:24:48 +08:00
Jiaming Yuan
c90f968d92
Update Python documents. ( #6376 )
2020-11-12 17:51:32 +08:00
Jiaming Yuan
d711d648cb
Fix label errors in graph visualization ( #6369 )
2020-11-11 17:44:59 -08:00
Jiaming Yuan
6e12c2a6f8
[dask] Supoort running on GKE. ( #6343 )
...
* Avoid accessing `scheduler_info()['workers']`.
* Avoid calling `client.gather` inside task.
* Avoid using `client.scheduler_address`.
2020-11-11 18:04:34 +08:00
Jiaming Yuan
8a17610666
Implement GPU predict leaf. ( #6187 )
2020-11-11 17:33:47 +08:00
Philip Hyunsu Cho
7f101d1b33
[CI] Remove R check from Jenkins ( #6372 )
...
* Remove R check from Jenkins
* Print stacktrace when CRAN test fail in GitHub Actions
* Add verbose flag in tests/ci_build/print_r_stacktrace.sh
* Fix path in tests/ci_build/print_r_stacktrace.sh
2020-11-10 22:46:54 -08:00
Jiaming Yuan
43efadea2e
Deterministic data partitioning for external memory ( #6317 )
...
* Make external memory data partitioning deterministic.
* Change the meaning of `page_size` from bytes to number of rows.
* Design a data pool.
* Note for external memory.
* Enable unity build on Windows CI.
* Force garbage collect on test.
2020-11-11 06:11:06 +08:00
Jiaming Yuan
184e2eac7d
Add period to evaluation monitor. ( #6348 )
2020-11-10 07:47:48 +08:00
Jiaming Yuan
519cee115a
Avoid resetting seed for every configuration. ( #6349 )
2020-11-06 10:28:35 +08:00
Jiaming Yuan
2cc9662005
Support slicing tree model ( #6302 )
...
This PR is meant the end the confusion around best_ntree_limit and unify model slicing. We have multi-class and random forests, asking users to understand how to set ntree_limit is difficult and error prone.
* Implement the save_best option in early stopping.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2020-11-02 23:27:39 -08:00
Rory Mitchell
29745c6df2
Fix inclusive scan for large sizes ( #6234 )
2020-11-03 17:01:43 +13:00
Jiaming Yuan
7756192906
[dask] Fix prediction on DaskDMatrix with multiple meta data. ( #6333 )
...
* Unify the meta handling methods.
2020-11-02 19:18:44 -05:00
Jiaming Yuan
048acf81cd
Enable shap sparse test. ( #6332 )
2020-11-01 20:59:27 +08:00
Jiaming Yuan
6ff331b705
Fix Python callback. ( #6320 )
2020-10-30 05:03:44 +08:00
Sergio Gavilán
b181a88f9f
Reduced some C++ compiler warnings ( #6197 )
...
* Removed some warnings
* Rebase with master
* Solved C++ Google Tests errors made by refactoring in order to remove warnings
* Undo renaming path -> path_
* Fix style check
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-10-29 12:36:00 -07:00
Jiaming Yuan
c80657b542
Fix flaky data initialization test. ( #6318 )
2020-10-30 03:11:22 +08:00
Jiaming Yuan
dfac5f89e9
Group CLI demo into subdirectory. ( #6258 )
...
CLI is not most developed interface. Putting them into correct directory can help new users to avoid it as most of the use cases are from a language binding.
2020-10-28 14:40:44 -07:00
Philip Hyunsu Cho
143b278267
Mark flaky tests as XFAIL ( #6299 )
...
* Temporarily skip TestGPUUpdaters::test_categorical
* Temporarily skip test_boost_from_prediction[approx]
2020-10-28 11:50:57 -07:00
Jiaming Yuan
c4da967b5c
Support unity build. ( #6295 )
...
* Support unity build.
* Setup on Windows Jenkins.
* Revert "Setup on Windows Jenkins."
This reverts commit 8345cb8d2b009eec8ae9fa6f16412a7c9b6ec12c.
2020-10-28 11:49:28 -07:00
Philip Hyunsu Cho
f6169c0b16
[CI] Use separate Docker cache for each CUDA version ( #6305 )
2020-10-28 11:07:00 -07:00
Jiaming Yuan
3310e208fd
Fix inplace prediction interval. ( #6259 )
...
* Add back the interval in call.
* Make the interval non-optional.
2020-10-28 13:13:59 +08:00
James Lamb
e1de390e6e
[ci] replace 'egrep' with 'grep -E' ( #6287 )
2020-10-27 12:05:48 -07:00
Philip Hyunsu Cho
c8ec62103a
Deprecate LabelEncoder in XGBClassifier; Enable cuDF/cuPy inputs in XGBClassifier ( #6269 )
...
* Deprecate LabelEncoder in XGBClassifier; skip LabelEncoder for cuDF/cuPy inputs
* Add unit tests for cuDF and cuPy inputs with XGBClassifier
* Fix lint
* Clarify warning
* Move use_label_encoder option to XGBClassifier constructor
* Add a test for cudf.Series
* Add use_label_encoder to XGBRFClassifier doc
* Address reviewer feedback
2020-10-26 13:20:51 -07:00
Jiaming Yuan
bcfab4d726
Revert "Disable JSON full serialization for now. ( #6248 )" ( #6266 )
...
This reverts commit 6d293020fbfa2c67b532d550fe5d55689662caac.
2020-10-27 03:30:47 +08:00
Jiaming Yuan
2686d32a36
Skip dask tests on ARM. ( #6267 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-10-26 15:09:05 +08:00
Philip Hyunsu Cho
677f676172
Use UserWarning for old callback, as DeprecationWarning is not visible ( #6270 )
2020-10-22 01:10:52 -07:00
Philip Hyunsu Cho
1300467d36
Fix a typo in is_arm() in testing.py [skip ci] ( #6271 )
2020-10-22 13:07:14 +08:00
Jiaming Yuan
b5c2a47b20
Drop single point model recovery ( #6262 )
...
* Pass rabit params in JVM package.
* Implement timeout using poll timeout parameter.
* Remove OOB data check.
2020-10-21 15:27:03 +08:00
Jiaming Yuan
81c37c28d5
Time the CPU tests on Jenkins. ( #6257 )
...
* Time the CPU tests on Jenkins.
* Reduce thread contention.
* Add doc.
* Skip heavy tests on ARM.
2020-10-20 17:19:07 -07:00
Jiaming Yuan
ddf37cca30
Unify thread configuration. ( #6186 )
2020-10-19 16:05:42 +08:00
Philip Hyunsu Cho
7f6ed5780c
[CI] Build a Python wheel for aarch64 platform ( #6253 )
2020-10-18 22:35:19 -07:00
Jiaming Yuan
5037abeb86
Fix linear gpu input ( #6255 )
2020-10-19 12:02:36 +08:00
Philip Hyunsu Cho
65ea42bd42
[CI] Reduce testing load with RMM ( #6249 )
...
* [CI] Reduce testing load with RMM
* Address reviewer's comment
2020-10-18 19:16:46 -07:00
Jiaming Yuan
6d293020fb
Disable JSON full serialization for now. ( #6248 )
...
* Disable JSON serialization for now.
* Multi-class classification is checkpointing for each iteration.
This brings significant overhead.
Revert: 90355b4f007ae
* Set R tests to use binary.
2020-10-16 17:59:54 +08:00
Jiaming Yuan
3da5a69dc9
Fix typo in dask interface. ( #6240 )
2020-10-15 15:26:29 +08:00
Jiaming Yuan
bed7ae4083
Loop over thrust::reduce. ( #6229 )
...
* Check input chunk size of dqdm.
* Add doc for current limitation.
2020-10-14 10:40:56 +13:00
Rory Mitchell
734a911a26
Loop over copy_if ( #6201 )
...
* Loop over copy_if
* Catch OOM.
Co-authored-by: fis <jm.yuan@outlook.com>
2020-10-14 10:23:16 +13:00
Jiaming Yuan
b05073bda5
[dask] Test for data initializaton. ( #6226 )
2020-10-13 11:08:35 +08:00
Jiaming Yuan
2443275891
Cleanup Python code. ( #6223 )
...
* Remove pathlike as XGBoost 1.2 requires Python 3.6.
* Move conditional import of dask/distributed into dask module.
2020-10-12 15:44:41 +08:00
Jiaming Yuan
70c2039748
Catch all standard exceptions in C API. ( #6220 )
...
* `std::bad_alloc` is not guaranteed to be caught.
2020-10-12 14:01:46 +08:00
Jiaming Yuan
2241563f23
Handle duplicated values in sketching. ( #6178 )
...
* Accumulate weights in duplicated values.
* Fix device id in iterative dmatrix.
2020-10-10 19:32:44 +08:00
Jiaming Yuan
ab5b35134f
Rework Python callback functions. ( #6199 )
...
* Define a new callback interface for Python.
* Deprecate the old callbacks.
* Enable early stopping on dask.
2020-10-10 17:52:36 +08:00
Jiaming Yuan
b5b24354b8
More categorical tests and disable shap sparse test. ( #6219 )
...
* Fix tree load with 32 category.
2020-10-10 16:12:37 +08:00
Philip Hyunsu Cho
c991eb612d
[jvm-packages] Fix up build for xgboost4j-gpu, xgboost4j-spark-gpu ( #6216 )
...
* [CI] Clean up build for JVM packages
* Use correct path for saving native lib
* Fix groupId of maven-surefire-plugin
* Fix stashing of xgboost4j_jar_gpu
* [CI] Don't run xgboost4j-tester with GPU, since it doesn't use gpu_hist
2020-10-09 14:08:15 -07:00
Jiaming Yuan
70ce5216b5
Add high level tests for categorical data. ( #6179 )
...
* Fix unique.
2020-10-09 09:27:23 +08:00
odidev
a2fea33103
Added arm64 job in Travis-CI ( #6200 )
...
Signed-off-by: odidev <odidev@puresoftware.com>
2020-10-07 15:02:09 +08:00
Philip Hyunsu Cho
f121f2738f
[CI] Fix Docker build for CUDA 11 ( #6202 )
2020-10-05 17:54:14 -07:00