Hui Liu
11afdc957e
fix memory type
2024-01-26 15:43:24 -08:00
Hui Liu
56d2821bb9
Merge tag 'v2.0.2' into release_2.0.2-rocm
2024-01-25 07:42:34 -08:00
Hui Liu
dc7ee041cc
use __HIPCC__ for device code
2024-01-24 12:32:51 -08:00
Hui Liu
7dc152450e
workaround memoryType and change rccl config
2024-01-11 13:13:14 -08:00
Philip Hyunsu Cho
41ce8f28b2
[jvm-packages] Add Scala version suffix to xgboost-jvm package ( #9776 )
...
* Update JVM script (#9714 )
* Bump version to 2.0.2; revamp pom.xml
* Update instructions in prepare_jvm_release.py
* Fix formatting
2023-11-08 10:17:26 -08:00
Jiaming Yuan
0ffc52e05c
[backport] Fix using categorical data with the ranker. ( #9753 ) ( #9778 )
2023-11-09 01:20:52 +08:00
Hui Liu
82d81bca94
rm hip.h files
2023-10-30 21:54:00 -07:00
Hui Liu
6ec5cf26fc
enable 3 more tests
2023-10-30 15:27:02 -07:00
Hui Liu
1ec57fd1a3
enable ROCm support, rm un-necessary code
2023-10-30 12:39:30 -07:00
Hui Liu
d0774a78e4
add hip to config
2023-10-30 12:01:24 -07:00
Hui Liu
8d160a206e
add jvm rocm support
2023-10-30 11:49:47 -07:00
Hui Liu
a41bc0975c
rocm enable for v2.0.1, rm setup.py
2023-10-27 18:53:16 -07:00
Hui Liu
782b73f2bb
rocm enable for v2.0.1
2023-10-27 18:50:28 -07:00
Philip Hyunsu Cho
a408254c2f
Use sys.base_prefix instead of sys.prefix ( #9711 )
...
* Use sys.base_prefix instead of sys.prefix
* Update libpath.py too
2023-10-23 23:31:40 -07:00
Philip Hyunsu Cho
22e891dafa
[jvm-packages] Remove hard dependency on libjvm ( #9698 ) ( #9705 )
2023-10-23 21:21:14 -07:00
Philip Hyunsu Cho
89530c80a7
[CI] Build libxgboost4j.dylib for Intel Mac ( #9704 )
2023-10-23 20:45:01 -07:00
Philip Hyunsu Cho
946ab53b57
Fix libpath logic for Windows ( #9687 )
2023-10-19 10:42:46 -07:00
Philip Hyunsu Cho
afd03a6934
Fix build for AppleClang 11 ( #9684 )
2023-10-18 09:35:59 -07:00
Jiaming Yuan
f7da938458
[backport][pyspark] Support stage-level scheduling ( #9519 ) ( #9686 )
...
Co-authored-by: Bobby Wang <wbo4958@gmail.com >
2023-10-18 14:05:08 +08:00
Philip Hyunsu Cho
6ab6577511
Fix build for GCC 8.x ( #9670 )
2023-10-12 23:36:41 -07:00
Philip Hyunsu Cho
8c57558d74
[backport] [CI] Pull CentOS 7 images from NGC ( #9666 ) ( #9668 )
2023-10-13 14:09:54 +08:00
Jiaming Yuan
58aa98a796
Bump version to 2.0.1. ( #9660 )
2023-10-13 08:47:32 +08:00
Jiaming Yuan
92273b39d8
[backport] Add support for cgroupv2. ( #9651 ) ( #9656 )
2023-10-12 11:39:27 +08:00
Jiaming Yuan
e824b18bf6
[backport] Support pandas 2.1.0. ( #9557 ) ( #9655 )
2023-10-12 11:29:59 +08:00
Jiaming Yuan
66ee89d8b4
[backport] Workaround Apple clang issue. ( #9615 ) ( #9636 )
2023-10-08 15:42:15 +08:00
Jiaming Yuan
54d1d72d01
[backport] Use array interface for testing numpy arrays. ( #9602 ) ( #9635 )
2023-10-08 11:45:49 +08:00
Jiaming Yuan
032bcc57f9
[backport][R] Fix method name. ( #9577 ) ( #9592 )
2023-09-19 02:08:46 +08:00
Jiaming Yuan
ace7713201
[backport] Fix default metric configuration. ( #9575 ) ( #9590 )
2023-09-18 23:40:43 +08:00
Jiaming Yuan
096047c547
Make 2.0 release. ( #9567 )
2023-09-12 00:20:49 +08:00
Jiaming Yuan
e75dd75bb2
[backport] [pyspark] support gpu transform ( #9542 ) ( #9559 )
...
---------
Co-authored-by: Bobby Wang <wbo4958@gmail.com >
2023-09-07 17:21:09 +08:00
Jiaming Yuan
4d387cbfbf
[backport] [pyspark] rework transform to reuse same code ( #9292 ) ( #9558 )
...
Co-authored-by: Bobby Wang <wbo4958@gmail.com >
2023-09-07 15:26:24 +08:00
Jiaming Yuan
3fde9361d7
[backport] Fix inplace predict with fallback when base margin is used. ( #9536 ) ( #9548 )
...
- Copy meta info from proxy DMatrix.
- Use `std::call_once` to emit less warnings.
2023-09-05 23:38:06 +08:00
Jiaming Yuan
b67c2ed96d
[backport] [CI] bump setup-r action version. ( #9544 ) ( #9551 )
2023-09-05 22:10:30 +08:00
Jiaming Yuan
177fd79864
[backport] Fix read the doc configuration. [skip ci] ( #9549 )
2023-09-05 17:32:00 +08:00
Jiaming Yuan
06487d3896
[backport] Fix GPU categorical split memory allocation. ( #9529 ) ( #9535 )
2023-08-29 21:14:43 +08:00
Jiaming Yuan
e50ccc4d3c
[R] Fix integer inputs with NA. ( #9522 ) ( #9534 )
2023-08-29 19:52:13 +08:00
Jiaming Yuan
add57f8880
[backport] Delay the check for vector leaf. ( #9509 ) ( #9533 )
2023-08-29 18:25:59 +08:00
Jiaming Yuan
a0d3573c74
[backport] Fix device dispatch for linear updater. ( #9507 ) ( #9532 )
2023-08-29 15:10:43 +08:00
Jiaming Yuan
4301558a57
Make 2.0.0 RC1. ( #9492 )
2023-08-17 16:16:51 +08:00
Bobby Wang
68be454cfa
[pyspark] hotfix for GPU setup validation ( #9495 )
...
* [pyspark] fix a bug of validating gpu configuration
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-08-17 16:01:39 +08:00
Jiaming Yuan
5188e27513
Fix version parsing with rc release. ( #9493 )
2023-08-16 22:44:58 +08:00
Jiaming Yuan
f380c10a93
Use hint for find nccl. ( #9490 )
2023-08-16 16:08:41 +08:00
Sean Yang
12fe2fc06c
Fix federated learning demos and tests ( #9488 )
2023-08-16 15:25:05 +08:00
Jiaming Yuan
b2e93d2742
[doc] Quick note for the device parameter. [skip ci] ( #9483 )
2023-08-16 13:35:55 +08:00
Jiaming Yuan
c061e3ae50
[jvm-packages] Bump rapids version. ( #9482 )
2023-08-15 16:26:42 -07:00
James Lamb
b82e78c169
[R] remove commented-out code ( #9481 )
2023-08-15 13:44:08 +08:00
Boris
8463107013
Updated versions. Reorganised dependencies. ( #9479 )
2023-08-14 14:28:28 -07:00
Jiaming Yuan
19b59938b7
Convert input to str for hypothesis note. ( #9480 )
2023-08-15 02:27:58 +08:00
James Lamb
e3f624d8e7
[R] remove more uses of default values in internal functions ( #9476 )
2023-08-14 22:18:33 +08:00
James Lamb
2c84daeca7
[R] [doc] remove documentation index entries for internal functions ( #9477 )
2023-08-14 22:18:02 +08:00
Bobby Wang
344f90b67b
[jvm-packages] throw exception when tree_method=approx and device=cuda ( #9478 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-08-14 17:52:14 +08:00
Jiaming Yuan
05d7000096
Handle special characters in JSON model dump. ( #9474 )
2023-08-14 15:49:00 +08:00
github-actions[bot]
f03463c45b
[CI] Update RAPIDS to latest stable ( #9464 )
...
* [CI] Update RAPIDS to latest stable
* [CI] Use CMake 3.26.4
---------
Co-authored-by: hcho3 <hcho3@users.noreply.github.com >
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2023-08-13 18:54:37 -07:00
Jiaming Yuan
fd4335d0bf
[doc] Document the current status of some features. ( #9469 )
2023-08-13 23:42:27 +08:00
Jiaming Yuan
801116c307
Test scikit-learn model IO with gblinear. ( #9459 )
2023-08-13 23:41:49 +08:00
Jiaming Yuan
bb56183396
Normalize file system path. ( #9463 )
2023-08-11 21:26:46 +08:00
Jiaming Yuan
bdc1a3c178
Fix pyspark parameter. ( #9460 )
...
- Don't pass the `use_gpu` parameter to the learner.
- Fix GPU approx with PySpark.
2023-08-11 19:07:50 +08:00
James Lamb
428f6cbbe2
[R] remove default values in internal booster manipulation functions ( #9461 )
2023-08-11 15:07:18 +08:00
ShaneConneely
d638535581
Update README.md ( #9462 )
2023-08-11 04:02:04 +08:00
James Lamb
44bd2981b2
[R] remove default values in internal utility functions ( #9457 )
2023-08-10 21:40:59 +08:00
James Lamb
9dbb71490c
[Doc] fix typos in documentation ( #9458 )
2023-08-10 19:26:36 +08:00
James Lamb
4359356d46
[R] [CI] use lintr 3.1.0 ( #9456 )
2023-08-10 17:49:16 +08:00
Jiaming Yuan
1caa93221a
Use realloc for histogram cache and expose the cache limit. ( #9455 )
2023-08-10 14:05:27 +08:00
Jiaming Yuan
a57371ef7c
Fix links in R doc. ( #9450 )
2023-08-10 02:38:14 +08:00
Jiaming Yuan
f05a23b41c
Use weakref instead of id for DataIter cache. ( #9445 )
...
- Fix case where Python reuses id from freed objects.
- Small optimization to column matrix with QDM by using `realloc` instead of copying data.
2023-08-10 00:40:06 +08:00
Bobby Wang
d495a180d8
[pyspark] add logs for training ( #9449 )
2023-08-09 18:32:23 +08:00
joshbrowning2358
7f854848d3
Update R docs based on deprecated parameters/behaviour ( #9437 )
2023-08-09 17:04:28 +08:00
Jiaming Yuan
f05294a6f2
Fix clang warnings. ( #9447 )
...
- static function in header. (which is marked as unused due to translation unit
visibility).
- Implicit copy operator is deprecated.
- Unused lambda capture.
- Moving a temporary variable prevents copy elision.
2023-08-09 15:34:45 +08:00
Philip Hyunsu Cho
819098a48f
[R] Handle UTF-8 paths on Windows ( #9448 )
2023-08-08 21:29:19 -07:00
Jiaming Yuan
c1b2cff874
[CI] Check compiler warnings. ( #9444 )
2023-08-08 12:02:45 -07:00
Philip Hyunsu Cho
7ce090e775
Handle UTF-8 paths correctly on Windows platform ( #9443 )
...
* Fix round-trip serialization with UTF-8 paths
* Add compiler version check
* Add comment to C API functions
* Add Python tests
* [CI] Updatre MacOS deployment target
* Use std::filesystem instead of dmlc::TemporaryDirectory
2023-08-07 23:27:25 -07:00
Jiaming Yuan
97fd5207dd
Use lambda function in ParallelFor2D. ( #9441 )
2023-08-08 14:04:46 +08:00
Jiaming Yuan
54029a59af
Bound the size of the histogram cache. ( #9440 )
...
- A new histogram collection with a limit in size.
- Unify histogram building logic between hist, multi-hist, and approx.
2023-08-08 03:21:26 +08:00
Philip Hyunsu Cho
5bd163aa25
Explicitly specify libcudart_static in CMake config ( #9436 )
2023-08-05 14:15:44 -07:00
Philip Hyunsu Cho
7fc57f3974
Remove Koffie Labs from Sponsors list ( #9434 )
2023-08-04 06:52:27 -07:00
Rong Ou
bde1ebc209
Switch back to the GPUIDX macro ( #9438 )
2023-08-04 15:14:31 +08:00
Philip Hyunsu Cho
1aabc690ec
[Doc] Clarify the output behavior of reg:logistic ( #9435 )
2023-08-03 20:42:07 -07:00
jinmfeng001
04c99683c3
Change training stage from ResultStage to ShuffleMapStage ( #9423 )
2023-08-03 23:40:04 +08:00
Jiaming Yuan
1332ff787f
Unify the code path between local and distributed training. ( #9433 )
...
This removes the need for a local histogram space during distributed training, which cuts the cache size by half.
2023-08-03 21:46:36 +08:00
Hendrik Makait
f958e32683
Raise if expected workers are not alive in xgboost.dask.train ( #9421 )
2023-08-03 20:14:07 +08:00
Jiaming Yuan
7129988847
Accept only keyword arguments in data iterator. ( #9431 )
2023-08-03 12:44:16 +08:00
Jiaming Yuan
e93a274823
Small cleanup for histogram routines. ( #9427 )
...
* Small cleanup for histogram routines.
- Extract hist train param from GPU hist.
- Make histogram const after construction.
- Unify parameter names.
2023-08-02 18:28:26 +08:00
Rong Ou
c2b85ab68a
Clean up MGPU C++ tests ( #9430 )
2023-08-02 14:31:18 +08:00
Jiaming Yuan
a9da2e244a
[CI] Update github actions. ( #9428 )
2023-08-01 23:03:53 +08:00
Jiaming Yuan
912e341d57
Initial GPU support for the approx tree method. ( #9414 )
2023-07-31 15:50:28 +08:00
Bobby Wang
8f0efb4ab3
[jvm-packages] automatically set the max/min direction for best score ( #9404 )
2023-07-27 11:09:55 +08:00
Rong Ou
7579905e18
Retry switching to per-thread default stream ( #9416 )
2023-07-26 07:09:12 +08:00
Nicholas Hilton
54579da4d7
[doc] Fix typo in prediction.rst ( #9415 )
...
Typo for `pred_contribs` and `pred_interactions`
2023-07-26 07:03:04 +08:00
Jiaming Yuan
3a9996173e
Revert "Switch to per-thread default stream ( #9396 )" ( #9413 )
...
This reverts commit f7f673b00c .
2023-07-24 12:03:28 -07:00
Bobby Wang
1b657a5513
[jvm-packages] set device to cuda when tree method is "gpu_hist" ( #9412 )
2023-07-24 18:32:25 +08:00
Jiaming Yuan
a196443a07
Implement sketching with Hessian on GPU. ( #9399 )
...
- Prepare for implementing approx on GPU.
- Unify the code path between weighted and uniform sketching on DMatrix.
2023-07-24 15:43:03 +08:00
Jiaming Yuan
851cba931e
Define best_iteration only if early stopping is used. ( #9403 )
...
* Define `best_iteration` only if early stopping is used.
This is the behavior specified by the document but not honored in the actual code.
- Don't set the attributes if there's no early stopping.
- Clean up the code for callbacks, and replace assertions with proper exceptions.
- Assign the attributes when early stopping `save_best` is used.
- Turn the attributes into Python properties.
---------
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2023-07-24 12:43:35 +08:00
Jiaming Yuan
01e00efc53
[breaking] Remove support for single string feature info. ( #9401 )
...
- Input must be a sequence of strings.
- Improve validation error message.
2023-07-24 11:06:30 +08:00
Jiaming Yuan
275da176ba
Document for device ordinal. ( #9398 )
...
- Rewrite GPU demos. notebook is converted to script to avoid committing additional png plots.
- Add GPU demos into the sphinx gallery.
- Add RMM demos into the sphinx gallery.
- Test for firing threads with different device ordinals.
2023-07-22 15:26:29 +08:00
Jiaming Yuan
22b0a55a04
Remove hist builder class. ( #9400 )
...
* Remove hist build class.
* Cleanup this stateless class.
* Add comment to thread block.
2023-07-22 10:43:12 +08:00
Jiaming Yuan
0de7c47495
Fix metric serialization. ( #9405 )
2023-07-22 08:39:21 +08:00
Jiaming Yuan
dbd5309b55
Fix warning message for device. ( #9402 )
2023-07-20 23:30:04 +08:00
Rong Ou
f7f673b00c
Switch to per-thread default stream ( #9396 )
2023-07-20 08:21:00 +08:00
Jiaming Yuan
7a0ccfbb49
Add compute 90. ( #9397 )
2023-07-19 13:42:38 +08:00
Jiaming Yuan
0897477af0
Remove unmaintained jvm readme and dev scripts. ( #9395 )
2023-07-18 18:23:43 +08:00
Philip Hyunsu Cho
e082718c66
[CI] Build pip wheel with RMM support ( #9383 )
2023-07-18 01:52:26 -07:00
Jiaming Yuan
6e18d3a290
[pyspark] Handle the device parameter in pyspark. ( #9390 )
...
- Handle the new `device` parameter in PySpark.
- Deprecate the old `use_gpu` parameter.
2023-07-18 08:47:03 +08:00
Philip Hyunsu Cho
2a0ff209ff
[CI] Block CI from running for dependabot PRs ( #9394 )
2023-07-17 10:53:57 -07:00
Jiaming Yuan
f4fb2be101
[jvm-packages] Add the new device parameter. ( #9385 )
2023-07-17 18:40:39 +08:00
Jiaming Yuan
2caceb157d
[jvm-packages] Reduce log verbosity for GPU tests. ( #9389 )
2023-07-17 13:25:46 +08:00
Jiaming Yuan
b342ef951b
Make feature validation immutable. ( #9388 )
2023-07-16 06:52:55 +08:00
Jiaming Yuan
0a07900b9f
Fix integer overflow. ( #9380 )
2023-07-15 21:11:02 +08:00
Jiaming Yuan
16eb41936d
Handle the new device parameter in dask and demos. ( #9386 )
...
* Handle the new `device` parameter in dask and demos.
- Check no ordinal is specified in the dask interface.
- Update demos.
- Update dask doc.
- Update the condition for QDM.
2023-07-15 19:11:20 +08:00
Jiaming Yuan
9da5050643
Turn warning messages into Python warnings. ( #9387 )
2023-07-15 07:46:43 +08:00
Jiaming Yuan
04aff3af8e
Define the new device parameter. ( #9362 )
2023-07-13 19:30:25 +08:00
Cássia Sampaio
2d0cd2817e
[doc] Fux learning_to_rank.rst ( #9381 )
...
just adding one missing bracket
2023-07-13 11:00:24 +08:00
jinmfeng001
a1367ea1f8
Set feature_names and feature_types in jvm-packages ( #9364 )
...
* 1. Add parameters to set feature names and feature types
2. Save feature names and feature types to native json model
* Change serialization and deserialization format to ubj.
2023-07-12 15:18:46 +08:00
Rong Ou
3632242e0b
Support column split with GPU quantile ( #9370 )
2023-07-11 12:15:56 +08:00
Jiaming Yuan
97ed944209
Unify the hist tree method for different devices. ( #9363 )
2023-07-11 10:04:39 +08:00
Jiaming Yuan
20c52f07d2
Support exporting cut values ( #9356 )
2023-07-08 15:32:41 +08:00
edumugi
c3124813e8
Support numpy vertical split ( #9365 )
2023-07-08 13:18:12 +08:00
Jiaming Yuan
59787b23af
Allow empty page in external memory. ( #9361 )
2023-07-08 09:24:35 +08:00
Rong Ou
15ca12a77e
Fix NCCL test hang ( #9367 )
2023-07-07 11:21:35 +08:00
Jiaming Yuan
41c6813496
Preserve order of saved updaters config. ( #9355 )
...
- Save the updater sequence as an array instead of object.
- Warn only once.
The compatibility is kept, but we should be able to break it as the config is not loaded
in pickle model and it's declared to be not stable.
2023-07-05 20:20:07 +08:00
Jiaming Yuan
b572a39919
[doc] Fix removed reference. ( #9358 )
2023-07-05 16:49:25 +08:00
Jiaming Yuan
645037e376
Improve test coverage with predictor configuration. ( #9354 )
...
* Improve test coverage with predictor configuration.
- Test with ext memory.
- Test with QDM.
- Test with dart.
2023-07-05 15:17:22 +08:00
Oliver Holworthy
6c9c8a9001
Enable Installation of Python Package with System lib in a Virtual Environment ( #9349 )
2023-07-05 05:46:17 +08:00
Boris
bb2de1fd5d
xgboost4j-gpu_2.12-2.0.0: added libxgboost4j.so back. ( #9351 )
2023-07-04 03:31:33 +08:00
Jiaming Yuan
d0916849a6
Remove unused weight from buffer for cat features. ( #9341 )
2023-07-04 01:07:09 +08:00
Jiaming Yuan
6155394a06
Update news for 1.7.6 [skip ci] ( #9350 )
2023-07-04 01:04:34 +08:00
Jiaming Yuan
e964654b8f
[skl] Enable cat feature without specifying tree method. ( #9353 )
2023-07-03 22:06:17 +08:00
Jiaming Yuan
39390cc2ee
[breaking] Remove the predictor param, allow fallback to prediction using DMatrix. ( #9129 )
...
- A `DeviceOrd` struct is implemented to indicate the device. It will eventually replace the `gpu_id` parameter.
- The `predictor` parameter is removed.
- Fallback to `DMatrix` when `inplace_predict` is not available.
- The heuristic for choosing a predictor is only used during training.
2023-07-03 19:23:54 +08:00
Rong Ou
3a0f787703
Support column split in GPU predictor ( #9343 )
2023-07-03 04:05:34 +08:00
Rong Ou
f90771eec6
Fix device communicator dependency ( #9346 )
2023-06-29 10:34:30 +08:00
Jiaming Yuan
f4798718c7
Use hist as the default tree method. ( #9320 )
2023-06-27 23:04:24 +08:00
Jiaming Yuan
bc267dd729
Use ptr from mmap for GHistIndexMatrix and ColumnMatrix. ( #9315 )
...
* Use ptr from mmap for `GHistIndexMatrix` and `ColumnMatrix`.
- Define a resource for holding various types of memory pointers.
- Define ref vector for holding resources.
- Swap the underlying resources for GHist and ColumnM.
- Add documentation for current status.
- s390x support is removed. It should work if you can compile XGBoost, all the old workaround code does is to get GCC to compile.
2023-06-27 19:05:46 +08:00
jasjung
96c3071a8a
[doc] Update learning_to_rank.rst ( #9336 )
2023-06-27 13:56:18 +08:00
Jiaming Yuan
cfa9c42eb4
Fix callback in AFT viz demo. ( #9333 )
...
* Fix callback in AFT viz demo.
- Update the callback function.
- Add lint check.
2023-06-26 22:35:02 +08:00
Jiaming Yuan
6efe7c129f
[doc] Update reference in R vignettes. ( #9323 )
2023-06-26 18:32:11 +08:00
amdsc21
2e7e9d3b2d
update rocgputreeshap branch
2023-06-23 19:50:08 +02:00
amdsc21
3e0c7d1dee
new url for rocgputreeshap
2023-06-23 19:46:45 +02:00
amdsc21
2f47a1ebe6
rm warp-primitives
2023-06-22 21:43:00 +02:00
Jiaming Yuan
54da4b3185
Cleanup to prepare for using mmap pointer in external memory. ( #9317 )
...
- Update SparseDMatrix comment.
- Use a pointer in the bitfield. We will replace the `std::vector<bool>` in `ColumnMatrix` with bitfield.
- Clean up the page source. The timer is removed as it's inaccurate once we swap the mmap pointer into the page.
2023-06-22 06:43:11 +08:00
Jiaming Yuan
4066d68261
[doc] Clarify early stopping. ( #9304 )
2023-06-20 17:56:47 +08:00
Jiaming Yuan
6d22ea793c
Test QDM with sparse data on CPU. ( #9316 )
2023-06-19 21:27:03 +08:00
Jiaming Yuan
ee6809e642
Use mmap for external memory. ( #9282 )
...
- Have basic infrastructure for mmap.
- Release file write handle.
2023-06-19 18:52:55 +08:00
Rong Ou
d8beb517ed
Support bitwise allreduce in NCCL communicator ( #9300 )
2023-06-17 01:56:50 +08:00
George Othon
2718ff530c
[doc] Variable 'label' is not defined in the pyspark application example ( #9302 )
2023-06-16 05:06:52 +08:00
amdsc21
5ca7daaa13
merge latest changes
2023-06-15 21:39:14 +02:00
Jacek Laskowski
0df1272695
[docs] How to build the docs using conda ( #9276 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-06-15 07:39:26 +08:00
Rong Ou
e70810be8a
Refactor device communicator to make allreduce more flexible ( #9295 )
2023-06-14 03:53:03 +08:00
Philip Hyunsu Cho
c2f0486d37
[CI] Run two pipeline loaders for responsiveness ( #9294 )
2023-06-12 09:52:40 -07:00
Jake Blitch
aad1313154
Fix community.rst typos. ( #9291 )
2023-06-11 09:09:27 +08:00
ZHAOKAI WANG
2b76061659
remove redundant method in expand_entry ( #9283 )
2023-06-10 05:18:21 +08:00
amdsc21
5f78360949
merge changes Jun092023
2023-06-09 22:41:33 +02:00
Jiaming Yuan
152e2fb072
Unify test helpers for creating ctx. ( #9274 )
2023-06-10 03:35:22 +08:00
Jiaming Yuan
ea0deeca68
Disable dense optimization in hist for distributed training. ( #9272 )
2023-06-10 02:31:34 +08:00
github-actions[bot]
8c1065f645
[CI] Update RAPIDS to latest stable ( #9278 )
...
Co-authored-by: hcho3 <hcho3@users.noreply.github.com >
2023-06-09 09:55:08 -07:00
Jiaming Yuan
1fcc26a6f8
Set ndcg to default for LTR. ( #8822 )
...
- Add document.
- Add tests.
- Use `ndcg` with `topk` as default.
2023-06-09 23:31:33 +08:00
Philip Hyunsu Cho
e4dd6051a0
Use good commit message when updating Rapids
2023-06-08 19:30:25 -07:00
Philip Hyunsu Cho
2ec2ecf013
Allow admin to manually trigger update_rapids workflow
2023-06-08 19:21:36 -07:00
Philip Hyunsu Cho
181dee13e9
Update update_rapids.yml
2023-06-08 19:11:49 -07:00
Rong Ou
ff122d61ff
More tests for cpu predictor with column split ( #9270 )
2023-06-08 22:47:19 +08:00
ZHAOKAI WANG
84d3fcb7ea
Fix cpu_predictor categorical feature disaptch ( #9256 )
2023-06-08 01:24:04 +08:00
dependabot[bot]
e229692572
Bump maven-surefire-plugin from 3.1.0 to 3.1.2 in /jvm-packages ( #9265 )
...
Bumps [maven-surefire-plugin](https://github.com/apache/maven-surefire ) from 3.1.0 to 3.1.2.
- [Release notes](https://github.com/apache/maven-surefire/releases )
- [Commits](https://github.com/apache/maven-surefire/compare/surefire-3.1.0...surefire-3.1.2 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-surefire-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-06-07 20:53:20 +08:00
dependabot[bot]
4a5802ed2c
Bump maven-project-info-reports-plugin in /jvm-packages ( #9268 )
...
Bumps [maven-project-info-reports-plugin](https://github.com/apache/maven-project-info-reports-plugin ) from 3.4.4 to 3.4.5.
- [Commits](https://github.com/apache/maven-project-info-reports-plugin/compare/maven-project-info-reports-plugin-3.4.4...maven-project-info-reports-plugin-3.4.5 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-project-info-reports-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-06-07 19:07:36 +08:00
amdsc21
35cde3b1b2
remove some hip.h
2023-06-07 04:48:09 +02:00
amdsc21
ce345c30a8
remove some hip.h
2023-06-07 03:39:01 +02:00
amdsc21
af8845405a
sync Jun 5
2023-06-07 02:43:21 +02:00
Jiaming Yuan
0cba2cdbb0
Support linalg data structures in check device. ( #9243 )
2023-06-06 09:47:24 +08:00
Jiaming Yuan
fc8110ef79
Remove document and demo in RABIT. ( #9246 )
2023-06-06 08:20:10 +08:00
Boris
7f9cb921f4
Rearranged maven profiles so that scala-2.13 artifacts are published without gpu-related libraries ( #9253 )
2023-06-05 13:52:10 -07:00
dependabot[bot]
a474a66573
Bump maven-release-plugin from 3.0.0 to 3.0.1 in /jvm-packages ( #9252 )
...
Bumps [maven-release-plugin](https://github.com/apache/maven-release ) from 3.0.0 to 3.0.1.
- [Release notes](https://github.com/apache/maven-release/releases )
- [Commits](https://github.com/apache/maven-release/compare/maven-release-3.0.0...maven-release-3.0.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-release-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-06-05 21:29:59 +08:00
Rong Ou
962a20693f
More support for column split in cpu predictor ( #9244 )
...
- Added column split support to `PredictInstance` and `PredictLeaf`.
- Refactoring of tests.
2023-06-05 08:05:38 +08:00
Philip Hyunsu Cho
3bf0f145bb
Update update_rapids.yml
2023-06-03 13:12:12 -07:00
Philip Hyunsu Cho
a1fad72ab3
Update outdated build badges ( #9232 )
2023-06-02 08:22:25 -07:00
Philip Hyunsu Cho
288539ac78
[CI] Automatically bump Rapids version in containers ( #9234 )
...
* [CI] Use RAPIDS 23.04
* [CI] Remove outdated filters in dependabot
* [CI] Automatically bump Rapids version in containers
* Automate pull request
2023-06-02 08:17:41 -07:00
Jiaming Yuan
9fbde21e9d
Rework the precision metric. ( #9222 )
...
- Rework the precision metric for both CPU and GPU.
- Mention it in the document.
- Cleanup old support code for GPU ranking metric.
- Deterministic GPU implementation.
* Drop support for classification.
* type.
* use batch shape.
* lint.
* cpu build.
* cpu build.
* lint.
* Tests.
* Fix.
* Cleanup error message.
2023-06-02 20:49:43 +08:00
amdsc21
9ee1852d4e
restore device helper
2023-06-02 02:55:13 +02:00
Your Name
6ecd7903f2
Merge branch 'master' into sync-condition-2023Jun01
2023-06-01 15:58:31 -07:00
Your Name
42867a4805
sync Jun 1
2023-06-01 15:55:06 -07:00
Philip Hyunsu Cho
db8288121d
Revert "Publishing scala-2.13 artifacts to the maven S3 repo. ( #9224 )" ( #9233 )
...
This reverts commit bb2a17b90c .
2023-06-01 14:39:39 -07:00
Boris
bb2a17b90c
Publishing scala-2.13 artifacts to the maven S3 repo. ( #9224 )
2023-06-01 10:45:18 -07:00
dependabot[bot]
e93b805a75
Bump scala.version from 2.12.17 to 2.12.18 in /jvm-packages ( #9230 )
...
Bumps `scala.version` from 2.12.17 to 2.12.18.
Updates `scala-compiler` from 2.12.17 to 2.12.18
- [Release notes](https://github.com/scala/scala/releases )
- [Commits](https://github.com/scala/scala/compare/v2.12.17...v2.12.18 )
Updates `scala-library` from 2.12.17 to 2.12.18
- [Release notes](https://github.com/scala/scala/releases )
- [Commits](https://github.com/scala/scala/compare/v2.12.17...v2.12.18 )
---
updated-dependencies:
- dependency-name: org.scala-lang:scala-compiler
dependency-type: direct:production
update-type: version-update:semver-patch
- dependency-name: org.scala-lang:scala-library
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-06-01 10:44:43 -07:00
ZHAOKAI WANG
fa2ab1f021
TreeRefresher note word spelling modification ( #9223 )
2023-05-31 20:27:27 +08:00
Jiaming Yuan
aba4559c4f
[doc] Update dask demo. ( #9201 )
2023-05-31 05:01:02 +08:00
Jiaming Yuan
7f20eaed93
[doc] Troubleshoot nccl shared memory. [skip ci] ( #9206 )
2023-05-31 05:00:02 +08:00
Jiaming Yuan
62e9387cd5
[ci] Update PySpark version. ( #9214 )
2023-05-31 03:00:44 +08:00
Jiaming Yuan
17fd3f55e9
Optimize adapter element counting on GPU. ( #9209 )
...
- Implement a simple `IterSpan` for passing iterators with size.
- Use shared memory for column size counts.
- Use one thread for each sample in row count to reduce atomic operations.
2023-05-30 23:28:43 +08:00
Jiaming Yuan
097f11b6e0
Support CUDA f16 without transformation. ( #9207 )
...
- Support f16 from cupy.
- Include CUDA header explicitly.
- Cleanup cmake nvtx support.
2023-05-30 20:54:31 +08:00
dependabot[bot]
6f83d9c69a
Bump maven-project-info-reports-plugin in /jvm-packages ( #9219 )
...
Bumps [maven-project-info-reports-plugin](https://github.com/apache/maven-project-info-reports-plugin ) from 3.4.3 to 3.4.4.
- [Commits](https://github.com/apache/maven-project-info-reports-plugin/compare/maven-project-info-reports-plugin-3.4.3...maven-project-info-reports-plugin-3.4.4 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-project-info-reports-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-30 19:10:07 +08:00
Jiaming Yuan
ae7450ce54
Skip optional synchronization in thrust. ( #9212 )
2023-05-30 17:23:09 +08:00
Jean Lescut-Muller
ddec0f378c
[doc] Show derivative of the custom objective ( #9213 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-05-30 04:07:12 +08:00
Bobby Wang
320323f533
[pyspark] add parameters in the ctor of all estimators. ( #9202 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-05-29 05:58:16 +08:00
Jiaming Yuan
03bc6e6427
Remove unused variables. ( #9210 )
...
- remove used variables.
- Remove signed comparison warnings.
2023-05-28 05:24:15 +08:00
dependabot[bot]
d563d6d8f4
Bump scala-collection-compat_2.12 from 2.9.0 to 2.10.0 in /jvm-packages ( #9208 )
...
Bumps [scala-collection-compat_2.12](https://github.com/scala/scala-collection-compat ) from 2.9.0 to 2.10.0.
- [Release notes](https://github.com/scala/scala-collection-compat/releases )
- [Commits](https://github.com/scala/scala-collection-compat/compare/v2.9.0...v2.10.0 )
---
updated-dependencies:
- dependency-name: org.scala-lang.modules:scala-collection-compat_2.12
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-28 00:22:28 +08:00
Boris
a01df102c9
Scala 2.13 support. ( #9099 )
...
1. Updated the test logic
2. Added smoke tests for Spark examples.
3. Added integration tests for Spark with Scala 2.13
2023-05-27 19:34:02 +08:00
Jiaming Yuan
8c174ef2d3
[CI] Update images that are not related to binary release. ( #9205 )
...
* [CI] Update images that are not related to the binary release.
- Update clang-tidy, prefer tools from the Ubuntu repository.
- Update GPU image to 22.04.
- Small cleanup to the tidy script.
- Remove gpu_jvm, which seems to be unused.
2023-05-27 17:40:46 +08:00
michael-gendy-mention-me
c5677a2b2c
Remove type: ignore hints ( #9197 )
2023-05-27 07:48:28 +08:00
Jiaming Yuan
053aababd4
Avoid thrust logical operation. ( #9199 )
...
Thrust implementation of `thrust::all_of/any_of/none_of` adopts an early stopping strategy
to bailout early by dividing the input into small batches. This is not ideal for data
validation as we expect all data to be valid. The strategy leads to excessive kernel
launches and stream synchronization.
* Use reduce from dh instead.
2023-05-27 01:36:58 +08:00
dependabot[bot]
614f47c477
Bump flink-clients from 1.17.0 to 1.17.1 in /jvm-packages ( #9203 )
...
Bumps [flink-clients](https://github.com/apache/flink ) from 1.17.0 to 1.17.1.
- [Commits](https://github.com/apache/flink/compare/release-1.17.0...release-1.17.1 )
---
updated-dependencies:
- dependency-name: org.apache.flink:flink-clients
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-26 18:42:24 +08:00
Rong Ou
5b69534b43
Support column split in multi-target hist ( #9171 )
2023-05-26 16:56:05 +08:00
Rong Ou
acd363033e
Fix running MGPU gtests ( #9200 )
2023-05-26 05:26:38 +08:00
amdsc21
c5b575e00e
fix host __assert_fail
2023-05-24 19:40:24 +02:00
amdsc21
1354138b7d
Merge branch 'master' into sync-condition-2023May15
2023-05-24 17:44:16 +02:00
dependabot[bot]
5d99b441d5
Bump scalatest_2.12 from 3.2.15 to 3.2.16 in /jvm-packages/xgboost4j ( #9160 )
...
Bumps [scalatest_2.12](https://github.com/scalatest/scalatest ) from 3.2.15 to 3.2.16.
- [Release notes](https://github.com/scalatest/scalatest/releases )
- [Commits](https://github.com/scalatest/scalatest/compare/release-3.2.15...release-3.2.16 )
---
updated-dependencies:
- dependency-name: org.scalatest:scalatest_2.12
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-24 09:09:25 +08:00
dependabot[bot]
e38e94ba4d
Bump rapids-4-spark_2.12 from 23.04.0 to 23.04.1 in /jvm-packages ( #9158 )
...
Bumps rapids-4-spark_2.12 from 23.04.0 to 23.04.1.
---
updated-dependencies:
- dependency-name: com.nvidia:rapids-4-spark_2.12
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-24 07:15:46 +08:00
dependabot[bot]
d6d83c818f
Bump maven-assembly-plugin from 3.5.0 to 3.6.0 in /jvm-packages ( #9163 )
...
Bumps [maven-assembly-plugin](https://github.com/apache/maven-assembly-plugin ) from 3.5.0 to 3.6.0.
- [Commits](https://github.com/apache/maven-assembly-plugin/compare/maven-assembly-plugin-3.5.0...maven-assembly-plugin-3.6.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-assembly-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-23 13:56:12 -07:00
dependabot[bot]
22b0fc0992
Bump maven-source-plugin from 3.2.1 to 3.3.0 in /jvm-packages ( #9184 )
...
Bumps [maven-source-plugin](https://github.com/apache/maven-source-plugin ) from 3.2.1 to 3.3.0.
- [Commits](https://github.com/apache/maven-source-plugin/compare/maven-source-plugin-3.2.1...maven-source-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-source-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-24 03:29:44 +08:00
dependabot[bot]
e67a0b8599
Bump maven-checkstyle-plugin from 3.2.2 to 3.3.0 in /jvm-packages ( #9192 )
...
Bumps [maven-checkstyle-plugin](https://github.com/apache/maven-checkstyle-plugin ) from 3.2.2 to 3.3.0.
- [Commits](https://github.com/apache/maven-checkstyle-plugin/compare/maven-checkstyle-plugin-3.2.2...maven-checkstyle-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-checkstyle-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-24 01:43:47 +08:00
amdsc21
b994a38b28
Merge branch 'master' into sync-condition-2023May15
2023-05-23 01:07:50 +02:00
Jiaming Yuan
3913ff470f
Import data lazily during tests. ( #9176 )
2023-05-23 03:58:31 +08:00
amdsc21
3a834c4992
change workflow
2023-05-20 07:04:06 +02:00
amdsc21
b22644fc10
add hip.h
2023-05-20 01:25:33 +02:00
amdsc21
7663d47383
Merge branch 'master' into sync-condition-2023May15
2023-05-19 20:30:35 +02:00
Bobby Wang
6274fba0a5
[pyspark] support tying ( #9172 )
2023-05-19 14:39:26 +08:00
amdsc21
88fc8badfa
Merge branch 'master' into sync-condition-2023May15
2023-05-17 19:55:50 +02:00
Bobby Wang
caf326d508
[pyspark] Refactor and typing support for models ( #9156 )
2023-05-17 16:38:51 +08:00
Bobby Wang
cb370c4f7d
[jvm] separate spark.version for cpu and gpu ( #9166 )
2023-05-17 07:12:20 +08:00
amdsc21
8cad8c693c
sync up May15 2023
2023-05-15 18:59:18 +02:00
Stephan T. Lavavej
7375bd058b
Fix IndexTransformIter. ( #9155 )
2023-05-12 21:25:54 +08:00
Stephan T. Lavavej
59edfdb315
Fix typo: _defined => defined ( #9153 )
2023-05-11 16:34:45 -07:00
Stephan T. Lavavej
779b82c098
Avoid redefining macros. ( #9154 )
2023-05-11 15:59:25 -07:00
Rong Ou
603f8ce2fa
Support hist in the partition builder under column split ( #9120 )
2023-05-11 05:24:29 +08:00
Rong Ou
52311dcec9
Fix multi-threaded gtests ( #9148 )
2023-05-10 19:15:32 +08:00
Jiaming Yuan
e4129ed6ee
[jvm-packages] Remove akka in tester. ( #9149 )
2023-05-10 14:10:58 +08:00
dependabot[bot]
2ab6660943
Bump maven-surefire-plugin in /jvm-packages/xgboost4j-spark ( #9131 )
...
Bumps [maven-surefire-plugin](https://github.com/apache/maven-surefire ) from 3.0.0 to 3.1.0.
- [Release notes](https://github.com/apache/maven-surefire/releases )
- [Commits](https://github.com/apache/maven-surefire/compare/surefire-3.0.0...surefire-3.1.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-surefire-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-10 12:21:36 +08:00
dependabot[bot]
d21e7e5f82
Bump maven-gpg-plugin from 3.0.1 to 3.1.0 in /jvm-packages ( #9136 )
...
Bumps [maven-gpg-plugin](https://github.com/apache/maven-gpg-plugin ) from 3.0.1 to 3.1.0.
- [Commits](https://github.com/apache/maven-gpg-plugin/compare/maven-gpg-plugin-3.0.1...maven-gpg-plugin-3.1.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-gpg-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-10 10:21:36 +08:00
Philip Hyunsu Cho
0cd4382d72
Fix config-settings handling in pip install ( #9115 )
...
* Fix config_settings handling in pip install
* Fix formatting
* Fix flag use_system_libxgboost
* Add setuptools to doc requirements.txt
* Fix mypy
2023-05-09 17:54:20 -07:00
Jiaming Yuan
09b44915e7
[doc] Replace recommonmark with myst-parser. ( #9125 )
2023-05-10 08:11:36 +08:00
Jiaming Yuan
85988a3178
Wait for data CUDA stream instead of sync. ( #9144 )
...
---------
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2023-05-09 09:52:21 +08:00
Uriya Harpeness
a075aa24ba
Move python tool configurations to pyproject.toml, and add the python 3.11 classifier. ( #9112 )
2023-05-06 02:59:06 +08:00
Jiaming Yuan
55968ed3fa
Fix monotone constraints on CPU. ( #9122 )
2023-05-06 01:07:54 +08:00
Rong Ou
250b22dd22
Fix nvflare horizontal demo ( #9124 )
2023-05-05 16:48:22 +08:00
Jiaming Yuan
47b3cb6fb7
Remove unused parameters in RABIT. ( #9108 )
2023-05-05 05:26:24 +08:00
Philip Hyunsu Cho
07b2d5a26d
Add useful links to pyproject.toml ( #9114 )
2023-05-02 12:47:15 -07:00
amdsc21
b066accad6
fix lambdarank_obj
2023-05-02 21:06:22 +02:00
amdsc21
b324d51f14
fix array_interface.h half type
2023-05-02 20:50:50 +02:00
amdsc21
65097212b3
fix IterativeDeviceDMatrix, support HIP
2023-05-02 20:20:11 +02:00
amdsc21
4a24ca2f95
fix helpers.h, enable HIP
2023-05-02 20:04:23 +02:00
amdsc21
83e6fceb5c
fix lambdarank_obj.cc, support HIP
2023-05-02 19:03:18 +02:00
amdsc21
e4538cb13c
fix, to support hip
2023-05-02 17:43:11 +02:00
amdsc21
5446c501af
merge 23Mar01
2023-05-02 00:05:58 +02:00
amdsc21
313a74b582
add Shap Magic to check if use cat
2023-05-01 21:55:14 +02:00
Jiaming Yuan
08ce495b5d
Use Booster context in DMatrix. ( #8896 )
...
- Pass context from booster to DMatrix.
- Use context instead of integer for `n_threads`.
- Check the consistency configuration for `max_bin`.
- Test for all combinations of initialization options.
2023-04-28 21:47:14 +08:00
Jiaming Yuan
1f9a57d17b
[Breaking] Require format to be specified in input URI. ( #9077 )
...
Previously, we use `libsvm` as default when format is not specified. However, the dmlc
data parser is not particularly robust against errors, and the most common type of error
is undefined format.
Along with which, we will recommend users to use other data loader instead. We will
continue the maintenance of the parsers as it's currently used for many internal tests
including federated learning.
2023-04-28 19:45:15 +08:00
Bobby Wang
e922004329
[doc] fix the cudf installation [skip ci] ( #9106 )
2023-04-28 19:43:58 +08:00
Jiaming Yuan
17ff471616
Optimize array interface input. ( #9090 )
2023-04-28 18:01:58 +08:00
Rong Ou
fb941262b4
Add demo for vertical federated learning ( #9103 )
2023-04-28 16:03:21 +08:00
Jiaming Yuan
e206b899ef
Rework MAP and Pairwise for LTR. ( #9075 )
2023-04-28 02:39:12 +08:00
Jiaming Yuan
0e470ef606
Optimize prediction with QuantileDMatrix. ( #9096 )
...
- Reduce overhead in `FVecDrop`.
- Reduce overhead caused by `HostVector()` calls.
2023-04-28 00:51:41 +08:00
Jiaming Yuan
fa267ad093
[CI] Freeze R version to 4.2.0 with MSVC. ( #9104 )
2023-04-27 22:48:31 +08:00
Jiaming Yuan
96d3f8a6f3
[doc] Update document. ( #9098 )
...
- Mention flink is still under construction.
- Update doxygen version.
- Fix warnings from doxygen about defgroup title and mismatched parameter name.
2023-04-27 19:29:03 +08:00
Rong Ou
511d4996b5
Rely on gRPC to generate random port ( #9102 )
2023-04-27 09:48:26 +08:00
Jiaming Yuan
101a2e643d
[jvm-packages] Bump rapids version. ( #9097 )
2023-04-27 09:46:46 +08:00
Scott Gustafson
353ed5339d
Convert `DaskXGBClassifier.classes_` to an array ( #8452 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-04-27 02:23:35 +08:00
Boris
0e7377ba9c
Updated flink 1.8 -> 1.17. Added smoke tests for Flink ( #9046 )
2023-04-26 18:41:11 +08:00
Rong Ou
a320b402a5
More refactoring to take advantage of collective aggregators ( #9081 )
2023-04-26 03:36:09 +08:00
dependabot[bot]
49ccae7fb9
Bump spark.version from 3.1.1 to 3.4.0 in /jvm-packages ( #9039 )
...
Bumps `spark.version` from 3.1.1 to 3.4.0.
Updates `spark-mllib_2.12` from 3.1.1 to 3.4.0
Updates `spark-core_2.12` from 3.1.1 to 3.4.0
Updates `spark-sql_2.12` from 3.1.1 to 3.4.0
---
updated-dependencies:
- dependency-name: org.apache.spark:spark-mllib_2.12
dependency-type: direct:production
update-type: version-update:semver-minor
- dependency-name: org.apache.spark:spark-core_2.12
dependency-type: direct:production
update-type: version-update:semver-minor
- dependency-name: org.apache.spark:spark-sql_2.12
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-26 01:32:06 +08:00
Bobby Wang
17add4776f
[pyspark] Don't stack for non feature columns ( #9088 )
2023-04-25 23:09:12 +08:00
dependabot[bot]
a2cc78c1fb
Bump scala.version from 2.12.8 to 2.12.17 in /jvm-packages ( #9083 )
...
Bumps `scala.version` from 2.12.8 to 2.12.17.
Updates `scala-compiler` from 2.12.8 to 2.12.17
- [Release notes](https://github.com/scala/scala/releases )
- [Commits](https://github.com/scala/scala/compare/v2.12.8...v2.12.17 )
Updates `scala-library` from 2.12.8 to 2.12.17
- [Release notes](https://github.com/scala/scala/releases )
- [Commits](https://github.com/scala/scala/compare/v2.12.8...v2.12.17 )
---
updated-dependencies:
- dependency-name: org.scala-lang:scala-compiler
dependency-type: direct:production
update-type: version-update:semver-patch
- dependency-name: org.scala-lang:scala-library
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-25 21:30:38 +08:00
Bobby Wang
339f21e1bf
[pyspark] fix a type hint with old pyspark release ( #9079 )
2023-04-24 20:04:14 +08:00
Bobby Wang
d237378452
[jvm-packages] Clean up the dependencies after removing scala versioned tracker ( #9078 )
2023-04-24 17:49:08 +08:00
Jiaming Yuan
c512c3f46b
[jvm-packages] Bump rapids version. ( #9056 )
2023-04-22 15:46:44 +08:00
Rong Ou
8dbe0510de
More collective aggregators ( #9060 )
2023-04-22 03:32:05 +08:00
Jiaming Yuan
7032981350
Fix timer annotation. ( #9057 )
2023-04-21 22:53:58 +08:00
austinzh
3b742dc4f1
Stop using Rabit in predition ( #9054 )
2023-04-21 19:38:07 +08:00
dependabot[bot]
39b0fde0e7
Bump kryo from 5.4.0 to 5.5.0 in /jvm-packages ( #9070 )
...
Bumps [kryo](https://github.com/EsotericSoftware/kryo ) from 5.4.0 to 5.5.0.
- [Release notes](https://github.com/EsotericSoftware/kryo/releases )
- [Commits](https://github.com/EsotericSoftware/kryo/compare/kryo-parent-5.4.0...kryo-parent-5.5.0 )
---
updated-dependencies:
- dependency-name: com.esotericsoftware:kryo
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-21 18:16:34 +08:00
dependabot[bot]
ee84e22c8d
Bump maven-checkstyle-plugin from 3.2.1 to 3.2.2 in /jvm-packages ( #9073 )
...
Bumps [maven-checkstyle-plugin](https://github.com/apache/maven-checkstyle-plugin ) from 3.2.1 to 3.2.2.
- [Release notes](https://github.com/apache/maven-checkstyle-plugin/releases )
- [Commits](https://github.com/apache/maven-checkstyle-plugin/compare/maven-checkstyle-plugin-3.2.1...maven-checkstyle-plugin-3.2.2 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-checkstyle-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-21 18:16:08 +08:00
Jiaming Yuan
b908680bec
Fix race condition in cpp metric tests. ( #9058 )
2023-04-21 05:24:10 +08:00
Philip Hyunsu Cho
a5cd2412de
Replace setup.py with pyproject.toml ( #9021 )
...
* Create pyproject.toml
* Implement a custom build backend (see below) in packager directory. Build logic from setup.py has been refactored and migrated into the new backend.
* Tested: pip wheel . (build wheel), python -m build --sdist . (source distribution)
2023-04-20 13:51:39 -07:00
Jiaming Yuan
a7b3dd3176
Fix compiler warnings. ( #9055 )
2023-04-21 02:26:47 +08:00
dependabot[bot]
2acd78b44b
Bump maven-project-info-reports-plugin in /jvm-packages/xgboost4j ( #9049 )
...
Bumps [maven-project-info-reports-plugin](https://github.com/apache/maven-project-info-reports-plugin ) from 3.4.2 to 3.4.3.
- [Release notes](https://github.com/apache/maven-project-info-reports-plugin/releases )
- [Commits](https://github.com/apache/maven-project-info-reports-plugin/compare/maven-project-info-reports-plugin-3.4.2...maven-project-info-reports-plugin-3.4.3 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-project-info-reports-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-04-21 00:10:45 +08:00
Emil Ejbyfeldt
a84a1fde02
[jvm-packages] Update scalatest to 3.2.15 ( #8925 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-04-20 22:16:56 +08:00
Jiaming Yuan
564df59204
[breaking] [jvm-packages] Remove scala-implemented tracker. ( #9045 )
2023-04-20 16:29:35 +08:00
amdsc21
65d83e288f
fix device query
2023-04-19 19:53:26 +02:00
Rong Ou
42d100de18
Make sure metrics work with federated learning ( #9037 )
2023-04-19 15:39:11 +08:00
Jiaming Yuan
ef13dd31b1
Rework the NDCG objective. ( #9015 )
2023-04-18 21:16:06 +08:00
Rong Ou
ba9d24ff7b
Make sure metrics work with column-wise distributed training ( #9020 )
2023-04-18 03:48:23 +08:00
amdsc21
f645cf51c1
Merge branch 'master' into sync-condition-2023Apr11
2023-04-17 18:33:00 +02:00
WeichenXu
191d0aa5cf
[spark] Make spark model have the same UID with its estimator ( #9022 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2023-04-14 02:53:30 +08:00
Philip Hyunsu Cho
8e0f320db3
[CI] Don't run CI automatically for dependabot ( #9034 )
2023-04-13 08:19:56 -07:00
amdsc21
db8420225b
fix RCCL
2023-04-12 01:09:14 +02:00
amdsc21
843fdde61b
sync Apr 11 2023
2023-04-11 20:03:25 +02:00
amdsc21
08bc4b0c0f
Merge branch 'master' into sync-condition-2023Apr11
2023-04-11 19:38:38 +02:00
amdsc21
6825d986fd
move Dockerfile to ci
2023-04-11 19:34:23 +02:00
Jiaming Yuan
fe9dff339c
Convert federated learner test into test suite. ( #9018 )
...
* Convert federated learner test into test suite.
- Add specialization to learning to rank.
2023-04-11 09:52:55 +08:00
Jiaming Yuan
2c8d735cb3
Fix tests with pandas 2.0. ( #9014 )
...
* Fix tests with pandas 2.0.
- `is_categorical` is replaced by `is_categorical_dtype`.
- one hot encoding returns boolean type instead of integer type.
2023-04-11 00:17:34 +08:00
Sarah Charlotte Johnson
ebd64f6e22
[doc] Update Dask deployment options ( #9008 )
2023-04-07 01:09:15 +08:00
Jiaming Yuan
1cf4d93246
Convert federated tests into test suite. ( #9006 )
...
- Add specialization for learning to rank.
2023-04-04 01:29:47 +08:00
Rong Ou
15e073ca9d
Make objectives work with vertical distributed and federated learning ( #9002 )
2023-04-03 17:07:42 +08:00
Jiaming Yuan
720a8c3273
[doc] Remove parameter type in Python doc strings. ( #9005 )
2023-04-01 04:04:30 +08:00
Jiaming Yuan
4caca2947d
Improve helper script for making release. [skip ci] ( #9004 )
...
* Merge source tarball generation script.
* Generate Python source wheel.
* Generate hashes and release note.
2023-03-31 23:14:58 +08:00
Jiaming Yuan
bcb55d3b6a
Portable macro definition. ( #8999 )
2023-03-31 20:48:59 +08:00
Jiaming Yuan
bac22734fb
Remove ntree limit in python package. ( #8345 )
...
- Remove `ntree_limit`. The parameter has been deprecated since 1.4.0.
- The SHAP package compatibility is broken.
2023-03-31 19:01:55 +08:00
paklui
d155ec77f9
building docker for xgboost-amd-condition
2023-03-30 13:36:39 -07:00
Jiaming Yuan
b647403baa
Update release news. [skip ci] ( #9000 )
2023-03-31 03:52:09 +08:00
Jiaming Yuan
cd05e38533
[doc][R] Update link. ( #8998 )
2023-03-30 19:09:07 +08:00
Jiaming Yuan
d062a9e009
Define pair generation strategies for LTR. ( #8984 )
2023-03-30 12:00:35 +08:00
amdsc21
991738690f
Merge branch 'sync-condition-2023Mar27' into amd-condition
2023-03-30 05:16:36 +02:00
amdsc21
aeb3fd1c95
Merge branch 'master' into sync-condition-2023Mar27
2023-03-30 05:15:55 +02:00
Rong Ou
d385cc64e2
Fix aft_loss_distribution documentation ( #8995 )
2023-03-29 19:13:23 -07:00
amdsc21
141a062e00
Merge branch 'sync-condition-2023Mar27' into amd-condition
2023-03-30 00:47:16 +02:00
amdsc21
acad01afc9
sync Mar 29
2023-03-30 00:46:50 +02:00
Jiaming Yuan
a58055075b
[dask] Return the first valid booster instead of all valid ones. ( #8993 )
...
* [dask] Return the first valid booster instead of all valid ones.
- Reduce memory footprint of the returned model.
* mypy error.
* lint.
* duplicated.
2023-03-30 03:16:18 +08:00
Philip Hyunsu Cho
6676c28cbc
[CI] Fix Windows wheel to be compatible with Poetry ( #8991 )
...
* [CI] Fix Windows wheel to be compatible with Poetry
* Typo
* Eagerly scan globs to avoid patching same file twice
2023-03-28 21:32:54 -07:00
Rong Ou
ff26cd3212
More tests for column split and vertical federated learning ( #8985 )
...
Added some more tests for the learner and fit_stump, for both column-wise distributed learning and vertical federated learning.
Also moved the `IsRowSplit` and `IsColumnSplit` methods from the `DMatrix` to the `MetaInfo` since in some places we only have access to the `MetaInfo`. Added a new convenience method `IsVerticalFederatedLearning`.
Some refactoring of the testing fixtures.
2023-03-28 16:40:26 +08:00
amdsc21
f289e5001d
Merge branch 'sync-condition-2023Mar27' into amd-condition
2023-03-28 00:24:12 +02:00
amdsc21
06d9b998ce
fix CAPI BuildInfo
2023-03-28 00:14:18 +02:00
amdsc21
c50cc424bc
sync Mar 27 2023
2023-03-27 18:54:41 +02:00
Jiaming Yuan
401ce5cf5e
Run linters with the multi output demo. ( #8966 )
2023-03-28 00:47:28 +08:00
Jiaming Yuan
acc110c251
[MT-TREE] Support prediction cache and model slicing. ( #8968 )
...
- Fix prediction range.
- Support prediction cache in mt-hist.
- Support model slicing.
- Make the booster a Python iterable by defining `__iter__`.
- Cleanup removed/deprecated parameters.
- A new field in the output model `iteration_indptr` for pointing to the ranges of trees for each iteration.
2023-03-27 23:10:54 +08:00
Jiaming Yuan
c2b3a13e70
[breaking][skl] Remove parameter serialization. ( #8963 )
...
- Remove parameter serialization in the scikit-learn interface.
The scikit-lear interface `save_model` will save only the model and discard all
hyper-parameters. This is to align with the native XGBoost interface, which distinguishes
the hyper-parameter and model parameters.
With the scikit-learn interface, model parameters are attributes of the estimator. For
instance, `n_features_in_`, `n_classes_` are always accessible with
`estimator.n_features_in_` and `estimator.n_classes_`, but not with the
`estimator.get_params`.
- Define a `load_model` method for classifier to load its own attributes.
- Set n_estimators to None by default.
2023-03-27 21:34:10 +08:00
dependabot[bot]
90645c4957
Bump maven-resources-plugin from 3.3.0 to 3.3.1 in /jvm-packages ( #8980 )
...
Bumps [maven-resources-plugin](https://github.com/apache/maven-resources-plugin ) from 3.3.0 to 3.3.1.
- [Release notes](https://github.com/apache/maven-resources-plugin/releases )
- [Commits](https://github.com/apache/maven-resources-plugin/compare/maven-resources-plugin-3.3.0...maven-resources-plugin-3.3.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-resources-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-27 16:03:45 +08:00
dependabot[bot]
43878b10b6
Bump maven-deploy-plugin in /jvm-packages/xgboost4j-spark-gpu ( #8973 )
...
Bumps [maven-deploy-plugin](https://github.com/apache/maven-deploy-plugin ) from 3.0.0 to 3.1.1.
- [Release notes](https://github.com/apache/maven-deploy-plugin/releases )
- [Commits](https://github.com/apache/maven-deploy-plugin/compare/maven-deploy-plugin-3.0.0...maven-deploy-plugin-3.1.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-deploy-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-27 12:47:13 +08:00
amdsc21
8c77e936d1
tune grid size
2023-03-26 17:45:19 +02:00
amdsc21
18034a4291
tune histogram
2023-03-26 01:42:51 +01:00
amdsc21
7ee4734d3a
rm device_helpers.hip.h from cu
2023-03-26 00:24:11 +01:00
amdsc21
ee582f03c3
rm device_helpers.hip.h from cuh
2023-03-25 23:35:57 +01:00
amdsc21
f3286bac04
rm warp header
2023-03-25 23:01:44 +01:00
amdsc21
3ee3bea683
fix warp header
2023-03-25 22:37:37 +01:00
amdsc21
5098735698
Merge branch 'condition-sync-Mar24-23' into hui-condition
2023-03-25 05:28:40 +01:00
amdsc21
e74b3bbf3c
fix macro
2023-03-25 05:17:39 +01:00
amdsc21
22525c002a
fix macro
2023-03-25 05:08:30 +01:00
amdsc21
80961039d7
fix macro
2023-03-25 05:00:55 +01:00
amdsc21
1474789787
add new file
2023-03-25 04:54:02 +01:00
amdsc21
1dc138404a
initial merge, fix linalg.h
2023-03-25 04:48:47 +01:00
amdsc21
e1d050f64e
initial merge, fix linalg.h
2023-03-25 04:37:43 +01:00
amdsc21
7fbc561e17
initial merge
2023-03-25 04:31:55 +01:00
amdsc21
d97be6f396
enable last 3 tests
2023-03-25 04:05:05 +01:00
amdsc21
f1211cffca
enable last 3 tests
2023-03-25 00:45:52 +01:00
amdsc21
e0716afabf
fix objective/objective.cc, CMakeFile and setup.py
2023-03-23 20:22:34 +01:00
dependabot[bot]
cff50fe3ef
Bump hadoop.version from 3.3.4 to 3.3.5 in /jvm-packages ( #8962 )
...
Bumps `hadoop.version` from 3.3.4 to 3.3.5.
Updates `hadoop-hdfs` from 3.3.4 to 3.3.5
Updates `hadoop-common` from 3.3.4 to 3.3.5
---
updated-dependencies:
- dependency-name: org.apache.hadoop:hadoop-hdfs
dependency-type: direct:production
update-type: version-update:semver-patch
- dependency-name: org.apache.hadoop:hadoop-common
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-23 16:12:04 +08:00
Jiaming Yuan
21a52c7f98
[doc] Add introduction and notes for the sklearn interface. ( #8948 )
2023-03-23 13:30:42 +08:00
Jiaming Yuan
bf88dadb61
[doc] Fix callback example. ( #8944 )
2023-03-23 03:27:04 +08:00
Jiaming Yuan
15a2724ff7
Removed outdated configuration serialization logic. ( #8942 )
...
- `saved_params` is empty.
- `saved_configs_` contains `num_round`, which is not used anywhere inside xgboost.
2023-03-23 01:31:46 +08:00
Jiaming Yuan
151882dd26
Initial support for multi-target tree. ( #8616 )
...
* Implement multi-target for hist.
- Add new hist tree builder.
- Move data fetchers for tests.
- Dispatch function calls in gbm base on the tree type.
2023-03-22 23:49:56 +08:00
Jiaming Yuan
ea04d4c46c
[doc] [dask] Troubleshooting NCCL errors. ( #8943 )
2023-03-22 22:17:26 +08:00
Jiaming Yuan
a551bed803
Remove duplicated learning rate parameter. ( #8941 )
2023-03-22 20:51:14 +08:00
Jiaming Yuan
a05799ed39
Specify char type in JSON. ( #8949 )
...
char is defined as signed on x86 but unsigned on arm64
- Use `std::int8_t` instead of char.
- Fix include when clang is pretending to be gcc.
2023-03-22 19:13:44 +08:00
Jiaming Yuan
5891f752c8
Rework the MAP metric. ( #8931 )
...
- The new implementation is more strict as only binary labels are accepted. The previous implementation converts values greater than 1 to 1.
- Deterministic GPU. (no atomic add).
- Fix top-k handling.
- Precise definition of MAP. (There are other variants on how to handle top-k).
- Refactor GPU ranking tests.
2023-03-22 17:45:20 +08:00
Rong Ou
b240f055d3
Support vertical federated learning ( #8932 )
2023-03-22 14:25:26 +08:00
Philip Hyunsu Cho
8dc1e4b3ea
Improve doxygen ( #8959 )
...
* Remove Sphinx build from GH Action
* Build Doxygen as part of RTD build
* Add jQuery
2023-03-21 09:22:11 -07:00
dependabot[bot]
34092d7fd0
Bump maven-release-plugin in /jvm-packages/xgboost4j-spark ( #8952 )
...
Bumps [maven-release-plugin](https://github.com/apache/maven-release ) from 2.5.3 to 3.0.0.
- [Release notes](https://github.com/apache/maven-release/releases )
- [Commits](https://github.com/apache/maven-release/compare/maven-release-2.5.3...maven-release-3.0.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-release-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-21 15:34:43 +08:00
amdsc21
595cd81251
add max shared mem workaround
2023-03-19 20:08:42 +01:00
amdsc21
0325ce0bed
update gputreeshap
2023-03-19 20:07:36 +01:00
Jiaming Yuan
9b6cc0ed07
Refactor hist to prepare for multi-target builder. ( #8928 )
...
- Extract the builder from the updater class. We need a new builder for multi-target.
- Extract `UpdateTree`, it can be reused for different builders. Eventually, other tree
updaters can use it as well.
2023-03-17 17:21:04 +08:00
Philip Hyunsu Cho
36263dd109
[jvm-packages] Use akka 2.6 ( #8920 )
2023-03-16 20:06:42 -07:00
Quentin Fiard
55ed50c860
Fix a few typos in the C API tutorial ( #8926 )
2023-03-16 20:24:03 +08:00
Jiaming Yuan
a093770f36
Partitioner for multi-target tree. ( #8922 )
2023-03-16 18:49:34 +08:00
amdsc21
a79a35c22c
add warp size
2023-03-15 22:00:26 +01:00
Jiaming Yuan
26209a42a5
Define git attributes for renormalization. ( #8921 )
2023-03-16 02:43:11 +08:00
Philip Hyunsu Cho
a2cdba51ce
Use hi-res SVG logo ( #8923 )
2023-03-15 10:02:38 -07:00
dependabot[bot]
fd016e43c6
Bump maven-surefire-plugin from 2.22.2 to 3.0.0 in /jvm-packages ( #8917 )
...
Bumps [maven-surefire-plugin](https://github.com/apache/maven-surefire ) from 2.22.2 to 3.0.0.
- [Release notes](https://github.com/apache/maven-surefire/releases )
- [Commits](https://github.com/apache/maven-surefire/compare/surefire-2.22.2...surefire-3.0.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-surefire-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-15 18:51:46 +08:00
Jiaming Yuan
f186c87cf9
Check inf in data for all types of DMatrix. ( #8911 )
2023-03-15 11:24:35 +08:00
amdsc21
4484c7f073
disable Optin Shared Mem
2023-03-15 02:10:16 +01:00
amdsc21
8207015e48
fix ../tests/cpp/common/test_span.h
2023-03-14 22:19:06 +01:00
Jiaming Yuan
72e8331eab
Reimplement the NDCG metric. ( #8906 )
...
- Add support for non-exp gain.
- Cache the DMatrix object to avoid re-calculating the IDCG.
- Make GPU implementation deterministic. (no atomic add)
2023-03-15 03:26:17 +08:00
Jiaming Yuan
8685556af2
Implement hist evaluator for multi-target tree. ( #8908 )
2023-03-15 01:42:51 +08:00
Jiaming Yuan
95e2baf7c2
[doc] Fix typo [skip ci] ( #8907 )
2023-03-15 00:55:17 +08:00
Jiaming Yuan
910ce580c8
Clear all cache after model load. ( #8904 )
2023-03-14 22:09:36 +08:00
Jiaming Yuan
c400fa1e8d
Predictor for vector leaf. ( #8898 )
2023-03-14 19:07:10 +08:00
amdsc21
364df7db0f
fix ../tree/gpu_hist/evaluate_splits.hip bugs, size 64
2023-03-14 06:17:21 +01:00
amdsc21
a2bab03205
fix aft_obj.hip
2023-03-13 23:19:59 +01:00
Jiaming Yuan
8be6095ece
Implement NDCG cache. ( #8893 )
2023-03-13 22:16:31 +08:00
Jiaming Yuan
9bade7203a
Remove public access to tree model param. ( #8902 )
...
* Make tree model param a private member.
* Number of features and targets are immutable after construction.
This is to reduce the number of places where we can run configuration.
2023-03-13 20:55:10 +08:00
Jiaming Yuan
5ba3509dd3
Define multi expand entry. ( #8895 )
2023-03-13 19:31:05 +08:00
Jiaming Yuan
bbee355b45
[doc][dask] Note on reproducible result. [skip ci] ( #8903 )
2023-03-13 19:30:35 +08:00
amdsc21
b71c1b50de
fix macro, no !
2023-03-12 23:02:28 +01:00
amdsc21
fa2336fcfd
sort bug fix
2023-03-12 07:09:10 +01:00
Jiaming Yuan
3689695d16
[CI] Run RMM gtests. ( #8900 )
...
* [CI] Run RMM gtests.
* Update test-cpp-gpu.sh
---------
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2023-03-12 03:14:31 +08:00
amdsc21
7d96758382
macro format
2023-03-11 06:57:24 +01:00
amdsc21
b0dacc5a80
fix bug
2023-03-11 03:47:23 +01:00
amdsc21
f64152bf97
add helpers.hip
2023-03-11 02:56:50 +01:00
amdsc21
b4dbe7a649
fix isnan
2023-03-11 02:39:58 +01:00
amdsc21
e5b6219a84
typo
2023-03-11 02:30:27 +01:00
amdsc21
3a07b1edf8
complete test porting
2023-03-11 02:17:05 +01:00
amdsc21
9bf16a2ca6
testing porting
2023-03-11 01:38:54 +01:00
amdsc21
332f6a89a9
more tests
2023-03-11 01:33:48 +01:00
amdsc21
204d0c9a53
add hip tests
2023-03-11 00:38:16 +01:00
Jiaming Yuan
36a7396658
Replace dmlc any with std any. ( #8892 )
2023-03-11 06:11:04 +08:00
amdsc21
e961016e71
rm HIPCUB
2023-03-10 22:21:37 +01:00
amdsc21
f0b8c02f15
merge latest changes
2023-03-10 22:10:20 +01:00
Rong Ou
79efcd37f5
Pick up dmlc-core fix for CSV parser ( #8897 )
2023-03-11 04:51:43 +08:00
Jiaming Yuan
2aa838c75e
Define multi-strategy parameter. ( #8890 )
2023-03-11 02:58:01 +08:00
amdsc21
5e8b1842b9
fix Pointer Attr
2023-03-10 19:06:02 +01:00
Jiaming Yuan
6deaec8027
Pass obj info by reference instead of by value. ( #8889 )
...
- Pass obj info into tree updater as const pointer.
This way we don't have to initialize the learner model param before configuring gbm, hence
breaking up the dependency of configurations.
2023-03-11 01:38:28 +08:00
amdsc21
9f072b50ba
fix __popc
2023-03-10 17:14:31 +01:00
amdsc21
e1ddb5ae58
fix macro XGBOOST_USE_HIP
2023-03-10 07:11:05 +01:00
amdsc21
643e2a7b39
fix macro XGBOOST_USE_HIP
2023-03-10 07:09:41 +01:00
amdsc21
bde3107c3e
fix macro XGBOOST_USE_HIP
2023-03-10 07:01:25 +01:00
amdsc21
5edfc1e2e9
finish ellpack_page.cc
2023-03-10 06:41:25 +01:00
amdsc21
c073417d0c
finish aft_obj.cu
2023-03-10 06:39:03 +01:00
amdsc21
9bbbeb3f03
finish multiclass_obj.cu
2023-03-10 06:35:46 +01:00
amdsc21
4bde2e3412
finish multiclass_obj.cu
2023-03-10 06:35:21 +01:00
amdsc21
58a9fe07b6
finish multiclass_obj.cu
2023-03-10 06:35:06 +01:00
amdsc21
41407850d5
finish rank_obj.cu
2023-03-10 06:29:08 +01:00
amdsc21
968a1db4c0
finish regression_obj.cu
2023-03-10 06:07:53 +01:00
amdsc21
ad710e4888
finish hinge.cu
2023-03-10 06:04:59 +01:00
amdsc21
4e3c699814
finish adaptive.cu
2023-03-10 06:02:48 +01:00
amdsc21
757de84398
finish quantile.cu
2023-03-10 05:55:51 +01:00
amdsc21
d27f9dfdce
finish host_device_vector.cu
2023-03-10 05:45:38 +01:00
amdsc21
14cc438a64
finish stats.cu
2023-03-10 05:38:16 +01:00
amdsc21
911a5d8a60
finish hist_util.cu
2023-03-10 05:32:38 +01:00
amdsc21
54b076b40f
finish common.cu
2023-03-10 05:20:29 +01:00
amdsc21
91a5ef762e
finish common.cu
2023-03-10 05:19:41 +01:00
amdsc21
8fd2af1c8b
finish numeric.cu
2023-03-10 05:16:23 +01:00
amdsc21
bb6adda8a3
finish c_api.cu
2023-03-10 05:12:51 +01:00
amdsc21
a76ccff390
finish c_api.cu
2023-03-10 05:11:20 +01:00
amdsc21
61c0b19331
finish ellpack_page_source.cu
2023-03-10 05:06:36 +01:00
amdsc21
fa9f69dd85
finish sparse_page_dmatrix.cu
2023-03-10 05:04:57 +01:00
Jiaming Yuan
54e001bbf4
[doc][dask] Reference examples from coiled. [skip ci] ( #8891 )
2023-03-09 20:03:24 -08:00
amdsc21
080fc35c4b
finish ellpack_page_raw_format.cu
2023-03-10 05:02:35 +01:00
amdsc21
ccce4cf7e1
finish data.cu
2023-03-10 05:00:57 +01:00
Jiaming Yuan
c5c8f643f2
Remove the cub submodule. ( #8888 )
...
XGBoost now uses CTK-11.8 for binary packages, there's no need to maintain a cub
submodule anymore.
2023-03-09 19:43:02 -08:00
amdsc21
713ab9e1a0
finish sparse_page_source.cu
2023-03-10 04:42:56 +01:00
amdsc21
134cbfddbe
finish gradient_index.cu
2023-03-10 04:40:33 +01:00
amdsc21
6e2c5be83e
finish array_interface.cu
2023-03-10 04:36:04 +01:00
amdsc21
185dbce21f
finish ellpack_page.cu
2023-03-10 04:26:09 +01:00
amdsc21
49732359ef
finish iterative_dmatrix.cu
2023-03-10 03:47:00 +01:00
amdsc21
ec9f500a49
finish proxy_dmatrix.cu
2023-03-10 03:40:07 +01:00
amdsc21
53244bef6f
finish simple_dmatrix.cu
2023-03-10 03:38:09 +01:00
amdsc21
f0febfbcac
finish gpu_predictor.cu
2023-03-10 01:29:54 +01:00
amdsc21
1c58ff61d1
finish fit_stump.cu
2023-03-10 00:46:29 +01:00
amdsc21
1530c03f7d
finish constraints.cu
2023-03-09 22:43:51 +01:00
amdsc21
309268de02
finish updater_gpu_hist.cu
2023-03-09 22:40:44 +01:00
amdsc21
500428cc0f
finish row_partitioner.cu
2023-03-09 22:31:11 +01:00
amdsc21
495816f694
finished gradient_based_sampler.cu
2023-03-09 22:26:08 +01:00
amdsc21
df42dd2c53
finished evaluator.cu
2023-03-09 22:22:05 +01:00
amdsc21
f55243fda0
finish evaluate_splits.cu
2023-03-09 22:15:10 +01:00
amdsc21
1e09c21456
finished feature_groups.cu
2023-03-09 21:31:00 +01:00
amdsc21
0ed5d3c849
finished histogram.cu
2023-03-09 21:28:37 +01:00
amdsc21
f67e7de7ef
finished communicator.cu
2023-03-09 21:02:48 +01:00
amdsc21
5044713388
finished updater_gpu_coordinate.cu
2023-03-09 20:53:54 +01:00
amdsc21
c875f0425f
finished rank_metric.cu
2023-03-09 20:48:31 +01:00
amdsc21
4fd08b6c32
finished survival_metric.cu
2023-03-09 20:41:52 +01:00
amdsc21
b9d86d44d6
finish multiclass_metric.cu
2023-03-09 20:37:16 +01:00
amdsc21
a56055225a
fix auc.cu
2023-03-09 20:29:38 +01:00
amdsc21
6eba0a56ec
fix CMakeLists.txt
2023-03-09 18:57:14 +01:00
Jiaming Yuan
5feee8d4a9
Define core multi-target regression tree structure. ( #8884 )
...
- Define a new tree struct embedded in the `RegTree`.
- Provide dispatching functions in `RegTree`.
- Fix some c++-17 warnings about the use of nodiscard (currently we disable the warning on
the CI).
- Use uint32_t instead of size_t for `bst_target_t` as it has a defined size and can be used
as part of dmlc parameter.
- Hide the `Segment` struct inside the categorical split matrix.
2023-03-09 19:03:06 +08:00
Jiaming Yuan
46dfcc7d22
Define a new ranking parameter. ( #8887 )
2023-03-09 17:46:24 +08:00
Krzysztof Dyba
e8a69013e6
[R] update predict docs ( #8886 )
2023-03-09 05:58:39 +08:00
amdsc21
00c24a58b1
finish elementwise_metric.cu
2023-03-08 22:50:07 +01:00
amdsc21
6fa248b75f
try elementwise_metric.cu
2023-03-08 22:42:48 +01:00
amdsc21
946f9e9802
fix gbtree.cc
2023-03-08 21:44:20 +01:00
amdsc21
4c4e5af29c
port elementwise_metric.cu
2023-03-08 21:39:56 +01:00
amdsc21
7e1b06417b
finish gbtree.cu porting
2023-03-08 21:09:56 +01:00
amdsc21
cdd7794641
add unused option
2023-03-08 20:37:53 +01:00
amdsc21
cd743a1ae9
fix DispatchRadixSort
2023-03-08 20:31:23 +01:00
amdsc21
a45005863b
fix DispatchScan
2023-03-08 20:15:33 +01:00
Jiaming Yuan
8c16da8863
[doc] Add note for rabit port. [skip ci] ( #8879 )
2023-03-08 19:00:10 +08:00
amdsc21
bdcb036592
add context.hip
2023-03-08 07:34:19 +01:00
amdsc21
7a3a9b682a
add device_helpers.hip.h
2023-03-08 07:18:33 +01:00
amdsc21
0a711662c3
add device_helpers.hip.h
2023-03-08 07:10:32 +01:00
amdsc21
312e58ec99
enable rocm, fix common.h
2023-03-08 06:45:03 +01:00
amdsc21
ca8f4e7993
enable rocm, fix stats.cuh
2023-03-08 06:43:06 +01:00
amdsc21
60795f22de
enable rocm, fix linalg_op.cuh
2023-03-08 06:42:20 +01:00
amdsc21
05fdca893f
enable rocm, fix cuda_pinned_allocator.h
2023-03-08 06:39:40 +01:00
amdsc21
d8cc93f3f2
enable rocm, fix algorithm.cuh
2023-03-08 06:38:35 +01:00
amdsc21
62c4efac51
enable rocm, fix transform.h
2023-03-08 06:37:34 +01:00
amdsc21
ba9e00d911
enable rocm, fix hist_util.cuh
2023-03-08 06:36:15 +01:00
amdsc21
d3be67ad8e
enable rocm, fix quantile.cuh
2023-03-08 06:32:09 +01:00
amdsc21
2eb0b6aae4
enable rocm, fix threading_utils.cuh
2023-03-08 06:30:52 +01:00
amdsc21
327f1494f1
enable rocm, fix cuda_context.cuh
2023-03-08 06:29:45 +01:00
amdsc21
fa92aa56ee
enable rocm, fix device_adapter.cuh
2023-03-08 06:26:31 +01:00
amdsc21
427f6c2a1a
enable rocm, fix simple_dmatrix.cuh
2023-03-08 06:24:34 +01:00
amdsc21
270c7b4802
enable rocm, fix row_partitioner.cuh
2023-03-08 06:22:25 +01:00
amdsc21
0fc1f640a9
enable rocm, fix nccl_device_communicator.cuh
2023-03-08 06:18:13 +01:00
dependabot[bot]
85c3334c2b
Bump hadoop-common from 3.2.4 to 3.3.4 in /jvm-packages ( #8882 )
...
Bumps hadoop-common from 3.2.4 to 3.3.4.
---
updated-dependencies:
- dependency-name: org.apache.hadoop:hadoop-common
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-08 13:15:39 +08:00
amdsc21
762fd9028d
enable rocm, fix device_communicator_adapter.cuh
2023-03-08 06:13:29 +01:00
amdsc21
f2009533e1
rm hip.h
2023-03-08 06:04:01 +01:00
amdsc21
53b5cd73f2
add hip flags
2023-03-08 03:42:51 +01:00
amdsc21
52b05d934e
add hip
2023-03-08 03:32:19 +01:00
amdsc21
840f15209c
add HIP flags, common
2023-03-08 03:11:49 +01:00
amdsc21
1e1c7fd8d5
add HIP flags, c_api
2023-03-08 01:34:37 +01:00
amdsc21
f5f800c80d
add HIP flags
2023-03-08 01:33:38 +01:00
amdsc21
6b7be96373
add HIP flags
2023-03-08 01:22:25 +01:00
amdsc21
75712b9c3c
enable HIP flags
2023-03-08 01:10:07 +01:00
amdsc21
ed45aa2816
Merge branch 'master' into dev-hui
2023-03-08 00:39:33 +01:00
Jiaming Yuan
f236640427
Support F order for the tensor type. ( #8872 )
...
- Add F order support for tensor and view.
- Use parameter pack for automatic type cast. (avoid excessive static cast for shape).
2023-03-08 03:27:49 +08:00
dependabot[bot]
f53055f75e
Bump maven-assembly-plugin from 3.4.2 to 3.5.0 in /jvm-packages ( #8837 )
...
Bumps [maven-assembly-plugin](https://github.com/apache/maven-assembly-plugin ) from 3.4.2 to 3.5.0.
- [Release notes](https://github.com/apache/maven-assembly-plugin/releases )
- [Commits](https://github.com/apache/maven-assembly-plugin/compare/maven-assembly-plugin-3.4.2...maven-assembly-plugin-3.5.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-assembly-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-08 02:20:40 +08:00
Jiaming Yuan
f7ce0ec0df
Upgrade gcc toolchain to 9.x. ( #8878 )
...
* Use new tool chain.
* Use gcc-9.
* Use cmake from system.
* DOn't link leak.
2023-03-07 08:25:23 -08:00
dependabot[bot]
2b2eb0d0f1
Bump scala-maven-plugin in /jvm-packages/xgboost4j-spark-gpu ( #8877 )
...
Bumps scala-maven-plugin from 4.8.0 to 4.8.1.
---
updated-dependencies:
- dependency-name: net.alchim31.maven:scala-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-07 19:33:33 +08:00
dependabot[bot]
5eabcae27b
Bump scala-maven-plugin from 4.8.0 to 4.8.1 in /jvm-packages/xgboost4j ( #8876 )
...
Bumps scala-maven-plugin from 4.8.0 to 4.8.1.
---
updated-dependencies:
- dependency-name: net.alchim31.maven:scala-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-07 19:33:16 +08:00
dependabot[bot]
d06b1fc26e
Bump scala-maven-plugin in /jvm-packages/xgboost4j-example ( #8875 )
...
Bumps scala-maven-plugin from 4.8.0 to 4.8.1.
---
updated-dependencies:
- dependency-name: net.alchim31.maven:scala-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-07 19:32:06 +08:00
dependabot[bot]
ffa5eb2aa4
Bump scala-maven-plugin in /jvm-packages/xgboost4j-gpu ( #8874 )
...
Bumps scala-maven-plugin from 4.8.0 to 4.8.1.
---
updated-dependencies:
- dependency-name: net.alchim31.maven:scala-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-07 19:31:50 +08:00
dependabot[bot]
0f6c502d36
Bump scala-maven-plugin in /jvm-packages/xgboost4j-spark ( #8873 )
...
Bumps scala-maven-plugin from 4.8.0 to 4.8.1.
---
updated-dependencies:
- dependency-name: net.alchim31.maven:scala-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-03-07 19:31:23 +08:00
amdsc21
f286ae5bfa
add hip rocthrust hipcub
2023-03-07 06:35:00 +01:00
amdsc21
f13a7f8d91
add submodules
2023-03-07 05:44:24 +01:00
amdsc21
c51a1c9aae
rename hip.cc to hip
2023-03-07 05:39:53 +01:00
amdsc21
30de728631
fix hip.cc
2023-03-07 05:11:42 +01:00
amdsc21
75fa15b36d
add hip support
2023-03-07 04:02:49 +01:00
amdsc21
eb30cb6293
add hip support
2023-03-07 03:49:52 +01:00
amdsc21
cafbfce51f
add hip.h
2023-03-07 03:46:26 +01:00
amdsc21
6039a71e6c
add hip structure
2023-03-07 02:17:19 +01:00
Jiaming Yuan
7eba285a1e
Support sklearn cross validation for ranker. ( #8859 )
...
* Support sklearn cross validation for ranker.
- Add a convention for X to include a special `qid` column.
sklearn utilities consider only `X`, `y` and `sample_weight` for supervised learning
algorithms, but we need an additional qid array for ranking.
It's important to be able to support the cross validation function in sklearn since all
other tuning functions like grid search are based on cross validation.
2023-03-07 00:22:08 +08:00
Jiaming Yuan
cad7401783
Disable gcc parallel extension if openmp is not available. ( #8871 )
...
`<parallel/algorithm>` internally includes the <omp.h> header, which leads to an error
when openmp is not available.
2023-03-06 22:51:06 +08:00
Jiaming Yuan
228a46e8ad
Support learning rate for zero-hessian objectives. ( #8866 )
2023-03-06 20:33:28 +08:00
Jiaming Yuan
173096a6a7
Discover libasan.so.6. ( #8864 )
2023-03-06 18:56:54 +08:00
Jiaming Yuan
6a892ce281
Specify src path for isort. ( #8867 )
2023-03-06 17:30:27 +08:00
Jiaming Yuan
4d665b3fb0
Restore clang tidy test. ( #8861 )
2023-03-03 13:47:04 -08:00
Rong Ou
2dc22e7aad
Take advantage of C++17 features ( #8858 )
...
---------
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-03-04 00:24:13 +08:00
Rory Mitchell
69a50248b7
Fix scope of feature set pointers ( #8850 )
...
---------
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2023-03-02 12:37:14 +08:00
mzzhang95
6cef9a08e9
[pyspark] Update eval_metric validation to support list of strings ( #8826 )
2023-03-02 08:24:12 +08:00
Jiaming Yuan
803d5e3c4c
Update c++ requirement to 17 for the R package. ( #8860 )
2023-03-01 14:49:39 -08:00
Rong Ou
a5852365fd
Update dmlc-core to get C++17 deprecation warning ( #8855 )
2023-03-01 12:30:59 -08:00
Rong Ou
7cbaee9916
Support column split in approx tree method ( #8847 )
2023-03-02 03:59:07 +08:00
Philip Hyunsu Cho
6d8afb2218
[CI] Require C++17 + CMake 3.18; Use CUDA 11.8 in CI ( #8853 )
...
* Update to C++17
* Turn off unity build
* Update CMake to 3.18
* Use MSVC 2022 + CUDA 11.8
* Re-create stack for worker images
* Allocate more disk space for Windows
* Tempiorarily disable clang-tidy
* RAPIDS now requires Python 3.10+
* Unpin cuda-python
* Use latest NCCL
* Use Ubuntu 20.04 in RMM image
* Mark failing mgpu test as xfail
2023-03-01 09:22:24 -08:00
Jiaming Yuan
d54ef56f6f
Fix cache with gc ( #8851 )
...
- Make DMatrixCache thread-safe.
- Remove the use of thread-local memory.
2023-03-01 00:39:06 +08:00
Rong Ou
d9688f93c7
Support column-split in row partitioner ( #8828 )
2023-02-26 04:43:35 +08:00
Mauro Leggieri
90c0633a28
Fixes compilation errors on MSVC x86 targets ( #8823 )
2023-02-26 03:20:28 +08:00
Rong Ou
a65ad0bd9c
Support column split in histogram builder ( #8811 )
2023-02-17 22:37:01 +08:00
dependabot[bot]
40fd3d6d5f
Bump maven-javadoc-plugin in /jvm-packages/xgboost4j-gpu ( #8815 )
...
Bumps [maven-javadoc-plugin](https://github.com/apache/maven-javadoc-plugin ) from 3.4.1 to 3.5.0.
- [Release notes](https://github.com/apache/maven-javadoc-plugin/releases )
- [Commits](https://github.com/apache/maven-javadoc-plugin/compare/maven-javadoc-plugin-3.4.1...maven-javadoc-plugin-3.5.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-javadoc-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-17 16:39:16 +08:00
dependabot[bot]
6ce9a35f55
Bump maven-javadoc-plugin from 3.4.1 to 3.5.0 in /jvm-packages/xgboost4j ( #8813 )
...
Bumps [maven-javadoc-plugin](https://github.com/apache/maven-javadoc-plugin ) from 3.4.1 to 3.5.0.
- [Release notes](https://github.com/apache/maven-javadoc-plugin/releases )
- [Commits](https://github.com/apache/maven-javadoc-plugin/compare/maven-javadoc-plugin-3.4.1...maven-javadoc-plugin-3.5.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-javadoc-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-17 15:04:06 +08:00
dependabot[bot]
d62daa0b32
Bump maven-javadoc-plugin from 3.4.1 to 3.5.0 in /jvm-packages ( #8814 )
...
Bumps [maven-javadoc-plugin](https://github.com/apache/maven-javadoc-plugin ) from 3.4.1 to 3.5.0.
- [Release notes](https://github.com/apache/maven-javadoc-plugin/releases )
- [Commits](https://github.com/apache/maven-javadoc-plugin/compare/maven-javadoc-plugin-3.4.1...maven-javadoc-plugin-3.5.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-javadoc-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-16 23:16:11 +08:00
Jiaming Yuan
c0afdb6786
Fix CPU bin compression with categorical data. ( #8809 )
...
* Fix CPU bin compression with categorical data.
* The bug causes the maximum category to be lesser than 256 or the maximum number of bins when
the input data is dense.
2023-02-16 04:20:34 +08:00
Jiaming Yuan
cce4af4acf
Initial support for quantile loss. ( #8750 )
...
- Add support for Python.
- Add objective.
2023-02-16 02:30:18 +08:00
Jiaming Yuan
282b1729da
Specify the number of threads for parallel sort. ( #8735 )
...
* Specify the number of threads for parallel sort.
- Pass context object into argsort.
- Replace macros with inline functions.
2023-02-16 00:20:19 +08:00
Jiaming Yuan
c7c485d052
Extract fit intercept. ( #8793 )
2023-02-15 22:41:31 +08:00
Jiaming Yuan
594371e35b
Fix CPP lint. ( #8807 )
2023-02-15 20:16:35 +08:00
Jiaming Yuan
e62167937b
[CI] Update action cache for jvm tests. ( #8806 )
2023-02-15 18:43:48 +08:00
Rong Ou
74572b5d45
Add convenience method for allgather ( #8804 )
2023-02-15 11:37:11 +08:00
WeichenXu
f27a7258c6
Fix feature types param ( #8772 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2023-02-14 02:16:42 +08:00
Jiaming Yuan
52d0230b58
Fix merge conflict. ( #8791 )
2023-02-13 23:43:42 +08:00
Jiaming Yuan
81b2ee1153
Pass DMatrix into metric for caching. ( #8790 )
2023-02-13 22:15:05 +08:00
Jiaming Yuan
31d3ec07af
Extract device algorithms. ( #8789 )
2023-02-13 20:53:53 +08:00
Jiaming Yuan
457f704e3d
Add quantile metric. ( #8761 )
2023-02-13 19:07:40 +08:00
Jiaming Yuan
d11a0044cf
Generalize prediction cache. ( #8783 )
...
* Extract most of the functionality into `DMatrixCache`.
* Move API entry to independent file to reduce dependency on `predictor.h` file.
* Add test.
---------
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2023-02-13 12:36:43 +08:00
Rong Ou
ed91e775ec
Fix quantile tests running on multi-gpus ( #8775 )
...
* Fix quantile tests running on multi-gpus
* Run some gtests with multiple GPUs
* fix mgpu test naming
* Instruct NCCL to print extra logs
* Allocate extra space in /dev/shm to enable NCCL
* use gtest_skip to skip mgpu tests
---------
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2023-02-12 17:00:26 -08:00
Jiaming Yuan
225b3158f6
Support custom metric in sklearn ranker. ( #8786 )
2023-02-12 13:14:07 +08:00
Jiaming Yuan
17b709acb9
Rename ranking utils to threading utils. ( #8785 )
2023-02-12 05:41:18 +08:00
Jiaming Yuan
70c9b885ef
Extract floating point rounding routines. ( #8771 )
2023-02-12 04:26:41 +08:00
Jiaming Yuan
e9c178f402
[doc] Document update [skip ci] ( #8784 )
...
- Remove version specifics in cat demo.
- Remove aws yarn.
- Update faq.
- Stop mentioning MPI.
- Update sphinx inventory links.
- Fix typo.
2023-02-12 04:25:22 +08:00
Jiaming Yuan
8a16944664
Fix ranking with quantile dmatrix and group weight. ( #8762 )
2023-02-10 20:32:35 +08:00
Dai-Jie (Jay) Wu
ad0ccc6e4f
[doc] fix inconsistent doc and minor typo for external memory ( #8773 )
2023-02-10 01:05:34 +08:00
Jiaming Yuan
199c421d60
Send default configuration from metric to objective. ( #8760 )
2023-02-09 20:18:07 +08:00
Jiaming Yuan
5f76edd296
Extract make metric name from ranking metric. ( #8768 )
...
- Extract the metric parsing routine from ranking.
- Add a test.
- Accept null for string view.
2023-02-09 18:30:21 +08:00
Jiaming Yuan
4ead65a28c
Increase timeout limit for linear. ( #8767 )
2023-02-09 18:20:12 +08:00
Rong Ou
cbf98cb9c6
Add Allgather to collective communicator ( #8765 )
...
* Add Allgather to collective communicator
2023-02-09 11:31:22 +08:00
Jiaming Yuan
48cefa012e
Support multiple alphas for segmented quantile. ( #8758 )
2023-02-07 17:17:59 +08:00
Jiaming Yuan
c4802bfcd0
Cleanup booster param types. ( #8756 )
2023-02-07 15:52:19 +08:00
Jiaming Yuan
7b3d473593
[doc] Add demo for inference using individual tree. ( #8752 )
2023-02-07 04:40:18 +08:00
Jiaming Yuan
28bb01aa22
Extract optional weight. ( #8747 )
...
- Extract optional weight from coommon.h to reduce dependency on this header.
- Add test.
2023-02-07 03:11:53 +08:00
Jiaming Yuan
0f37a01dd9
Require black formatter for the python package. ( #8748 )
2023-02-07 01:53:33 +08:00
Jiaming Yuan
a2e433a089
Fix empty DMatrix with categorical features. ( #8739 )
2023-02-07 00:40:11 +08:00
Rory Mitchell
7214a45e83
Fix different number of features in gpu_hist evaluator. ( #8754 )
2023-02-06 23:15:16 +08:00
Rong Ou
66191e9926
Support cpu quantile sketch with column-wise data split ( #8742 )
2023-02-05 14:26:24 +08:00
Jiaming Yuan
c1786849e3
Use array interface for CSC matrix. ( #8672 )
...
* Use array interface for CSC matrix.
Use array interface for CSC matrix and align the interface with CSR and dense.
- Fix nthread issue in the R package DMatrix.
- Unify the behavior of handling `missing` with other inputs.
- Unify the behavior of handling `missing` around R, Python, Java, and Scala DMatrix.
- Expose `num_non_missing` to the JVM interface.
- Deprecate old CSR and CSC constructors.
2023-02-05 01:59:46 +08:00
BenEfrati
213b5602d9
Add sample_weight to eval_metric ( #8706 )
2023-02-05 00:06:38 +08:00
Philip Hyunsu Cho
dd79ab846f
[CI] Fix failing arm build ( #8751 )
...
* Always install Conda env into /opt/python; use Mamba
* Change ownership of Conda env to buildkite-agent user
* Use unique name
* Fix
2023-02-03 22:32:48 -08:00
Jiaming Yuan
0e61ba57d6
Fix GPU L1 error. ( #8749 )
2023-02-04 03:02:00 +08:00
Hamel Husain
16ef016ba7
[CI] Use bash -l {0} as the default in GitHub Actions ( #8741 )
2023-01-31 15:00:29 +08:00
James Lamb
0d8248ddcd
[R] discourage use of regex for fixed string comparisons ( #8736 )
2023-01-30 18:47:21 +08:00
Jiaming Yuan
1325ba9251
Support primitive types of pyarrow-backed pandas dataframe. ( #8653 )
...
Categorical data (dictionary) is not supported at the moment.
2023-01-30 17:53:29 +08:00
Jiaming Yuan
3760cede0f
Consistent use of context to specify number of threads. ( #8733 )
...
- Use context in all tests.
- Use context in R.
- Use context in C API DMatrix initialization. (0 threads is used as dft).
2023-01-30 15:25:31 +08:00
Jiaming Yuan
21a28f2cc5
Small refactor for hist builder. ( #8698 )
...
- Use span instead of vector as parameter. No perf change as the builder work on pointer.
- Use const pointer for reg tree.
2023-01-30 14:06:41 +08:00
Rong Ou
8af98e30fc
Use in-memory communicator to test quantile ( #8710 )
2023-01-27 23:28:28 +08:00
James Lamb
96e6b6beba
[ci] remove unused imports in tests ( #8707 )
2023-01-25 14:10:29 +08:00
Philip Hyunsu Cho
d29e45371f
[R-package] Alter xgb.train() to accept multiple eval metrics as a list ( #8657 )
2023-01-24 17:14:14 -08:00
James Lamb
0f4d52a864
[R] add tests on print.xgb.DMatrix() ( #8704 )
2023-01-22 06:44:14 +08:00
Jiaming Yuan
9fb12b20a4
Cleanup the callback module. ( #8702 )
...
- Cleanup pylint markers.
- run formatter.
- Update examples of using callback.
2023-01-22 00:13:49 +08:00
Jiaming Yuan
34eee56256
Fix compiler warnings. ( #8703 )
...
Fix warnings about signed/unsigned comparisons.
2023-01-21 15:16:23 +08:00
Jiaming Yuan
e49e0998c0
Extract CPU sampling routines. ( #8697 )
2023-01-19 23:28:18 +08:00
Jiaming Yuan
7a068af1a3
Workaround CUDA warning. ( #8696 )
2023-01-19 09:16:08 +08:00
James Lamb
6933240837
[python-package] remove unused functions in xgboost.data ( #8695 )
2023-01-19 08:02:54 +08:00
Jiaming Yuan
4416452f94
Return single thread from context when called inside omp region. ( #8693 )
2023-01-18 09:23:37 +08:00
Jiaming Yuan
31b9cbab3d
Make sure input numpy array is aligned. ( #8690 )
...
- use `np.require` to specify that the alignment is required.
- scipy csr as well.
- validate input pointer in `ArrayInterface`.
2023-01-18 08:12:13 +08:00
Jiaming Yuan
175986b739
[doc] Add missing document for pyspark ranker. [skip ci] ( #8692 )
2023-01-18 07:52:18 +08:00
Rong Ou
78396f8a6e
Initial support for column-split cpu predictor ( #8676 )
2023-01-18 06:33:13 +08:00
James Lamb
980233e648
[R] remove XGBoosterPredict_R ( fixes #8687 ) ( #8689 )
2023-01-17 14:19:01 +08:00
Jiaming Yuan
247946a875
Cache transformed in QuantileDMatrix for efficiency. ( #8666 )
2023-01-17 06:02:40 +08:00
James Lamb
06ba285f71
[R] fix OpenMP detection on macOS ( #8684 )
2023-01-17 05:01:26 +08:00
Jiaming Yuan
43152657d4
Extract JSON type check. ( #8677 )
...
- Reuse it in `GetMissing`.
- Add test.
2023-01-17 03:11:07 +08:00
Jiaming Yuan
9f598efc3e
Rename context in Metric. ( #8686 )
2023-01-17 01:10:13 +08:00
Jiaming Yuan
d6018eb4b9
Remove all use of DeviceQuantileDMatrix. ( #8665 )
2023-01-17 00:04:10 +08:00
Jiaming Yuan
0ae8df9a65
Define default ctors for gpair. ( #8660 )
...
* Define default ctors for gpair.
Fix clang warning:
Definition of implicit copy assignment operator for 'GradientPairInternal<float>' is
deprecated because it has a user-declared copy constructor
2023-01-16 22:52:13 +08:00
dependabot[bot]
a9c6199723
Bump maven-project-info-reports-plugin in /jvm-packages ( #8662 )
...
Bumps [maven-project-info-reports-plugin](https://github.com/apache/maven-project-info-reports-plugin ) from 3.4.1 to 3.4.2.
- [Release notes](https://github.com/apache/maven-project-info-reports-plugin/releases )
- [Commits](https://github.com/apache/maven-project-info-reports-plugin/compare/maven-project-info-reports-plugin-3.4.1...maven-project-info-reports-plugin-3.4.2 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-project-info-reports-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-16 04:57:28 +08:00
dependabot[bot]
37d4482e3e
Bump maven-checkstyle-plugin from 3.2.0 to 3.2.1 in /jvm-packages ( #8661 )
...
Bumps [maven-checkstyle-plugin](https://github.com/apache/maven-checkstyle-plugin ) from 3.2.0 to 3.2.1.
- [Release notes](https://github.com/apache/maven-checkstyle-plugin/releases )
- [Commits](https://github.com/apache/maven-checkstyle-plugin/compare/maven-checkstyle-plugin-3.2.0...maven-checkstyle-plugin-3.2.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-checkstyle-plugin
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-16 02:02:03 +08:00
James Lamb
e227abc57a
[R] avoid leaving test files behind ( #8685 )
2023-01-15 23:34:54 +08:00
Jiaming Yuan
e7d612d22c
[R] Fix threads used to create DMatrix in predict. ( #8681 )
2023-01-15 03:09:08 +08:00
James Lamb
292df67824
[R] remove unused define XGBOOST_CUSTOMIZE_LOGGER ( #8647 )
2023-01-15 02:29:25 +08:00
Jiaming Yuan
f7a2f52136
[R] Get CXX flags from R CMD config. ( #8669 )
2023-01-14 16:48:21 +08:00
Jiaming Yuan
07cf3d3e53
Fix threads in DMatrix slice. ( #8667 )
2023-01-14 07:16:57 +08:00
Jiaming Yuan
e27cda7626
[CI] Skip pyspark sparse tests. ( #8675 )
2023-01-14 05:37:00 +08:00
Jiaming Yuan
b2b6a8aa39
[R] fix CSR input. ( #8673 )
2023-01-14 01:32:41 +08:00
Bobby Wang
72ec0c5484
[pyspark] support pred_contribs ( #8633 )
2023-01-11 16:51:12 +08:00
Jiaming Yuan
cfa994d57f
Multi-target support for L1 error. ( #8652 )
...
- Add matrix support to the median function.
- Iterate through each target for quantile computation.
2023-01-11 05:51:14 +08:00
Jiaming Yuan
badeff1d74
Init estimation for regression. ( #8272 )
2023-01-11 02:04:56 +08:00
Jiaming Yuan
1b58d81315
[doc] Document Python inputs. ( #8643 )
2023-01-10 15:39:32 +08:00
Bobby Wang
4e12f3e1bc
[Breaking][jvm-packages] Bump rapids version to 22.12.0 ( #8648 )
...
* [jvm-packages] Bump rapids version to 22.12.0
This PR bumps spark version to 3.1.1 and the rapids version
to 22.12.0, which results in the latest xgboost can't run
with the old rapids packages.
2023-01-07 18:59:17 +08:00
Jiaming Yuan
06a1cb6e03
Release news for patch releases including upcoming 1.7.3. [skip ci] ( #8645 )
2023-01-06 16:19:16 +08:00
Emre Batuhan Baloğlu
2b88099c74
[doc] Update custom_metric_obj.rst ( #8626 )
2023-01-06 05:08:25 +08:00
Jiaming Yuan
e68a152d9e
Do not return internal value for get_params. ( #8634 )
2023-01-05 17:48:26 +08:00
Jiaming Yuan
26c9882e23
Fix loading GPU pickle with a CPU-only xgboost distribution. ( #8632 )
...
We can handle loading the pickle on a CPU-only machine if the XGBoost is built with CUDA
enabled (Linux and Windows PyPI package), but not if the distribution is CPU-only (macOS
PyPI package).
2023-01-05 02:14:30 +08:00
Bobby Wang
d3ad0524e7
[pyspark] Re-work _fit function ( #8630 )
2023-01-04 18:21:57 +08:00
Jiaming Yuan
beefd28471
Split up SHAP from RegTree. ( #8612 )
...
* Split up SHAP from `RegTree`.
Simplify the tree interface.
2023-01-04 18:17:47 +08:00
Jiaming Yuan
d308124910
Refactor PySpark tests. ( #8605 )
...
- Convert classifier tests to pytest tests.
- Replace hardcoded tests.
2023-01-04 17:05:16 +08:00
James Lamb
fa44a33ee6
remove unused variables in JSON-parsing code ( #8627 )
2023-01-04 15:50:33 +08:00
Jiaming Yuan
6eaddaa9c3
[CI] Fix CI with updated dependencies. ( #8631 )
...
* [CI] Fix CI with updated dependencies.
- Fix jvm package get iris.
* Skip SHAP test for now.
* Revert "Skip SHAP test for now."
This reverts commit 9aa28b4d8aee53fa95d92d2a879c6783ff4b2faa.
* Catch all exceptions.
2023-01-03 21:04:04 -08:00
Jiaming Yuan
8d545ab2a2
Implement fit stump. ( #8607 )
2023-01-04 04:14:51 +08:00
dependabot[bot]
20e6087579
Bump kryo from 5.3.0 to 5.4.0 in /jvm-packages ( #8629 )
...
Bumps [kryo](https://github.com/EsotericSoftware/kryo ) from 5.3.0 to 5.4.0.
- [Release notes](https://github.com/EsotericSoftware/kryo/releases )
- [Commits](https://github.com/EsotericSoftware/kryo/compare/kryo-parent-5.3.0...kryo-parent-5.4.0 )
---
updated-dependencies:
- dependency-name: com.esotericsoftware:kryo
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-01-03 18:44:39 +08:00
James Lamb
dd72af2620
[CI] fix git errors related to directory ownership ( #8628 )
2023-01-01 16:05:44 -08:00
James Lamb
9a98c3726c
[R] [CI] add more linting checks ( #8624 )
2022-12-29 18:20:36 +08:00
James Lamb
b05abfc494
[CI] remove unused cpp test helper function ( #8625 )
2022-12-28 02:47:52 +08:00
Rong Ou
3ceeb8c61c
Add data split mode to DMatrix MetaInfo ( #8568 )
2022-12-25 20:37:37 +08:00
Rong Ou
77b069c25d
Support bitwise allreduce operations in the communicator ( #8623 )
2022-12-25 06:40:05 +08:00
James Lamb
c7e82b5914
[R] enforce lintr checks ( fixes #8012 ) ( #8613 )
2022-12-25 05:02:56 +08:00
James Lamb
f489d824ca
[R] remove unused imports in tests ( #8614 )
2022-12-25 03:45:47 +08:00
Jiaming Yuan
c430ae52f3
Fix mypy errors with the latest numpy. ( #8617 )
2022-12-21 01:42:05 -08:00
Philip Hyunsu Cho
5bf9e79413
[CI] Disable gtest with RMM ( #8620 )
2022-12-21 01:41:34 -08:00
Jiaming Yuan
c6a8754c62
Define CUDA Context. ( #8604 )
...
We will transition to non-default and non-blocking CUDA stream.
2022-12-20 15:15:07 +08:00
James Lamb
e01639548a
[R] remove unused compiler flag RABIT_CUSTOMIZE_MSG_ ( #8610 )
2022-12-17 19:36:35 +08:00
James Lamb
17ce1f26c8
[R] address some lintr warnings ( #8609 )
2022-12-17 18:36:14 +08:00
James Lamb
53e6e32718
[R] resolve assignment_linter warnings ( #8599 )
2022-12-17 01:22:41 +08:00
Jiaming Yuan
f6effa1734
Support Series and Python primitives in inplace_predict and QDM ( #8547 )
2022-12-17 00:15:15 +08:00
Jiaming Yuan
a10e4cba4e
Fix linalg iterator. ( #8603 )
2022-12-16 23:05:03 +08:00
Jiaming Yuan
38887a1876
Fix windows build on buildkite. ( #8602 )
2022-12-16 21:12:24 +08:00
Jiaming Yuan
43a647a4dd
Fix inference with categorical feature. ( #8591 )
2022-12-15 17:57:26 +08:00
Esteban Djeordjian
7dc3e95a77
Added ranges for alpha and lambda in docs ( #8597 )
2022-12-15 16:51:04 +08:00
dependabot[bot]
0c38ca7f6e
Bump nexus-staging-maven-plugin from 1.6.7 to 1.6.13 in /jvm-packages ( #8600 )
2022-12-15 08:44:05 +00:00
Jiaming Yuan
001e663d42
Set enable_categorical to True in predict. ( #8592 )
2022-12-15 05:27:06 +08:00
James Lamb
7a07dcf651
[R] resolve line_length_linter warnings ( #8565 )
...
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2022-12-14 21:04:24 +08:00
dependabot[bot]
eac980fbfc
Bump maven-checkstyle-plugin from 3.1.2 to 3.2.0 in /jvm-packages ( #8594 )
...
Bumps [maven-checkstyle-plugin](https://github.com/apache/maven-checkstyle-plugin ) from 3.1.2 to 3.2.0.
- [Release notes](https://github.com/apache/maven-checkstyle-plugin/releases )
- [Commits](https://github.com/apache/maven-checkstyle-plugin/compare/maven-checkstyle-plugin-3.1.2...maven-checkstyle-plugin-3.2.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-checkstyle-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-14 19:46:03 +08:00
James Lamb
06ea6c7e79
[python] remove unnecessary conversions between data structures ( #8546 )
...
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2022-12-14 18:32:02 +08:00
dependabot[bot]
f64871c74a
Bump spark.version from 3.0.1 to 3.0.3 in /jvm-packages ( #8593 )
...
Bumps `spark.version` from 3.0.1 to 3.0.3.
Updates `spark-mllib_2.12` from 3.0.1 to 3.0.3
Updates `spark-core_2.12` from 3.0.1 to 3.0.3
Updates `spark-sql_2.12` from 3.0.1 to 3.0.3
---
updated-dependencies:
- dependency-name: org.apache.spark:spark-mllib_2.12
dependency-type: direct:production
update-type: version-update:semver-patch
- dependency-name: org.apache.spark:spark-core_2.12
dependency-type: direct:production
update-type: version-update:semver-patch
- dependency-name: org.apache.spark:spark-sql_2.12
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-14 17:23:48 +08:00
Jiaming Yuan
40343c8ee1
Test dask demos. ( #8557 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-12-13 18:37:31 +08:00
Rong Ou
15a88ceef0
Fix deprecated CUB calls in CUDA 12.0 ( #8578 )
2022-12-12 17:02:30 +08:00
Philip Hyunsu Cho
35d8447282
[CI] Use conda-forge channel in conda ( #8583 )
2022-12-11 23:25:29 -08:00
Rong Ou
42e6fbb0db
Fix sklearn test that calls a removed field ( #8579 )
2022-12-09 13:06:44 -08:00
Jiaming Yuan
deb3edf562
Support list and tuple for QDM. ( #8542 )
2022-12-10 01:14:44 +08:00
Jiaming Yuan
8824b40961
Update date in release script. [skip ci] ( #8574 )
2022-12-09 23:16:10 +08:00
Rong Ou
0caf2be684
Update NVFlare demo to work with the latest release ( #8576 )
2022-12-09 02:48:20 +08:00
James Lamb
ffee35e0f0
[R] [ci] remove dependency on {devtools} ( #8563 )
2022-12-09 01:21:28 +08:00
James Lamb
fbe40d00d8
[R] resolve brace_linter warnings ( #8564 )
2022-12-08 23:01:00 +08:00
Bobby Wang
40a1a2ffa8
[pyspark] check use_qdm across all the workers ( #8496 )
2022-12-08 18:09:17 +08:00
dependabot[bot]
5aeb8f7009
Bump maven-gpg-plugin from 1.5 to 3.0.1 in /jvm-packages ( #8571 )
2022-12-08 06:59:11 +00:00
dependabot[bot]
f592a5125b
Bump flink.version from 1.7.2 to 1.8.3 in /jvm-packages ( #8561 )
2022-12-07 20:53:22 +00:00
dependabot[bot]
27aea6c7b5
Bump maven-surefire-plugin from 2.19.1 to 2.22.2 in /jvm-packages ( #8562 )
2022-12-07 17:56:05 +00:00
Gianfrancesco Angelini
5540019373
feat(py, plot_importance): + values_format as arg ( #8540 )
2022-12-08 00:47:28 +08:00
François Bobot
8c6630c310
Typo in model schema ( #8543 )
...
categorical -> categories
2022-12-07 22:56:59 +08:00
Matthew Rocklin
b7ffdcdbb9
Properly await async method client.wait_for_workers ( #8558 )
...
* Properly await async method client.wait_for_workers
* ignore mypy error.
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2022-12-07 21:49:30 +08:00
dependabot[bot]
4f1e453ff5
Bump maven-project-info-reports-plugin in /jvm-packages ( #8560 )
...
Bumps [maven-project-info-reports-plugin](https://github.com/apache/maven-project-info-reports-plugin ) from 2.2 to 3.4.1.
- [Release notes](https://github.com/apache/maven-project-info-reports-plugin/releases )
- [Commits](https://github.com/apache/maven-project-info-reports-plugin/compare/maven-project-info-reports-plugin-2.2...maven-project-info-reports-plugin-3.4.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-project-info-reports-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-07 14:33:29 +08:00
Jiaming Yuan
3e26107a9c
Rename and extract Context. ( #8528 )
...
* Rename `GenericParameter` to `Context`.
* Rename header file to reflect the change.
* Rename all references.
2022-12-07 04:58:54 +08:00
James Lamb
05fc6f3ca9
[R] [ci] move linting code out of package ( #8545 )
2022-12-07 03:18:17 +08:00
Jiaming Yuan
e38fe21e0d
Cleanup regression objectives. ( #8539 )
2022-12-07 01:05:42 +08:00
dependabot[bot]
7774bf628e
Bump scalatest-maven-plugin from 1.0 to 2.2.0 in /jvm-packages ( #8509 )
...
Bumps [scalatest-maven-plugin](https://github.com/scalatest/scalatest-maven-plugin ) from 1.0 to 2.2.0.
- [Release notes](https://github.com/scalatest/scalatest-maven-plugin/releases )
- [Commits](https://github.com/scalatest/scalatest-maven-plugin/commits/release-2.2.0 )
---
updated-dependencies:
- dependency-name: org.scalatest:scalatest-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 21:34:22 +08:00
dependabot[bot]
4a99c9bdb8
Bump commons-lang3 from 3.9 to 3.12.0 in /jvm-packages ( #8548 )
...
Bumps commons-lang3 from 3.9 to 3.12.0.
---
updated-dependencies:
- dependency-name: org.apache.commons:commons-lang3
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 20:13:46 +08:00
Jiaming Yuan
d99bdd1b1e
[CI] Fix github action mismatched glibcxx. ( #8551 )
...
* [CI] Fix github action mismatched glibcxx.
Split up the Linux test to use the toolchain from conda forge.
2022-12-06 17:42:15 +08:00
dependabot[bot]
ed1a4f3205
Bump maven-source-plugin from 2.2.1 to 3.2.1 in /jvm-packages ( #8549 )
...
Bumps [maven-source-plugin](https://github.com/apache/maven-source-plugin ) from 2.2.1 to 3.2.1.
- [Release notes](https://github.com/apache/maven-source-plugin/releases )
- [Commits](https://github.com/apache/maven-source-plugin/compare/maven-source-plugin-2.2.1...maven-source-plugin-3.2.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-source-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 14:10:19 +08:00
dependabot[bot]
e85c9b987b
Bump maven-site-plugin from 3.0 to 3.12.1 in /jvm-packages ( #8533 )
...
Bumps [maven-site-plugin](https://github.com/apache/maven-site-plugin ) from 3.0 to 3.12.1.
- [Release notes](https://github.com/apache/maven-site-plugin/releases )
- [Commits](https://github.com/apache/maven-site-plugin/compare/maven-site-plugin-3.0...maven-site-plugin-3.12.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-site-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 11:35:53 +08:00
Jiaming Yuan
7ac52e674f
[doc] Update model schema. ( #8538 )
...
* Update model schema with `num_target`.
2022-12-06 11:35:07 +08:00
dependabot[bot]
2790e3091f
Bump maven-assembly-plugin from 2.6 to 3.4.2 in /jvm-packages ( #8521 )
...
Bumps [maven-assembly-plugin](https://github.com/apache/maven-assembly-plugin ) from 2.6 to 3.4.2.
- [Release notes](https://github.com/apache/maven-assembly-plugin/releases )
- [Commits](https://github.com/apache/maven-assembly-plugin/compare/maven-assembly-plugin-2.6...maven-assembly-plugin-3.4.2 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-assembly-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 04:07:04 +08:00
dependabot[bot]
0c1769b3a5
Bump maven-javadoc-plugin in /jvm-packages/xgboost4j ( #8534 )
...
Bumps [maven-javadoc-plugin](https://github.com/apache/maven-javadoc-plugin ) from 2.10.3 to 3.4.1.
- [Release notes](https://github.com/apache/maven-javadoc-plugin/releases )
- [Commits](https://github.com/apache/maven-javadoc-plugin/compare/maven-javadoc-plugin-2.10.3...maven-javadoc-plugin-3.4.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-javadoc-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 02:30:46 +08:00
dependabot[bot]
67752e3967
Bump scala-maven-plugin from 3.2.2 to 4.8.0 in /jvm-packages ( #8532 )
...
Bumps scala-maven-plugin from 3.2.2 to 4.8.0.
---
updated-dependencies:
- dependency-name: net.alchim31.maven:scala-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-06 00:58:30 +08:00
Jiaming Yuan
8afcecc025
[doc] Fix outdated document [skip ci] ( #8527 )
...
* [doc] Fix document around categorical parameters. [skip ci]
* note on validate parameter [skip ci]
* Fix dask doc as well [skip ci]
2022-12-06 00:56:17 +08:00
Jiaming Yuan
e143a4dd7e
[pyspark] Refactor local tests. ( #8525 )
...
- Use pytest fixture for spark session.
- Replace hardcoded results.
2022-12-05 23:49:54 +08:00
Philip Hyunsu Cho
42c5ee5588
[jvm-packages] Bump version of akka packages ( #8524 )
2022-12-05 22:45:00 +08:00
Jiaming Yuan
e3bf5565ab
Extract transform iterator. ( #8498 )
2022-12-05 21:37:07 +08:00
Jiaming Yuan
d8544e4d9e
[R] Remove unused assert definition. ( #8526 )
2022-12-05 20:29:03 +08:00
dependabot[bot]
d8d2eefa63
Bump junit from 4.13.1 to 4.13.2 in /jvm-packages/xgboost4j-gpu ( #8516 )
...
Bumps [junit](https://github.com/junit-team/junit4 ) from 4.13.1 to 4.13.2.
- [Release notes](https://github.com/junit-team/junit4/releases )
- [Changelog](https://github.com/junit-team/junit4/blob/main/doc/ReleaseNotes4.13.1.md )
- [Commits](https://github.com/junit-team/junit4/compare/r4.13.1...r4.13.2 )
---
updated-dependencies:
- dependency-name: junit:junit
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-05 19:11:33 +08:00
dependabot[bot]
8e8d3ac708
Bump kryo from 4.0.2 to 5.3.0 in /jvm-packages ( #8503 )
...
Bumps [kryo](https://github.com/EsotericSoftware/kryo ) from 4.0.2 to 5.3.0.
- [Release notes](https://github.com/EsotericSoftware/kryo/releases )
- [Commits](https://github.com/EsotericSoftware/kryo/compare/kryo-parent-4.0.2...kryo-parent-5.3.0 )
---
updated-dependencies:
- dependency-name: com.esotericsoftware:kryo
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-05 18:01:59 +08:00
dependabot[bot]
3bfe90c183
Bump exec-maven-plugin in /jvm-packages/xgboost4j-gpu ( #8531 )
...
Bumps [exec-maven-plugin](https://github.com/mojohaus/exec-maven-plugin ) from 1.6.0 to 3.1.0.
- [Release notes](https://github.com/mojohaus/exec-maven-plugin/releases )
- [Commits](https://github.com/mojohaus/exec-maven-plugin/compare/exec-maven-plugin-1.6.0...exec-maven-plugin-3.1.0 )
---
updated-dependencies:
- dependency-name: org.codehaus.mojo:exec-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-05 17:41:41 +08:00
dependabot[bot]
a903241fbf
Bump maven-javadoc-plugin in /jvm-packages/xgboost4j-gpu ( #8530 )
...
Bumps [maven-javadoc-plugin](https://github.com/apache/maven-javadoc-plugin ) from 2.10.3 to 3.4.1.
- [Release notes](https://github.com/apache/maven-javadoc-plugin/releases )
- [Commits](https://github.com/apache/maven-javadoc-plugin/compare/maven-javadoc-plugin-2.10.3...maven-javadoc-plugin-3.4.1 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-javadoc-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-05 16:32:08 +08:00
Bobby Wang
f1e9bbcee5
[breakinig] [jvm-packages] change DeviceQuantileDmatrix into QuantileDMatrix ( #8461 )
2022-12-05 12:23:21 +08:00
Rong Ou
78d65a1928
Initial support for column-wise data split ( #8468 )
2022-12-04 01:37:51 +08:00
dependabot[bot]
c0609b98f1
Bump exec-maven-plugin from 1.6.0 to 3.1.0 in /jvm-packages/xgboost4j ( #8518 )
...
Bumps [exec-maven-plugin](https://github.com/mojohaus/exec-maven-plugin ) from 1.6.0 to 3.1.0.
- [Release notes](https://github.com/mojohaus/exec-maven-plugin/releases )
- [Commits](https://github.com/mojohaus/exec-maven-plugin/compare/exec-maven-plugin-1.6.0...exec-maven-plugin-3.1.0 )
---
updated-dependencies:
- dependency-name: org.codehaus.mojo:exec-maven-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-02 17:24:22 -08:00
dependabot[bot]
ba0ed255ef
Bump maven-jar-plugin from 3.0.2 to 3.3.0 in /jvm-packages/xgboost4j-gpu ( #8512 )
...
Bumps [maven-jar-plugin](https://github.com/apache/maven-jar-plugin ) from 3.0.2 to 3.3.0.
- [Release notes](https://github.com/apache/maven-jar-plugin/releases )
- [Commits](https://github.com/apache/maven-jar-plugin/compare/maven-jar-plugin-3.0.2...maven-jar-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-jar-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-02 17:23:03 -08:00
dependabot[bot]
1d8bb7332f
Bump maven-resources-plugin in /jvm-packages/xgboost4j ( #8515 )
...
Bumps [maven-resources-plugin](https://github.com/apache/maven-resources-plugin ) from 3.1.0 to 3.3.0.
- [Release notes](https://github.com/apache/maven-resources-plugin/releases )
- [Commits](https://github.com/apache/maven-resources-plugin/compare/maven-resources-plugin-3.1.0...maven-resources-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-resources-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-02 17:22:26 -08:00
dependabot[bot]
dcc92a6703
Bump maven-jar-plugin from 3.0.2 to 3.3.0 in /jvm-packages/xgboost4j ( #8517 )
...
Bumps [maven-jar-plugin](https://github.com/apache/maven-jar-plugin ) from 3.0.2 to 3.3.0.
- [Release notes](https://github.com/apache/maven-jar-plugin/releases )
- [Commits](https://github.com/apache/maven-jar-plugin/compare/maven-jar-plugin-3.0.2...maven-jar-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-jar-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-02 17:21:42 -08:00
dependabot[bot]
fcafd3a777
Bump maven-jar-plugin from 3.0.2 to 3.3.0 in /jvm-packages ( #8506 )
...
Bumps [maven-jar-plugin](https://github.com/apache/maven-jar-plugin ) from 3.0.2 to 3.3.0.
- [Release notes](https://github.com/apache/maven-jar-plugin/releases )
- [Commits](https://github.com/apache/maven-jar-plugin/compare/maven-jar-plugin-3.0.2...maven-jar-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-jar-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-02 17:21:01 -08:00
dependabot[bot]
b23e97f8b0
Bump maven-resources-plugin from 3.1.0 to 3.3.0 in /jvm-packages ( #8504 )
...
Bumps [maven-resources-plugin](https://github.com/apache/maven-resources-plugin ) from 3.1.0 to 3.3.0.
- [Release notes](https://github.com/apache/maven-resources-plugin/releases )
- [Commits](https://github.com/apache/maven-resources-plugin/compare/maven-resources-plugin-3.1.0...maven-resources-plugin-3.3.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-resources-plugin
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-02 17:20:37 -08:00
Bobby Wang
8e41ad24f5
[pyspark] sort qid for SparkRanker ( #8497 )
...
* [pyspark] sort qid for SparkRandker
* resolve comments
2022-12-01 16:40:35 -08:00
dependabot[bot]
f747e05eac
Bump maven-deploy-plugin from 2.8.2 to 3.0.0 in /jvm-packages ( #8502 )
...
Bumps [maven-deploy-plugin](https://github.com/apache/maven-deploy-plugin ) from 2.8.2 to 3.0.0.
- [Release notes](https://github.com/apache/maven-deploy-plugin/releases )
- [Commits](https://github.com/apache/maven-deploy-plugin/compare/maven-deploy-plugin-2.8.2...maven-deploy-plugin-3.0.0 )
---
updated-dependencies:
- dependency-name: org.apache.maven.plugins:maven-deploy-plugin
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-01 16:39:59 -08:00
Philip Hyunsu Cho
2546d139d6
[jvm-packages] Add missing commons-lang3 dependency to xgboost4j-gpu ( #8508 )
...
* [jvm-packages] Add missing commons-lang3 dependency to xgboost4j-gpu
* Update commons-lang3
2022-12-01 16:27:11 -08:00
Philip Hyunsu Cho
7c6f2346d3
[jvm-packages] Configure dependabot properly ( #8507 )
...
* [jvm-packages] Configure dependabot properly
* Allow automatic updates for Scala and Spark within the same major version
2022-12-01 16:26:47 -08:00
Philip Hyunsu Cho
f550109641
Bump some old dependencies of JVM packages ( #8456 )
2022-11-30 23:04:08 -08:00
Philip Hyunsu Cho
9a98e79649
[jvm-packages] Set up dependabot ( #8501 )
2022-11-30 22:46:17 -08:00
Rong Ou
a8255ea678
Add an in-memory collective communicator ( #8494 )
2022-12-01 00:24:12 +08:00
Jiaming Yuan
157e98edf7
Support half type from cupy. ( #8487 )
2022-11-30 17:56:42 +08:00
Jiaming Yuan
addaa63732
Support null value in CUDA array interface. ( #8486 )
...
* Support null value in CUDA array interface.
- Fix for potential null value in array interface.
- Fix incorrect check on mask stride.
* Simple tests.
* Extract mask.
2022-11-28 17:48:25 -08:00
Jiaming Yuan
3fc1046fd3
Reduce compiler warnings on CPU-only build. ( #8483 )
2022-11-29 00:04:16 +08:00
Jiaming Yuan
d666ba775e
Support all pandas nullable integer types. ( #8480 )
...
- Enumerate all pandas integer types.
- Tests for `None`, `nan`, and `pd.NA`
2022-11-28 22:38:16 +08:00
Jiaming Yuan
f2209c1fe4
Don't shuffle columns in categorical tests. ( #8446 )
2022-11-28 20:28:06 +08:00
WeichenXu
67ea1c3435
[pyspark] Make QDM optional based on cuDF check ( #8471 )
2022-11-27 14:58:54 +08:00
Jiaming Yuan
8f97c92541
Support half type for pandas. ( #8481 )
2022-11-24 12:47:40 +08:00
Jiaming Yuan
e07245f110
Take datatable as row major input. ( #8472 )
...
* Take datatable as row major input.
Try to avoid a transform with dense table.
2022-11-24 09:20:13 +08:00
Jiaming Yuan
284dcf8d22
Add script for change version. ( #8443 )
...
- Replace jvm regex replacement script with mvn command.
- Replace cmake script for python version with python script.
- Automate rest of the manual steps.
The script can handle dev branch, rc release, and formal release version.
2022-11-24 00:06:39 +08:00
Jiaming Yuan
5f1a6fca0d
[R] Use new interface for creating DMatrix from CSR. ( #8455 )
...
* [R] Use new interface for creating DMatrix from CSR.
- CSC is still using the old API.
The old API is not aware of `nthread` parameter, which makes DMatrix to use all available
thread during construction and during transformation lie `SparsePage` -> `CSCPage`.
2022-11-23 21:36:43 +08:00
Nick Becker
58d211545f
explain cpu/gpu interop and link to model IO tutorial ( #8450 )
2022-11-23 20:58:28 +08:00
Bobby Wang
2dde65f807
[ci] reduce pyspark test time ( #8324 )
2022-11-21 16:58:00 +08:00
Joyce
3b8a0e08f7
feat: use commit hash instead of version to actions workflows ( #8460 )
...
Signed-off-by: Joyce Brum <joycebrum@google.com >
Signed-off-by: Joyce Brum <joycebrum@google.com >
2022-11-17 22:04:11 +08:00
Rong Ou
30b1a26fc0
Remove unused page size constant ( #8457 )
2022-11-17 11:41:39 +08:00
Otto von Sperling
812d577597
Fix inline code blocks in 'spark_estimator.rst' ( #8465 )
2022-11-15 05:47:58 +08:00
Robert Maynard
16f96b6cfb
Work with newer thrust and libcudacxx ( #8454 )
...
* Thrust 1.17 removes the experimental/pinned_allocator.
When xgboost is brought into a large project it can
be compiled against Thrust 1.17+ which don't offer
this experimental allocator.
To ensure that going forward xgboost works in all environments we provide a xgboost namespaced version of
the pinned_allocator that previously was in Thrust.
2022-11-11 04:22:53 +08:00
Gavin Zhang
0c6266bc4a
SO_DOMAIN do not support on IBM i, using getsockname instead ( #8437 )
...
Co-authored-by: GavinZhang <zhanggan@cn.ibm.com >
2022-11-10 23:54:57 +08:00
Jiaming Yuan
9dd8d70f0e
Fix mypy errors. ( #8444 )
2022-11-09 13:19:11 +08:00
Jiaming Yuan
0252d504d8
Fix R package build on CI. ( #8445 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2022-11-09 12:18:36 +08:00
Jiaming Yuan
a83748eb45
[CI] Revise R tests. ( #8430 )
...
- Use the standard package check (check on the tarball instead of the source tree).
- Run commands in parallel.
- Cleanup dependencies installation.
- Replace makefile.
- Documentation.
- Test using the image from rhub.
2022-11-09 09:12:13 +08:00
Rong Ou
4449e30184
Always link federated proto statically ( #8442 )
2022-11-09 07:47:38 +08:00
Jiaming Yuan
ca0f7f2714
[doc] Update C tutorial. [skip ci] ( #8436 )
...
- Use rst references instead of doxygen links.
- Replace deprecated functions.
- Add SaveModel; put free step last [skip ci]
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2022-11-09 07:14:12 +08:00
Jiaming Yuan
0b36f8fba1
[R] Fix CRAN test notes. ( #8428 )
...
- Limit the number of used CPU cores in examples.
- Add a note for the constraint.
- Bring back the cleanup script.
2022-11-09 02:03:30 +08:00
Rong Ou
8e76f5f595
Use DataSplitMode to configure data loading ( #8434 )
...
* Use `DataSplitMode` to configure data loading
2022-11-08 16:21:50 +08:00
Jiaming Yuan
0d3da9869c
Require isort on all Python files. ( #8420 )
2022-11-08 12:59:06 +08:00
James Lamb
bf8de227a9
[CI] remove unused import in python tests ( #8409 )
2022-11-03 22:27:25 +08:00
James Lamb
b1b2524dbb
add files from python tests to .gitignore ( #8410 )
2022-11-03 07:57:45 +08:00
Rong Ou
99fa8dad2d
Add back xgboost.rabit for backwards compatibility ( #8408 )
...
* Add back xgboost.rabit for backwards compatibility
* fix my errors
* Fix lint
* Use FutureWarning
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-11-01 21:47:41 -07:00
Philip Hyunsu Cho
0db903b471
Fix formatting in NEWS.md [skip ci]
2022-10-31 15:42:31 -07:00
Jiaming Yuan
917cbc0699
1.7 release note. [skip ci] ( #8374 )
...
* Draft for 1.7 release note. [skip ci]
* Wording [skip ci]
* Update with backports [skip ci]
* Apply suggestions from code review [skip ci]
* Apply suggestions from code review [skip ci]
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
* Update NEWS.md [skip ci]
Co-authored-by: Rory Mitchell <r.a.mitchell.nz@gmail.com >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
Co-authored-by: Rory Mitchell <r.a.mitchell.nz@gmail.com >
2022-10-31 09:32:33 -07:00
Jiaming Yuan
2ed3c29c8a
[CI] Cleanup github action tests. ( #8397 )
...
- Merge doxygen build with sphinx.
- Use mamba on non-windows Github Action.
2022-10-29 06:04:27 +08:00
Joyce
7174d60ed2
Fix Scorecard Github Action not working ( #8402 )
...
* chore: create security policy
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* chore: only latest release on security police
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* chore: security policy support on effort base
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* Use dedicated e-mail address for security reporting
* fix: upgrade scorecard action version
Signed-off-by: Joyce Brum <joycebrum@google.com >
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
Signed-off-by: Joyce Brum <joycebrum@google.com >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-10-28 16:25:43 -04:00
Jiaming Yuan
a408c34558
Update JSON parser demo with categorical feature. ( #8401 )
...
- Parse categorical features in the Python example.
- Add tests.
- Update document.
2022-10-28 20:57:43 +08:00
Jiaming Yuan
cfd2a9f872
Extract dask and spark test into distributed test. ( #8395 )
...
- Move test files.
- Run spark and dask separately to prevent conflicts.
- Gather common code into the testing module.
2022-10-28 16:24:32 +08:00
Jiaming Yuan
f73520bfff
Bump development version to 2.0. ( #8390 )
2022-10-28 15:21:19 +08:00
Christian Clauss
ae27e228c4
xrange() was removed in Python 3 in favor or range() ( #8371 )
2022-10-27 16:36:14 +08:00
Yizhi Liu
5699f60a88
Type fix for WebAssembly: use bst_ulong instead of size_t for ncol in CSR conversion. ( #8369 )
2022-10-26 19:21:45 +08:00
Jiaming Yuan
a2593e60bf
Speedup R test on github. ( #8388 )
2022-10-26 18:02:27 +08:00
Jiaming Yuan
786aa27134
[doc] Additional notes for release [skip ci] ( #8367 )
2022-10-26 17:55:15 +08:00
Jiaming Yuan
cf70864fa3
Move Python testing utilities into xgboost module. ( #8379 )
...
- Add typehints.
- Fixes for pylint.
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-10-26 16:56:11 +08:00
Jiaming Yuan
7e53189e7c
[pyspark] Improve tutorial on enabling GPU support. ( #8385 )
...
- Quote the databricks doc on how to manage dependencies.
- Some wording changes.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-10-26 15:45:54 +08:00
Thomas Stanley
ba9cc43464
Fix acronym ( #8386 )
2022-10-26 06:22:30 +08:00
Philip Hyunsu Cho
8bb55949ef
Fix building XGBoost with libomp 15 ( #8384 )
2022-10-25 12:01:11 -07:00
Jiaming Yuan
d0b99bdd95
[pyspark] Add type hint to basic utilities. ( #8375 )
2022-10-25 17:26:25 +08:00
Jiaming Yuan
1d2f6de573
remove travis status [skip ci] ( #8382 )
2022-10-24 14:37:33 +08:00
Jiaming Yuan
a3b8bca46a
Remove travis configuration file. [skip ci] ( #8381 )
2022-10-23 02:49:29 +08:00
Jiaming Yuan
bb5e18c29c
Fix CUDA async stream. ( #8380 )
2022-10-22 23:13:28 +08:00
Christian Clauss
5761f27e5e
Use ==/!= to compare constant literals (str, bytes, int, float, tuple) ( #8372 )
2022-10-22 21:53:03 +08:00
Jiaming Yuan
99467f3999
[doc] Cleanup outdated documents for GPU. [skip ci] ( #8378 )
2022-10-21 20:13:31 +08:00
Jiaming Yuan
28a466ab51
Fixes for R checks. ( #8330 )
...
- Bump configure.ac version.
- Remove amalgamation to reduce the build time for a single object with the added benefit that we can use parallel build during development.
- Fix c function prototype warning.
- Remove Windows automake file generation step to make the build script easier to understand.
2022-10-20 02:52:54 +08:00
Dmitry Razdoburdin
5bd849f1b5
Unify the partitioner for hist and approx.
...
Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com >
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2022-10-20 02:49:20 +08:00
Jiaming Yuan
c69af90319
Fix github action r tests. ( #8364 )
2022-10-20 01:07:18 +08:00
Jiaming Yuan
c884b9e888
Validate features for inplace predict. ( #8359 )
2022-10-19 23:05:36 +08:00
Joyce
52977f0cdf
Create Security Police ( #8360 )
...
* chore: create security policy
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* chore: only latest release on security police
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* chore: security policy support on effort base
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* Use dedicated e-mail address for security reporting
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-10-18 17:15:30 -07:00
luca-s
c47c71e34f
XGBRanker documentation: few clarifications ( #8356 )
2022-10-19 01:54:14 +08:00
Bobby Wang
76f95a6667
[pyspark] Filter out the unsupported train parameters ( #8355 )
2022-10-18 23:26:02 +08:00
Jiaming Yuan
3901f5d9db
[pyspark] Cleanup data processing. ( #8344 )
...
* Enable additional combinations of ctor parameters.
* Unify procedures for QuantileDMatrix and DMatrix.
2022-10-18 14:56:23 +08:00
Rong Ou
521086d56b
Make federated client more robust ( #8351 )
2022-10-18 13:52:44 +08:00
luca-s
5647fc6542
XGBRanker documentation: missing default objective ( #8347 )
2022-10-18 10:43:29 +08:00
Rong Ou
8f3dee58be
Speed up tests with federated learning enabled ( #8350 )
...
* Speed up tests with federated learning enabled
* Re-enable timeouts
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-10-17 15:17:04 -07:00
Jiaming Yuan
031d66ec27
Configuration for init estimation. ( #8343 )
...
* Configuration for init estimation.
* Check whether the model needs configuration based on const attribute `ModelFitted`
instead of a mutable state.
* Add parameter `boost_from_average` to tell whether the user has specified base score.
* Add tests.
2022-10-18 01:52:24 +08:00
Jiaming Yuan
2176e511fc
Disable pytest-timeout for now. ( #8348 )
2022-10-17 23:06:10 +08:00
Jiaming Yuan
fcddbc9264
FIx incorrect function name. ( #8346 )
2022-10-17 19:28:20 +08:00
Rong Ou
80e10e02ab
Avoid blank lines with federated training ( #8342 )
2022-10-14 14:55:01 +08:00
Rong Ou
b3208aac4e
Fix NVFLARE demo ( #8340 )
2022-10-14 12:18:34 +08:00
Jiaming Yuan
748d516c50
[pyspark] Enable running GPU tests on variable number of GPUs. ( #8335 )
2022-10-13 21:03:45 +08:00
Jiaming Yuan
4633b476e9
[doc] Display survival demos in sphinx doc. [skip ci] ( #8328 )
2022-10-13 20:51:23 +08:00
Jiaming Yuan
3ef1703553
Allow using string view to find JSON value. ( #8332 )
...
- Allow comparison between string and string view.
- Fix compiler warnings.
2022-10-13 17:10:13 +08:00
Philip Hyunsu Cho
29595102b9
[CI] Set up test analytics for CPU Python tests ( #8333 )
...
* [CI] Set up test analytics for CPU Python tests
* Install test collector
2022-10-12 23:15:50 -07:00
Philip Hyunsu Cho
2faa744aba
[CI] Test federated learning plugin in the CI ( #8325 )
2022-10-12 13:57:39 -07:00
Jiaming Yuan
97a5b088a5
[pyspark] Use quantile dmatrix. ( #8284 )
2022-10-12 20:38:53 +08:00
Rory Mitchell
ce0382dcb0
[CI] Refactor tests to reduce CI time. ( #8312 )
2022-10-12 11:32:06 +02:00
Rong Ou
39afdac3be
Better error message when world size and rank are set as strings ( #8316 )
...
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2022-10-12 15:53:25 +08:00
Rory Mitchell
210915c985
Use integer gradients in gpu_hist split evaluation ( #8274 )
2022-10-11 12:16:27 +02:00
Jiaming Yuan
c68684ff4c
Update parameter for categorical feature. ( #8285 )
2022-10-10 19:48:29 +08:00
Jiaming Yuan
5545c49cfc
Require keyword args for data iterator. ( #8327 )
2022-10-10 17:47:13 +08:00
Jiaming Yuan
e1f9f80df2
Use gpu predictor for get csr test. ( #8323 )
2022-10-10 16:12:37 +08:00
Philip Hyunsu Cho
a71421e825
[CI] Update GitHub Actions to use macos-11 ( #8321 )
2022-10-08 00:40:43 -07:00
Philip Hyunsu Cho
d70e59fefc
Fix Intel's link [skip ci]
2022-10-06 16:55:42 -07:00
Philip Hyunsu Cho
50ff8a2623
More CI improvements ( #8313 )
...
* Reduce clutter in log of Python test
* Set up BuildKite test analytics
* Add separate step for building containers
* Enable incremental update of CI stack; custom agent IAM policy
2022-10-06 06:33:46 -08:00
Philip Hyunsu Cho
bc7a6ec603
Fix clang tidy ( #8314 )
...
* Fix clang-tidy
* Exempt clang-tidy from budget check
* Move clang-tidy
2022-10-06 05:16:06 -08:00
Dmitry Razdoburdin
c24e9d712c
Dispatcher for template parameters of BuildHist Kernels ( #8259 )
...
* Intoducing Column Wise Hist Building
* linting
* more linting
* bug fixing
* Removing column samping optimization for a while to simplify the review process.
* linting
* Removing unnecessary changes
* Use DispatchBinType in hist_util.cc
* Adding force_read_by column flag to buildhist. Adding tests for column wise buiilhist.
* Introducing new dispatcher for compile time flags in hist building
* fixing bug with using of DispatchBinType
* Fixing building
* Merging with master branch
Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com >
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2022-10-06 03:02:29 -08:00
Rong Ou
8d4038da57
Don't split input data in federated mode ( #8279 )
...
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-10-05 18:19:28 -08:00
Philip Hyunsu Cho
66fd9f5207
Update sponsors list [skip ci] ( #8309 )
2022-10-05 16:40:46 -08:00
Rory Mitchell
909e49e214
Reduce docker image size. ( #8306 )
2022-10-05 15:55:51 -08:00
Rong Ou
668b8a0ea4
[Breaking] Switch from rabit to the collective communicator ( #8257 )
...
* Switch from rabit to the collective communicator
* fix size_t specialization
* really fix size_t
* try again
* add include
* more include
* fix lint errors
* remove rabit includes
* fix pylint error
* return dict from communicator context
* fix communicator shutdown
* fix dask test
* reset communicator mocklist
* fix distributed tests
* do not save device communicator
* fix jvm gpu tests
* add python test for federated communicator
* Update gputreeshap submodule
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-10-05 14:39:01 -08:00
Jiaming Yuan
e47b3a3da3
Upgrade mypy. ( #8302 )
...
Some breaking changes were made in mypy.
2022-10-05 14:31:59 +08:00
Jiaming Yuan
97c3a80a34
Add C document to sphinx, fix arrow. ( #8300 )
...
- Group C API.
- Add C API sphinx doc.
- Consistent use of `OptionalArg` and the parameter name `config`.
- Remove call to deprecated functions in demo.
- Fix some formatting errors.
- Add links to c examples in the document (only visible with doxygen pages)
- Fix arrow.
2022-10-05 09:52:15 +08:00
Philip Hyunsu Cho
b2bbf49015
Additional improvements to CI ( #8303 )
...
* Wait until budget check is complete
* Ensure that multi-GPU tests run for the master branch
* Fix
2022-10-04 03:03:38 -08:00
Rory Mitchell
d686bf52a6
Reduce time for some multi-gpu tests ( #8288 )
...
* Faster dask tests
* Reuse AllReducer objects in tests.
* Faster boost from prediction tests.
* Use rmm dask fixture.
* Speed up dask demo.
* mypy
* Format with black.
* mypy
* Clang-tidy
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-10-04 02:49:33 -08:00
Philip Hyunsu Cho
ca0547bb65
[CI] Use RAPIDS 22.10 ( #8298 )
...
* [CI] Use RAPIDS 22.10
* Store CUDA and RAPIDS versions in one place
* Fix
* Add missing #include
* Update gputreeshap submodule
* Fix
* Remove outdated distributed tests
2022-10-03 23:18:07 -08:00
Philip Hyunsu Cho
37886a5dff
[CI] Document the use of Docker wrapper script ( #8297 )
...
* [CI] Document the use of Docker wrapper script
* Grammer fixes
* Document buildkite pipeline defs
* tests/buildkite/*.sh isn't meant to run locally
2022-10-02 12:45:00 -07:00
Philip Hyunsu Cho
9af99760d4
Various CI savings ( #8291 )
2022-09-30 05:42:56 -07:00
Jiaming Yuan
299e5000a4
Fix buildkite label. ( #8287 )
2022-09-29 17:33:19 -07:00
Jiaming Yuan
55cf24cc32
Obtain CSR matrix from DMatrix. ( #8269 )
2022-09-29 20:41:43 +08:00
Philip Hyunsu Cho
b14c44ee5e
[CI] Put Multi-GPU test suites in separate pipeline ( #8286 )
...
* [CI] Put Multi-GPU test suites in separate pipeline
* Avoid unset var error in Bash
2022-09-29 00:41:48 -08:00
Bobby Wang
cbf3a5f918
[pyspark][doc] add more doc for pyspark ( #8271 )
...
Co-authored-by: fis <jm.yuan@outlook.com >
2022-09-29 11:58:18 +08:00
Bobby Wang
c91fed083d
[pyspark] disable repartition_random_shuffle by default ( #8283 )
2022-09-29 10:50:51 +08:00
Jiaming Yuan
6925b222e0
Fix mixed types with cuDF. ( #8280 )
2022-09-29 00:57:52 +08:00
Jiaming Yuan
f835368bcf
Mark next release as 1.7 instead of 2.0 ( #8281 )
2022-09-28 14:33:37 +08:00
Jiaming Yuan
6d1452074a
Remove MGPU cpp tests. ( #8276 )
...
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-09-27 21:18:23 +08:00
Jiaming Yuan
fcab51aa82
Support more pandas nullable types ( #8262 )
...
- Float32/64
- Category.
2022-09-27 01:59:50 +08:00
Alex
1082ccd3cc
GitHub Workflows security hardening ( #8267 )
...
Signed-off-by: Alex <aleksandrosansan@gmail.com >
2022-09-27 00:54:27 +08:00
Rory Mitchell
8f77677193
Use quantised gradients in gpu_hist histograms ( #8246 )
2022-09-26 17:35:35 +02:00
Jiaming Yuan
4056974e37
Fix sparse threshold warning. ( #8268 )
2022-09-26 22:22:11 +08:00
WeichenXu
ff71c69adf
[pyspark] Add validation for param 'early_stopping_rounds' and 'validation_indicator_col' ( #8250 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2022-09-26 17:43:03 +08:00
Jiaming Yuan
0cd11b893a
[doc] Fix sphinx build. ( #8270 )
2022-09-26 12:33:31 +08:00
Joyce
be5b95e743
Enable OpenSSF Scorecard Github Action ( #8263 )
...
* chore: enable scorecard github action
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
* docs: add scorecard badge to the README file
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
Signed-off-by: Joyce Brum <joycebrumu.u@gmail.com >
2022-09-25 13:02:36 -07:00
Bobby Wang
8d247f0d64
[jvm-packages] fix spark-rapids compatibility issue ( #8240 )
...
* [jvm-packages] fix spark-rapids compatibility issue
spark-rapids (from 22.10) has shimmed GpuColumnVector, which means
we can't call it directly. So this PR call the UnshimmedGpuColumnVector
2022-09-22 23:31:29 +08:00
WeichenXu
ab342af242
[pyspark] Fix xgboost spark estimator dataset repartition issues ( #8231 )
2022-09-22 21:31:41 +08:00
Jiaming Yuan
3fd331f8f2
Add checks to C pointer arguments. ( #8254 )
2022-09-22 19:02:22 +08:00
Dmitry Razdoburdin
eb7bbee2c9
Optional by-column histogram build. ( #8233 )
...
Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com >
2022-09-22 05:16:13 +08:00
Jiaming Yuan
b791446623
Initial support for IPv6 ( #8225 )
...
- Merge rabit socket into XGBoost.
- Dask interface support.
- Add test to the socket.
2022-09-21 18:06:50 +08:00
Rong Ou
7d43e74e71
JNI wrapper for the collective communicator ( #8242 )
2022-09-21 04:20:25 +08:00
Jiaming Yuan
fffb1fca52
Calculate base_score based on input labels for mae. ( #8107 )
...
Fit an intercept as base score for abs loss.
2022-09-20 20:53:54 +08:00
Bobby Wang
4f42aa5f12
[pyspark] make the model saved by pyspark compatible ( #8219 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2022-09-20 16:43:49 +08:00
Bobby Wang
520586ffa7
[pyspark] fix empty data issue when constructing DMatrix ( #8245 )
...
Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu >
2022-09-20 16:43:20 +08:00
Philip Hyunsu Cho
70df36c99c
[CI] Retire Jenkins server ( #8243 )
2022-09-14 08:46:23 -07:00
Jiaming Yuan
2e63af6117
Mitigate flaky data iter test. ( #8244 )
...
- Reduce the number of batches.
- Verify labels.
2022-09-14 17:54:14 +08:00
Jiaming Yuan
bdf265076d
Make QuantileDMatrix default to sklearn esitmators. ( #8220 )
2022-09-13 13:52:19 +08:00
Rong Ou
a2686543a9
Common interface for collective communication ( #8057 )
...
* implement broadcast for federated communicator
* implement allreduce
* add communicator factory
* add device adapter
* add device communicator to factory
* add rabit communicator
* add rabit communicator to the factory
* add nccl device communicator
* add synchronize to device communicator
* add back print and getprocessorname
* add python wrapper and c api
* clean up types
* fix non-gpu build
* try to fix ci
* fix std::size_t
* portable string compare ignore case
* c style size_t
* fix lint errors
* cross platform setenv
* fix memory leak
* fix lint errors
* address review feedback
* add python test for rabit communicator
* fix failing gtest
* use json to configure communicators
* fix lint error
* get rid of factories
* fix cpu build
* fix include
* fix python import
* don't export collective.py yet
* skip collective communicator pytest on windows
* add review feedback
* update documentation
* remove mpi communicator type
* fix tests
* shutdown the communicator separately
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2022-09-12 15:21:12 -07:00
Jiaming Yuan
bc818316f2
Prepare for improving Windows networking compatibility. ( #8234 )
...
* Prepare for improving Windows networking compatibility.
* Include dmlc filesystem indirectly as dmlc/filesystem.h includes windows.h, which
conflicts with winsock2.h
* Define `NOMINMAX` conditionally.
* Link the winsock library when mysys32 is used.
* Add config file for read the doc.
2022-09-10 15:16:49 +08:00
Jiaming Yuan
dd44ac91b8
[CI] Use binary R dependencies on Windows. ( #8241 )
2022-09-09 19:51:15 -07:00
Philip Hyunsu Cho
23faf656ad
[CI] Don't require manual approval for master branch ( #8235 )
2022-09-08 09:26:22 -08:00
Philip Hyunsu Cho
e888eb2fa9
[CI] Migrate CI pipelines from Jenkins to BuildKite ( #8142 )
...
* [CI] Migrate CI pipelines from Jenkins to BuildKite
* Require manual approval
* Less verbose output when pulling Docker
* Remove us-east-2 from metadata.py
* Add documentation
* Add missing underscore
* Add missing punctuation
* More specific instruction
* Better paragraph structure
2022-09-07 16:29:25 -08:00
Philip Hyunsu Cho
b397d64c96
Drop use of deleted virtual function to support older MacOS ( #8226 )
...
* Support older MacOS
* Update json.h
2022-09-07 11:25:59 -08:00
Rehan Guha
dc07137a2c
Updated dart.rst with correct links ( #8229 )
...
Updated the DART paper link as it was invalid and link was broken.
2022-09-08 00:57:09 +08:00
Jiaming Yuan
b5eb36f1af
Add max_cat_threshold to GPU and handle missing cat values. ( #8212 )
2022-09-07 00:57:51 +08:00
Jiaming Yuan
441ffc017a
Copy data from Ellpack to GHist. ( #8215 )
2022-09-06 23:05:49 +08:00
Bobby Wang
7ee10e3dbd
[pyspark] Cleanup the comments ( #8217 )
2022-09-05 16:20:12 +08:00
Jiaming Yuan
ada4a86d1c
Fix dask interface with latest cupy. ( #8210 )
2022-09-03 03:10:43 +08:00
Dmitry Razdoburdin
deae99e662
Optimization/buildhist/hist util ( #8218 )
...
* BuildHistKernel optimization
Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com >
2022-09-02 19:39:45 +08:00
Rong Ou
b78bc734d9
Fix dask.py lint error ( #8216 )
2022-09-02 16:30:01 +08:00
Philip Hyunsu Cho
56395d120b
Work around MSVC behavior wrt constexpr capture ( #8211 )
...
* Work around MSVC behavior wrt constexpr capture
* Fix lint
2022-08-31 11:42:08 -08:00
CW
a868498c18
[doc] Update prediction.rst ( #8214 )
2022-08-31 21:00:12 +08:00
Jiaming Yuan
8dac90a593
Mark parameter validation non-experimental. ( #8206 )
2022-08-30 15:49:43 +08:00
Rong Ou
d6e2013c5f
Set max message size in insecure gRPC ( #8203 )
2022-08-26 16:33:51 +08:00
WeichenXu
651f0a8889
[pyspark] Fixing xgboost.spark python doc ( #8200 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2022-08-25 14:41:48 +08:00
WeichenXu
d03794ce7a
[pyspark] Add param validation for "objective" and "eval_metric" param, and remove invalid booster params ( #8173 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2022-08-24 15:29:43 +08:00
Jiaming Yuan
9b32e6e2dc
Fix release script. ( #8187 ) ( #8195 )
2022-08-23 15:08:30 +08:00
WeichenXu
f4628c22a4
[pyspark] Implement SparkXGBRanker estimator ( #8172 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2022-08-23 02:35:19 +08:00
Philip Hyunsu Cho
35ef8abc27
[CI] Prune unused archs from libnccl ( #8179 )
...
* [CI] Prune unused archs from libnccl
* Put pruning logic in CI directory
* Don't use --color in grep
2022-08-21 00:46:16 -08:00
Rong Ou
ad3bc0edee
Allow insecure gRPC connections for federated learning ( #8181 )
...
* Allow insecure gRPC connections for federated learning
* format
2022-08-19 12:16:14 +08:00
WeichenXu
53d2a733b0
[pyspark] Make Xgboost estimator support using sparse matrix as optimization ( #8145 )
...
Signed-off-by: Weichen Xu <weichen.xu@databricks.com >
2022-08-19 01:57:28 +08:00
Rory Mitchell
1703dc330f
Optimise histogram kernels ( #8118 )
2022-08-18 14:07:26 +02:00
Gavin Zhang
40a10c217d
Use make on i system ( #8178 )
...
Co-authored-by: GavinZhang <zhanggan@cn.ibm.com >
2022-08-18 12:55:32 +08:00
dependabot[bot]
93966b0d19
Bump hadoop-common from 3.2.3 to 3.2.4 in /jvm-packages/xgboost4j-flink ( #8157 )
...
Bumps hadoop-common from 3.2.3 to 3.2.4.
---
updated-dependencies:
- dependency-name: org.apache.hadoop:hadoop-common
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-15 06:47:27 -08:00
Andy Kattine
a9458fd844
Grammar Fix in Introduction to Boosted Trees ( #8166 )
...
Added "of" to "objective functions is that they consist of two parts" in line 32 of ./doc/tutorials/model.rst
2022-08-15 15:19:47 +08:00
Ravi Makhija
fa869eebd9
Edit grammar in custom metric tutorial ( #8163 )
2022-08-13 01:02:25 +08:00
Rory Mitchell
f421c26d35
Tune cuda architectures ( #8152 )
2022-08-11 13:36:47 -07:00
Jiaming Yuan
16bca5d4a1
Support CPU input for device QuantileDMatrix. ( #8136 )
...
- Copy `GHistIndexMatrix` to `Ellpack` when needed.
2022-08-11 21:21:26 +08:00
Jiaming Yuan
36e7c5364d
[dask] Deterministic rank assignment. ( #8018 )
2022-08-11 19:17:58 +08:00
Ravi Makhija
20d1bba1bb
Simplify Python getting started example ( #8153 )
...
Load data set via `sklearn` rather than a local file path.
2022-08-11 16:42:09 +08:00
Jiaming Yuan
d868126c39
[CI] Fix R build on Jenkins. ( #8154 )
2022-08-11 14:50:03 +08:00
Jiaming Yuan
570f8ae4ba
Use black on more Python files. ( #8137 )
2022-08-11 01:38:11 +08:00
Jiaming Yuan
bdb291f1c2
[doc] Clarification for feature importance. ( #8151 )
2022-08-11 00:30:42 +08:00
Jiaming Yuan
446d536c23
Fix loading DMatrix binary in distributed env. ( #8149 )
...
- Try to load DMatrix binary before trying to parse text input.
- Remove some unmaintained code.
2022-08-10 22:53:16 +08:00
Jiaming Yuan
8fc60b31bc
Update PyPi wheel size limit. ( #8150 )
2022-08-10 18:49:57 +08:00
Jiaming Yuan
9ae547f994
Use config_context in sklearn interface. ( #8141 )
2022-08-09 14:48:54 +08:00
Bobby Wang
03cc3b359c
[pyspark] support a list of feature column names ( #8117 )
2022-08-08 17:05:27 +08:00
Jiaming Yuan
bcc8679a05
Update CUDA docker image and NCCL. ( #8139 )
2022-08-07 16:32:41 +08:00
Praateek Mahajan
ff471b3fab
In PySpark Estimator example use the model with validation_indicator ( #8131 )
...
* use the validation_indicator model
* use the validation_indicator model for regression
2022-08-03 13:57:41 +08:00
Jiaming Yuan
d87f69215e
Quantile DMatrix for CPU. ( #8130 )
...
- Add a new `QuantileDMatrix` that works for both CPU and GPU.
- Deprecate `DeviceQuantileDMatrix`.
2022-08-02 15:51:23 +08:00
Jiaming Yuan
2cba1d9fcc
Fix compatibility with latest cupy. ( #8129 )
...
* Fix compatibility with latest cupy.
* Freeze mypy.
2022-08-01 15:24:42 +08:00
Philip Hyunsu Cho
24c2373080
[Doc] Indicate lack of py-xgboost-gpu on Windows ( #8127 )
2022-07-28 12:57:16 -07:00
Jiaming Yuan
2c70751d1e
Implement iterative DMatrix for CPU. ( #8116 )
2022-07-26 22:34:21 +08:00
Jiaming Yuan
546de5efd2
[pyspark] Cleanup data processing. ( #8088 )
...
- Use numpy stack for handling list of arrays.
- Reuse concat function from dask.
- Prepare for `QuantileDMatrix`.
- Remove unused code.
- Use iterator for prediction to avoid initializing xgboost model
2022-07-26 15:00:52 +08:00
Jiaming Yuan
3970e4e6bb
Move pylint helper from dmlc-core. ( #8101 )
...
* Move pylint helper from dmlc-core.
- Move the helper into the XGBoost ci_build.
- Run it with multiprocessing.
* Fix original test.
2022-07-23 08:12:37 +08:00
Jiaming Yuan
7785d65c8a
Fix feature weights with multiple column sampling. ( #8100 )
2022-07-22 20:23:05 +08:00
Jiaming Yuan
4a4e5c7c18
Prepare gradient index for Quantile DMatrix. ( #8103 )
...
* Prepare gradient index for Quantile DMatrix.
- Implement push batch with adapter batch.
- Implement `GetFvalue` for prediction.
2022-07-22 17:26:33 +08:00
Rory Mitchell
1be09848a7
Refactor split valuation kernel ( #8073 )
2022-07-21 15:41:50 +02:00
Tim Gates
cb40bbdadd
docs: fix simple typo, cannonical -> canonical ( #8099 )
...
There is a small typo in src/common/partition_builder.h.
Should read `canonical` rather than `cannonical`.
Signed-off-by: Tim Gates <tim.gates@iress.com >
2022-07-20 21:04:50 +08:00
QuellaZhang
703261e78f
[MSVC][std:c++latest] Fix compiler error ( #8093 )
...
Co-authored-by: QuellaZhang <zhangyi2090@163.com >
2022-07-20 15:15:39 +08:00
Jiaming Yuan
ef11b024e8
Cleanup data generator. ( #8094 )
...
- Avoid duplicated definition of data shape.
- Explicitly define numpy iterator for CPU data.
2022-07-20 13:48:52 +08:00
Jiaming Yuan
5156be0f49
Limit max_depth to 30 for GPU. ( #8098 )
2022-07-20 12:28:49 +08:00
Jiaming Yuan
8bdea72688
[Python] Require black and isort for new Python files. ( #8096 )
...
* [Python] Require black and isort for new Python files.
- Require black and isort for spark and dask module.
These files are relatively new and are more conform to the black formatter. We will
convert the rest of the library as we move forward.
Other libraries including dask/distributed and optuna use the same formatting style and
have a more strict standard. The black formatter is indeed quite nice, automating it can
help us unify the code style.
- Gather Python checks into a single script.
2022-07-20 10:25:24 +08:00
WeichenXu
f23cc92130
[pyspark] User guide doc and tutorials ( #8082 )
...
Co-authored-by: Bobby Wang <wbo4958@gmail.com >
2022-07-19 22:25:14 +08:00
Bobby Wang
f801d3cf15
[PySpark] change the returning model type to string from binary ( #8085 )
...
* [PySpark] change the returning model type to string from binary
XGBoost pyspark can be can be accelerated by RAPIDS Accelerator seamlessly by
changing the returning model type from binary to string.
2022-07-19 18:39:20 +08:00
Jiaming Yuan
2365f82750
[dask] Mitigate non-deterministic test. ( #8077 )
2022-07-19 16:55:59 +08:00
Rong Ou
7a6b711eb8
Remove unused updater basemaker ( #8091 )
2022-07-19 15:41:27 +08:00
Philip Hyunsu Cho
4325178822
[CI] Clear workspace after budget check ( #8092 )
...
* [CI] Clear workspace after budget check
* Windows too
2022-07-18 19:17:33 -07:00
Jiaming Yuan
4083440690
Small cleanups to various data types. ( #8086 )
...
- Use `bst_bin_t` in batch param constructor.
- Use `StringView` to avoid `std::string` when appropriate.
- Avoid using `MetaInfo` in quantile constructor to limit the scope of parameter.
2022-07-18 22:39:36 +08:00
Jiaming Yuan
e28f6f6657
[doc] Integrate pyspark module into sphinx doc [skip ci] ( #8066 )
2022-07-17 10:46:09 +08:00
Rafail Giavrimis
579ab23b10
Check cudf lazily ( #8084 )
2022-07-17 09:27:43 +08:00
Bobby Wang
a33f35eecf
[PySpark] add gpu support for spark local mode ( #8068 )
2022-07-17 07:59:06 +08:00
Bobby Wang
91bb9e2cb3
[PySpark] fix raw_prediction_col parameter and minor cleanup ( #8067 )
2022-07-16 17:58:57 +08:00
Jiaming Yuan
0ce80b7bcf
Mitigate flaky GPU test. ( #8078 )
...
The flakiness is caused by the global random engine, which will take some time to fix.
2022-07-16 13:45:32 +08:00
Jiaming Yuan
7a5586f3db
Fix GPU quantile distributed test. ( #8076 )
2022-07-16 11:40:53 +08:00
Jiaming Yuan
8fccc3c4ad
[dask] Fix potential error in demo. ( #8079 )
...
* Use dask_cudf instead.
2022-07-15 18:42:29 +08:00
Jiaming Yuan
647d3844dd
Make test for categorical data deterministic. ( #8080 )
2022-07-15 14:48:39 +08:00
Jiaming Yuan
dae7a41baa
Update Python requirement to >=3.8. ( #8071 )
...
Additional changes:
- Use mamba for CPU test on Jenkins.
- Cleanup CPU test dependencies.
- Restore some of the modin tests
2022-07-14 18:01:47 +08:00
Jiaming Yuan
8dd96013f1
Split up column matrix initialization. ( #8060 )
...
* Split up column matrix initialization.
This PR splits the column matrix initialization into 2 steps, the first one initializes
the storage while the second one does the transpose. By doing so, we can reuse the code
for Quantile DMatrix.
2022-07-14 10:34:47 +08:00
Philip Hyunsu Cho
36cf979b82
[CI] Fix S3 uploads ( #8069 )
...
* [CI] Fix S3 upload issues
* Don't launch Docker containers when uploading to S3
2022-07-13 16:23:00 -07:00
Jiaming Yuan
abaa593aa0
Fix compiler warnings. ( #8059 )
...
- Remove unused parameters.
- Avoid comparison of different signedness.
2022-07-14 05:29:56 +08:00
Jiaming Yuan
937352c78f
Fix R package Windows build. ( #8065 )
2022-07-14 05:27:38 +08:00
WeichenXu
176fec8789
PySpark XGBoost integration ( #8020 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2022-07-13 13:11:18 +08:00
Jiaming Yuan
8959622836
[dask] Use an invalid port for test. ( #8064 )
2022-07-13 11:59:02 +08:00
Rory Mitchell
0bdaca25ca
Use single precision in gain calculation, use pointers instead of span. ( #8051 )
2022-07-12 21:56:27 +02:00
Jiaming Yuan
a5bc8e2c6a
Fix mypy error with the latest dask. ( #8052 )
...
* Fix mypy error with latest dask.
Dask is adding type hints to its codebase and as the result, checks in XGBoost can be
performed more rigorously.
- Remove compatibility with old dask version where multi lock was missing.
- Restrict input of `X` to be non-series.
- Adopt latest definition of `Delayed`.
- Avoid passing optional `host_ip`.
- Avoid deprecated `worker.nthreads`.
2022-07-09 08:02:42 +08:00
Jiaming Yuan
210eb471e9
[R] Implement feature info for DMatrix. ( #8048 )
2022-07-09 05:57:39 +08:00
Jiaming Yuan
701f32b227
[py-sckl] Raise import error if skl is not installed. ( #8049 )
2022-07-09 05:56:46 +08:00
Rory Mitchell
794cbaa60a
Fuse split evaluation kernels ( #8026 )
2022-07-05 10:24:31 +02:00
Jiaming Yuan
ff1c559084
Remove unused variable. ( #8046 )
2022-07-05 01:59:22 +08:00
Jiaming Yuan
8746f9cddf
Rename IterativeDMatrix. ( #8045 )
2022-07-04 18:52:31 +08:00
Jiaming Yuan
f24bfc7684
Bump R cache version. ( #8044 )
2022-07-03 03:53:05 +08:00
Michael Chirico
3af02584c1
error early if missing DiagrammeR ( #8037 )
2022-07-02 19:37:53 +08:00
Rory Mitchell
bc4f802b17
Batch UpdatePosition using cudaMemcpy ( #7964 )
2022-06-30 17:52:40 +02:00
kiwiwarmnfuzzy
2407381c3d
Force auc.cc to be statically linked ( #8039 )
2022-06-30 19:24:22 +08:00
Jiaming Yuan
e88d6e071d
Fix compiler warning in JSON IO. ( #8031 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2022-06-30 01:13:22 +08:00
Jiaming Yuan
dcaf580476
Fix Python package source install. ( #8036 )
...
* Copy gputreeshap.
2022-06-29 21:45:09 +08:00
Rong Ou
6eb23353d7
Update nvflare demo for release 2.1.2 ( #8038 )
2022-06-29 17:58:06 +08:00
Joris LIMONIER
f470ad3af9
Fix multiple typos ( #8028 )
...
Fix 4 "graphiz" instead of "graphviz".
2022-06-27 19:21:58 +08:00
Rong Ou
45dc1f818a
Make federated plugin work with cmake 3.16.3 ( #8029 )
2022-06-27 17:26:41 +08:00
Rong Ou
0725fd6081
fix federated learning plugin ( #8027 )
2022-06-24 08:41:07 +08:00
Bobby Wang
a68580e2a7
[jvm-packages] fix executor crashing issue when transforming on xgboost4j-spark-gpu ( #8025 )
...
* [jvm-packages] fix executor crashing issue when transforming on xgboost4j-spark-gpu
the API XGBoosterSetParam is not thread-safe. Dring the phase of transforming,
XGBoost runs several transforming tasks at a time, and each of them will set
the "gpu_id" and "predictor" parameters, so if several tasks (multi-threads)
all XGBoosterSetParam simultaneously, it may cause the memory to be corrupted
and cause SIGSEGV.
This PR first get the booster from broadcast and set to the correct gpu_id
and predictor, and then all transforming taskes will use the same booster to
do the transforming.
2022-06-24 01:18:41 +08:00
Jiaming Yuan
f0c1b842bf
Implement sketching with adapter. ( #8019 )
2022-06-23 00:03:02 +08:00
Jiaming Yuan
142a208a90
Fix compiler warnings. ( #8022 )
...
- Remove/fix unused parameters
- Remove deprecated code in rabit.
- Update dmlc-core.
2022-06-22 21:29:10 +08:00
Bobby Wang
e44a082620
[jvm-packages] update nccl version to 2.12.12-1 ( #8015 )
2022-06-21 17:34:09 +08:00
Rong Ou
e5ec546da5
[Breaking] Remove rabit support for custom reductions and grow_local_histmaker updater ( #7992 )
2022-06-21 15:08:23 +08:00
Jiaming Yuan
4a87ea49b8
Reduce regularization for CPU gblinear. ( #8013 )
2022-06-21 01:05:27 +08:00
Jiaming Yuan
d285d6ba2a
Reduce regularization in GPU gblinear test. ( #8010 )
2022-06-20 23:55:12 +08:00
Jiaming Yuan
e58e417603
[CI] Fix lintr error. ( #8011 )
2022-06-20 22:17:14 +08:00
Jiaming Yuan
9b0eb66b78
Fix GPU driver test. ( #8008 )
...
* Initialize the training parameter.
2022-06-20 19:37:31 +08:00
Jiaming Yuan
637e42a0c0
Use 22.04 for RMM. ( #8001 )
...
22.06 is not released yet.
2022-06-17 04:07:31 +08:00
Jiaming Yuan
bb47fd8c49
[jvm-packages] Change log level for tracker message. ( #7968 )
2022-06-09 18:15:08 +08:00
Jiaming Yuan
8f8bd8147a
Fix LTR with weighted Quantile DMatrix. ( #7975 )
...
* Fix LTR with weighted Quantile DMatrix.
* Better tests.
2022-06-09 01:33:41 +08:00
Jiaming Yuan
1a33b50a0d
Fix compiler warnings. ( #7974 )
...
- Remove unused parameters. There are still many warnings that are not yet
addressed. Currently, the warnings in dmlc-core dominate the error log.
- Remove `distributed` parameter from metric.
- Fixes some warnings about signed comparison.
2022-06-06 22:56:25 +08:00
Jiaming Yuan
d48123d23b
Fix rmm build ( #7973 )
...
- Optionally switch to c++17
- Use rmm CMake target.
- Workaround compiler errors.
- Fix GPUMetric inheritance.
- Run death tests even if it's built with RMM support.
Co-authored-by: jakirkham <jakirkham@gmail.com >
2022-06-06 20:18:32 +08:00
Philip Hyunsu Cho
1ced638165
Document how to reproduce Docker environment from Jenkins ( #7971 )
2022-06-04 20:56:53 +09:00
Jiaming Yuan
b90c6d25e8
Implement max_cat_threshold for CPU. ( #7957 )
2022-06-04 11:02:46 +08:00
Bobby Wang
78694405a6
[jvm-packages] add jni for setting feature name and type ( #7966 )
2022-06-03 11:09:48 +08:00
Gavin Zhang
6426449c8b
Support IBM i OS ( #7920 )
2022-06-02 23:38:35 +08:00
Rong Ou
31e6902e43
Support GPU training in the NVFlare demo ( #7965 )
2022-06-02 21:52:36 +08:00
Jiaming Yuan
6b55150e80
Fix pylint errors. ( #7967 )
2022-06-02 18:04:46 +08:00
Jiaming Yuan
13b15e07e8
Handle formatted JSON input. ( #7953 )
2022-06-01 16:20:58 +08:00
Rong Ou
d3429f2ff6
Increase gRPC max receive message size for federated learning ( #7958 )
2022-06-01 13:21:54 +08:00
Bobby Wang
545fd4548e
[jvm-packages] refactor xgboost read/write ( #7956 )
...
1. Removed the duplicated Default XGBoost read/write which is copied from
spark 2.3.x
2. Put some utils into util package
2022-06-01 11:38:49 +08:00
Yang Jiandan
27c66f12d1
set log level as ERROR for trackerProcess has some stderr output ( #7952 )
2022-05-31 22:54:38 +08:00
Bobby Wang
5a7dc41351
[doc] update doc for dumping model to be json or ubj for jvm packages ( #7955 )
2022-05-31 14:43:13 +08:00
Rong Ou
80339c3427
Enable distributed GPU training over Rabit ( #7930 )
2022-05-31 04:09:45 +08:00
Bobby Wang
6275cdc486
[jvm-packages] add format option when saving a model ( #7940 )
2022-05-30 15:49:59 +08:00
Gyeongjae Choi
cc6d57aa0d
Add minimal emscripten build support ( #7954 )
2022-05-30 14:11:40 +08:00
Tim Sabsch
7a039e03fe
Fix incomplete type hints for verbose ( #7945 )
2022-05-30 12:08:24 +08:00
Bobby Wang
fbc3d861bb
[jvm-packages] remove default parameters ( #7938 )
2022-05-28 10:31:19 +08:00
Philip Hyunsu Cho
47224dd6d3
Use private mirror to host llvm-openmp tarballs ( #7950 )
2022-05-27 14:56:59 -07:00
Jiaming Yuan
bde4f25794
Handle missing categorical value in CPU evaluator. ( #7948 )
2022-05-27 14:15:47 +08:00
Philip Hyunsu Cho
2070afea02
[CI] Rotate package repository keys ( #7943 )
2022-05-26 17:06:46 -07:00
Jiaming Yuan
18cbebaeb9
Unify the cat split storage for CPU. ( #7937 )
...
* Unify the cat split storage for CPU.
* Cleanup.
* Workaround.
2022-05-26 04:14:40 -07:00
Daniel Clausen
755d9d4609
[JVM-Packages] Auto-detection of MUSL is replaced by system properties ( #7921 )
...
This PR removes auto-detection of MUSL-based Linux systems in favor of system properties the user can set to configure a specific path for a native library.
2022-05-26 10:53:15 +08:00
Jiaming Yuan
606be9e663
Handle missing values in one hot splits. ( #7934 )
2022-05-24 20:48:41 +08:00
Jiaming Yuan
18a38f7ca0
Refactor for GHistIndex. ( #7923 )
...
* Pass sparse page as adapter, which prepares for quantile dmatrix.
* Remove old external memory code like `rbegin` and extra `Init` function.
* Simplify type dispatch.
2022-05-23 23:04:53 +08:00
Jiaming Yuan
d314680a15
Verify shared object version at load. ( #7928 )
2022-05-23 20:53:30 +08:00
Jiaming Yuan
474366c020
Add convergence test for sparse datasets. ( #7922 )
2022-05-23 18:07:26 +08:00
Rory Mitchell
f6babc814c
Do not initialise data structures to maximum possible tree size. ( #7919 )
2022-05-19 19:45:53 +02:00
Philip Hyunsu Cho
6f424d8d6c
[Doc] Warn against loading JSON from external source ( #7918 )
2022-05-18 17:02:36 -07:00
Jiaming Yuan
f93a727869
Address remaining mypy errors in python package. ( #7914 )
2022-05-18 22:46:15 +08:00
Jiaming Yuan
edf9a9608e
Fix type conversion warning. ( #7916 )
2022-05-18 20:14:14 +08:00
Jiaming Yuan
765097d514
Simplify inplace-predict. ( #7910 )
...
Pass the `X` as part of Proxy DMatrix instead of an independent `dmlc::any`.
2022-05-18 17:52:00 +08:00
Jiaming Yuan
19775ffe15
Use adapter to initialize column matrix. ( #7912 )
2022-05-18 16:15:12 +08:00
Bobby Wang
5ef33adf68
[jvm-packges] set the correct objective if user doesn't explicitly set it ( #7781 )
2022-05-18 14:05:18 +08:00
Chengyang
806c92c80b
Add Type Hints for Python Package ( #7742 )
...
Co-authored-by: Chengyang Gu <bridgream@gmail.com >
Co-authored-by: Jiamingy <jm.yuan@outlook.com >
2022-05-17 22:14:09 +08:00
Rory Mitchell
71d3b2e036
Fuse gpu_hist all-reduce calls where possible ( #7867 )
2022-05-17 13:27:50 +02:00
Bobby Wang
b41cf92dc2
[jvm-packages] move dmatrix building into rabit context for cpu pipeline ( #7908 )
2022-05-17 14:52:25 +08:00
Rong Ou
77d4a53c32
use RabitContext intead of init/finalize ( #7911 )
2022-05-17 12:15:41 +08:00
Jiaming Yuan
4fcfd9c96e
Fix and cleanup for column matrix. ( #7901 )
...
* Fix missed type dispatching for dense columns with missing values.
* Code cleanup to reduce special cases.
* Reduce memory usage.
2022-05-16 21:11:50 +08:00
Bobby Wang
1496789561
[doc] update the doc for jvm model compatibility ( #7907 )
2022-05-16 14:05:26 +08:00
Sze Yeung
a06d53688c
Correct a mistake in Setting Parameters section ( #7905 )
2022-05-15 18:56:31 -07:00
Philip Hyunsu Cho
4cd14aee5a
Rename misspelled config parameter for pseudo-Huber ( #7904 )
2022-05-15 06:38:33 -07:00
Jiaming Yuan
1baad8650c
Small cleanup to Column. ( #7898 )
...
* Define forward iterator to hide the internal state.
2022-05-15 12:39:10 +08:00
Jiaming Yuan
ee382c4153
Update news for 1.6.1 ( #7877 )
2022-05-14 15:38:18 -07:00
Rong Ou
af907e2d0d
Demo of federated learning using NVFlare ( #7879 )
...
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2022-05-14 22:45:41 +08:00
Bobby Wang
11e46e4bc0
[Breaking][jvm-packages] make classification model be xgboost-compatible ( #7896 )
2022-05-14 15:43:05 +08:00
Jiaming Yuan
1b6538b4e5
[breaking] Drop single precision histogram ( #7892 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-05-13 19:54:55 +08:00
Jiaming Yuan
c8f9d4b6e6
Show libxgboost.so path in build info. ( #7893 )
2022-05-13 18:08:56 +08:00
Bobby Wang
9fa7ed1743
[Breaking][jvm-packages] remove timeoutRequestWorkers parameter ( #7839 )
2022-05-13 16:26:25 +08:00
Jiaming Yuan
11d65fcb21
Extract partial sum into an independent function. ( #7889 )
2022-05-13 14:30:35 +08:00
Jiaming Yuan
db80671d6b
Fix monotone constraint with tuple input. ( #7891 )
2022-05-13 04:00:03 +08:00
Jiaming Yuan
94ca52b7b7
Fix overflow in prediction size. ( #7885 )
2022-05-12 02:44:03 +08:00
Jiaming Yuan
8ba4722d04
Remove pyarrow workaround. ( #7884 )
2022-05-11 20:54:48 +08:00
Philip Hyunsu Cho
65e6d73b95
[CI] Automate artifact fetch step in JVM release process ( #7882 )
2022-05-11 00:35:22 -07:00
Jiaming Yuan
16ba74d008
Update CUDA version requirement in CMake script. ( #7876 )
2022-05-09 04:16:22 +08:00
Philip Hyunsu Cho
d2bc0f0f08
Allow loading old models from RDS ( #7864 )
2022-05-06 22:49:38 -07:00
Amit Bera
1823db53f2
updated winning solution under readme.md ( #7862 )
2022-05-06 17:38:07 +08:00
Rory Mitchell
7ef54e39ec
Small refactor to categoricals ( #7858 )
2022-05-05 17:47:02 +02:00
Rong Ou
14ef38b834
Initial support for federated learning ( #7831 )
...
Federated learning plugin for xgboost:
* A gRPC server to aggregate MPI-style requests (allgather, allreduce, broadcast) from federated workers.
* A Rabit engine for the federated environment.
* Integration test to simulate federated learning.
Additional followups are needed to address GPU support, better security, and privacy, etc.
2022-05-05 21:49:22 +08:00
Jiaming Yuan
46e0bce212
Use maximum category in sketch. ( #7853 )
2022-05-05 19:56:49 +08:00
Jiaming Yuan
8ab5e13b5d
Fix typo [skip ci] ( #7861 )
2022-05-04 18:34:45 +08:00
Jiaming Yuan
317d7be6ee
Always use partition based categorical splits. ( #7857 )
2022-05-03 22:30:32 +08:00
Rory Mitchell
90cce38236
Remove single_precision_histogram for gpu_hist ( #7828 )
2022-05-03 14:53:19 +02:00
Jiaming Yuan
50d854e02e
[CI] Test with latest RAPIDS. ( #7816 )
2022-04-30 11:55:10 -07:00
Bobby Wang
1b103e1f5f
[CI] make container be able to re-attached ( #7848 )
...
When re-starting the container, it will fail in entrypoint.sh which
will exit when adding an existing group or user
2022-04-29 19:00:35 -07:00
Jiaming Yuan
288c52596c
Define bin type. ( #7850 )
2022-04-29 19:41:39 +08:00
Michael Allman
f7db16add1
Ignore all Java exceptions when looking for Linux musl support ( #7844 )
2022-04-28 15:44:30 +08:00
Bobby Wang
a94e1b172e
[jvm-packages] Fix model compatibility ( #7845 )
2022-04-28 02:05:38 +08:00
Bobby Wang
686caad40c
[jvm-package] remove the coalesce in barrier mode ( #7846 )
2022-04-27 23:34:22 +08:00
Jiaming Yuan
fdf533f2b9
[POC] Experimental support for l1 error. ( #7812 )
...
Support adaptive tree, a feature supported by both sklearn and lightgbm. The tree leaf is recomputed based on residue of labels and predictions after construction.
For l1 error, the optimal value is the median (50 percentile).
This is marked as experimental support for the following reasons:
- The value is not well defined for distributed training, where we might have empty leaves for local workers. Right now I just use the original leaf value for computing the average with other workers, which might cause significant errors.
- Some follow-ups are required, for exact, pruner, and optimization for quantile function. Also, we need to calculate the initial estimation.
2022-04-26 21:41:55 +08:00
Jiaming Yuan
ad06172c6b
Refactor pandas dataframe handling. ( #7843 )
2022-04-26 18:53:43 +08:00
Bobby Wang
bef1f939ce
[doc] remove the doc about killing SparkContext [skip ci] ( #7840 )
2022-04-25 19:29:16 +08:00
Bobby Wang
dc2e699656
[Breaking][jvm-packages] Use barrier execution mode ( #7836 )
...
With the introduction of the barrier execution mode. we don't need to kill SparkContext when some xgboost tasks failed. Instead, Spark will handle the errors for us. So in this PR, `killSparkContextOnWorkerFailure` parameter is deleted.
2022-04-25 17:09:52 +08:00
Bobby Wang
6ece549a90
[doc] update the jvm tutorial to 1.6.1 [skip ci] ( #7834 )
2022-04-24 14:25:22 +08:00
Jiaming Yuan
332380479b
Avoid warning in np primitive type tests. ( #7833 )
2022-04-23 02:07:01 +08:00
Bobby Wang
c45665a55a
[jvm-packages] move the dmatrix building into rabit context ( #7823 )
...
This fixes the QuantileDeviceDMatrix in distributed environment.
2022-04-23 00:06:50 +08:00
Jiaming Yuan
f0f76259c9
Remove STRING_TYPES. ( #7827 )
2022-04-22 19:07:51 +08:00
forestkey
c13a2a3114
[doc] "irrevelant" to "irrelevant" ( #7832 )
2022-04-22 16:54:30 +08:00
Jiaming Yuan
c70fa502a5
Expose feature_types to sklearn interface. ( #7821 )
2022-04-21 20:23:35 +08:00
Jiaming Yuan
401d451569
Clear configuration cache. ( #7826 )
2022-04-21 19:09:54 +08:00
Jiaming Yuan
52d4eda786
Deprecate use_label_encoder in XGBClassifier. ( #7822 )
...
* Deprecate `use_label_encoder` in XGBClassifier.
* We have removed the encoder, now prepare to remove the indicator.
2022-04-21 13:14:02 +08:00
Jiaming Yuan
5815df4c46
Remove warning in 1.4. ( #7815 )
2022-04-20 01:19:09 +08:00
Jiaming Yuan
d0de954af2
v1.6.0 release note. [skip ci] ( #7746 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-04-16 16:27:54 +08:00
Jiaming Yuan
5dea21273a
Fix training continuation with categorical model. ( #7810 )
...
* Make sure the task is initialized before construction of tree updater.
This is a quick fix meant to be backported to 1.6, for a full fix we should pass the model
param into tree updater by reference instead.
2022-04-15 18:21:02 +08:00
Bobby Wang
2d83b2ad8f
[jvm-packages] add hostIp and python exec for rabit tracker ( #7808 )
2022-04-15 16:28:43 +08:00
Bobby Wang
6f032b7152
[doc] fix a typo in jvm/index.rst ( #7806 )
2022-04-13 17:02:42 -07:00
dependabot[bot]
1bb1913811
Bump hadoop-common from 2.10.1 to 3.2.3 in /jvm-packages/xgboost4j-flink ( #7801 )
...
Bumps hadoop-common from 2.10.1 to 3.2.3.
---
updated-dependencies:
- dependency-name: org.apache.hadoop:hadoop-common
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-13 22:24:44 +08:00
Ikko Ashimine
56e4baff7c
[doc] Fix typo in build.rst ( #7800 )
...
avaiable -> available
2022-04-13 16:45:26 +08:00
Bobby Wang
3f536b5308
[jvm-packages] fix evaluation when featuresCols is used ( #7798 )
2022-04-13 12:52:50 +08:00
Bobby Wang
4b00c64d96
[doc] improve xgboost4j-spark-gpu doc [skip ci] ( #7793 )
...
Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com >
2022-04-12 12:02:16 +08:00
Bobby Wang
118192f116
[jvm-packages] xgboost4j-spark should work when featuresCols is specified ( #7789 )
2022-04-08 13:21:04 +08:00
Bobby Wang
729d227b89
[jvm-packages] remove the dep of com.fasterxml.jackson ( #7791 )
2022-04-08 13:04:34 +08:00
Bobby Wang
89d6419fd5
[jvm-packages] add doc for xgboost4j-spark-gpu ( #7779 )
...
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2022-04-07 11:35:01 +08:00
Bobby Wang
2454407f3a
[jvm-packages] unify setFeaturesCol API for XGBoostRegressor ( #7784 )
2022-04-05 13:35:33 +08:00
Philip Hyunsu Cho
e5ab8f3ebe
[CI] Speed up CPU test pipeline ( #7772 )
2022-04-01 02:39:04 +08:00
Jiaming Yuan
bcce17e688
Remove text loading in basic walk through demo. ( #7753 )
2022-04-01 00:59:42 +08:00
giuliohome
c467e90ac1
[doc] Update doc for Kubernetes Operator ( #7777 )
2022-03-31 23:10:49 +08:00
Jiaming Yuan
fd78af404b
Drop support for deprecated CUDA architectures. ( #7774 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-03-31 21:42:23 +08:00
Jiaming Yuan
02dd7b6913
Remove use of distutils. ( #7770 )
...
distutils is deprecated and replaced by other stdlib constructs.
2022-03-31 19:03:10 +08:00
Philip Hyunsu Cho
e8eff3581b
[CI] Enable faulthandler to show details when 0xC0000005 error occurs ( #7771 ) ( #7775 )
2022-03-31 17:40:06 +08:00
Jiaming Yuan
6fa1afdffc
Avoid compiler warning about comparison. ( #7768 )
2022-03-31 08:52:14 +08:00
Jiaming Yuan
522636cb52
Bump version. ( #7769 )
2022-03-31 06:33:22 +08:00
Jiaming Yuan
9150fdbd4d
Support pandas nullable types. ( #7760 )
2022-03-30 08:51:52 +08:00
Jiaming Yuan
d4796482b5
Fix failures on R hub and Win builder. ( #7763 )
...
* Update date.
* Workaround amalgamation build with clang. (SimpleDMatrix instantiation)
* Workaround compiler error with driver push.
* Revert autoconf requirement.
* Fix model IO on 32-bit environment. (i386)
* Clarify the function name.
2022-03-30 07:14:33 +08:00
Jiaming Yuan
a50b84244e
Cleanup configuration for constraints. ( #7758 )
2022-03-29 04:22:46 +08:00
Jiaming Yuan
3c9b04460a
Move num_parallel_tree to model parameter. ( #7751 )
...
The size of forest should be a property of model itself instead of a training
hyper-parameter.
2022-03-29 02:32:42 +08:00
Jiaming Yuan
8b3ecfca25
Mitigate flaky tests. ( #7749 )
...
* Skip non-increasing test with external memory when subsample is used.
* Increase bin numbers for boost from prediction test. This mitigates the effect of
non-deterministic partitioning.
2022-03-28 21:20:50 +08:00
Christian Marquardt
39c5616af2
Added CPPFLAGS and LDFLAGS to the testing for OpenMP during R installation from source. ( #7759 )
2022-03-28 19:14:07 +08:00
Haoming Chen
b37ff3d492
Fix cox objective test by using XGBOOST_PARALLEL_STABLE_SORT ( #7756 )
2022-03-26 17:58:30 +08:00
Jiaming Yuan
b3ba0e8708
Check cupy lazily. ( #7752 )
2022-03-26 06:09:58 +08:00
Jiaming Yuan
af0cf88921
Workaround compiler error. ( #7745 )
2022-03-25 17:05:14 +08:00
Jiaming Yuan
64575591d8
Use context in SetInfo. ( #7687 )
...
* Use the name `Context`.
* Pass a context object into `SetInfo`.
* Add context to proxy matrix.
* Add context to iterative DMatrix.
This is to remove the use of the default number of threads during `SetInfo` as a follow-up on
removing the global omp variable while preparing for CUDA stream semantic. Currently, XGBoost
uses the legacy CUDA stream, we will gradually remove them in the future in favor of non-blocking streams.
2022-03-24 22:16:26 +08:00
Oleksandr Pryimak
f5b20286e2
[jvm-packages] Launch dev jvm image under my user ( #4676 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2022-03-23 10:39:51 -07:00
Chengyang
c92ab2ce49
Add type hints to core.py ( #7707 )
...
Co-authored-by: Chengyang Gu <bridgream@gmail.com >
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2022-03-23 21:12:14 +08:00
Philip Hyunsu Cho
66cb4afc6c
Update install doc ( #7747 )
2022-03-23 17:20:01 +08:00
Aging
f20ffa8db3
Update JVM dev build Dockerfile and shell script ( #6792 )
...
Co-authored-by: Zhuo Yuzhen <yuzhuo@paypal.com >
2022-03-22 16:39:10 -07:00
Jiaming Yuan
4d81c741e9
External memory support for hist ( #7531 )
...
* Generate column matrix from gHistIndex.
* Avoid synchronization with the sparse page once the cache is written.
* Cleanups: Remove member variables/functions, change the update routine to look like approx and gpu_hist.
* Remove pruner.
2022-03-22 00:13:20 +08:00
Jiaming Yuan
cd55823112
Demo for using custom objective with multi-target regression. ( #7736 )
2022-03-20 17:44:25 +08:00
Jiaming Yuan
996cc705af
Small cleanup to hist tree method. ( #7735 )
...
* Remove special optimization using number of bins.
* Remove 1-based index for column sampling.
* Remove data layout.
* Unify update prediction cache.
2022-03-20 03:44:55 +08:00
Jiaming Yuan
718472dbe2
[CI] Upgrade GitHub action Windows workers. ( #7739 )
2022-03-20 01:44:33 +08:00
Jiaming Yuan
9a400731d9
Replace device sync with stream sync. ( #7737 )
2022-03-19 23:22:23 +08:00
Jiaming Yuan
da351621a1
[R] Fix parsing decision stump. ( #7689 )
2022-03-17 01:08:22 +08:00
Jiaming Yuan
e78a38b837
Sort sparse page index when constructing DMatrix. ( #7731 )
2022-03-16 18:01:05 +08:00
Xiaochang Wu
613ec36c5a
Support building SimpleDMatrix from Arrow data format ( #7512 )
...
* Integrate with Arrow C data API.
* Support Arrow dataset.
* Support Arrow table.
Co-authored-by: Xiaochang Wu <xiaochang.wu@intel.com >
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
Co-authored-by: Zhang Zhang <zhang.zhang@intel.com >
2022-03-15 13:25:19 +08:00
William Hicks
6b6849b001
Correct xgboost-config directory for inclusion in other projects ( #7730 )
2022-03-15 03:18:44 +08:00
Jiaming Yuan
98d6faefd6
Implement slope for Pseduo-Huber. ( #7727 )
...
* Add objective and metric.
* Some refactoring for CPU/GPU dispatching using linalg module.
2022-03-14 21:42:38 +08:00
Daniel Clausen
4dafb5fac8
[JVM-Packages] Add support for detecting musl-based Linux ( #7624 )
...
Co-authored-by: Marc Philipp <marc@gradle.com >
2022-03-14 00:37:27 +08:00
Haoming Chen
04fc575c0e
Run tests in a temporary directory ( #7723 )
...
Fix some tests to run in a temporary directory in case the root
directory is not writable. Note that most of tests are already
running in the temporary directory, so this PR just make them
consistent.
2022-03-12 21:24:36 +08:00
Haoming Chen
55463b76c1
Initialize TreeUpdater ctx_ with nullptr ( #7722 )
2022-03-10 22:33:32 +08:00
Jiaming Yuan
a62a3d991d
[dask] prediction with categorical data. ( #7708 )
2022-03-10 00:21:48 +08:00
Pradipta Ghosh
68b6d6bbe2
Fix for Feature shape mismatch error ( #7715 )
2022-03-03 21:36:29 +08:00
Cheng Li
a92e0f6240
multi groups in the constraints ( #7711 )
2022-03-01 18:10:15 +08:00
Jiaming Yuan
1d468e20a4
Optimize GPU evaluation function for categorical data. ( #7705 )
...
* Use transform and cache.
2022-02-28 17:46:29 +08:00
Jiaming Yuan
18a4af63aa
Update documents and tests. ( #7659 )
...
* Revise documents after recent refactoring and cat support.
* Add tests for behavior of max_depth and max_leaves.
2022-02-26 03:57:47 +08:00
Jiaming Yuan
5eed2990ad
Fix file descriptor leak. ( #7704 )
2022-02-25 17:49:33 +08:00
Philip Hyunsu Cho
1b25dd59f9
Use CUDA 11 in clang-tidy ( #7701 )
...
* Show command args when clang-tidy fails
* Add option to specify CUDA args
* Use clang-tidy 11
* [CI] Use CUDA 11
2022-02-24 15:15:07 -08:00
Jiaming Yuan
83a66b4994
Support categorical data for hist. ( #7695 )
...
* Extract partitioner from hist.
* Implement categorical data support by passing the gradient index directly into the partitioner.
* Organize/update document.
* Remove code for negative hessian.
2022-02-25 03:47:14 +08:00
Jiaming Yuan
f60d95b0ba
[R] Construct booster object in load.raw. ( #7686 )
2022-02-24 10:06:18 +08:00
Bobby Wang
89aa8ddf52
[jvm-packages] fix the prediction issue for multi:softmax ( #7694 )
2022-02-24 01:09:45 +08:00
Jiaming Yuan
6762c45494
Small cleanup to gradient index and hist. ( #7668 )
...
* Code comments.
* Const accessor to index.
* Remove some weird variables in the `Index` class.
* Simplify the `MemStackAllocator`.
2022-02-23 11:37:21 +08:00
Jiaming Yuan
49c74a5369
Update R package description. ( #7691 )
...
* Change role.
* Remove cmake file when building the package.
2022-02-23 08:36:37 +08:00
Bobby Wang
e3e6de5ed9
[jvm-packages] unify the set features API ( #7692 )
...
xgboost4j-spark provides 2 sets of API for setting features, one for CPU, another for GPU, which may cause confusion.
This PR removes the GPU API and adds an override CPU function setFeaturesCol to accept Array[String] parameters.
2022-02-23 03:37:25 +08:00
Jiaming Yuan
c859764d29
[doc] Clarify that states in callbacks are mutated. ( #7685 )
...
* Fix copy for cv. This prevents inserting default callbacks into the input list.
* Clarify the behavior of callbacks in training/cv.
* Fix typos in doc.
2022-02-22 11:45:00 +08:00
Jiaming Yuan
584bae1fc6
Fix document build with scikit-learn ( #7684 )
...
* Require sphinx >= 4.4 for RTD.
* Install sklearn.
2022-02-22 08:58:54 +08:00
Jiaming Yuan
e56d1779e1
Require Python 3.7. ( #7682 )
...
* Update setup.py.
2022-02-21 05:46:48 +08:00
Jiaming Yuan
549f3bd781
Honor CPU counts from CFS. ( #7654 )
2022-02-21 03:13:26 +08:00
Jiaming Yuan
671b3c8d8e
Fix typo. ( #7680 )
2022-02-20 03:42:47 +08:00
Jiaming Yuan
b2341eab0c
[R] Fix broken links. ( #7670 )
2022-02-20 00:55:48 +08:00
Bobby Wang
131858e7cb
[jvm-packages] Do not repartition when nWorker = 1 ( #7676 )
2022-02-19 21:45:54 +08:00
Jiaming Yuan
f08c5dcb06
Cleanup some pylint errors. ( #7667 )
...
* Cleanup some pylint errors.
* Cleanup pylint errors in rabit modules.
* Make data iter an abstract class and cleanup private access.
* Cleanup no-self-use for booster.
2022-02-19 18:53:12 +08:00
Jiaming Yuan
b76c5d54bf
Define export symbols in callback module. ( #7665 )
2022-02-19 18:52:41 +08:00
Jiaming Yuan
7366d3b20c
Ensure models with categorical splits don't use old binary format. ( #7666 )
2022-02-19 08:05:28 +08:00
Jiaming Yuan
14d61b0141
[doc] Update document for building from source. ( #7664 )
...
- Mention standard install command for R package.
- Remove repeated "get source" step.
- Remove troubleshooting on Windows. It's outdated considering VS 2022 is already out.
2022-02-19 04:57:03 +08:00
Jiaming Yuan
d625dc2047
Work around nvcc error. ( #7673 )
2022-02-19 01:41:46 +08:00
Jiaming Yuan
3877043d41
Avoid print for R package. ( #7672 )
2022-02-18 08:06:24 +08:00
Jiaming Yuan
711f7f3851
Avoid std::terminate for R package. ( #7661 )
...
This is part of CRAN policies.
2022-02-17 01:27:20 +08:00
Jiaming Yuan
12949c6b31
[R] Implement feature weights. ( #7660 )
2022-02-16 22:20:52 +08:00
Philip Hyunsu Cho
0149f81a5a
[CI] Fix S3 upload ( #7662 )
2022-02-16 01:35:27 -08:00
Jiaming Yuan
93eebe8664
[doc] Fix broken link. [skip ci] ( #7655 )
2022-02-15 14:07:34 +08:00
Jiaming Yuan
0da7d872ef
[doc] Update for prediction. ( #7648 )
2022-02-15 05:01:55 +08:00
Jiaming Yuan
0d0abe1845
Support optimal partitioning for GPU hist. ( #7652 )
...
* Implement `MaxCategory` in quantile.
* Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function.
* Extract an evaluator from GPU Hist to store the needed states.
* Added some CUDA stream/event utilities.
* Update document with references.
* Fixed a bug in approx evaluator where the number of data points is less than the number of categories.
2022-02-15 03:03:12 +08:00
Jiaming Yuan
2369d55e9a
Add tests for prediction cache. ( #7650 )
...
* Extract the test from approx for other tree methods.
* Add note on how it works.
2022-02-15 00:28:00 +08:00
Jiaming Yuan
5cd1f71b51
[dask] Improve configuration for port. ( #7645 )
...
- Try port 0 to let the OS return the available port.
- Add port configuration.
2022-02-14 21:34:34 +08:00
Jiaming Yuan
b52c4e13b0
[dask] Fix empty partition with pandas input. ( #7644 )
...
Empty partition is different from empty dataset. For the former case, each worker has
non-empty dask collections, but each collection might contain empty partition.
2022-02-14 19:35:51 +08:00
Jiaming Yuan
1f020a6097
Add maintainer for R package. ( #7649 )
2022-02-12 23:45:30 +08:00
Jiaming Yuan
1441a6cd27
[CI] Update R cache. ( #7646 )
2022-02-11 19:50:11 +08:00
Jiaming Yuan
2775c2a1ab
Prepare external memory support for hist. ( #7638 )
...
This PR prepares the GHistIndexMatrix to host the column matrix which is used by the hist tree method by accepting sparse_threshold parameter.
Some cleanups are made to ensure the correct batch param is being passed into DMatrix along with some additional tests for correctness of SimpleDMatrix.
2022-02-10 16:58:02 +08:00
dependabot[bot]
87c01f49d8
Bump hadoop-common from 2.7.3 to 2.10.1 in /jvm-packages/xgboost4j-flink ( #7641 )
...
Bumps hadoop-common from 2.7.3 to 2.10.1.
---
updated-dependencies:
- dependency-name: org.apache.hadoop:hadoop-common
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-02-09 17:07:35 -08:00
Jiaming Yuan
fe4ce920b2
[dask] Cleanup dask module. ( #7634 )
...
* Add a new utility for mapping function onto workers.
* Unify the type for feature names.
* Clean up the iterator.
* Fix prediction with DaskDMatrix worker specification.
* Fix base margin with DeviceQuantileDMatrix.
* Support vs 2022 in setup.py.
2022-02-08 20:41:46 +08:00
Jiaming Yuan
926af9951e
Add missing train parameter for sklearn interface. ( #7629 )
...
Some other parameters are still missing and rely on **kwargs, for instance parameters from
dart.
2022-02-08 13:20:19 +08:00
Jiaming Yuan
3e693e4f97
[dask] Fix nthread config with dask sklearn wrapper. ( #7633 )
2022-02-08 06:38:32 +08:00
Ed Shee
d152c59a9c
fixed broken link to Seldon XGBoost server ( #7628 )
2022-02-05 01:03:29 +08:00
Philip Hyunsu Cho
34a238ca98
[CI] Clean up Python wheel build pipeline ( #7626 )
...
* [CI] Always upload artifacts to [branch_name]/
* [CI] Move detailed setup inside build_python_wheels.sh
* Fix typo
2022-02-03 00:55:44 -08:00
Philip Hyunsu Cho
f6e6d0b2c0
[CI] Build Python wheels for MacOS (x86_64 and arm64) ( #7621 )
...
* Build Python wheels for OSX (x86_64 and arm64)
* Use Conda's libomp when running Python tests
* fix
* Add comment to explain CIBW_TARGET_OSX_ARM64
* Update release script
* Add comments in build_python_wheels.sh
* Document wheel pipeline
2022-02-02 17:35:48 -08:00
Philip Hyunsu Cho
271a7c5d43
[Doc] fix typo in install doc ( #7623 )
2022-01-31 13:35:56 -08:00
Philip Hyunsu Cho
c621775f34
Replace all uses of deprecated function sklearn.datasets.load_boston ( #7373 )
...
* Replace all uses of deprecated function sklearn.datasets.load_boston
* More renaming
* Fix bad name
* Update assertion
* Fix n boosted rounds.
* Avoid over regularization.
* Rebase.
* Avoid over regularization.
* Whac-a-mole
Co-authored-by: fis <jm.yuan@outlook.com >
2022-01-30 04:27:57 -08:00
Philip Hyunsu Cho
b4340abf56
Add special handling for multi:softmax in sklearn predict ( #7607 )
...
* Add special handling for multi:softmax in sklearn predict
* Add test coverage
2022-01-29 15:54:49 -08:00
david-cortes
7f738e7f6f
[R] Accept CSR data for predictions ( #7615 )
2022-01-30 00:54:57 +08:00
Michael Chirico
549bd419bb
use exit hook to remove temp file ( #7611 )
...
This guarantees the removal will trigger for unexpected early exits
2022-01-29 16:06:52 +08:00
Philip Hyunsu Cho
f21301c749
[Doc] Add instruction to install XGBoost for Apple Silicon using Conda ( #7612 )
2022-01-28 01:06:39 -08:00
Jiaming Yuan
81210420c6
Remove omp_get_max_threads ( #7608 )
...
This is the one last PR for removing omp global variable.
* Add context object to the `DMatrix`. This bridges `DMatrix` with https://github.com/dmlc/xgboost/issues/7308 .
* Require context to be available at the construction time of booster.
* Add `n_threads` support for R csc DMatrix constructor.
* Remove `omp_get_max_threads` in R glue code.
* Remove threading utilities that rely on omp global variable.
2022-01-28 16:09:22 +08:00
Philip Hyunsu Cho
028bdc1740
[R] Fix typo in docstring ( #7606 )
2022-01-26 23:33:25 +08:00
Jiaming Yuan
e060519d4f
Avoid regenerating the gradient index for approx. ( #7591 )
2022-01-26 21:41:30 +08:00
Jiaming Yuan
5d7818e75d
Remove omp_get_max_threads in tree updaters. ( #7590 )
2022-01-26 19:55:47 +08:00
Jiaming Yuan
24789429fd
Support latest pandas Index type. ( #7595 )
2022-01-26 18:20:10 +08:00
AJ Schmidt
511805c981
Compress fatbins ( #7601 )
...
* compress CUDA device code
Co-authored-by: ptaylor <paul.e.taylor@me.com >
2022-01-25 18:30:59 +08:00
Jiaming Yuan
6967ef7267
Remove omp_get_max_threads in objective. ( #7589 )
2022-01-24 04:35:49 +08:00
Jiaming Yuan
5817840858
Remove omp_get_max_threads in data. ( #7588 )
2022-01-24 02:44:07 +08:00
Jiaming Yuan
f84291c1e1
Fix max_cat_to_onehot doc annotation [skip ci] ( #7592 )
2022-01-23 16:33:23 +08:00
Jiaming Yuan
d262503781
[R] Implement new save raw in R. ( #7571 )
2022-01-22 20:55:47 +08:00
Jiaming Yuan
ef4dae4c0e
[dask] Add scheduler address to dask config. ( #7581 )
...
- Add user configuration.
- Bring back to the logic of using scheduler address from dask. This was removed when we were trying to support GKE, now we bring it back and let xgboost try it if direct guess or host IP from user config failed.
2022-01-22 01:56:32 +08:00
Jiaming Yuan
5ddd4a9d06
Small cleanup to tests. ( #7585 )
...
* Use random port in dask tests to avoid warnings for occupied port.
* Increase the difficulty of AUC tests.
2022-01-21 06:26:57 +00:00
Philip Hyunsu Cho
9fd510faa5
[CI] Clarify steps for publishing artifacts to Maven Central ( #7582 )
2022-01-20 14:23:07 -08:00
Jiaming Yuan
529cf8a54a
Configure cub version automatically. ( #7579 )
...
Note that when cub inside CUDA is being used, XGBoost performs checks on input size
instead of using internal cub function to accept inputs larger than maximum integer.
2022-01-20 19:49:26 +08:00
Jiaming Yuan
ac7a36367c
[jvm-packages] Implement new save_raw in jvm-packages. ( #7570 )
...
* New `toByteArray` that accepts a parameter for format.
2022-01-19 16:00:14 +08:00
Jiaming Yuan
b4ec1682c6
Update document for multi output and categorical. ( #7574 )
...
* Group together categorical related parameters.
* Update documents about multioutput and categorical.
2022-01-19 04:35:17 +08:00
Jiaming Yuan
dac9eb13bd
Implement new save_raw in Python. ( #7572 )
...
* Expose the new C API function to Python.
* Remove old document and helper script.
* Small optimization to the `save_raw` and Json ctors.
2022-01-19 02:27:51 +08:00
Jiaming Yuan
9f20a3315e
Test with latest numpy. ( #7573 )
2022-01-19 00:46:23 +08:00
Jiaming Yuan
bb56bb9a13
Fix merge conflict. ( #7577 )
2022-01-18 23:01:34 +08:00
Jiaming Yuan
cc06fab9a7
Support distributed CPU env for categorical data. ( #7575 )
...
* Add support for cat data in sketch allreduce.
* Share tests between CPU and GPU.
2022-01-18 21:56:07 +08:00
Jiaming Yuan
deab0e32ba
Validate out of range categorical value. ( #7576 )
...
* Use float in CPU categorical set to preserve the input value.
* Check out of range values.
2022-01-18 20:16:19 +08:00
Jiaming Yuan
d6ea5cc1ed
Cover approx tree method for categorical data tests. ( #7569 )
...
* Add tree to df tests.
* Add plotting tests.
* Add histogram tests.
2022-01-16 11:31:40 +08:00
Jiaming Yuan
465dc63833
Fix tree param feature type. ( #7565 )
2022-01-16 04:46:29 +08:00
Jiaming Yuan
a1bcd33a3b
[breaking] Change internal model serialization to UBJSON. ( #7556 )
...
* Use typed array for models.
* Change the memory snapshot format.
* Add new C API for saving to raw format.
2022-01-16 02:11:53 +08:00
Jiaming Yuan
13b0fa4b97
Implement get_group. ( #7564 )
2022-01-16 02:07:42 +08:00
Jiaming Yuan
52277cc3da
Rename build info function to be consistent with rest of the API. ( #7553 )
2022-01-14 00:39:28 +08:00
Jiaming Yuan
e94b766310
Fix early stopping with linear model. ( #7554 )
2022-01-13 21:53:06 +08:00
Jiaming Yuan
e5e47c3c99
Clarify the behavior of invalid categorical value handling. ( #7529 )
2022-01-13 16:11:52 +08:00
Philip Hyunsu Cho
20c0d60ac7
Restore functionality of max_depth=0 in hist ( #7551 )
...
* Restore functionality of max_depth=0 in hist
* Add test case
2022-01-11 01:37:44 +08:00
Jiaming Yuan
2db808021d
Silent some warnings for unused variable. ( #7548 )
2022-01-11 01:16:26 +08:00
Jiaming Yuan
c635d4c46a
Implement ubjson. ( #7549 )
...
* Implement ubjson.
This is a partial implementation of UBJSON with support for typed arrays. Some missing
features are `f64`, typed object, and the no-op.
2022-01-10 23:24:23 +08:00
Jiaming Yuan
001503186c
Rewrite approx ( #7214 )
...
This PR rewrites the approx tree method to use codebase from hist for better performance and code sharing.
The rewrite has many benefits:
- Support for both `max_leaves` and `max_depth`.
- Support for `grow_policy`.
- Support for mono constraint.
- Support for feature weights.
- Support for easier bin configuration (`max_bin`).
- Support for categorical data.
- Faster performance for most of the datasets. (many times faster)
- Support for prediction cache.
- Significantly better performance for external memory.
- Unites the code base between approx and hist.
2022-01-10 21:15:05 +08:00
Jiaming Yuan
ed95e77752
[jvm-packages] Update JNI header. ( #7550 )
2022-01-10 14:59:40 +08:00
Jiaming Yuan
91c1a1c52f
Fix index type for bitfield. ( #7541 )
2022-01-05 19:23:29 +08:00
Jiaming Yuan
0df2ae63c7
Fix num_boosted_rounds for linear model. ( #7538 )
...
* Add note.
* Fix n boosted rounds.
2022-01-05 03:29:33 +08:00
Jiaming Yuan
28af6f9abb
Remove omp_get_max_threads in gbm and linear. ( #7537 )
...
* Use ctx in gbm.
* Use ctx threads in gbm and linear.
2022-01-05 03:28:52 +08:00
Jiaming Yuan
eea094e1bc
Remove some warnings from clang. ( #7533 )
...
* Unused variable.
* Unnecessary virtual function.
2022-01-05 03:28:21 +08:00
Jiaming Yuan
ec56d5869b
[doc] Include dask examples into doc. ( #7530 )
2022-01-05 03:27:22 +08:00
Jiaming Yuan
54582f641a
[doc] Use cross references in sphinx doc. ( #7522 )
...
* Use cross references instead of URL.
* Fix auto doc for callback.
2022-01-05 03:21:25 +08:00
Jiaming Yuan
eb1efb54b5
Define feature_names_in_. ( #7526 )
...
* Define `feature_names_in_`.
* Raise attribute error if it's not defined.
2022-01-05 01:35:34 +08:00
Jiaming Yuan
8f0a42a266
Initial support for multi-label classification. ( #7521 )
...
* Add support in sklearn classifier.
2022-01-04 23:58:21 +08:00
Jiaming Yuan
68cdbc9c16
Remove omp_get_max_threads in CPU predictor. ( #7519 )
...
This is part of the on going effort to remove the dependency on global omp variables.
2022-01-04 22:12:15 +08:00
Ikko Ashimine
5516281881
Fix typo in tree_model.cc ( #7539 )
...
occurance -> occurrence
2021-12-30 20:12:25 +08:00
Randall Britten
a4a0ebb85d
[doc] Lowercase omega for per tree complexity ( #7532 )
...
As suggested on issue #7480
2021-12-29 23:05:54 +08:00
Louis Desreumaux
3886c3dd8f
Remove macro definitions of snprintf and vsnprintf ( #7536 )
2021-12-26 08:05:59 +08:00
Ginko Balboa
29bfa94bb6
Fix external memory with gpu_hist and subsampling combination bug. ( #7481 )
...
Instead of accessing data from the `original_page_`, access the data from the first page of the available batch.
fix #7476
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2021-12-24 11:15:35 +08:00
Jiaming Yuan
7f399eac8b
Use double for GPU Hist node sum. ( #7507 )
2021-12-22 08:41:35 +08:00
Jiaming Yuan
eabec370e4
[R] Fix single sample prediction. ( #7524 )
2021-12-21 14:11:07 +08:00
Bobby Wang
e8c1eb99e4
[jvm-package] Clean up the legacy gpu support tests ( #7523 )
2021-12-21 09:15:51 +08:00
Xiaochang Wu
59bd1ab17e
Skip callback demo test if matplotlib is not installed ( #7520 )
2021-12-19 08:20:38 +08:00
Jiaming Yuan
58a6723eb1
Initial support for multioutput regression. ( #7514 )
...
* Add num target model parameter, which is configured from input labels.
* Change elementwise metric and indexing for weights.
* Add demo.
* Add tests.
2021-12-18 09:28:38 +08:00
Jiaming Yuan
9ab73f737e
Extract Sketch Entry from hist maker. ( #7503 )
...
* Extract Sketch Entry from hist maker.
* Add a new sketch container for sorted inputs.
* Optimize bin search.
2021-12-18 05:36:56 +08:00
Qingyun Wu
b4a1236cfc
[doc] Update the link to the tuning example in FLAML
2021-12-17 14:31:00 +08:00
Bobby Wang
24e25802a7
[jvm-packages] Add Rapids plugin support ( #7491 )
...
* Add GPU pre-processing pipeline.
2021-12-17 13:11:12 +08:00
Jiaming Yuan
5b1161bb64
Convert labels into tensor. ( #7456 )
...
* Add a new ctor to tensor for `initilizer_list`.
* Change labels from host device vector to tensor.
* Rename the field from `labels_` to `labels` since it's a public member.
2021-12-17 00:58:35 +08:00
Jiaming Yuan
6f8a4633b7
Fix Python typehint with upgraded mypy. ( #7513 )
2021-12-16 23:08:08 +08:00
Jiaming Yuan
70b12d898a
[dask] Fix ddqdm with empty partition. ( #7510 )
...
* Fix empty partition.
* war.
2021-12-16 20:37:29 +08:00
Jiaming Yuan
a512b4b394
[doc] Promote dask from experimental. [skip ci] ( #7509 )
2021-12-16 14:17:06 +08:00
Jiaming Yuan
05497a9141
[dask] Fix asyncio. ( #7508 )
2021-12-13 01:48:25 +08:00
Jiaming Yuan
01152f89ee
Remove unused parameters. ( #7499 )
2021-12-09 14:24:51 +08:00
Harvey
1864fab592
Minor edits to Parameters doc page. ( #7500 )
...
* bost -> both
* doc improvement
* use original filename
* syntax highlight false
* missed a few highlights
2021-12-07 15:46:44 +08:00
Jiaming Yuan
021f8bf28b
Fix pylint. ( #7498 )
2021-12-07 13:23:30 +08:00
Jiaming Yuan
eee527d264
Add approx partitioner. ( #7467 )
2021-11-27 15:22:06 +08:00
Jiaming Yuan
85cbd32c5a
Add range-based slicing to tensor view. ( #7453 )
2021-11-27 13:42:36 +08:00
danmarinescu
6f38f5affa
Updated CMake version requirement in build.rst ( #7487 )
...
The documentation states that to build from source you need CMake 3.13 or higher. However, according to https://github.com/dmlc/xgboost/blob/master/CMakeLists.txt#L1 CMake 3.14 or higher is required.
2021-11-27 09:58:01 +08:00
Jiaming Yuan
557ffc4bf5
Reduce base margin to 2 dim for now. ( #7455 )
2021-11-27 00:46:13 +08:00
Jiaming Yuan
bf7bb575b4
Test CPU histogram with cat data. ( #7465 )
2021-11-27 00:43:28 +08:00
Bobby Wang
24be04e848
[jvm-packages] Add DeviceQuantileDMatrix to Scala binding ( #7459 )
2021-11-24 20:23:18 +08:00
Philip Hyunsu Cho
619c450a49
[CI] Add missing step extract_branch ( #7479 )
2021-11-24 17:35:59 +08:00
Jiaming Yuan
820e1c01ef
Fix macos package upload. ( #7475 )
...
* Split up the tests.
2021-11-24 03:43:49 +08:00
Jiaming Yuan
488f12a996
Fix github macos package upload. ( #7474 )
2021-11-24 00:29:11 +08:00
Jiaming Yuan
c024c42dce
Modernize XGBoost Python document. ( #7468 )
...
* Use sphinx gallery to integrate examples.
* Remove mock objects.
* Add dask doc inventory.
2021-11-23 23:24:52 +08:00
Philip Hyunsu Cho
96a9848c9e
[CI] Fix continuous delivery pipeline for MacOS ( #7472 )
2021-11-23 22:22:08 +08:00
Jiaming Yuan
b124a27f57
Support scipy sparse in dask. ( #7457 )
2021-11-23 16:45:36 +08:00
Jiaming Yuan
5262e933f7
Remove unnecessary constexpr. ( #7466 )
2021-11-23 16:42:08 +08:00
Philip Hyunsu Cho
0c67685e43
[CI] Add a helper script to aid Maven release ( #7470 )
...
* [CI] Add a helper script to aid Maven release
* Move script to dev/ [skip ci]
* Update command [skip ci]
2021-11-23 00:11:07 -08:00
Harvey
0552ca8021
Fix typo ( #7469 )
2021-11-23 08:58:45 +08:00
Jiaming Yuan
176110a22d
Support external memory in CPU histogram building. ( #7372 )
2021-11-23 01:13:33 +08:00
Jiaming Yuan
d33854af1b
[Breaking] Accept multi-dim meta info. ( #7405 )
...
This PR changes base_margin into a 3-dim array, with one of them being reserved for multi-target classification. Also, a breaking change is made for binary serialization due to extra dimension along with a fix for saving the feature weights. Lastly, it unifies the prediction initialization between CPU and GPU. After this PR, the meta info setter in Python will be based on array interface.
2021-11-18 23:02:54 +08:00
Jiaming Yuan
9fb4338964
Add test for eta and mitigate float error. ( #7446 )
...
* Add eta test.
* Don't skip test.
2021-11-18 20:42:48 +08:00
Bobby Wang
7cfb310eb4
Rework transform ( #7440 )
...
extract the common part of transform code from XGBoostClassifier
and XGBoostRegressor
2021-11-18 15:48:57 +08:00
Philip Hyunsu Cho
2adf222fb2
[CI] CI cost saving ( #7407 )
...
* [CI] Drop CUDA 10.1; Require 11.0
* Change NCCL version
* Use CUDA 10.1 for clang-tidy, for now
* Remove JDK 11 and 12
* Fix NCCL version
* Don't require 11.0 just yet, until clang-tidy is fixed
* Skip MultiClassesSerializationTest.GpuHist
2021-11-17 21:02:20 -08:00
Jiaming Yuan
b0015fda96
Fix R CRAN failures. ( #7404 )
...
* Remove hist builder dtor.
* Initialize values.
* Tolerance.
* Remove the use of nthread in col maker.
2021-11-16 10:51:12 +08:00
Jiaming Yuan
55ee272ea8
Extend array interface to handle ndarray. ( #7434 )
...
* Extend array interface to handle ndarray.
The `ArrayInterface` class is extended to support multi-dim array inputs. Previously this
class handles only 2-dim (vector is also matrix). This PR specifies the expected
dimension at compile-time and the array interface can perform various checks automatically
for input data. Also, adapters like CSR are more rigorous about their input. Lastly, row
vector and column vector are handled without intervention from the caller.
2021-11-16 09:52:15 +08:00
Jiaming Yuan
e27f543deb
Set use_logger in tracker to false. ( #7438 )
2021-11-16 05:12:42 +08:00
Jiaming Yuan
d4274bc556
Fix typo. ( #7433 )
2021-11-15 01:28:11 +08:00
Jiaming Yuan
a7057fa64c
Implement typed storage for tensor. ( #7429 )
...
* Add `Tensor` class.
* Add elementwise kernel for CPU and GPU.
* Add unravel index.
* Move some computation to compile time.
2021-11-14 18:53:13 +08:00
Kian Meng Ang
d27a11ff87
Fix typos in python package ( #7432 )
2021-11-14 17:20:19 +08:00
Jiaming Yuan
8cc75f1576
Cleanup Python tests. ( #7426 )
2021-11-14 15:47:05 +08:00
Jiaming Yuan
38ca96c9fc
[CI] Install igraph as binary. ( #7417 )
2021-11-12 19:04:28 +08:00
Jiaming Yuan
46726ec176
Expose build info ( #7399 )
2021-11-12 18:22:46 +08:00
Jiaming Yuan
937fa282b5
Extract string view. ( #7416 )
...
* Add equality operators.
* Return a view in substr.
* Add proper iterator types.
2021-11-12 18:22:30 +08:00
Jiaming Yuan
ca6f980932
Check number of trees in inplace predict. ( #7409 )
2021-11-12 18:20:23 +08:00
Jiaming Yuan
97d7582457
Delay breaking changes to 1.6. ( #7420 )
...
The patch is too big to be backported.
2021-11-12 16:46:03 +08:00
Bobby Wang
cb685607b2
[jvm-packages] Rework the train pipeline ( #7401 )
...
1. Add PreXGBoost to build RDD[Watches] from Dataset
2. Feed RDD[Watches] built from PreXGBoost to XGBoost to train
2021-11-10 17:51:38 +08:00
Jiaming Yuan
8df0a252b7
[doc] Update document for GPU. [skip ci] ( #7403 )
...
* Remove outdated workaround and description.
2021-11-09 02:05:55 +08:00
Jiaming Yuan
d7d1b6e3a6
CPU evaluation for cat data. ( #7393 )
...
* Implementation for one hot based.
* Implementation for partition based. (LightGBM)
2021-11-06 14:41:35 +08:00
Jiaming Yuan
6ede12412c
Update dmlc-core and use data iter for GPU sampling tests. ( #7398 )
...
* Update dmlc-core.
* New parquet parser in dmlc-core.
* Use data iter for GPU sampling tests.
2021-11-06 05:12:49 +08:00
Jiaming Yuan
c968217ca8
[R] Fix global feature importance and predict with 1 sample. ( #7394 )
...
* [R] Fix global feature importance.
* Add implementation for tree index. The parameter is not documented in C API since we
should work on porting the model slicing to R instead of supporting more use of tree
index.
* Fix the difference between "gain" and "total_gain".
* debug.
* Fix prediction.
2021-11-05 10:07:00 +08:00
Jiaming Yuan
48aff0eabd
[doc][jvm-packages] Update information about Python tracker. [skip ci] ( #7396 )
2021-11-05 05:55:13 +08:00
Jiaming Yuan
b06040b6d0
Implement a general array view. ( #7365 )
...
* Replace existing matrix and vector view.
This is to prepare for handling higher dimension data and prediction when we support multi-target models.
2021-11-05 04:16:11 +08:00
Jiaming Yuan
232144ca09
Add note about CRAN release [skip ci] ( #7395 )
2021-11-05 00:34:14 +08:00
Jiaming Yuan
4100827971
Pass infomation about objective to tree methods. ( #7385 )
...
* Define the `ObjInfo` and pass it down to every tree updater.
2021-11-04 01:52:44 +08:00
Jiaming Yuan
ccdabe4512
Support building gradient index with cat data. ( #7371 )
2021-11-03 22:37:37 +08:00
Jiaming Yuan
57a4b4ff64
Handle OMP_THREAD_LIMIT. ( #7390 )
2021-11-03 15:44:38 +08:00
Jiaming Yuan
e6ab594e14
Change shebang used in CLI demo. ( #7389 )
...
Change from system Python to environment python3. For Ubuntu 20.04, only `python3` is
available and there's no `python`. So at least `python3` is consistent with Python
virtual env, Ubuntu and anaconda.
2021-11-02 22:11:19 +08:00
Jiaming Yuan
a55d43ccfd
Add test for invalid categorical data values. ( #7380 )
...
* Add test for invalid categorical data values.
* Add check during sketching.
2021-11-02 18:00:52 +08:00
Jiaming Yuan
c74df31bf9
Cleanup the train function. ( #7377 )
...
* Move attribute setter to callback.
* Remove the internal train function.
* Remove unnecessary initialization.
2021-11-02 18:00:26 +08:00
Jiaming Yuan
154b15060e
Move callbacks from fit to __init__. ( #7375 )
2021-11-02 17:51:42 +08:00
Jiaming Yuan
32e673d8c4
Support building with CTK11.5. ( #7379 )
...
* Support building with CTK11.5.
* Require system cub installation for CTK11.4+.
* Check thrust version for segmented sort.
2021-11-02 16:22:26 +08:00
Jiaming Yuan
a13321148a
Support multi-class with base margin. ( #7381 )
...
This is already partially supported but never properly tested. So the only possible way to use it is calling `numpy.ndarray.flatten` with `base_margin` before passing it into XGBoost. This PR adds proper support
for most of the data types along with tests.
2021-11-02 13:38:00 +08:00
Jiaming Yuan
6295dc3b67
Fix span reverse iterator. ( #7387 )
...
* Fix span reverse iterator.
* Disable `rbegin` on device code to avoid calling host function.
* Add `trbegin` and friends.
2021-11-02 13:35:59 +08:00
Jiaming Yuan
8211e5f341
Add clang-format config. ( #7383 )
...
Generated using `clang-format -style=google -dump-config > .clang-format`, with column
width changed from 80 to 100 to be consistent with existing cpplint check.
2021-11-02 13:34:38 +08:00
Jiaming Yuan
0f7a9b42f1
Use double precision in metric calculation. ( #7364 )
2021-11-02 12:00:32 +08:00
Jiaming Yuan
239dbb3c0a
Move macos test to github action. ( #7382 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2021-10-30 14:40:32 +08:00
Bobby Wang
b81ebbef62
[jvm-packages] Fix json4s binary compatibility issue ( #7376 )
...
Spark 3.2 depends on 3.7.0-M11 which has changed some implicited functions'
signatures. And it will result the xgboost4j built against spark 3.0/3.1
failed when saving the model.
2021-10-30 03:20:57 +08:00
Jiaming Yuan
c6769488b3
Typehint for subset of core API. ( #7348 )
2021-10-28 20:47:04 +08:00
Jiaming Yuan
45aef75cca
Move skl eval_metric and early_stopping rounds to model params. ( #6751 )
...
A new parameter `custom_metric` is added to `train` and `cv` to distinguish the behaviour from the old `feval`. And `feval` is deprecated. The new `custom_metric` receives transformed prediction when the built-in objective is used. This enables XGBoost to use cost functions from other libraries like scikit-learn directly without going through the definition of the link function.
`eval_metric` and `early_stopping_rounds` in sklearn interface are moved from `fit` to `__init__` and is now saved as part of the scikit-learn model. The old ones in `fit` function are now deprecated. The new `eval_metric` in `__init__` has the same new behaviour as `custom_metric`.
Added more detailed documents for the behaviour of custom objective and metric.
2021-10-28 17:20:20 +08:00
Jiaming Yuan
6b074add66
Update setup.py. ( #7360 )
...
* Add new classifiers.
* Typehint.
2021-10-28 14:58:31 +08:00
Jiaming Yuan
3c4aa9b2ea
[breaking] Remove label encoder deprecated in 1.3. ( #7357 )
2021-10-28 13:24:29 +08:00
Jiaming Yuan
d05754f558
Avoid OMP reduction in AUC. ( #7362 )
2021-10-28 05:03:52 +08:00
Jiaming Yuan
ac9bfaa4f2
Handle missing values in dataframe with category dtype. ( #7331 )
...
* Replace -1 in pandas initializer.
* Unify `IsValid` functor.
* Mimic pandas data handling in cuDF glue code.
* Check invalid categories.
* Fix DDM sketching.
2021-10-28 03:33:54 +08:00
Jiaming Yuan
2eee87423c
Remove old custom objective demo. ( #7369 )
...
We have 2 new custom objective demos covering both regression and classification with
accompanying tutorials in documents.
2021-10-27 16:31:48 +08:00
Jiaming Yuan
b9414b6477
Update GPU doc for PR-AUC. [skip ci] ( #7368 )
2021-10-27 16:31:07 +08:00
Jiaming Yuan
d4349426d8
Re-implement PR-AUC. ( #7297 )
...
* Support binary/multi-class classification, ranking.
* Add documents.
* Handle missing data.
2021-10-26 13:07:50 +08:00
nicovdijk
a6bcd54b47
[jvm-packages] Fix for space in sys.executable path in create_jni.py ( #7358 )
2021-10-25 13:45:11 +08:00
Jiaming Yuan
fd61c61071
Avoid omp reduction in rank metric. ( #7349 )
2021-10-22 14:13:34 +08:00
Jiaming Yuan
e36b066344
[doc] Document the status of RTD hosting. [skip ci] ( #7353 )
2021-10-22 14:12:55 +08:00
Jiaming Yuan
864d236a82
[doc] Remove num_pbuffer. [skip ci] ( #7356 )
2021-10-22 14:12:32 +08:00
nicovdijk
31a307cf6b
[XGBoost4J-Spark] Serialization for custom objective and eval ( #7274 )
...
* added type hints to custom_obj and custom_eval for Spark persistence
Co-authored-by: Bobby Wang <wbo4958@gmail.com >
2021-10-21 16:22:23 +08:00
Jiaming Yuan
7593fa9982
1.5 release note. [skip ci] ( #7271 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-10-21 13:43:31 +08:00
Jiaming Yuan
d1f00fb0b7
Stricter validation for group. ( #7345 )
2021-10-21 12:13:33 +08:00
nicovdijk
74bab6e504
Control logging for early stopping using shouldPrint() ( #7326 )
2021-10-21 12:12:06 +08:00
Jiaming Yuan
8d7c6366d7
Accept histogram cut instead gradient index in evaluation. ( #7336 )
2021-10-20 18:04:46 +08:00
Jiaming Yuan
15685996fc
[doc] Small improvements for categorical data document. ( #7330 )
2021-10-20 18:04:32 +08:00
Jiaming Yuan
f999897615
[dask] Use nthread in DMatrix construction. ( #7337 )
...
This is consistent with the thread overriding behavior.
2021-10-20 15:16:40 +08:00
Philip Hyunsu Cho
b8e8f0fcd9
[doc] Use latest Sphinx RTD theme ( #7347 )
2021-10-20 00:04:43 -07:00
Jiaming Yuan
3b0b74fa94
[doc] Use RTD theme. ( #7346 )
2021-10-19 23:49:19 -07:00
Jiaming Yuan
376b448015
[doc] Fix broken links. ( #7341 )
...
* Fix most of the link checks from sphinx.
* Remove duplicate explicit target name.
2021-10-20 14:45:30 +08:00
Jiaming Yuan
f53da412aa
Add typehint to tracker. ( #7338 )
2021-10-20 12:49:36 +08:00
Jiaming Yuan
5ff210ed75
Small fix for the release doc and script. [skip ci] ( #7332 )
...
Add Philip as co-maintainer of maven packages.
2021-10-20 12:49:12 +08:00
Jiaming Yuan
c42e3fbcf3
[doc] Fix early stopping document. ( #7334 )
2021-10-18 11:21:16 -07:00
Bobby Wang
4fd149b3a2
[jvm-packages] update checkstyle ( #7335 )
...
* [jvm-packages] update scalastyle
1. bump scalastyle-maven-plugin and maven-checkstyle-plugin to latest
2. remove unused imports
* fix code style check
2021-10-18 18:42:01 +08:00
Jiaming Yuan
fbb0dc4275
Remove auto configuration of seed_per_iteration. ( #7009 )
...
* Remove auto configuration of seed_per_iteration.
This should be related to model recovery from rabit, which is removed.
* Document.
2021-10-17 15:58:57 +08:00
Jiaming Yuan
fb1a9e6bc5
Avoid omp reduction in coordinate descent and aft metrics. ( #7316 )
...
Aside from the omp issue, parameter configuration for aft metric is simplified.
2021-10-17 15:55:49 +08:00
Jiaming Yuan
f56e2e9a66
Support categorical data with pandas Dataframe in inplace prediction ( #7322 )
2021-10-17 14:32:06 +08:00
Jiaming Yuan
8e619010d0
Extract CPUExpandEntry and HistParam. ( #7321 )
...
* Remove kRootNid.
* Check for empty hessian.
2021-10-17 14:22:25 +08:00
Jiaming Yuan
6cdcfe8128
Improve external memory demo. ( #7320 )
...
* Use npy format.
* Add evaluation.
* Use make_regression.
2021-10-17 11:25:24 +08:00
Jiaming Yuan
e6a142fe70
Fix document about best_iteration ( #7324 )
2021-10-14 15:30:46 -07:00
Jiaming Yuan
4ddf8d001c
Deterministic result for element-wise/mclass metrics. ( #7303 )
...
Remove openmp reduction.
2021-10-13 14:22:40 +08:00
Jiaming Yuan
406c70ba0e
[doc] Fix typo. [skip ci] ( #7311 )
2021-10-12 19:10:18 +08:00
Jiaming Yuan
0bd8f21e4e
Add document for categorical data. ( #7307 )
2021-10-12 16:10:59 +08:00
Jiaming Yuan
a7d0c66457
Remove unused code. ( #7293 )
2021-10-12 15:04:41 +08:00
Jiaming Yuan
130df8cdda
Add tests for tree grow policy. ( #7302 )
2021-10-12 15:04:06 +08:00
Jiaming Yuan
5b17bb0031
Fix prediction with cat data in sklearn interface. ( #7306 )
...
* Specify DMatrix parameter for pre-processing dataframe.
* Add document about the behaviour of prediction.
2021-10-12 14:31:12 +08:00
Jiaming Yuan
89d87e5331
Update GPU Tree SHAP ( #7304 )
2021-10-11 21:39:50 +08:00
Jiaming Yuan
298af6f409
Fix weighted samples in multi-class AUC. ( #7300 )
2021-10-11 15:12:29 +08:00
Jiaming Yuan
69d3b1b8b4
Remove old callback deprecated in 1.3. ( #7280 )
2021-10-08 17:24:59 +08:00
Jiaming Yuan
578de9f762
Fix cv verbose_eval ( #7291 )
2021-10-08 12:28:38 +08:00
Jiaming Yuan
f7caac2563
Bump version to 1.6.0 in master. ( #7259 )
2021-10-07 16:09:26 +08:00
Jiaming Yuan
e2660ab8f3
Extend release script with R packages. [skip ci] ( #7278 )
2021-10-07 16:08:42 +08:00
Yuan Tang
cc459755be
Update affiliation ( #7289 )
2021-10-07 16:07:34 +08:00
Jiaming Yuan
d8cb395380
Fix gamma neg log likelihood. ( #7275 )
2021-10-05 16:57:08 +08:00
Jiaming Yuan
b3b03200e2
Remove old warning in 1.3 ( #7279 )
2021-10-01 08:05:50 +08:00
Philip Hyunsu Cho
2a0368b7ca
Add CMake option to use /MD runtime ( #7277 )
2021-09-30 13:13:57 +08:00
Jiaming Yuan
b2d8431aea
[R] Fix document for nthread. ( #7263 )
2021-09-28 11:46:24 +08:00
Jiaming Yuan
d8a549e6ac
Avoid thread block with sparse data. ( #7255 )
2021-09-25 13:11:34 +08:00
Jiaming Yuan
ca17f8a5fc
Dispatch thrust versions and upgrade rmm. ( #7254 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-09-25 03:43:23 +08:00
Jiaming Yuan
fbd58bf190
[jvm-packages] Create demo and test for xgboost4j early stopping. ( #7252 )
2021-09-25 03:29:27 +08:00
Bobby Wang
0ee11dac77
[jvm-packages][xgboost4j-gpu] Support GPU dataframe and DeviceQuantileDMatrix ( #7195 )
...
Following classes are added to support dataframe in java binding:
- `Column` is an abstract type for a single column in tabular data.
- `ColumnBatch` is an abstract type for dataframe.
- `CuDFColumn` is an implementaiton of `Column` that consume cuDF column
- `CudfColumnBatch` is an implementation of `ColumnBatch` that consumes cuDF dataframe.
- `DeviceQuantileDMatrix` is the interface for quantized data.
The Java implementation mimics the Python interface and uses `__cuda_array_interface__` protocol for memory indexing. One difference is on JVM package, the data batch is staged on the host as java iterators cannot be reset.
Co-authored-by: jiamingy <jm.yuan@outlook.com >
2021-09-24 14:25:00 +08:00
Philip Hyunsu Cho
d27a427dc5
[CI] Rotate access keys for uploading MacOS artifacts from Travis CI ( #7253 )
2021-09-24 10:44:00 +08:00
ShvetsKS
475fd1abec
Reduced span overheads in objective function calculate ( #7206 )
...
Co-authored-by: fis <jm.yuan@outlook.com >
2021-09-23 04:43:59 +08:00
Jiaming Yuan
9472be7d77
Fix initialization from pandas series. ( #7243 )
2021-09-23 04:43:25 +08:00
david-cortes
4f93e5586a
Improve wording for warning ( #7248 )
...
This warning sounds a bit ungrammatical. Additionally, the second part of the warning is not clear. This PR changes the wording to make it clearer.
2021-09-21 10:48:11 +08:00
Jiaming Yuan
18bd16341a
Update Python intro. [skip ci] ( #7235 )
...
* Fix the link to demo.
* Stop recommending text file inputs.
* Brief mention to scikit-learn interface.
* Fix indent warning in tree method doc.
2021-09-21 02:47:09 +00:00
david-cortes
61a619b5c3
[R] Avoid symbol naming conflicts with other packages ( #7245 )
...
* don't register all R symbols
* typo
2021-09-19 11:17:08 -07:00
Jiaming Yuan
e48e05e6e2
Add typehint to rabit module. ( #7240 )
2021-09-17 18:31:02 +08:00
Jiaming Yuan
c735c17f33
Disable callback and ES on random forest. ( #7236 )
2021-09-17 18:21:17 +08:00
Jiaming Yuan
c311a8c1d8
Enable compiling with system cub. ( #7232 )
...
- Tested with all CUDA 11.x.
- Workaround cub scan by using discard iterator in AUC.
- Limit the size of Argsort when compiled with CUDA cub.
2021-09-17 14:28:18 +08:00
Jiaming Yuan
b18f5f61b0
Fix pylint ( #7241 )
2021-09-17 11:50:36 +08:00
Jiaming Yuan
38a23f66a8
Fix typo in release script. [skip ci] ( #7238 )
2021-09-17 11:14:05 +08:00
Jiaming Yuan
8ad7e8eeb0
[doc] Fix typo. [skip ci] ( #7226 )
2021-09-17 11:13:49 +08:00
Jiaming Yuan
22d56cebf1
Encode pandas categorical data automatically. ( #7231 )
2021-09-17 11:09:55 +08:00
Jiaming Yuan
32e0858501
Fix travis. ( #7237 )
2021-09-17 10:06:23 +08:00
Jiaming Yuan
31c1e13f90
Categorical data support in CPU sketching. ( #7221 )
2021-09-17 04:37:09 +08:00
Jiaming Yuan
9f63d6fead
[jvm-packages] Deprecate constructors with implicit missing value. ( #7225 )
2021-09-17 04:35:04 +08:00
Jiaming Yuan
0ed979b096
Support more input types for categorical data. ( #7220 )
...
* Support more input types for categorical data.
* Shorten the type name from "categorical" to "c".
* Tests for np/cp array and scipy csr/csc/coo.
* Specify the type for feature info.
2021-09-16 20:39:30 +08:00
Jiaming Yuan
2942dc68e4
Fix mixed types in GPU sketching. ( #7228 )
2021-09-16 00:10:25 +08:00
Jiaming Yuan
037dd0820d
Implement __sklearn_is_fitted__. ( #7230 )
2021-09-15 19:09:04 +08:00
Jiaming Yuan
d997c967d5
Demo for experimental categorical data support. ( #7213 )
2021-09-15 08:20:12 +08:00
Jiaming Yuan
3515931305
Initial support for external memory in gradient index. ( #7183 )
...
* Add hessian to batch param in preparation of new approx impl.
* Extract a push method for gradient index matrix.
* Use span instead of vector ref for hessian in sketching.
* Create a binary format for gradient index.
2021-09-13 12:40:56 +08:00
Christian Lorentzen
a0dcf6f5c1
[DOC] Improve tutorial on feature interactions ( #7219 )
2021-09-12 21:40:02 +08:00
Jiaming Yuan
804b2ac60f
Expose DMatrix API for CUDA columnar and array. ( #7217 )
...
* Use JSON encoded configurations.
* Expose them into header file.
2021-09-09 17:55:25 +08:00
Jiaming Yuan
68a2c7b8d6
Fix memory leak in demo. ( #7216 )
2021-09-09 13:51:03 +08:00
Jiaming Yuan
b12e7f7edd
Add noexcept to JSON objects. ( #7205 )
2021-09-07 13:56:48 +08:00
Jiaming Yuan
3a4f51f39f
Avoid calling CUDA code on CPU for linear model. ( #7154 )
2021-09-01 10:45:31 +08:00
Jiaming Yuan
ba69244a94
Restore the custom double atomic add. ( #7198 )
2021-08-28 18:30:42 +08:00
Jiaming Yuan
7a1d67f9cb
[breaking] Use integer atomic for GPU histogram. ( #7180 )
...
On GPU we use rouding factor to truncate the gradient for deterministic results. This PR changes the gradient representation to fixed point number with exponent aligned with rounding factor.
[breaking] Drop non-deterministic histogram.
Use fixed point for shared memory.
This PR is to improve the performance of GPU Hist.
Co-authored-by: Andy Adinets <aadinets@nvidia.com >
2021-08-28 05:17:05 +08:00
Jiaming Yuan
e7d7ab6bc3
Better error message for ncclUnhandledCudaError. ( #7190 )
2021-08-27 10:29:22 +08:00
Philip Hyunsu Cho
b70e07da1f
[CI] Clean up in beginning of each task in Win CI ( #7189 )
2021-08-25 04:15:22 -07:00
Jiaming Yuan
cdfaa705f3
Fix building on CUDA 11.0. ( #7187 )
2021-08-25 02:57:53 -07:00
Philip Hyunsu Cho
3060f0b562
[CI] Automatically build GPU-enabled R package for Windows ( #7185 )
...
* [CI] Automatically build GPU-enabled R package for Windows
* Update Jenkinsfile-win64
* Build R package for the release branch only
* Update install doc
2021-08-25 02:11:01 -07:00
Jiaming Yuan
9c64618cb6
[breaking] Remove CUDA sm_35, add sm_86 ( #7182 )
2021-08-25 16:04:23 +08:00
Philip Hyunsu Cho
d04312b9c0
[CI] Fix hanging Python setup in Windows CI ( #7186 )
2021-08-24 22:03:51 -07:00
Jiaming Yuan
ee8d1f5ed8
Fix histogram truncation. ( #7181 )
...
* Fix truncation.
* Lint.
* lint.
2021-08-24 18:34:32 -07:00
Jiaming Yuan
3290a4f3ed
Re-enable feature validation in predict proba. ( #7177 )
2021-08-22 15:28:08 +08:00
Jiaming Yuan
bf562bd33c
Remove unused code. ( #7175 )
2021-08-18 14:02:19 +08:00
Anton Kostin
01b7acba30
Update conf.py ( #7174 )
2021-08-17 03:38:26 +08:00
Anton Kostin
ec849ec335
Update README.md ( #7173 )
2021-08-17 03:37:53 +08:00
Martin Petříček
46c46829ce
Fix model loading from stream ( #7067 )
...
Fix bug introduced in 17913713b5 (allow loading from byte array)
When loading model from stream, only last buffer read from the input stream is used to construct the model.
This may work for models smaller than 1 MiB (if you are lucky enough to read the whole model at once), but will always fail if the model is larger.
2021-08-15 21:04:33 +08:00
Jiaming Yuan
6bcbc77226
[doc] Fix typo. [skip ci] ( #7170 )
2021-08-13 03:48:16 +08:00
Jiaming Yuan
3f38d983a6
Fix prediction configuration. ( #7159 )
...
After the predictor parameter was added to the constructor, this configuration was broken.
2021-08-11 16:34:36 +08:00
Jiaming Yuan
9600ca83f3
Remove synchronization in monitor. ( #7164 )
...
* Remove synchronization in monitor.
Calling rabit functions during destruction is flaky.
* Add xgboost prefix to nvtx marker.
2021-08-11 16:33:53 +08:00
Jiaming Yuan
149f209af6
Extract histogram builder from CPU Hist. ( #7152 )
...
* Extract the CPU histogram builder.
* Fix tests.
* Reduce number of histograms being built.
2021-08-09 21:15:21 +08:00
Philip Hyunsu Cho
336af4f974
Work around a segfault observed in SparsePage::Push() ( #7161 )
...
* Work around a segfault observed in SparsePage::Push()
* Revert "Work around a segfault observed in SparsePage::Push()"
This reverts commit 30934844d00908750a5442082eb4769b1489f6a9.
* Don't call vector::resize() inside OpenMP block
* Set GITHUB_PAT env var to fix R tests
* Use built-in GITHUB_TOKEN
2021-08-08 02:12:30 -07:00
AJ Schmidt
f7003dc819
Include cpack ( #7160 )
...
Co-authored-by: ptaylor <paul.e.taylor@me.com >
2021-08-07 00:57:34 +08:00
Jiaming Yuan
8a84be37b8
Pass scikit learn estimator checks for regressor. ( #7130 )
...
* Check data shape.
* Check labels.
2021-08-03 18:58:20 +08:00
Jiaming Yuan
8ee127469f
[R] Fix nthread in DMatrix constructor. ( #7127 )
...
* Break the R C API for nthread.
2021-08-03 17:39:25 +08:00
Jiaming Yuan
ba47eda61b
[doc] Use figure directive. ( #7143 )
2021-08-03 15:56:25 +08:00
Jiaming Yuan
e2c406f5c8
Support min_delta in early stopping. ( #7137 )
...
* Support `min_delta` in early stopping.
* Remove abs_tol.
2021-08-03 14:29:17 +08:00
Jiaming Yuan
7bdedacb54
Document for process_type. ( #7135 )
...
* Update document for prune and refresh.
* Add demo.
2021-08-03 13:11:52 +08:00
Jiaming Yuan
d080b5a953
Fix model slicing. ( #7149 )
...
* Use correct pointer.
* Remove best_iteration/best_score.
2021-08-03 11:51:56 +08:00
Jiaming Yuan
36346f8f56
C API demo for inference. ( #7151 )
2021-08-03 00:46:47 +08:00
Jiaming Yuan
1369133916
[dask] Remove the workaround for segfault. ( #7146 )
2021-07-30 03:57:53 +08:00
Philip Hyunsu Cho
f1a4a1ac95
[CI] Upgrade build image to CentOS 7 + GCC 8; require CUDA 10.1 and later ( #7141 )
2021-07-29 10:54:33 -07:00
graue70
dfdf0b08fc
Fix typo and grammatical mistake in error message ( #7134 )
2021-07-28 17:17:05 +08:00
Gil Forsyth
92ae3abc97
[dask] Disallow importing non-dask estimators from xgboost.dask ( #7133 )
...
* Disallow importing non-dask estimators from xgboost.dask
This is mostly a style change, but also avoids a user error (that I have
committed on a few occasions). Since `XGBRegressor` and `XGBClassifier`
are imported as parent classes for the `dask` estimators, without
defining an `__all__`, autocomplete (or muscle) memory will produce the
following with little prompting:
```
from xgboost.dask import XGBClassifier
```
There's nothing inherently wrong with that, but given that
`XGBClassifier` is not `dask` enabled, it can lead to confusing behavior
until you figure out you should've typed
```
from xgboost.dask import DaskXGBClassifier
```
Another option is to alias import the existing non-dask estimators.
* Remove base/iter class, add train predict funcs
2021-07-28 02:07:23 +08:00
Robert Maynard
1a75f43304
Allow compilation with nvcc 11.4 ( #7131 )
...
* Use type aliases for discard iterators
* update to include host_vector as thrust 1.12 doesn't bring it in as a side-effect
* cub::DispatchRadixSort requires signed offset types
2021-07-27 20:05:33 +08:00
Jiaming Yuan
7017dd5a26
[JVM-Packages] Use Python tracker in XGBoost for JVM package. ( #7132 )
2021-07-27 16:20:42 +08:00
Jiaming Yuan
48d5de80a2
[R] Fix softprob reshape. ( #7126 )
2021-07-27 15:25:17 +08:00
Jiaming Yuan
7ee7a95b84
Use upstream URI in distributed quantile tests. ( #7129 )
...
* Use upstream URI in distributed quantile tests.
* Fix test cv `PytestAssertRewriteWarning`.
2021-07-27 14:09:49 +08:00
Jiaming Yuan
e88ac9cc54
[dask] Extend tree stats tests. ( #7128 )
...
* Add tests to GPU.
* Assert cover in children sums up to the parent.
2021-07-27 12:22:13 +08:00
Jiaming Yuan
778135f657
Fix parameter loading with training continuation. ( #7121 )
...
* Add a demo for training continuation.
2021-07-23 10:51:47 +08:00
Taewoo Kim
41e882f80b
Check input value is duplicated when quantile queue is full ( #7091 )
...
Co-authored-by: Taewoo Kim <taewoo@layer6.com >
2021-07-23 03:07:01 +08:00
ShvetsKS
caa9e527dd
Remove extra sync for dense data ( #7120 )
...
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com >
2021-07-22 19:02:31 +08:00
Jiaming Yuan
e6088366df
Export Python Interface for external memory. ( #7070 )
...
* Add Python iterator interface.
* Add tests.
* Add demo.
* Add documents.
* Handle empty dataset.
2021-07-22 15:15:53 +08:00
farfarawayzyt
e64ee6592f
fix typo in src/common/hist.cc BuildHistKernel ( #7116 )
2021-07-21 19:53:05 +08:00
naveenkb
9f7f8b976d
[XGBoost4J-Spark] bestIteration and bestScore for early stopping ( #7095 )
2021-07-19 18:46:49 +08:00
farfarawayzyt
d7c14496d2
fix typo in arguments of PartitionBuilder::Init ( #7113 )
...
Co-authored-by: Yuntian Zhang <zhangyt@lamda.nju.edu.cn >
2021-07-16 15:46:22 +08:00
Jiaming Yuan
bd1f3a38f0
Rewrite sparse dmatrix using callbacks. ( #7092 )
...
- Reduce dependency on dmlc parsers and provide an interface for users to load data by themselves.
- Remove use of threaded iterator and IO queue.
- Remove `page_size`.
- Make sure the number of pages in memory is bounded.
- Make sure the cache can not be violated.
- Provide an interface for internal algorithms to process data asynchronously.
2021-07-16 12:33:31 +08:00
Jiaming Yuan
2f524e9f41
[dask] Work around segfault in prediction. ( #7112 )
2021-07-16 04:27:05 +08:00
Jiaming Yuan
abec3dbf6d
Fix thread safety of softmax prediction. ( #7104 )
2021-07-16 02:06:55 +08:00
Philip Hyunsu Cho
2801d69fb7
[CI] Pin libomp to 11.1.0 ( #7107 )
2021-07-15 11:16:51 +08:00
Jiaming Yuan
8e8232fb4c
[CI] Update R cache. ( #7102 )
2021-07-14 03:15:35 +08:00
Jiaming Yuan
345796825f
Optional find dependency in installed cmake config. ( #7099 )
...
* Find dependency only when xgboost is built as static library.
* Resolve msvc warning.
* Add test for linking shared library.
2021-07-11 17:20:55 +08:00
ZabelTech
1d91f71119
fix typo in XGDMatrixSetFloatInfo example ( #7097 )
2021-07-10 21:40:25 +08:00
Jiaming Yuan
77f6cf2d13
Support hessian in host sketch container. ( #7081 )
...
Prepare for migrating approx onto hist's codebase.
2021-07-08 16:33:58 +08:00
Jiaming Yuan
84d359efb8
Support host data in proxy DMatrix. ( #7087 )
2021-07-08 11:35:48 +08:00
Jiaming Yuan
5d7cdf2e36
[Breaking] Rename Quantile DMatrix C API. ( #7082 )
...
The role of ProxyDMatrix is going beyond what it was designed. Now it's used by both
QuantileDeviceDMatrix and inplace prediction. After the refactoring of sparse DMatrix it
will also be used for external memory. Renaming the C API to extract it from
QuantileDeviceDMatrix.
2021-07-08 11:34:14 +08:00
Jiaming Yuan
c766f143ab
Refactor external memory formats. ( #7089 )
...
* Save base_rowid.
* Return write size.
* Remove unused function.
2021-07-08 04:04:51 +08:00
Jiaming Yuan
689eb8f620
Check external memory support for exact tree method. ( #7088 )
2021-07-08 02:12:57 +08:00
Jiaming Yuan
615ab2b03e
Extract evaluate splits from CPU hist. ( #7079 )
...
Other than modularizing the split evaluation function, this PR also removes some more functions including `InitNewNodes` and `BuildNodeStats` among some other unused variables. Also, scattered code like setting leaf weights is grouped into the split evaluator and `NodeEntry` is simplified and made private. Another subtle difference with the original implementation is that the modified code doesn't call `tree[nidx].Parent()` to traversal upward.
2021-07-07 15:16:25 +08:00
Jeff H
d22b293f2f
Update reference to treelite website ( #7084 )
...
treelite.io is no longer a valid site and re-directs users to a parked domain. Re-directing to the documentation is safer at this point.
2021-07-06 22:15:07 -07:00
Jiaming Yuan
f937f514aa
Remove lz4 compression with external memory. ( #7076 )
2021-07-06 14:46:43 +08:00
Jiaming Yuan
116d711815
Make SimpleDMatrix ctor reusable. ( #7075 )
2021-07-06 13:38:24 +08:00
Jiaming Yuan
d7e1fa7664
Fix feature names and types in output model slice. ( #7078 )
2021-07-06 11:47:49 +08:00
Jiaming Yuan
ffa66aace0
Persist data in dask test. ( #7077 )
2021-07-06 11:47:17 +08:00
Jiaming Yuan
b56d3d5d5c
Fix with latest panda range index. ( #7074 )
2021-07-03 16:43:52 +08:00
Jiaming Yuan
93f3acdef9
Fix with latest pylint. ( #7071 )
2021-07-02 21:26:00 +08:00
Jiaming Yuan
a5d222fcdb
Handle categorical split in model histogram and dataframe. ( #7065 )
...
* Error on get_split_value_histogram when feature is categorical
* Add a category column to output dataframe
2021-07-02 13:10:36 +08:00
Jiaming Yuan
1cd20efe68
Move GHistIndex into DMatrix. ( #7064 )
2021-07-01 00:44:49 +08:00
Jiaming Yuan
1c8fdf2218
Remove use of device_idx in dh::LaunchN. ( #7063 )
...
It's an unused parameter, removing it can make the CI log more readable.
2021-06-29 11:37:26 +08:00
Philip Hyunsu Cho
dd4db347f3
Fix early stopping behavior with MAPE metric ( #7061 )
2021-06-26 03:02:33 +08:00
Jiaming Yuan
8fa32fdda2
Implement categorical data support for SHAP. ( #7053 )
...
* Add CPU implementation.
* Update GPUTreeSHAP.
* Add GPU implementation by defining custom split condition.
2021-06-25 19:02:46 +08:00
Jiaming Yuan
663136aa08
Implement feature score for linear model. ( #7048 )
...
* Add feature score support for linear model.
* Port R interface to the new implementation.
* Add linear model support in Python.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-06-25 14:34:02 +08:00
Philip Hyunsu Cho
b2d300e727
[CI] Upgrade to CMake 3.14 ( #7060 )
...
* [CI] Upgrade to CMake 3.14
* Add FATAL_ERROR directive, for users with CMake 2.x
2021-06-24 18:07:24 -07:00
Jiaming Yuan
1d4d345634
Tests for dask skl categorical data support. ( #7054 )
2021-06-24 16:33:57 +08:00
Jiaming Yuan
da1ad798ca
Convert numpy float to Python float in feat score. ( #7047 )
2021-06-21 20:58:43 +08:00
Jiaming Yuan
bbfffb444d
Fix race condition in CPU shap. ( #7050 )
2021-06-21 10:03:15 +08:00
Jiaming Yuan
29f8fd6fee
Support categorical split in tree model dump. ( #7036 )
2021-06-18 16:46:20 +08:00
Jiaming Yuan
7968c0d051
Test on s390x. ( #7038 )
...
* Fix && remove unused parameter.
2021-06-18 14:55:08 +08:00
Jiaming Yuan
86715e4cd4
Support categorical data for dask functional interface and DQM. ( #7043 )
...
* Support categorical data for dask functional interface and DQM.
* Implement categorical data support for GPU GK-merge.
* Add support for dask functional interface.
* Add support for DQM.
* Get newer cupy.
2021-06-18 13:06:52 +08:00
Jiaming Yuan
7dd29ffd47
Implement feature score in GBTree. ( #7041 )
...
* Categorical data support.
* Eliminate text parsing during feature score computation.
2021-06-18 11:53:16 +08:00
Jiaming Yuan
dcd84b3979
[CI] Configure RAPIDS, dask, modin ( #7033 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-06-18 10:27:51 +08:00
Jiaming Yuan
d9799b09d0
Categorical data support for cuDF. ( #7042 )
...
* Add support in DMatrix.
* Add support in DQM, except for iterator.
2021-06-17 13:54:33 +08:00
Jiaming Yuan
5c2d7a18c9
Parallel model dump for trees. ( #7040 )
2021-06-15 14:08:26 +08:00
ShvetsKS
2567404ab6
Simplify sparse and dense CPU hist kernels ( #7029 )
...
* Simplify sparse and dense kernels
* Extract row partitioner.
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
2021-06-11 18:26:30 +08:00
Jiaming Yuan
1faad825f4
Remove appveyor badge. [skip ci] ( #7035 )
2021-06-11 14:37:18 +08:00
Jiaming Yuan
b56614e9b8
[R] Use new predict function. ( #6819 )
...
* Call new C prediction API.
* Add `strict_shape`.
* Add `iterationrange`.
* Update document.
2021-06-11 13:03:29 +08:00
jmoralez
25514e104a
[dask] speed up tests ( #7020 )
2021-06-11 11:43:01 +08:00
Jiaming Yuan
f79cc4a7a4
Implement categorical prediction for CPU and GPU predict leaf. ( #7001 )
...
* Categorical prediction with CPU predictor and GPU predict leaf.
* Implement categorical prediction for CPU prediction.
* Implement categorical prediction for GPU predict leaf.
* Refactor the prediction functions to have a unified get next node function.
Co-authored-by: Shvets Kirill <kirill.shvets@intel.com >
2021-06-11 10:11:45 +08:00
Jiaming Yuan
72f9daf9b6
Fix gpu_id with custom objective. ( #7015 )
2021-06-09 14:51:17 +08:00
TP Boudreau
bd2ca543c4
Fix BinarySearchBin() argument types ( #7026 )
2021-06-08 19:05:46 +08:00
Jiaming Yuan
7beb2f7fae
Hide symbols in CI build + hide symbols for C and CUDA ( #6798 )
...
* Hide symbols in CI build.
* Hide symbols for other languages.
2021-06-04 02:35:46 +08:00
Jiaming Yuan
c4b9f4f622
Add enable_categorical to sklearn. ( #7011 )
2021-06-04 02:29:14 +08:00
Philip Hyunsu Cho
655e6992f6
[Dask] Add example of using custom callback in Dask ( #6995 )
2021-06-03 07:05:55 +08:00
ShvetsKS
5cdaac00c1
Remove feature grouping ( #7018 )
...
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
2021-06-03 04:35:26 +08:00
Philip Hyunsu Cho
05db6a6c29
[CI] Upgrade cuDF and RMM to 21.06 nightly ( #7012 )
...
* [CI] Upgrade cuDF and RMM to 21.06 nightly
* Trim outdated test cases
* Pin Dask version to 2021.05.0 for now
2021-06-02 11:59:30 -07:00
ShvetsKS
57c732655e
Merge lossgude and depthwise strategies for CPU hist ( #7007 )
...
* fix java/scala test: max depth is also valid parameter for lossguide
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
2021-06-03 01:49:43 +08:00
Jiaming Yuan
ee4f51a631
Support for all primitive types from array. ( #7003 )
...
* Change C API name.
* Test for all primitive types from array.
* Add native support for CPU 128 float.
* Convert boolean and float16 in Python.
* Fix dask version for now.
2021-06-01 08:34:48 +08:00
Jiaming Yuan
816b789bf0
Add predictor to skl constructor. ( #7000 )
2021-05-29 04:52:56 +08:00
ShvetsKS
55b823b27d
Reduce 'InitSampling' complexity and set gradients to zero ( #6922 )
...
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
2021-05-29 04:52:23 +08:00
Jiaming Yuan
89a49cf30e
Fix dask predict on DaskDMatrix with iteration_range. ( #7005 )
2021-05-29 04:43:12 +08:00
Jiaming Yuan
4cf95a6041
Support numpy array interface ( #6998 )
2021-05-27 16:08:22 +08:00
Jiaming Yuan
ab6fd304c4
[Python] Change development release postfix to dev ( #6988 )
2021-05-27 16:06:51 +08:00
Jiaming Yuan
29d6a5e2b8
[CI] Move appveyor tests to action ( #6986 )
...
* Drop support for VS14, use VS15 instead.
* Drop support for mingw.
* Remove debug build.
* Split up jvm tests.
* Split up Python tests.
2021-05-27 04:49:45 +08:00
Jiaming Yuan
86e60e3ba8
Guard against index error in prediction. ( #6982 )
...
* Remove `best_ntree_limit` from documents.
2021-05-25 23:24:59 +08:00
Philip Hyunsu Cho
c6d87e5e18
[CI] Remove stray build artifact to avoid error in artifact packaging ( #6994 )
2021-05-25 19:48:27 +08:00
Jiaming Yuan
a4bc7ecf27
Restore R cache on github action. ( #6985 )
2021-05-25 18:53:44 +08:00
Jiaming Yuan
6e52aefb37
Revert OMP guard. ( #6987 )
...
The guard protects the global variable from being changed by XGBoost. But this leads to a
bug that the `n_threads` parameter is no longer used after the first iteration. This is
due to the fact that `omp_set_num_threads` is only called once in `Learner::Configure` at
the beginning of the training process.
The guard is still useful for `gpu_id`, since this is called all the times in our codebase
doesn't matter which iteration we are currently running.
2021-05-25 08:56:28 +08:00
Jiaming Yuan
cf06a266a8
[dask][doc] Wrap the example in main guard. ( #6979 )
2021-05-25 08:24:47 +08:00
Mads R. B. Kristensen
81bdfb835d
lazy_isinstance(): use .__class__ for type check ( #6974 )
2021-05-21 11:33:08 +08:00
Emil Sadek
29c942f2a8
[doc] Capitalize section headers ( #6976 )
2021-05-21 11:31:05 +08:00
Adam Pocock
2320aa0da2
Making the Java library loader emit helpful error messages on missing dependencies. ( #6926 )
2021-05-19 14:53:56 +08:00
Jiaming Yuan
5cb51a191e
[dask][doc] Add small example for sklearn interface. ( #6970 )
2021-05-19 13:50:45 +08:00
Jiaming Yuan
7e846bb965
Fix prediction on df with latest dask. ( #6969 )
2021-05-19 12:23:03 +08:00
Jiaming Yuan
6e104f0570
Add news for 1.4.2. [skip ci] ( #6963 )
2021-05-17 02:50:55 +08:00
ReeceGoding
42fc7ca6a0
Corrected lapply comment in callbacks.R ( #6967 )
...
The comment was made false by the removal of the pipes.
2021-05-17 02:31:50 +08:00
Livius
a4886c404a
Fix compilation error on x86 ( #6964 )
...
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2021-05-14 13:31:49 +08:00
ReeceGoding
f94f479358
Simplify list2mat call from lapply in callbacks.R ( #6966 )
2021-05-14 03:40:58 +08:00
Jiaming Yuan
d245bc891e
Add tolerance to early stopping. ( #6942 )
2021-05-14 00:19:51 +08:00
James Lamb
894e9bc5d4
[R-package] remove dependency on {magrittr} ( #6928 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2021-05-13 04:34:59 +08:00
Jiaming Yuan
44cc9c04ea
Fix multiclass auc with empty dataset. ( #6947 )
2021-05-12 15:01:14 +08:00
Jiaming Yuan
05ac415780
[dask] Set dataframe index in predict. ( #6944 )
2021-05-12 13:24:21 +08:00
Andrew Ziem
3e7e426b36
Fix spelling in documents ( #6948 )
...
* Update roxygen2 doc.
Co-authored-by: fis <jm.yuan@outlook.com >
2021-05-11 20:44:36 +08:00
vslaykovsky
2a9979e256
Fixed incorrect feature mismatch error message ( #6949 )
...
data.shape[0] denotes the number of samples, data.shape[1] is the number of features
2021-05-11 13:52:11 +08:00
Philip Hyunsu Cho
90cd724be1
[CI] Fix CI/CD pipeline broken by latest auditwheel (4.0.0) ( #6951 )
2021-05-10 22:43:15 -07:00
Daniel Saxton
e41619b1fc
Link to valid tree_method values in docs ( #6935 )
2021-05-06 17:33:18 +08:00
Philip Hyunsu Cho
ec6ce08cd0
[jvm-packages] Make it easier to release GPU/CPU code artifacts to Maven Central ( #6940 )
2021-05-04 14:00:03 -07:00
Jose Manuel Llorens
4ddbaeea32
Improve warning when using np.ndarray subsets ( #6934 )
2021-05-04 13:24:41 +08:00
Ali
b35dd76dca
[R] don't remove CMakeLists in cleanup ( #6930 )
...
currently installing the R-pacakge will leave the repo in dirty state, since
`CmakeLists.txt` is already checked in. This fixes the `cleanup`
script to not delete this file.
2021-05-03 17:46:15 +08:00
Jiaming Yuan
37ad60fe25
Enforce input data is not object. ( #6927 )
...
* Check for object data type.
* Allow strided arrays with greater underlying buffer size.
2021-05-02 00:09:01 +08:00
Jiaming Yuan
a1d23f6613
Relax test for decision stump in distributed environment. ( #6919 )
2021-04-30 09:04:11 +08:00
Jiaming Yuan
45ddc39c1d
Relax shotgun test. ( #6918 )
2021-04-30 09:03:12 +08:00
Jiaming Yuan
34df1f588b
Reduce Travis environment setup time. ( #6912 )
...
* Remove unused r from travis.
* Don't update homebrew.
* Don't install indirect/unused dependencies like libgit2, wget, openssl.
* Move graphviz installation to conda.
2021-04-30 09:02:40 +08:00
Jiaming Yuan
b31d37eac5
[CI] Fix custom metric test with empty dataset. ( #6917 )
2021-04-30 09:00:05 +08:00
Jiaming Yuan
db6285fb55
[CI] Skip external memory gtest on osx. ( #6901 )
2021-04-30 08:59:33 +08:00
david-cortes
4e1a8b1fe5
Update R handles in-place ( #6903 )
...
* update R handles in-place #fixes 6896
* update test to expect non-null handle
* remove unused variable
* fix failing tests
* solve linter complains
2021-04-29 12:50:46 -07:00
Philip Hyunsu Cho
5472ef626c
[R] Re-generate Roxygen2 doc ( #6915 )
2021-04-29 11:55:07 -07:00
James Lamb
20f34d9776
[R-package] Update dependencies from CMake-based installation ( #6906 )
...
* remove stringi
* add Matrix and jsonlite
2021-04-29 01:32:01 +08:00
Jiaming Yuan
ef473b1f09
Disable pylint error. ( #6911 )
2021-04-29 01:01:37 +08:00
Jiaming Yuan
8760ec4827
Ensure predict leaf output 1-dim vector where there's only 1 tree. ( #6889 )
2021-04-23 15:07:48 +08:00
Jiaming Yuan
54afa3ac7a
Relax shotgun test. ( #6900 )
...
It's non-deterministic algorithm, the test is flaky.
2021-04-23 13:01:44 +08:00
Jiaming Yuan
a2ecbdaa31
Add an API guard to prevent global variables being changed. ( #6891 )
2021-04-23 10:27:57 +08:00
Jiaming Yuan
896aede340
Reorganize the installation documents. ( #6877 )
...
* Split up installation and building from source.
* Use consistent section titles.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-04-22 04:48:32 +08:00
Jiaming Yuan
74b41637de
Revert "[jvm-packages] Add XGBOOST_RABIT_TRACKER_IP_FOR_TEST to set rabit tracker IP. ( #6869 )" ( #6886 )
...
This reverts commit 2828da3c4c .
2021-04-21 11:20:10 -07:00
Kai Fricke
c8cc3eacc9
[docs] Add tutorial for XGBoost-Ray ( #6884 )
...
* Add XGBoost-Ray tutorial
* Add link to modin
2021-04-22 02:07:13 +08:00
Bobby Wang
2828da3c4c
[jvm-packages] Add XGBOOST_RABIT_TRACKER_IP_FOR_TEST to set rabit tracker IP. ( #6869 )
...
* Add `XGBOOST_RABIT_TRACKER_IP_FOR_TEST` to set rabit tracker IP
* change spark and rabit tracker IP to 127.0.0.1on GitHub Action.
Co-authored-by: fis <jm.yuan@outlook.com >
2021-04-22 02:00:22 +08:00
Jiaming Yuan
233bdf105f
Remove setDaemon in tracker. ( #6872 )
2021-04-22 01:57:13 +08:00
Jiaming Yuan
71b938f608
1.4.1 release news. ( #6876 )
2021-04-22 01:55:57 +08:00
Jiaming Yuan
146549260a
Bump version to 1.5.0 snapshot in master. ( #6875 )
2021-04-22 01:53:44 +08:00
Jiaming Yuan
bec2b4f094
Revert "Use CPU input for test_boost_from_prediction. ( #6818 )" ( #6858 )
...
This reverts commit 74f3a2f4b5 .
2021-04-20 14:54:02 +08:00
Bobby Wang
2c684ffd32
[jvm-packages] fix "key not found: train" issue ( #6842 )
...
* [jvm-packages] fix "key not found: train" issue
* fix bug
2021-04-18 23:28:39 -07:00
Jiaming Yuan
556a83022d
Implement unified update prediction cache for (gpu_)hist. ( #6860 )
...
* Implement utilites for linalg.
* Unify the update prediction cache functions.
* Implement update prediction cache for multi-class gpu hist.
2021-04-17 00:29:34 +08:00
Jiaming Yuan
1b26a2a561
Copy output data for argsort. ( #6866 )
...
Fix GPU AUC.
2021-04-16 21:05:01 +08:00
Jiaming Yuan
a5d7094a45
Update documents. ( #6856 )
...
* Add early stopping section to prediction doc.
* Remove best_ntree_limit.
* Better doxygen output.
2021-04-16 12:41:03 +08:00
ReeceGoding
d31a57cf5f
Removed typo in callbacks.R ( #6863 )
...
Changed "TURE" to "TRUE".
2021-04-16 05:43:22 +08:00
Jiaming Yuan
bccb7e87d1
Update dmlc-core. ( #6862 )
...
* Install pandoc, pandoc-citeproc on CI.
2021-04-16 00:14:17 +08:00
ReeceGoding
2e8c101b4a
Removed magrittr dependency in callbacks.R ( #6855 )
2021-04-15 18:45:17 +08:00
Philip Hyunsu Cho
4224c08cac
Add demo for using AFT survival with Dask ( #6853 )
2021-04-13 16:18:33 -07:00
Philip Hyunsu Cho
878b990fcd
[CI] Upload Doxygen to correct destination ( #6854 )
2021-04-13 16:18:13 -07:00
Jiaming Yuan
dee5ef2dfd
Typehint for Sklearn. ( #6799 )
2021-04-14 06:55:21 +08:00
Jiaming Yuan
3d919db0c0
Fix pip release script. [skip ci] ( #6845 )
2021-04-14 06:46:02 +08:00
Jiaming Yuan
b9a4f3336a
1.4 release notes. ( #6843 )
2021-04-13 08:38:27 +08:00
Philip Hyunsu Cho
ea7a6a0321
[CI] Pack R package tarball with pre-built xgboost.so (with GPU support) ( #6827 )
...
* Add scripts for packaging R package with GPU-enabled libxgboost.so
* [CI] Automatically build R package tarball
* Add comments
* Don't build tarball for pull requests
* Update the installation doc
2021-04-07 21:15:34 -07:00
Jiaming Yuan
f294c4e023
Use constexpr in dh::CopyIf. ( #6828 )
2021-04-08 07:37:47 +08:00
Viktor Szathmáry
b65e3c4444
[jvm] reduce scala-compiler, scalatest dependency scopes ( #6730 )
...
* [jvm] reduce scala-compiler, scalatest dependency scopes
* [jvm] workaround for GpuTestSuite scalatest dependency
* scalatest scope tweak
2021-04-07 15:22:08 -07:00
Jiaming Yuan
7bcc8b3e5c
Use batched copy if. ( #6826 )
2021-04-06 10:34:04 +08:00
giladmaya
aa0d8f20c1
Support configuring constraints by feature names ( #6783 )
...
Co-authored-by: fis <jm.yuan@outlook.com >
2021-04-04 06:53:33 +08:00
Jiaming Yuan
7e06c81894
Fix approximated predict contribution. ( #6811 )
2021-04-03 02:15:03 +08:00
Jiaming Yuan
0cced530ea
[doc] Clarify prediction function. ( #6813 )
2021-04-03 02:12:04 +08:00
Jiaming Yuan
b1fdb220f4
Remove deprecated n_gpus parameter. ( #6821 )
2021-04-02 03:02:32 +08:00
Jiaming Yuan
74f3a2f4b5
Use CPU input for test_boost_from_prediction. ( #6818 )
2021-04-02 00:11:35 +08:00
Jiaming Yuan
47b62480af
More general predict proba. ( #6817 )
...
* Use `output_margin` for `softmax`.
* Add test for dask binary cls.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-04-01 19:52:12 +08:00
Jiaming Yuan
a5c852660b
Update document for sklearn model IO. ( #6809 )
...
* Update the use of JSON.
* Remove unnecessary type cast.
2021-04-01 15:52:36 +08:00
Jiaming Yuan
905fdd3e08
Fix typos in AUC. ( #6795 )
2021-03-31 16:35:42 +08:00
Jiaming Yuan
ca998df912
Clarify the behavior of use_rmm. ( #6808 )
...
* Clarify the `use_rmm` flag in document and demo.
2021-03-31 15:43:11 +08:00
Jiaming Yuan
3039dd194b
Don't estimate sketch batch size when rmm is used. ( #6807 )
2021-03-31 15:29:56 +08:00
Jiaming Yuan
10ae0f9511
Fix doc for apply method. ( #6796 )
2021-03-31 15:28:31 +08:00
Jiaming Yuan
138fe8516a
Remove unnecessary calls to iota. ( #6797 )
2021-03-31 15:27:23 +08:00
Jiaming Yuan
79b8b560d2
Optimize dart inplace predict perf. ( #6804 )
2021-03-31 15:20:54 +08:00
JohanWork
4aa12e10c0
Update URL ( #6810 )
2021-03-30 22:27:30 +08:00
James Lamb
f01af43eb0
[dask] disable work stealing explicitly for training tasks ( #6794 )
2021-03-29 16:47:56 +08:00
Jiaming Yuan
a59c7323b4
Fix inplace predict missing value. ( #6787 )
2021-03-27 05:36:10 +08:00
Jiaming Yuan
5c87c2bba8
Update demo for prediction. ( #6789 )
...
* Remove use of deprecated ntree_limit.
* Add sklearn demo.
2021-03-27 03:09:25 +08:00
ShvetsKS
8825670c9c
Memory consumption fix for row-major adapters ( #6779 )
...
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
Co-authored-by: fis <jm.yuan@outlook.com >
2021-03-26 08:44:30 +08:00
Philip Hyunsu Cho
744c46995c
[CI] Upload xgboost4j.dll to S3 ( #6781 )
2021-03-25 11:34:34 -07:00
Jiaming Yuan
a7083d3c13
Fix dart inplace prediction with GPU input. ( #6777 )
...
* Fix dart inplace predict with data on GPU, which might trigger a fatal check
for device access right.
* Avoid copying data whenever possible.
2021-03-25 12:00:32 +08:00
Jiaming Yuan
1d90577800
Verify strictly positive labels for gamma regression. ( #6778 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2021-03-25 11:46:52 +08:00
Jiaming Yuan
794fd6a46b
Support v3 cuda array interface. ( #6776 )
2021-03-25 09:58:09 +08:00
Jiaming Yuan
bcc0277338
Re-implement ROC-AUC. ( #6747 )
...
* Re-implement ROC-AUC.
* Binary
* MultiClass
* LTR
* Add documents.
This PR resolves a few issues:
- Define a value when the dataset is invalid, which can happen if there's an
empty dataset, or when the dataset contains only positive or negative values.
- Define ROC-AUC for multi-class classification.
- Define weighted average value for distributed setting.
- A correct implementation for learning to rank task. Previous
implementation is just binary classification with averaging across groups,
which doesn't measure ordered learning to rank.
2021-03-20 16:52:40 +08:00
Jiaming Yuan
4ee8340e79
Support column major array. ( #6765 )
2021-03-20 05:19:46 +08:00
Jiaming Yuan
f6fe15d11f
Improve parameter validation ( #6769 )
...
* Add quotes to unused parameters.
* Check for whitespace.
2021-03-20 01:56:55 +08:00
Jiaming Yuan
23b4165a6b
Fix gamma deviance ( #6761 )
2021-03-20 01:56:17 +08:00
ReeceGoding
c2b6b80600
R documentation: Make construction of DMatrix consistent.
...
* Fix inconsistency of construction of DMatrix.
* Fix missing parameters.
2021-03-20 01:55:13 +08:00
Qingyun Wu
642336add7
[doc] Add FLAML as a fast tuning tool for XGBoost ( #6770 )
...
Co-authored-by: Qingyun Wu <qiw@microsoft.com >
2021-03-20 01:47:39 +08:00
Philip Hyunsu Cho
4230dcb614
Re-introduce double buffer in UpdatePosition, to fix perf regression in gpu_hist ( #6757 )
...
* Revert "gpu_hist performance tweaks (#5707 )"
This reverts commit f779980f7e .
* Address reviewer's comment
* Fix build error
2021-03-18 13:56:10 -07:00
Jiaming Yuan
e2d8a99413
Add document for tests directory. [skip ci] ( #6760 )
2021-03-18 15:15:50 +08:00
ReeceGoding
4e00737c60
Fix R documentation for xgb.train. ( #6764 )
...
The [general documentation](https://xgboost.readthedocs.io/en/latest/parameter.html#parameters-for-tree-booster ) clearly has alpha and lambda under its "Parameters for Tree Booster" heading. Furthermore, the R package clearly uses alpha and lambda when told to use the tree booster. This update adds those two parameters to the documentation for the R package.
Closed issue #6763 .
2021-03-18 15:04:00 +08:00
Jiaming Yuan
4f75f514ce
Fix GPU RF ( #6755 )
...
* Fix sampling.
2021-03-17 06:23:35 +08:00
Jiaming Yuan
1a73a28511
Add device argsort. ( #6749 )
...
This is part of https://github.com/dmlc/xgboost/pull/6747 .
2021-03-16 16:05:22 +08:00
Jiaming Yuan
325bc93e16
[dask] Use distributed.MultiLock ( #6743 )
...
* [dask] Use `distributed.MultiLock`
This enables training multiple models in parallel.
* Conditionally import `MultiLock`.
* Use async train directly in scikit learn interface.
* Use `worker_client` when available.
2021-03-16 14:19:41 +08:00
Igor Rukhovich
19a2c54265
Prediction by indices (subsample < 1) ( #6683 )
...
* Another implementation of predicting by indices
* Fixed omp parallel_for variable type
* Removed SparsePageView from Updater
2021-03-16 15:08:20 +13:00
Philip Hyunsu Cho
366f3cb9d8
Add use_rmm flag to global configuration ( #6656 )
...
* Ensure RMM is 0.18 or later
* Add use_rmm flag to global configuration
* Modify XGBCachingDeviceAllocatorImpl to skip CUB when use_rmm=True
* Update the demo
* [CI] Pin NumPy to 1.19.4, since NumPy 1.19.5 doesn't work with latest Shap
2021-03-09 14:53:05 -08:00
Philip Hyunsu Cho
e4894111ba
Update dmlc-core submodule ( #6745 )
2021-03-07 00:30:26 -08:00
Bobby Wang
49c22c23b4
[jvm-packages] fix early stopping doesn't work even without custom_eval setting ( #6738 )
...
* [jvm-packages] fix early stopping doesn't work even without custom_eval setting
* remove debug info
* resolve comment
2021-03-06 20:19:40 -08:00
Philip Hyunsu Cho
5ae7f9944b
[CI] Clear R package cache ( #6746 )
2021-03-06 08:37:16 -08:00
Jiaming Yuan
f20074e826
Check for invalid data. ( #6742 )
2021-03-04 14:37:20 +08:00
Jiaming Yuan
a9b4a95225
Fix learning rate scheduler with cv. ( #6720 )
...
* Expose more methods in cvpack and packed booster.
* Fix cv context in deprecated callbacks.
* Fix document.
2021-02-28 13:57:42 +08:00
kangsheng89
9c8523432a
fix relocatable include in CMakeList ( #6734 ) ( #6737 )
2021-02-27 19:17:29 +08:00
Roffild
1fa6793a4e
Tests for regression metrics with weights. ( #6729 )
2021-02-25 22:08:14 +08:00
Jiaming Yuan
9da2287ab8
[breaking] Save booster feature info in JSON, remove feature name generation. ( #6605 )
...
* Save feature info in booster in JSON model.
* [breaking] Remove automatic feature name generation in `DMatrix`.
This PR is to enable reliable feature validation in Python package.
2021-02-25 18:54:16 +08:00
capybara
b6167cd2ff
[dask] Use client to persist collections ( #6722 )
...
Co-authored-by: fis <jm.yuan@outlook.com >
2021-02-25 16:40:38 +08:00
Louis Desreumaux
9b530e5697
Improve OpenMP exception handling ( #6680 )
2021-02-25 13:56:16 +08:00
Jiaming Yuan
c375173dca
Support pylint 2.7.0 ( #6726 )
2021-02-25 12:49:58 +08:00
Honza Sterba
17913713b5
[jvm] Add ability to load booster direct from byte array ( #6655 )
...
* Add ability to load booster direct from byte array
* fix compiler error
* move InputStream to byte-buffer conversion
- move it from Booster to XGBoost facade class
2021-02-23 11:28:27 -08:00
Jiaming Yuan
872e559b91
Use inplace predict for sklearn. ( #6718 )
...
* Use inplace predict for sklearn when possible.
2021-02-22 12:27:04 +08:00
Benjamin Lehmann
25077564ab
Fixes small typo in sklearn documentation ( #6717 )
...
Replaces "dowm" with "down" on parameter n_jobs
2021-02-20 07:36:06 +08:00
Jiaming Yuan
bdedaab8d1
Fix pylint. ( #6714 )
2021-02-19 11:53:27 +08:00
ShvetsKS
9f15b9e322
Optimize CPU prediction ( #6696 )
...
Co-authored-by: Shvets Kirill <kirill.shvets@intel.com >
2021-02-16 14:41:22 +08:00
James Lamb
dc97b5f19f
[dask] remove outdated comment ( #6699 )
2021-02-15 18:49:11 +08:00
Roffild
4c5d2608e0
[python-package] Fix class Booster: feature_types = None ( #6705 )
2021-02-13 17:50:23 +08:00
ShvetsKS
9a0399e898
Removed unnecessary PredictBatch calls ( #6700 )
...
Co-authored-by: Shvets Kirill <kirill.shvets@intel.com >
2021-02-10 20:15:14 +08:00
Ali
9b267a435e
Bail out early if libxgboost exists in python setup ( #6694 )
...
Skip `copy_tree` when existing build is found.
2021-02-10 10:50:10 +08:00
Jiaming Yuan
e8c5c53e2f
Use Predictor for dart. ( #6693 )
...
* Use normal predictor for dart booster.
* Implement `inplace_predict` for dart.
* Enable `dart` for dask interface now that it's thread-safe.
* categorical data should be working out of box for dart now.
The implementation is not very efficient as it has to pull back the data and
apply weight for each tree, but still a significant improvement over previous
implementation as now we no longer binary search for each sample.
* Fix output prediction shape on dataframe.
2021-02-09 23:30:19 +08:00
Jiaming Yuan
dbf7e9d3cb
Remove R cache in github action. ( #6695 )
...
The cache stores outdated packages with wrong linkage. Right now there's no way to clear the cache.
2021-02-09 18:53:20 +08:00
Jiaming Yuan
1335db6113
[dask] Improve documents. ( #6687 )
...
* Add tag for versions.
* use autoclass in sphinx build.
Made some class methods to be private to avoid exporting documents.
2021-02-09 09:20:58 +08:00
Jiaming Yuan
5d48d40d9a
Fix DMatrix slice with feature types. ( #6689 )
2021-02-09 08:13:51 +08:00
Jiaming Yuan
218a5fb6dd
Simplify Span checks. ( #6685 )
...
* Stop printing out message.
* Remove R specialization.
The printed message is not really useful anyway, without a reproducible example
there's no way to fix it. But if there's a reproducible example, we can always
obtain these information by a debugger. Removing the `printf` function avoids
creating the context in kernel.
2021-02-09 08:12:58 +08:00
Jiaming Yuan
4656b09d5d
[breaking] Add prediction fucntion for DMatrix and use inplace predict for dask. ( #6668 )
...
* Add a new API function for predicting on `DMatrix`. This function aligns
with rest of the `XGBoosterPredictFrom*` functions on semantic of function
arguments.
* Purge `ntree_limit` from libxgboost, use iteration instead.
* [dask] Use `inplace_predict` by default for dask sklearn models.
* [dask] Run prediction shape inference on worker instead of client.
The breaking change is in the Python sklearn `apply` function, I made it to be
consistent with other prediction functions where `best_iteration` is used by
default.
2021-02-08 18:26:32 +08:00
Jiaming Yuan
dbb5208a0a
Use __array_interface__ for creating DMatrix from CSR. ( #6675 )
...
* Use __array_interface__ for creating DMatrix from CSR.
* Add configuration.
2021-02-05 21:09:47 +08:00
Jiaming Yuan
1e949110da
Use generic dispatching routine for array interface. ( #6672 )
2021-02-05 09:23:38 +08:00
Jiaming Yuan
a4101de678
Fix divide by 0 in feature importance when no split is found. ( #6676 )
2021-02-05 03:39:30 +08:00
Jiaming Yuan
72892cc80d
[dask] Disable gblinear and dart. ( #6665 )
2021-02-04 09:13:09 +08:00
Jiaming Yuan
9d62b14591
Fix document. [skip ci] ( #6669 )
2021-02-02 20:43:31 +08:00
Jiaming Yuan
411592a347
Enhance inplace prediction. ( #6653 )
...
* Accept array interface for csr and array.
* Accept an optional proxy dmatrix for metainfo.
This constructs an explicit `_ProxyDMatrix` type in Python.
* Remove unused doc.
* Add strict output.
2021-02-02 11:41:46 +08:00
Jiaming Yuan
87ab1ad607
[dask] Accept Future of model for prediction. ( #6650 )
...
This PR changes predict and inplace_predict to accept a Future of model, to avoid sending models to workers repeatably.
* Document is updated to reflect functionality additions in recent changes.
2021-02-02 08:45:52 +08:00
Jiaming Yuan
a9ec0ea6da
Align device id in predict transform with predictor. ( #6662 )
2021-02-02 08:33:29 +08:00
Jiaming Yuan
d8ec7aad5a
[dask] Add a 1 line sample to infer output shape. ( #6645 )
...
* [dask] Use a 1 line sample to infer output shape.
This is for inferring shape with direct prediction (without DaskDMatrix).
There are a few things that requires known output shape before carrying out
actual prediction, including dask meta data, output dataframe columns.
* Infer output shape based on local prediction.
* Remove set param in predict function as it's not thread safe nor necessary as
we now let dask to decide the parallelism.
* Simplify prediction on `DaskDMatrix`.
2021-01-30 18:55:50 +08:00
Jiaming Yuan
c3c8e66fc9
Make prediction functions thread safe. ( #6648 )
2021-01-28 23:29:43 +08:00
Philip Hyunsu Cho
0f2ed21a9d
[Breaking] Change default evaluation metric for binary:logitraw objective to logloss ( #6647 )
2021-01-29 00:12:12 +09:00
Jiaming Yuan
d167892c7e
[dask] Ensure model can be pickled. ( #6651 )
2021-01-28 21:47:57 +08:00
Philip Hyunsu Cho
0ad6e18a2a
[CI] Do not mix up stashed executable built for ARM and x86_64 platforms ( #6646 )
2021-01-27 23:57:26 +09:00
Philip Hyunsu Cho
55ee2bd77f
[CI] Add ARM64 test to Jenkins pipeline ( #6643 )
...
* Add ARM64 test to Jenkins pipeline
* Check for bundled libgomp
* Use a separate test suite for ARM64
* Ensure that x86 jobs don't run on ARM workers
2021-01-27 21:51:17 +09:00
Jiaming Yuan
1b70a323a7
Improve string view to reduce string allocation. ( #6644 )
2021-01-27 19:08:52 +08:00
Jiaming Yuan
bc08e0c9d1
Remove experimental_json_serialization from tests. ( #6640 )
2021-01-27 17:44:49 +08:00
Jiaming Yuan
8968ca7c0a
Disable s390x and arm64 tests on travis for now. ( #6641 )
2021-01-27 16:21:40 +08:00
Jiaming Yuan
d19a0ddacf
Move sdist test to action. ( #6635 )
...
* Move x86 linux and osx sdist test to action.
* Add Windows.
2021-01-26 08:25:59 +08:00
Jiaming Yuan
740d042255
Add base_margin for evaluation dataset. ( #6591 )
...
* Add base margin to evaluation datasets.
* Unify the code base for evaluation matrices.
2021-01-26 02:11:02 +08:00
Jiaming Yuan
4bf23c2391
Specify shape in prediction contrib and interaction. ( #6614 )
2021-01-26 02:08:22 +08:00
Jiaming Yuan
8942c98054
Define metainfo and other parameters for all DMatrix interfaces. ( #6601 )
...
This PR ensures all DMatrix types have a common interface.
* Fix logic in avoiding duplicated DMatrix in sklearn.
* Check for consistency between DMatrix types.
* Add doc for bounds.
2021-01-25 16:06:06 +08:00
Jiaming Yuan
561809200a
Fix document for tree methods. ( #6633 )
2021-01-25 15:52:08 +08:00
Adam Pocock
fec66d033a
[jvm-packages] JVM library loader extensions ( #6630 )
...
* [java] extending the library loader to use both OS and CPU architecture.
* Simplifying create_jni.py's architecture detection.
* Tidying up the architecture detection in create_jni.py
2021-01-25 15:51:39 +08:00
Jiaming Yuan
a275f40267
[dask] Rework base margin test. ( #6627 )
2021-01-22 17:49:13 +08:00
Jiaming Yuan
7bc56fa0ed
Use simple print in tracker print function. ( #6609 )
2021-01-21 21:15:43 +08:00
Jiaming Yuan
26982f9fce
Skip unused CMake argument in setup.py ( #6611 )
2021-01-21 17:25:33 +08:00
Jiaming Yuan
f0fd7629ae
Add helper script and doc for releasing pip package. ( #6613 )
...
* Fix `long_description_content_type`.
2021-01-21 14:46:52 +08:00
Bobby Wang
9d2832a3a3
fix potential TaskFailedListener's callback won't be called ( #6612 )
...
there is possibility that onJobStart of TaskFailedListener won't be called, if
the job is submitted before the other thread adds addSparkListener.
detail can be found at https://github.com/dmlc/xgboost/pull/6019#issuecomment-760937628
2021-01-21 14:20:32 +08:00
Jiaming Yuan
f8bb678c67
Exclude dmlc test on github action. ( #6625 )
2021-01-20 18:50:20 +08:00
Jiaming Yuan
d6d72de339
Revert ntree limit fix ( #6616 )
...
The old (before fix) best_ntree_limit ignores the num_class parameters, which is incorrect. In before we workarounded it in c++ layer to avoid possible breaking changes on other language bindings. But the Python interpretation stayed incorrect. The PR fixed that in Python to consider num_class, but didn't remove the old workaround, so tree calculation in predictor is incorrect, see PredictBatch in CPUPredictor.
2021-01-19 23:51:16 +08:00
Jiaming Yuan
d132933550
Remove type check for solaris. ( #6610 )
2021-01-16 02:58:19 +08:00
Jiaming Yuan
d356b7a071
Restore unknown data support. ( #6595 )
2021-01-14 04:51:16 +08:00
Jiaming Yuan
89a00a5866
[dask] Random forest estimators ( #6602 )
2021-01-13 20:59:20 +08:00
Jiaming Yuan
0027220aa0
[breaking] Remove duplicated predict functions, Fix attributes IO. ( #6593 )
...
* Fix attributes not being restored.
* Rename all `data` to `X`. [breaking]
2021-01-13 16:56:49 +08:00
ShvetsKS
7f4d3a91b9
Multiclass prediction caching for CPU Hist ( #6550 )
...
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
2021-01-13 04:42:07 +08:00
Jiaming Yuan
03cd087da1
Remove duplicated DMatrix. ( #6592 )
2021-01-12 09:36:56 +08:00
Jiaming Yuan
c709f2aaaf
Fix evaluation result for XGBRanker. ( #6594 )
...
* Remove duplicated code, which fixes typo `evals_result` -> `evals_result_`.
2021-01-12 09:36:41 +08:00
Jiaming Yuan
f2f7dd87b8
Use view for SparsePage exclusively. ( #6590 )
2021-01-11 18:04:55 +08:00
Jiaming Yuan
78f2cd83d7
Suppress hypothesis health check for dask client. ( #6589 )
2021-01-11 14:11:57 +08:00
Jiaming Yuan
80065d571e
[dask] Add DaskXGBRanker ( #6576 )
...
* Initial support for distributed LTR using dask.
* Support `qid` in libxgboost.
* Refactor `predict` and `n_features_in_`, `best_[score/iteration/ntree_limit]`
to avoid duplicated code.
* Define `DaskXGBRanker`.
The dask ranker doesn't support group structure, instead it uses query id and
convert to group ptr internally.
2021-01-08 18:35:09 +08:00
Jiaming Yuan
96d3d32265
[dask] Add shap tests. ( #6575 )
2021-01-08 14:59:27 +08:00
Jiaming Yuan
7c9dcbedbc
Fix best_ntree_limit for dart and gblinear. ( #6579 )
2021-01-08 10:05:39 +08:00
Jiaming Yuan
f5ff90cd87
Support _estimator_type. ( #6582 )
...
* Use `_estimator_type`.
For more info, see: https://scikit-learn.org/stable/developers/develop.html#estimator-types
* Model trained from dask can be loaded by single node skl interface.
2021-01-08 10:01:16 +08:00
Jiaming Yuan
8747885a8b
Support Solaris. ( #6578 )
...
* Add system header.
* Remove use of TR1 on Solaris
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2021-01-07 09:05:05 +08:00
TP Boudreau
b2246ae7ef
Update dmlc-core submodule and conform to new API ( #6431 )
...
* Update dmlc-core submodule and conform to new API
* Remove unsupported parameter from method signature
* Update dmlc-core submodule and conform to new API
* Update dmlc-core
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2021-01-05 16:12:22 -08:00
Jiaming Yuan
60cfd14349
[dask, sklearn] Fix predict proba. ( #6566 )
...
* For sklearn:
- Handles user defined objective function.
- Handles `softmax`.
* For dask:
- Use the implementation from sklearn, the previous implementation doesn't perform any extra handling.
2021-01-05 08:29:06 +08:00
Jiaming Yuan
516a93d25c
Fix best_ntree_limit. ( #6569 )
2021-01-03 05:58:54 +08:00
James Lamb
195a41cef1
[python-package] remove unnecessary files to reduce sdist size ( fixes #6560 ) ( #6565 )
2021-01-02 15:56:39 +08:00
Jiaming Yuan
2b049b32e9
Document various tree methods. ( #6564 )
2021-01-02 15:40:46 +08:00
Philip Hyunsu Cho
fa13992264
Calling XGBModel.fit() should clear the Booster by default ( #6562 )
...
* Calling XGBModel.fit() should clear the Booster by default
* Document the behavior of fit()
* Allow sklearn object to be passed in directly via xgb_model argument
* Fix lint
2020-12-31 11:02:08 -08:00
Jiaming Yuan
5e9e525223
Remove warnings in tests. ( #6554 )
2020-12-31 13:41:18 +08:00
James Lamb
8ad22bf4e7
Add credentials to .gitignore ( #6559 )
2020-12-30 15:58:14 -08:00
Jiaming Yuan
de8fd852a5
[dask] Add type hints. ( #6519 )
...
* Add validate_features.
* Show type hints in doc.
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-12-29 19:41:02 +08:00
Jiaming Yuan
610ee632cc
[Breaking] Rename data to X in predict_proba. ( #6555 )
...
New Scikit-Learn version uses keyword argument, and `X` is the predefined
keyword.
* Use pip to install latest Python graphviz on Windows CI.
2020-12-28 21:36:03 +08:00
Jiaming Yuan
cb207a355d
Add script for generating release tarball. ( #6544 )
2020-12-23 16:08:10 +08:00
Gorkem Ozkaya
2231940d1d
Clip small positive values in gamma-nloglik ( #6537 )
...
For the `gamma-nloglik` eval metric, small positive values in the labels are causing `NaN`'s in the outputs, as reported here: https://github.com/dmlc/xgboost/issues/5349 . This will add clipping on them, similar to what is done in other metrics like `poisson-nloglik` and `logloss`.
2020-12-22 03:11:40 +08:00
MBSMachineLearning
95cbfad990
"featue_map" typo changed to "feature_map" ( #6540 )
2020-12-21 22:11:11 +08:00
Philip Hyunsu Cho
fbb980d9d3
Expand ~ into the home directory on Linux and MacOS ( #6531 )
2020-12-19 23:35:13 -08:00
Philip Hyunsu Cho
cd0821500c
Add Saturn Cloud Dask XGBoost tutorial to Awesome XGBoost [skip ci] ( #6532 )
2020-12-19 15:57:05 -08:00
Philip Hyunsu Cho
380f6f4ab8
Remove cupy.array_equal, since it's not compatible with cuPy 7.8 ( #6528 )
2020-12-18 09:16:52 -08:00
Jiaming Yuan
ca3da55de4
Support early stopping with training continuation, correct num boosted rounds. ( #6506 )
...
* Implement early stopping with training continuation.
* Add new C API for obtaining boosted rounds.
* Fix off by 1 in `save_best`.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-12-17 19:59:19 +08:00
Philip Hyunsu Cho
125b3c0f2d
Lazy import cuDF and Dask ( #6522 )
...
* Lazy import cuDF
* Lazy import Dask
Co-authored-by: PSEUDOTENSOR / Jonathan McKinney <pseudotensor@gmail.com >
* Fix lint
Co-authored-by: PSEUDOTENSOR / Jonathan McKinney <pseudotensor@gmail.com >
2020-12-17 01:51:35 -08:00
Philip Hyunsu Cho
ad1a527709
Enable loading model from <1.0.0 trained with objective='binary:logitraw' ( #6517 )
...
* Enable loading model from <1.0.0 trained with objective='binary:logitraw'
* Add binary:logitraw in model compatibility testing suite
* Feedback from @trivialfis: Override ProbToMargin() for LogisticRaw
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2020-12-16 16:53:46 -08:00
Philip Hyunsu Cho
bf6cfe3b99
[Breaking] Upgrade cuDF and RMM to 0.18 nightlies; require RMM 0.18+ for RMM plugin ( #6510 )
...
* [CI] Upgrade cuDF and RMM to 0.18 nightlies
* Modify RMM plugin to be compatible with RMM 0.18
* Update src/common/device_helpers.cuh
Co-authored-by: Mark Harris <mharris@nvidia.com >
Co-authored-by: Mark Harris <mharris@nvidia.com >
2020-12-16 10:07:52 -08:00
Jiaming Yuan
d8d684538c
[CI] Split up main.yml, add mypy. ( #6515 )
2020-12-17 00:15:44 +08:00
Jiaming Yuan
c5876277a8
Drop saving binary format for memory snapshot. ( #6513 )
2020-12-17 00:14:57 +08:00
Jiaming Yuan
0e97d97d50
Fix merge conflict. ( #6512 )
2020-12-16 18:02:25 +08:00
hzy001
749364f25d
Update the C API comments ( #6457 )
...
Signed-off-by: Hao Ziyu <haoziyu@qiyi.com >
Co-authored-by: Hao Ziyu <haoziyu@qiyi.com >
2020-12-16 14:56:13 +08:00
Jiaming Yuan
347f593169
Accept numpy array for DMatrix slice index. ( #6368 )
2020-12-16 14:42:52 +08:00
Jiaming Yuan
ef4a0e0aac
Fix DMatrix feature names/types IO. ( #6507 )
...
* Fix feature names/types IO
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-12-16 14:24:27 +08:00
Jiaming Yuan
886486a519
Support categorical data in GPU weighted sketching. ( #6508 )
2020-12-16 14:23:28 +08:00
Igor Rukhovich
5c8ccf4455
Improved InitSampling function speed by 2.12 times ( #6410 )
...
* Improved InitSampling function speed by 2.12 times
* Added explicit conversion
2020-12-15 20:59:24 -08:00
Jiaming Yuan
3c3f026ec1
Move metric configuration into booster. ( #6504 )
2020-12-16 05:35:04 +08:00
Jiaming Yuan
d45c0d843b
Show partition status in dask error. ( #6366 )
2020-12-16 02:58:21 +08:00
James Lamb
1e2c3ade9e
[doc] [dask] Add example on early stopping with Dask ( #6501 )
...
Co-authored-by: fis <jm.yuan@outlook.com >
2020-12-15 22:23:23 +08:00
ShvetsKS
8139849ab6
Fix handling of print period in EvaluationMonitor ( #6499 )
...
Co-authored-by: Kirill Shvets <kirill.shvets@intel.com >
2020-12-15 19:20:19 +08:00
Philip Hyunsu Cho
9a194273cd
Add conda-forge badge ( #6502 )
2020-12-14 18:58:03 -08:00
Philip Hyunsu Cho
aac4eba2ef
Add release note for 1.3.0 in NEWS.md ( #6495 )
...
* Add release note for 1.3.0
* Address reviewer's comment
* Fix silly mistake
* Apply suggestions from code review
Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com >
Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com >
2020-12-14 14:42:30 -08:00
James Lamb
afc4567268
[doc] [dask] fix partitioning in Dask example ( #6389 )
2020-12-14 18:37:49 +08:00
Jiaming Yuan
a30461cf87
[dask] Support all parameters in regressor and classifier. ( #6471 )
...
* Add eval_metric.
* Add callback.
* Add feature weights.
* Add custom objective.
2020-12-14 07:35:56 +08:00
Philip Hyunsu Cho
c31e3efa7c
Pass correct split_type to GPU predictor ( #6491 )
...
* Pass correct split_type to GPU predictor
* Add a test
2020-12-11 19:30:00 -08:00
Philip Hyunsu Cho
0d483cb7c1
Bump version to 1.4.0 snapshot in master ( #6486 )
2020-12-10 07:38:08 -08:00
Philip Hyunsu Cho
b8044e6136
[CI] Use manylinux2010_x86_64 container to vendor libgomp ( #6485 )
2020-12-10 07:37:15 -08:00
Jiaming Yuan
0ffaf0f5be
Fix dask ip resolution. ( #6475 )
...
This adopts the solution used in dask/dask-xgboost#40 which employs the get_host_ip from dmlc-core tracker.
2020-12-07 16:36:23 -08:00
Jiaming Yuan
47b86180f6
Don't validate feature when number of rows is 0. ( #6472 )
2020-12-07 18:08:51 +08:00
Philip Hyunsu Cho
55bdf084cb
[Doc] Document that AUC and AUCPR are for binary classification/ranking [skip ci] ( #5899 )
2020-12-06 22:17:20 -08:00
Jiaming Yuan
703c2d06aa
Fix global config default value. ( #6470 )
2020-12-06 06:15:33 +08:00
Jiaming Yuan
d6386e45e8
Fix filtering callable objects in skl xgb param. ( #6466 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-12-05 17:20:36 +08:00
Philip Hyunsu Cho
05e5563c2c
[CI] Fix CentOS 6 Docker images ( #6467 )
2020-12-04 21:33:11 -08:00
Philip Hyunsu Cho
84b726ef53
Vendor libgomp in the manylinux Python wheel ( #6461 )
...
* Vendor libgomp in the manylinux2014_aarch64 wheel
* Use vault repo, since CentOS 6 has reached End-of-Life on Nov 30
* Vendor libgomp in the manylinux2010_x86_64 wheel
* Run verification step inside the container
2020-12-03 19:55:32 -08:00
Philip Hyunsu Cho
c103ec51d8
Enforce row-major order in cuPy array ( #6459 )
2020-12-03 18:29:10 -08:00
Philip Hyunsu Cho
4f70e14031
Fix docstring of config.py to use correct versionadded ( #6458 )
2020-12-03 10:41:53 -08:00
Philip Hyunsu Cho
fb56da5e8b
Add global configuration ( #6414 )
...
* Add management functions for global configuration: XGBSetGlobalConfig(), XGBGetGlobalConfig().
* Add Python interface: set_config(), get_config(), and config_context().
* Add unit tests for Python
* Add R interface: xgb.set.config(), xgb.get.config()
* Add unit tests for R
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2020-12-03 00:05:18 -08:00
hzy001
c2ba4fb957
Fix broken links. ( #6455 )
...
Co-authored-by: Hao Ziyu <haoziyu@qiyi.com >
Co-authored-by: fis <jm.yuan@outlook.com >
2020-12-02 17:39:12 +08:00
Jiaming Yuan
927c316aeb
Fix period in evaluation monitor. ( #6441 )
2020-11-29 03:18:33 +08:00
Jiaming Yuan
f4ff1c53fd
Fix CLI ranking demo. ( #6439 )
...
Save model at final round.
2020-11-29 03:12:06 +08:00
Honza Sterba
b0036b339b
Optionaly fail when gpu_id is set to invalid value ( #6342 )
2020-11-28 15:14:12 +08:00
ShvetsKS
956beead70
Thread local memory allocation for BuildHist ( #6358 )
...
* thread mem locality
* fix apply
* cleanup
* fix lint
* fix tests
* simple try
* fix
* fix
* apply comments
* fix comments
* fix
* apply simple comment
Co-authored-by: ShvetsKS <kirill.shvets@intel.com >
2020-11-25 17:50:12 +03:00
Philip Hyunsu Cho
4dbbeb635d
[CI] Upgrade cuDF and RMM to 0.17 nightlies ( #6434 )
2020-11-24 13:21:41 -08:00
Philip Hyunsu Cho
0c85b90671
[R] Fix R package installation via CMake ( #6423 )
2020-11-22 05:49:09 -08:00
Jiaming Yuan
42d31d9dcb
Fix MPI build. ( #6403 )
2020-11-21 13:38:21 +08:00
Jiaming Yuan
2ce2a1a4d8
[SKL] Propagate parameters to booster during set_param. ( #6416 )
2020-11-20 20:37:35 +08:00
zhang_jf
cc581b3b6b
Misleading exception information: no such param of "allow_non_zero_missing" ( #6418 )
2020-11-20 19:33:34 +08:00
Jiaming Yuan
00218d065a
[dask] Update document. [skip ci] ( #6413 )
2020-11-20 19:16:19 +08:00
Jiaming Yuan
c120822a24
Fix flaky sparse page dmatrix test. ( #6417 )
2020-11-20 19:15:45 +08:00
Jiaming Yuan
a7b42adb74
Fix dask predict ( #6412 )
2020-11-20 10:10:52 +08:00
Jiaming Yuan
44a9d69efb
Small cleanup to evaluator. ( #6400 )
2020-11-20 09:33:51 +08:00
Philip Hyunsu Cho
9c9070aea2
Use pytest conventions consistently ( #6337 )
...
* Do not derive from unittest.TestCase (not needed for pytest)
* assertRaises -> pytest.raises
* Simplify test_empty_dmatrix with test parametrization
* setUpClass -> setup_class, tearDownClass -> teardown_class
* Don't import unittest; import pytest
* Use plain assert
* Use parametrized tests in more places
* Fix test_gpu_with_sklearn.py
* Put back run_empty_dmatrix_reg / run_empty_dmatrix_cls
* Fix test_eta_decay_gpu_hist
* Add parametrized tests for monotone constraints
* Fix test names
* Remove test parametrization
* Revise test_slice to be not flaky
2020-11-19 17:00:15 -08:00
Philip Hyunsu Cho
c763b50dd0
[CI] Upgrade to MacOS Mojave image ( #6406 )
2020-11-18 20:29:10 -08:00
Nan Zhu
4d1d5d4010
[jvm-packages] fix potential unit test suites aborted issue ( #6373 )
...
* fix race conditio
* code cleaning
rm pom.xml-e
* clean again
* fix compilation issue
* recover
* avoid using getOrCreate
* interrupt zombie threads
* safe guard
* fix deadlock
* Update SparkParallelismTracker.scala
2020-11-17 10:59:26 -08:00
Philip Hyunsu Cho
e426b6e040
[R] Do not convert continuous labels to factors ( #6380 )
...
* [R] Do not convert continuous labels to factors
* Address reviewer's comment
2020-11-17 09:19:16 -08:00
James Lamb
3cca1c5fa1
[R] remove uses of exists() ( #6387 )
2020-11-17 15:06:23 +08:00
Jiaming Yuan
3ac173fc8b
Fix typo. ( #6399 )
2020-11-16 16:59:12 -08:00
Nikhil Choudhary
ae1662028a
Fixed few grammatical mistakes in doc ( #6393 )
2020-11-15 13:48:08 +08:00
Philip Hyunsu Cho
5cb24d0d39
Fix broken link in CLI doc ( #6396 )
2020-11-14 17:58:07 -08:00
ShvetsKS
512b464cfa
Disable HT for DMatrix creation ( #6386 )
...
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com >
2020-11-14 22:18:33 +08:00
Jiaming Yuan
fcd6fad822
[dask] Small cleanup. ( #6391 )
2020-11-14 22:15:05 +08:00
Jiaming Yuan
4ccf92ea34
[dask] Fix union of workers. ( #6375 )
2020-11-13 16:55:05 +08:00
Jiaming Yuan
fcfeb4959c
Deprecate positional arguments. ( #6365 )
...
Deprecate positional arguments in following functions:
- `__init__` for all classes in sklearn module.
- `fit` method for all classes in sklearn module.
- dask interface.
- `set_info` for `DMatrix` class.
Refactor the evaluation matrices handling.
2020-11-13 11:10:30 +08:00
Philip Hyunsu Cho
e5193c21a1
[dask] Allow empty data matrix in AFT survival ( #6379 )
...
* [dask] Allow empty data matrix in AFT survival
* Add unit test
2020-11-12 17:49:58 -08:00
Philip Hyunsu Cho
5a33c2f3a0
[CI] Add noLD R test ( #6382 )
...
* [CI] Add noLD test
* Make noLD test only trigger with a PR comment
* [CI] Don't install stringi
* Add the Titanic example as a unit test
* Document trigger
* add to index
* Clarify that it needs to be a review comment
2020-11-12 12:41:25 -08:00
Jiaming Yuan
c1a62b5fa2
Expect gpu external memory to fail. ( #6381 )
2020-11-12 19:24:48 +08:00
Jiaming Yuan
c90f968d92
Update Python documents. ( #6376 )
2020-11-12 17:51:32 +08:00
Philip Hyunsu Cho
c5645180a6
[R] Fix a crash that occurs with noLD R ( #6378 )
2020-11-11 21:09:08 -08:00
James Lamb
12d27f43ff
[doc] make Dask distributed example copy-pastable ( #6345 )
2020-11-11 20:22:17 -08:00
Jiaming Yuan
d711d648cb
Fix label errors in graph visualization ( #6369 )
2020-11-11 17:44:59 -08:00
Jiaming Yuan
debeae2509
[R] Fix warnings from R check --as-cran ( #6374 )
...
* Remove exit and printf.
* Fix warnings.
2020-11-11 18:39:37 +08:00
Jiaming Yuan
6e12c2a6f8
[dask] Supoort running on GKE. ( #6343 )
...
* Avoid accessing `scheduler_info()['workers']`.
* Avoid calling `client.gather` inside task.
* Avoid using `client.scheduler_address`.
2020-11-11 18:04:34 +08:00
Jiaming Yuan
8a17610666
Implement GPU predict leaf. ( #6187 )
2020-11-11 17:33:47 +08:00
Philip Hyunsu Cho
7f101d1b33
[CI] Remove R check from Jenkins ( #6372 )
...
* Remove R check from Jenkins
* Print stacktrace when CRAN test fail in GitHub Actions
* Add verbose flag in tests/ci_build/print_r_stacktrace.sh
* Fix path in tests/ci_build/print_r_stacktrace.sh
2020-11-10 22:46:54 -08:00
Jiaming Yuan
a5cfa7841e
Run R check as cran on action. [skip ci] ( #6371 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-11-11 12:02:53 +08:00
Jiaming Yuan
43efadea2e
Deterministic data partitioning for external memory ( #6317 )
...
* Make external memory data partitioning deterministic.
* Change the meaning of `page_size` from bytes to number of rows.
* Design a data pool.
* Note for external memory.
* Enable unity build on Windows CI.
* Force garbage collect on test.
2020-11-11 06:11:06 +08:00
Jean Lescut-Muller
9564886d9f
Update custom_metric_obj.rst ( #6367 )
2020-11-10 22:29:22 +08:00
Jiaming Yuan
e65e3cf36e
Support shared library in system path. ( #6362 )
2020-11-10 16:04:25 +08:00
Jiaming Yuan
184e2eac7d
Add period to evaluation monitor. ( #6348 )
2020-11-10 07:47:48 +08:00
ShvetsKS
d411f98d26
simple fix for static shedule in predict ( #6357 )
...
Co-authored-by: ShvetsKS <kirill.shvets@intel.com >
2020-11-09 17:01:30 +08:00
Jiaming Yuan
519cee115a
Avoid resetting seed for every configuration. ( #6349 )
2020-11-06 10:28:35 +08:00
James Lamb
f3a4253984
Ignore files from local Dask development ( #6346 )
2020-11-05 13:54:46 +08:00
Jack Dunn
51e6531315
Fix missing space in warning message ( #6340 )
2020-11-04 06:03:16 -05:00
Jiaming Yuan
2cc9662005
Support slicing tree model ( #6302 )
...
This PR is meant the end the confusion around best_ntree_limit and unify model slicing. We have multi-class and random forests, asking users to understand how to set ntree_limit is difficult and error prone.
* Implement the save_best option in early stopping.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-11-02 23:27:39 -08:00
Rory Mitchell
29745c6df2
Fix inclusive scan for large sizes ( #6234 )
2020-11-03 17:01:43 +13:00
Jiaming Yuan
7756192906
[dask] Fix prediction on DaskDMatrix with multiple meta data. ( #6333 )
...
* Unify the meta handling methods.
2020-11-02 19:18:44 -05:00
Jiaming Yuan
5a7b3592ed
Optional find_package for sanitizers. ( #6329 )
2020-11-02 19:17:17 -05:00
Jiaming Yuan
048acf81cd
Enable shap sparse test. ( #6332 )
2020-11-01 20:59:27 +08:00
Igor Moura
5e1e972aea
Clean up warnings ( #6325 )
2020-10-30 23:50:29 +08:00
nabokovas
f0fe18fc28
Add a new github actions badge ( #6321 )
2020-10-30 17:57:21 +08:00
Jiaming Yuan
6ff331b705
Fix Python callback. ( #6320 )
2020-10-30 05:03:44 +08:00
Sergio Gavilán
b181a88f9f
Reduced some C++ compiler warnings ( #6197 )
...
* Removed some warnings
* Rebase with master
* Solved C++ Google Tests errors made by refactoring in order to remove warnings
* Undo renaming path -> path_
* Fix style check
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-29 12:36:00 -07:00
Jiaming Yuan
c80657b542
Fix flaky data initialization test. ( #6318 )
2020-10-30 03:11:22 +08:00
Naveed Ahmed Saleem Janvekar
608bda7052
[jvm-packages] add example to handle missing value other than 0 ( #5677 )
...
add example to handle missing value other than 0 under Dealing with missing values section
2020-10-28 17:24:35 -07:00
Jiaming Yuan
74ea82209b
Lazy import dask libraries. ( #6309 )
...
* Lazy import dask libraries.
* Lint && fix.
* Use short name.
2020-10-28 15:50:11 -07:00
Jiaming Yuan
dfac5f89e9
Group CLI demo into subdirectory. ( #6258 )
...
CLI is not most developed interface. Putting them into correct directory can help new users to avoid it as most of the use cases are from a language binding.
2020-10-28 14:40:44 -07:00
James Lamb
6383757dca
[R] allow xgb.plot.importance() calls to fill a grid ( #6294 )
2020-10-28 14:37:28 -07:00
Tanuja Kirthi Doddapaneni
d261ba029a
Added USE_NCCL_LIB_PATH option to enable user to set NCCL_LIBRARY during build ( #6310 )
...
Description: To enable user to set NCCL_LIBRARY during build
2020-10-28 14:36:31 -07:00
vcarpani
671971e12e
Compiler warnings ( #6286 )
...
* Fix warnings for json.h
* Fix warnings for metric.h
* Fix warnings for updater_quantile_hist.cc.
* Fix warnings for updater_histmaker.cc.
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-28 13:46:15 -07:00
Jiaming Yuan
e8884c4637
Document tree method for feature weights. ( #6312 )
2020-10-28 13:42:13 -07:00
Philip Hyunsu Cho
143b278267
Mark flaky tests as XFAIL ( #6299 )
...
* Temporarily skip TestGPUUpdaters::test_categorical
* Temporarily skip test_boost_from_prediction[approx]
2020-10-28 11:50:57 -07:00
Jiaming Yuan
c4da967b5c
Support unity build. ( #6295 )
...
* Support unity build.
* Setup on Windows Jenkins.
* Revert "Setup on Windows Jenkins."
This reverts commit 8345cb8d2b009eec8ae9fa6f16412a7c9b6ec12c.
2020-10-28 11:49:28 -07:00
Philip Hyunsu Cho
f6169c0b16
[CI] Use separate Docker cache for each CUDA version ( #6305 )
2020-10-28 11:07:00 -07:00
Jiaming Yuan
3310e208fd
Fix inplace prediction interval. ( #6259 )
...
* Add back the interval in call.
* Make the interval non-optional.
2020-10-28 13:13:59 +08:00
Jiaming Yuan
cc76724762
Reduce warning. ( #6273 )
2020-10-27 12:24:19 -07:00
DIVYA CHAUHAN
4e9c4f2d73
Create a tutorial for using the C API in a C/C++ application ( #6285 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-27 12:19:20 -07:00
James Lamb
e1de390e6e
[ci] replace 'egrep' with 'grep -E' ( #6287 )
2020-10-27 12:05:48 -07:00
Rory Mitchell
f0c3ff313f
Update GPUTreeShap, add docs ( #6281 )
...
* Update GPUTreeShap, add docs
* Fix test
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-27 18:22:12 +13:00
Jiaming Yuan
b180223d18
Cleanup RABIT. ( #6290 )
...
* Remove recovery and MPI speed tests.
* Remove readme.
* Remove Python binding.
* Add checks in C API.
2020-10-27 08:48:22 +08:00
Akira Funahashi
8e0f5a6fc7
Update plugin instructions for CMake build ( #6289 )
2020-10-26 17:42:07 -07:00
Philip Hyunsu Cho
c8ec62103a
Deprecate LabelEncoder in XGBClassifier; Enable cuDF/cuPy inputs in XGBClassifier ( #6269 )
...
* Deprecate LabelEncoder in XGBClassifier; skip LabelEncoder for cuDF/cuPy inputs
* Add unit tests for cuDF and cuPy inputs with XGBClassifier
* Fix lint
* Clarify warning
* Move use_label_encoder option to XGBClassifier constructor
* Add a test for cudf.Series
* Add use_label_encoder to XGBRFClassifier doc
* Address reviewer feedback
2020-10-26 13:20:51 -07:00
Jiaming Yuan
bcfab4d726
Revert "Disable JSON full serialization for now. ( #6248 )" ( #6266 )
...
This reverts commit 6d293020fb .
2020-10-27 03:30:47 +08:00
Jiaming Yuan
d61b628bf5
Remove RABIT CMake targets. ( #6275 )
...
* Now it's built as part of libxgboost.
* Set correct C API error in RABIT initialization and finalization.
* Remove redundant message.
* Guard the tracker print C API.
2020-10-27 01:30:20 +08:00
Jiaming Yuan
2686d32a36
Skip dask tests on ARM. ( #6267 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-26 15:09:05 +08:00
Philip Hyunsu Cho
677f676172
Use UserWarning for old callback, as DeprecationWarning is not visible ( #6270 )
2020-10-22 01:10:52 -07:00
Philip Hyunsu Cho
1300467d36
Fix a typo in is_arm() in testing.py [skip ci] ( #6271 )
2020-10-22 13:07:14 +08:00
Jiaming Yuan
b5c2a47b20
Drop single point model recovery ( #6262 )
...
* Pass rabit params in JVM package.
* Implement timeout using poll timeout parameter.
* Remove OOB data check.
2020-10-21 15:27:03 +08:00
Jiaming Yuan
81c37c28d5
Time the CPU tests on Jenkins. ( #6257 )
...
* Time the CPU tests on Jenkins.
* Reduce thread contention.
* Add doc.
* Skip heavy tests on ARM.
2020-10-20 17:19:07 -07:00
Igor Moura
d1254808d5
Clean up C++ warnings ( #6213 )
2020-10-19 23:02:33 +08:00
Jiaming Yuan
ddf37cca30
Unify thread configuration. ( #6186 )
2020-10-19 16:05:42 +08:00
Philip Hyunsu Cho
7f6ed5780c
[CI] Build a Python wheel for aarch64 platform ( #6253 )
2020-10-18 22:35:19 -07:00
Jiaming Yuan
5037abeb86
Fix linear gpu input ( #6255 )
2020-10-19 12:02:36 +08:00
Yuan Tang
cdcdab98b8
Add sponsors link to FUNDING.yml ( #6252 )
2020-10-18 19:17:11 -07:00
Philip Hyunsu Cho
65ea42bd42
[CI] Reduce testing load with RMM ( #6249 )
...
* [CI] Reduce testing load with RMM
* Address reviewer's comment
2020-10-18 19:16:46 -07:00
Manikya Bardhan
549f361b71
Updated winning solutions list ( #6254 )
2020-10-19 04:06:48 +08:00
Jiaming Yuan
6d293020fb
Disable JSON full serialization for now. ( #6248 )
...
* Disable JSON serialization for now.
* Multi-class classification is checkpointing for each iteration.
This brings significant overhead.
Revert: 90355b4f00
* Set R tests to use binary.
2020-10-16 17:59:54 +08:00
Jiaming Yuan
52452bebb9
Fix cls typo. ( #6247 )
2020-10-16 16:40:44 +08:00
Yuan Tang
3098d7cee0
Add link to XGBoost's Twitter handle ( #6244 )
2020-10-15 16:54:34 -07:00
Jiaming Yuan
3da5a69dc9
Fix typo in dask interface. ( #6240 )
2020-10-15 15:26:29 +08:00
dependabot[bot]
06e453ddf4
Bump junit from 4.11 to 4.13.1 in /jvm-packages/xgboost4j ( #6230 )
...
Bumps [junit](https://github.com/junit-team/junit4 ) from 4.11 to 4.13.1.
- [Release notes](https://github.com/junit-team/junit4/releases )
- [Changelog](https://github.com/junit-team/junit4/blob/main/doc/ReleaseNotes4.11.md )
- [Commits](https://github.com/junit-team/junit4/compare/r4.11...r4.13.1 )
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2020-10-13 19:46:19 -07:00
dependabot[bot]
b51a717deb
Bump junit from 4.11 to 4.13.1 in /jvm-packages/xgboost4j-gpu ( #6233 )
...
Bumps [junit](https://github.com/junit-team/junit4 ) from 4.11 to 4.13.1.
- [Release notes](https://github.com/junit-team/junit4/releases )
- [Changelog](https://github.com/junit-team/junit4/blob/main/doc/ReleaseNotes4.11.md )
- [Commits](https://github.com/junit-team/junit4/compare/r4.11...r4.13.1 )
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2020-10-13 19:44:56 -07:00
Jiaming Yuan
bed7ae4083
Loop over thrust::reduce. ( #6229 )
...
* Check input chunk size of dqdm.
* Add doc for current limitation.
2020-10-14 10:40:56 +13:00
Rory Mitchell
734a911a26
Loop over copy_if ( #6201 )
...
* Loop over copy_if
* Catch OOM.
Co-authored-by: fis <jm.yuan@outlook.com >
2020-10-14 10:23:16 +13:00
Wittty-Panda
0fc263ead5
Update the list of winning solutions ( #6222 )
2020-10-13 20:05:12 +08:00
Jiaming Yuan
b05073bda5
[dask] Test for data initializaton. ( #6226 )
2020-10-13 11:08:35 +08:00
Jiaming Yuan
2443275891
Cleanup Python code. ( #6223 )
...
* Remove pathlike as XGBoost 1.2 requires Python 3.6.
* Move conditional import of dask/distributed into dask module.
2020-10-12 15:44:41 +08:00
Jiaming Yuan
70c2039748
Catch all standard exceptions in C API. ( #6220 )
...
* `std::bad_alloc` is not guaranteed to be caught.
2020-10-12 14:01:46 +08:00
Jiaming Yuan
2241563f23
Handle duplicated values in sketching. ( #6178 )
...
* Accumulate weights in duplicated values.
* Fix device id in iterative dmatrix.
2020-10-10 19:32:44 +08:00
Jiaming Yuan
ab5b35134f
Rework Python callback functions. ( #6199 )
...
* Define a new callback interface for Python.
* Deprecate the old callbacks.
* Enable early stopping on dask.
2020-10-10 17:52:36 +08:00
Jiaming Yuan
b5b24354b8
More categorical tests and disable shap sparse test. ( #6219 )
...
* Fix tree load with 32 category.
2020-10-10 16:12:37 +08:00
Philip Hyunsu Cho
c991eb612d
[jvm-packages] Fix up build for xgboost4j-gpu, xgboost4j-spark-gpu ( #6216 )
...
* [CI] Clean up build for JVM packages
* Use correct path for saving native lib
* Fix groupId of maven-surefire-plugin
* Fix stashing of xgboost4j_jar_gpu
* [CI] Don't run xgboost4j-tester with GPU, since it doesn't use gpu_hist
2020-10-09 14:08:15 -07:00
Jiaming Yuan
70ce5216b5
Add high level tests for categorical data. ( #6179 )
...
* Fix unique.
2020-10-09 09:27:23 +08:00
vcarpani
6bc9747df5
Reduce compile warnings ( #6198 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-08 23:14:59 +08:00
ShvetsKS
a4ce0eae43
CPU predict performance improvement ( #6127 )
...
Co-authored-by: ShvetsKS <kirill.shvets@intel.com >
2020-10-08 15:50:21 +03:00
Jiaming Yuan
4cfdcaaf7b
Move non-OpenMP gtest to GitHub Actions ( #6210 )
2020-10-08 00:58:21 -07:00
Jiaming Yuan
ddc4f20e54
Add JSON schema for categorical splits. ( #6194 )
2020-10-07 17:33:31 +08:00
odidev
a2fea33103
Added arm64 job in Travis-CI ( #6200 )
...
Signed-off-by: odidev <odidev@puresoftware.com >
2020-10-07 15:02:09 +08:00
Igor Moura
5908598666
[Doc] Add info on GPU compiler ( #6204 )
...
* Add note about the required compiler version for CUDA.
* Also added a link that gives a short explanation on compute capability version
2020-10-06 11:35:18 +08:00
Yuan Tang
1013224888
Consistent style for build status badge ( #6203 )
2020-10-05 18:23:21 -07:00
Philip Hyunsu Cho
f121f2738f
[CI] Fix Docker build for CUDA 11 ( #6202 )
2020-10-05 17:54:14 -07:00
Jiaming Yuan
fd58005edf
Ignore cachedir by joblib. [skip ci] ( #6193 )
2020-10-04 14:54:32 +08:00
DIVYA CHAUHAN
750bd0ae9a
Update the list of winning solutions using XGBoost ( #6192 )
...
Co-authored-by: divya <divyachauhan661@gmail.com >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-03 13:39:58 -07:00
Christian Lorentzen
cf4f019ed6
[Breaking] Change default evaluation metric for classification to logloss / mlogloss ( #6183 )
...
* Change DefaultEvalMetric of classification from error to logloss
* Change default binary metric in plugin/example/custom_obj.cc
* Set old error metric in python tests
* Set old error metric in R tests
* Fix missed eval metrics and typos in R tests
* Fix setting eval_metric twice in R tests
* Add warning for empty eval_metric for classification
* Fix Dask tests
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-10-02 12:06:47 -07:00
John Quitto-Graham
e0e4f15d0e
Fix a comment in demo to use correct reference ( #6190 )
...
Co-authored-by: John Quitto Graham <johnq@dgx07.aselab.nvidia.com >
2020-10-01 13:16:04 -07:00
Philip Hyunsu Cho
eb7946ff25
Hide C++ symbols from dmlc-core ( #6188 )
2020-10-01 10:07:13 -07:00
lacrosse91
6bc41df2fe
[Doc] Add list of winning solutions in data science competitions using XGBoost ( #6177 )
2020-09-30 14:41:29 -07:00
Jiaming Yuan
f0c63902ff
Use default allocator in sketching. ( #6182 )
2020-09-30 14:55:59 +08:00
Jiaming Yuan
444131a2e6
Add categorical data support to GPU Hist. ( #6164 )
2020-09-29 11:27:25 +08:00
Jiaming Yuan
798af22ff4
Add categorical data support to GPU predictor. ( #6165 )
2020-09-29 11:25:34 +08:00
Jiaming Yuan
7622b8cdb8
Enable categorical data support on Python DMatrix. ( #6166 )
...
* Only pandas is recognized.
2020-09-29 11:22:56 +08:00
Jiaming Yuan
52c0b3f100
Fix error message. ( #6176 )
2020-09-29 11:18:25 +08:00
Rory Mitchell
dda9e1e487
Update GPUTreeshap ( #6163 )
...
* Reduce shap test duration
* Test interoperability with shap package
* Add feature interactions
* Update GPUTreeShap
2020-09-28 09:43:47 +13:00
Jiaming Yuan
434a3f35a3
Add TAGS to gitignore. [skip ci] ( #6175 )
2020-09-27 21:27:40 +08:00
Jiaming Yuan
07355599c2
Option for generating device debug info. ( #6168 )
...
* Supply `-G;-src-in-ptx` when `USE_DEVICE_DEBUG` is set and debug mode is selected.
* Refactor CMake script to gather all CUDA configuration.
* Use CMAKE_CUDA_ARCHITECTURES. Close #6029 .
* Add compute 80. Close #5999
2020-09-27 03:26:56 +08:00
Kyle Nicholson
e6a238c020
Update base margin dask ( #6155 )
...
* Add `base-margin`
* Add `output_margin` to regressor.
Co-authored-by: fis <jm.yuan@outlook.com >
2020-09-26 21:30:52 +08:00
Alexander Gugel
03b8fdec74
Add DMatrix usage examples to c-api-demo ( #5854 )
...
* Add DMatrix usage examples to c-api-demo
* Add XGDMatrixCreateFromCSREx example
* Add XGDMatrixCreateFromCSCEx example
2020-09-26 02:10:12 -07:00
Philip Hyunsu Cho
2c4dedb7a0
[CI] Test C API demo ( #6159 )
...
* Fix CMake install config to use dependencies
* [CI] Test C API demo
* Explicitly cast num_feature, to avoid warning in Linux
2020-09-25 14:49:01 -07:00
Philip Hyunsu Cho
bd2b1eabd0
Add back support for scipy.sparse.coo_matrix ( #6162 )
2020-09-25 00:49:49 -07:00
Philip Hyunsu Cho
72ef553550
Fall back to CUB allocator if RMM memory pool is not set up ( #6150 )
...
* Fall back to CUB allocator if RMM memory pool is not set up
* Fix build
* Prevent memory leak
* Add note about lack of memory initialisation
* Add check for other fast allocators
* Set use_cub_allocator_ to true when RMM is not enabled
* Fix clang-tidy
* Do not demangle symbol; add check to ensure Linux+Clang/GCC combo
2020-09-24 11:04:50 -07:00
Zeno Gantner
5b05f88ba9
Cosmetic fixes in faq.rst ( #6161 )
2020-09-24 21:05:10 +08:00
Jiaming Yuan
14afdb4d92
Support categorical data in ellpack. ( #6140 )
2020-09-24 19:28:57 +08:00
Jiaming Yuan
78d72ef936
Add DaskDeviceQuantileDMatrix demo. ( #6156 )
2020-09-24 14:08:28 +08:00
Philip Hyunsu Cho
678ea40b24
[CI] Upgrade cuDF and RMM to 0.16 nightlies; upgrade to Ubuntu 18.04 ( #6157 )
...
* [CI] Upgrade cuDF and RMM to 0.16 nightlies
* Use Ubuntu 18.04 in RMM test, since RMM needs GCC 7+
2020-09-23 19:48:44 -07:00
James Lamb
c686bc0461
[R] remove warning in configure.ac ( fixes #6151 ) ( #6152 )
...
* [R] remove warning in configure.ac (fixes #6151 )
* update configure
2020-09-22 22:47:38 -07:00
Jiaming Yuan
e033caa3ba
Remove linking RMM library. ( #6146 )
...
* Remove linking RMM library.
* RMM is now header only.
* Remove remaining reference.
2020-09-22 16:59:33 -07:00
Jiaming Yuan
452ac8ea62
Time GPU tests on CI. ( #6141 )
2020-09-22 14:25:10 +08:00
Jiaming Yuan
33d80ffad0
[dask] Support more meta data on functional interface. ( #6132 )
...
* Add base_margin, label_(lower|upper)_bound.
* Test survival training with dask.
2020-09-21 16:56:37 +08:00
Jiaming Yuan
7065779afa
Improve JSON format for categorical features. ( #6128 )
...
* Gather categories for all nodes.
2020-09-21 15:35:05 +08:00
Jiaming Yuan
210c131ce7
Support categorical data in GPU sketching. ( #6137 )
2020-09-21 13:53:06 +08:00
Nan Zhu
c932fb50a1
[jvm-packages]add xgboost4j-gpu/xgboost4j-spark-gpu module to facilitate release ( #6136 )
...
* add xgboost4j-gpu/xgboost4j-spark-gpu module to facilitate release
* Update pom.xml
2020-09-20 09:20:38 -07:00
Jiaming Yuan
a069a21e03
Implement intrusive ptr ( #6129 )
...
* Use intrusive ptr for JSON.
2020-09-20 20:07:16 +08:00
Jiaming Yuan
e319b63f9e
Merge extract cuts into QuantileContainer. ( #6125 )
...
* Use pruning for initial summary construction.
2020-09-18 16:36:39 +08:00
Jiaming Yuan
cc82ca167a
[dask] Refactor meta data handling. ( #6130 )
2020-09-18 13:26:40 +08:00
Jiaming Yuan
5384ed85c8
Use caching allocator from RMM, when RMM is enabled ( #6131 )
2020-09-17 21:51:49 -07:00
neko
6bc9b9dc4f
Fix doc for CMake requirement. ( #6123 )
2020-09-16 17:59:43 +08:00
Philip Hyunsu Cho
9e955fb9b0
[R] Check warnings explicitly for model compatibility tests ( #6114 )
...
* [R] Check warnings explicitly for model compatibility tests
* Address reviewer's feedback
2020-09-15 10:49:48 -07:00
Philip Hyunsu Cho
33577ef5d3
Add MAPE metric ( #6119 )
2020-09-14 18:45:27 -07:00
Rory Mitchell
47350f6acb
Allow kwargs in dask predict ( #6117 )
2020-09-15 13:04:03 +12:00
Jiaming Yuan
b5f52f0b1b
Validate weights are positive values. ( #6115 )
2020-09-15 09:03:55 +08:00
Jiaming Yuan
c6f2b8c841
Upgrade gputreeshap. ( #6099 )
...
* Upgrade gputreeshap.
Co-authored-by: Rory Mitchell <r.a.mitchell.nz@gmail.com >
2020-09-15 12:57:22 +12:00
Vitalie Spinu
1453bee3e7
[R] Remove stringi dependency ( #6109 )
...
* [R] Fix empty empty tests and a test warnings
* [R] Remove stringi dependency (fix #5905 )
* Fix R lint check
* [R] Fix automatic conversion to factor in R < 4.0.0 in xgb.model.dt.tree
* Add `R` Makefile variable
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-09-12 13:18:08 -07:00
Jiaming Yuan
07945290a2
Remove unused RABIT targets. ( #6110 )
...
* Remove rabit mock.
* Remove rabit base.
2020-09-11 14:09:44 +08:00
Jiaming Yuan
c92d751ad1
Enable building rabit on Windows ( #6105 )
2020-09-11 11:54:46 +08:00
Jiaming Yuan
08bdb2efc8
Fix dask doc. [skip ci] ( #6108 )
2020-09-11 10:56:12 +08:00
Bobby Wang
00b0ad1293
[Doc] add doc for kill_spark_context_on_worker_failure parameter ( #6097 )
...
* [Doc] add doc for kill_spark_context_on_worker_failure parameter
* resolve comments
2020-09-09 21:28:44 -07:00
Philip Hyunsu Cho
d0ccb13d09
Work around a compiler bug in MacOS AppleClang 11 ( #6103 )
...
* Workaround a compiler bug in MacOS AppleClang
* [CI] Run C++ test with MacOS Catalina + AppleClang 11.0.3
* [CI] Migrate cmake_test on MacOS from Travis CI to GitHub Actions
* Install OpenMP runtime
* [CI] Use CMake to locate lz4 lib
2020-09-09 21:21:55 -07:00
Philip Hyunsu Cho
9338582d79
[CI] Fix CTest by running it in a correct directory ( #6104 )
...
* [CI] Fix CTest by running it in a correct directory
* [CI] Do not run dmlc-core unit tests with sanitizer
2020-09-09 10:31:09 -07:00
Jiaming Yuan
3dcd85fab5
Refactor rabit tests ( #6096 )
...
* Merge rabit tests into XGBoost.
* Run them On CI.
* Simplification for CMake scripts.
2020-09-09 12:30:29 +08:00
Jiaming Yuan
318bffaa10
Fix custom obj link. [skip ci] ( #6100 )
2020-09-09 10:55:38 +08:00
Jiaming Yuan
b0001a6e29
Correct style warnings from clang-tidy for rabit. ( #6095 )
2020-09-08 12:13:58 +08:00
Hristo Iliev
da61d9460b
[jvm-packages] Add getNumFeature method ( #6075 )
...
* Add getNumFeature to the Java API
* Add getNumFeature to the Scala API
* Add unit tests for getNumFeature
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-09-07 20:57:46 -07:00
Jiaming Yuan
93e9af43bb
Unify set index data. ( #6062 )
2020-09-08 11:38:41 +08:00
Jiaming Yuan
e5d40b39cd
[Breaking] Don't save leaf child count in JSON. ( #6094 )
...
The field is deprecated and not used anywhere in XGBoost.
2020-09-08 11:11:13 +08:00
Jiaming Yuan
5994f3b14c
Don't link imported target. ( #6093 )
2020-09-07 02:51:09 -07:00
Philip Hyunsu Cho
974ba12f38
Fix CMake build with BUILD_STATIC_LIB option ( #6090 )
...
* Fix CMake build with BUILD_STATIC_LIB option
* Disable BUILD_STATIC_LIB option when R/JVM pkg is enabled
* Add objxgboost to install target only when BUILD_STATIC_LIB=ON
2020-09-07 02:38:29 -07:00
Daniel Steinberg
68c55a37d9
Add cache name back to external_memory.py files. ( #6088 )
2020-09-06 16:01:09 +08:00
Boris Feld
24ca9348f7
Fix typo in xgboost.callback.early_stop docstring ( #6071 )
2020-09-06 13:37:07 +08:00
Rory Mitchell
2e907abdb8
Updates to GPUTreeShap ( #6087 )
...
* Extract paths on device
* Update GPUTreeShap
2020-09-06 13:39:08 +12:00
Bobby Wang
0e2d5669f6
[jvm-packages] cancel job instead of killing SparkContext ( #6019 )
...
* cancel job instead of killing SparkContext
This PR changes the default behavior that kills SparkContext. Instead, This PR
cancels jobs when coming across task failed. That means the SparkContext is
still alive even some exceptions happen.
* add a parameter to control if killing SparkContext
* cancel the jobs the failed task belongs to
* remove the jobId from the map when one job failed.
* resolve comments
2020-09-02 14:20:59 -07:00
Tong He
3912f3de06
Updates from 1.2.0 cran submission ( #6077 )
...
* update for 1.2.0 cran submission
* recover cmakelists
* fix unittest from the shap PR
* trigger CI
2020-09-02 20:50:23 +08:00
Philip Hyunsu Cho
9be969cc7a
Add release note for 1.2.0 in NEWS.md ( #6063 )
...
* Update query_contributors.py to account for pagination
* Add the release note for 1.2.0
* Add release note for patch releases
* Apply suggestions from code review
* Fix typo
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com >
2020-09-02 00:49:02 -07:00
Anthony D'Amato
ada964f16e
Clean the way deterministic paritioning is computed ( #6033 )
...
We propose to only use the rowHashCode to compute the partitionKey, adding the FeatureValue hashCode does not bring more value and would make the computation slower. Even though a collision would appear at 0.2% with MurmurHash3 this is bearable for partitioning, this won't have any impact on the data balancing.
2020-08-30 14:38:23 -07:00
ShvetsKS
c1ca872d1e
Modin DF support ( #6055 )
...
* Modin DF support
* mode change
* tests were added, ci env was extended
* mode change
* Remove redundant installation of modin
* Add a pytest skip marker for modin
* Install Modin[ray] from PyPI
* fix interfering
* avoid extra conversion
* delete cv test for modin
* revert cv function
Co-authored-by: ShvetsKS <kirill.shvets@intel.com >
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-29 22:33:30 +03:00
FelixYBW
3a990433f9
set maxBins to 256. Align with c code in src/tree/param.h ( #6066 )
2020-08-28 15:06:11 +03:00
Rory Mitchell
9bddecee05
Update GPUTreeShap ( #6064 )
...
* Update GPUTreeShap
* Update src/CMakeLists.txt
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-27 12:01:53 -07:00
Jiaming Yuan
2fcc4f2886
Unify evaluation functions. ( #6037 )
2020-08-26 14:23:27 +08:00
Jiaming Yuan
80c8547147
Make binary bin search reusable. ( #6058 )
...
* Move binary search row to hist util.
* Remove dead code.
2020-08-26 05:05:11 +08:00
Philip Hyunsu Cho
9c14e430af
[CI] Improve JVM test in GitHub Actions ( #5930 )
...
* [CI] Improve JVM test in GitHub Actions
* Use env var for Wagon options [skip ci]
* Move the retry flag to pom.xml [skip ci]
* Export env var RABIT_MOCK to run Spark tests [skip ci]
* Correct location of env var
* Re-try up to 5 times [skip ci]
* Don't run distributed training test on Windows
* Fix typo
* Update main.yml
2020-08-25 10:14:46 -07:00
Jiaming Yuan
81d8dd79ca
Bump header version. ( #6056 )
2020-08-26 00:29:00 +08:00
Jiaming Yuan
20c95be625
Expand categorical node. ( #6028 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-25 18:53:57 +08:00
Rory Mitchell
9a4e8b1d81
GPUTreeShap ( #6038 )
2020-08-25 12:47:41 +12:00
Philip Hyunsu Cho
b3193052b3
Bump version to 1.3.0 snapshot in master ( #6052 )
2020-08-23 17:13:46 -07:00
Philip Hyunsu Cho
4729458a36
[jvm-packages] [doc] Update install doc for JVM packages ( #6051 )
2020-08-23 14:14:53 -07:00
Philip Hyunsu Cho
cfced58c1c
[CI] Port CI fixes from the 1.2.0 branch ( #6050 )
...
* Fix a unit test on CLI, to handle RC versions
* [CI] Use mgpu machine to run gpu hist unit tests
* [CI] Build GPU-enabled JAR artifact and deploy to xgboost-maven-repo
2020-08-22 23:24:46 -07:00
Jiaming Yuan
a144daf034
Limit tree depth for GPU hist. ( #6045 )
2020-08-22 19:34:52 +08:00
Jiaming Yuan
b9ebbffc57
Fix plotting test. ( #6040 )
...
Previously the test loads a model generated by `test_basic.py`, now we generate
the model explicitly.
* Cleanup saved files for basic tests.
2020-08-22 13:18:48 +08:00
Jiaming Yuan
7a46515d3d
Remove win2016 jvm github action test. ( #6042 )
2020-08-20 19:39:46 -07:00
Jiaming Yuan
7be2e04bd4
Fix scikit learn cls doc. ( #6041 )
2020-08-20 19:23:06 -07:00
Philip Hyunsu Cho
1fd29edf66
[CI] Migrate linters to GitHub Actions ( #6035 )
...
* [CI] Move lint to GitHub Actions
* [CI] Move Doxygen to GitHub Actions
* [CI] Move Sphinx build test to GitHub Actions
* [CI] Reduce workload for Windows R tests
* [CI] Move clang-tidy to Build stage
2020-08-19 12:33:51 -07:00
ShvetsKS
24f2e6c97e
Optimize DMatrix build time. ( #5877 )
...
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com >
2020-08-20 01:37:03 +08:00
Jiaming Yuan
29b7fea572
Optimize cpu sketch allreduce for sparse data. ( #6009 )
...
* Bypass RABIT serialization reducer and use custom allgather based merging.
2020-08-19 10:03:45 +08:00
Jiaming Yuan
90355b4f00
Make JSON the default full serialization format. ( #6027 )
2020-08-19 09:57:43 +08:00
Anthony D'Amato
f58e41bad8
Fix deterministic partitioning with dataset containing Double.NaN ( #5996 )
...
The functions featureValueOfSparseVector or featureValueOfDenseVector could return a Float.NaN if the input vectore was containing any missing values. This would make fail the partition key computation and most of the vectors would end up in the same partition. We fix this by avoid returning a NaN and simply use the row HashCode in this case.
We added a test to ensure that the repartition is indeed now uniform on input dataset containing values by checking that the partitions size variance is below a certain threshold.
Signed-off-by: Anthony D'Amato <anthony.damato@hotmail.fr >
2020-08-18 18:55:37 -07:00
Cuong Duong
e51cba6195
Add SHAP summary plot using ggplot2 ( #5882 )
...
* add SHAP summary plot using ggplot2
* Update xgb.plot.shap
* Update example in xgb.plot.shap documentation
* update logic, add tests
* whitespace fixes
* whitespace fixes for test_helpers
* namespace for sd function
* explicitly declare variables that are automatically evaluated by data.table
* Fix R lint
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-18 18:04:09 -07:00
Qi Zhang
989ddd036f
Swap byte-order in binary serializer to support big-endian arch ( #5813 )
...
* fixed some endian issues
* Use dmlc::ByteSwap() to simplify code
* Fix lint check
* [CI] Add test for s390x
* Download latest CMake on s390x
* Fix a bug in my code
* Save magic number in dmatrix with byteswap on big-endian machine
* Save version in binary with byteswap on big-endian machine
* Load scalar with byteswap in MetaInfo
* Add a debugging message
* Handle arrays correctly when byteswapping
* EOF can also be 255
* Handle magic number in MetaInfo carefully
* Skip Tree.Load test for big-endian, since the test manually builds little-endian binary model
* Handle missing packages in Python tests
* Don't use boto3 in model compatibility tests
* Add s390 Docker file for local testing
* Add model compatibility tests
* Add R compatibility test
* Revert "Add R compatibility test"
This reverts commit c2d2bdcb7dbae133cbb927fcd20f7e83ee2b18a8.
Co-authored-by: Qi Zhang <q.zhang@ibm.com >
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-18 14:47:17 -07:00
Jiaming Yuan
4d99c58a5f
Feature weights ( #5962 )
2020-08-18 19:55:41 +08:00
Jiaming Yuan
a418278064
Merge pull request #6023 from trivialfis/merge-rabit
...
Merge rabit
2020-08-18 09:01:56 +08:00
Philip Hyunsu Cho
14d5ce712c
[CI] Fix Dask Pytest fixture ( #6024 )
2020-08-17 16:45:22 -07:00
fis
111968ca58
Merge rabit
2020-08-18 03:52:33 +08:00
fis
1c5904df3f
Remove rabit.
2020-08-18 03:48:36 +08:00
Jiaming Yuan
d240463b38
Revert "Remove warning about memset. ( #6003 )" ( #6020 )
...
This reverts commit 12e3fb6a6c .
2020-08-17 20:10:15 +08:00
Philip Hyunsu Cho
511bb22ffd
[Doc] Add dtreeviz as a showcase example of integration with 3rd-party software ( #6013 )
2020-08-13 20:53:59 -07:00
Philip Hyunsu Cho
e3ec7b01df
[CI] Cancel builds on subsequent pushes ( #6011 )
...
* [CI] Cancel builds on subsequent pushes
* Use a more secure method
* test commit
2020-08-13 11:17:39 -07:00
Jiaming Yuan
674c409e9d
Remove rabit dependency on public headers. ( #6005 )
2020-08-13 08:26:20 +08:00
Jiaming Yuan
12e3fb6a6c
Remove warning about memset. ( #6003 )
2020-08-13 08:25:46 +08:00
Philip Hyunsu Cho
9adb812a0a
RMM integration plugin ( #5873 )
...
* [CI] Add RMM as an optional dependency
* Replace caching allocator with pool allocator from RMM
* Revert "Replace caching allocator with pool allocator from RMM"
This reverts commit e15845d4e72e890c2babe31a988b26503a7d9038.
* Use rmm::mr::get_default_resource()
* Try setting default resource (doesn't work yet)
* Allocate pool_mr in the heap
* Prevent leaking pool_mr handle
* Separate EXPECT_DEATH() in separate test suite suffixed DeathTest
* Turn off death tests for RMM
* Address reviewer's feedback
* Prevent leaking of cuda_mr
* Fix Jenkinsfile syntax
* Remove unnecessary function in Jenkinsfile
* [CI] Install NCCL into RMM container
* Run Python tests
* Try building with RMM, CUDA 10.0
* Do not use RMM for CUDA 10.0 target
* Actually test for test_rmm flag
* Fix TestPythonGPU
* Use CNMeM allocator, since pool allocator doesn't yet support multiGPU
* Use 10.0 container to build RMM-enabled XGBoost
* Revert "Use 10.0 container to build RMM-enabled XGBoost"
This reverts commit 789021fa31112e25b683aef39fff375403060141.
* Fix Jenkinsfile
* [CI] Assign larger /dev/shm to NCCL
* Use 10.2 artifact to run multi-GPU Python tests
* Add CUDA 10.0 -> 11.0 cross-version test; remove CUDA 10.0 target
* Rename Conda env rmm_test -> gpu_test
* Use env var to opt into CNMeM pool for C++ tests
* Use identical CUDA version for RMM builds and tests
* Use Pytest fixtures to enable RMM pool in Python tests
* Move RMM to plugin/CMakeLists.txt; use PLUGIN_RMM
* Use per-device MR; use command arg in gtest
* Set CMake prefix path to use Conda env
* Use 0.15 nightly version of RMM
* Remove unnecessary header
* Fix a unit test when cudf is missing
* Add RMM demos
* Remove print()
* Use HostDeviceVector in GPU predictor
* Simplify pytest setup; use LocalCUDACluster fixture
* Address reviewers' commments
Co-authored-by: Hyunsu Cho <chohyu01@cs.wasshington.edu >
2020-08-12 01:26:02 -07:00
Jiaming Yuan
c3ea3b7e37
Fix nightly build doc. [skip ci] ( #6004 )
...
* Fix nightly build doc. [skip ci]
* Fix title too short. [skip ci]
2020-08-12 15:00:40 +08:00
Jiaming Yuan
ee70a2380b
Unify CPU hist sketching ( #5880 )
2020-08-12 01:33:06 +08:00
jameskrach
bd6b7f4aa7
[Breaking] Fix .predict() method and add .predict_proba() in xgboost.dask.DaskXGBClassifier ( #5986 )
2020-08-11 16:11:28 +08:00
Jiaming Yuan
6f7112a848
Move warning about empty dataset. ( #5998 )
2020-08-11 14:10:51 +08:00
Jiaming Yuan
f93f1c03fc
Rabit update. ( #5978 )
...
* Remove parameter on JVM Packages.
2020-08-11 09:17:32 +08:00
Jiaming Yuan
0b2a26fa74
Remove skmaker. ( #5971 )
2020-08-09 15:23:31 +08:00
Vladislav Epifanov
388f975cf5
Introducing DPC++-based plugin (predictor, objective function) supporting oneAPI programming model ( #5825 )
...
* Added plugin with DPC++-based predictor and objective function
* Update CMakeLists.txt
* Update regression_obj_oneapi.cc
* Added README.md for OneAPI plugin
* Added OneAPI predictor support to gbtree
* Update README.md
* Merged kernels in gradient computation. Enabled multiple loss functions with DPC++ backend
* Aligned plugin CMake files with latest master changes. Fixed whitespace typos
* Removed debug output
* [CI] Make oneapi_plugin a CMake target
* Added tests for OneAPI plugin for predictor and obj. functions
* Temporarily switched to default selector for device dispacthing in OneAPI plugin to enable execution in environments without gpus
* Updated readme file.
* Fixed USM usage in predictor
* Removed workaround with explicit templated names for DPC++ kernels
* Fixed warnings in plugin tests
* Fix CMake build of gtest
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-08 18:40:40 -07:00
Anthony D'Amato
7cf3e9be59
Fix typo in tracker logging ( #5994 )
2020-08-09 03:45:46 +08:00
James Lamb
589b385ec6
[R] fix uses of 1:length(x) and other small things ( #5992 )
2020-08-09 03:31:33 +08:00
Jiaming Yuan
801e6b6800
Fix dask predict shape infer. ( #5989 )
2020-08-08 14:29:22 +08:00
Jiaming Yuan
4acdd7c6f6
Remove stop process. ( #143 )
2020-08-05 10:12:00 -07:00
Jiaming Yuan
9c6e791e64
Enforce tree order in JSON. ( #5974 )
...
* Make JSON model IO more future proof by using tree id in model loading.
2020-08-05 16:44:52 +08:00
Jiaming Yuan
dde9c5aaff
Fix missing data warning. ( #5969 )
...
* Fix data warning.
* Add numpy/scipy test.
2020-08-05 16:19:12 +08:00
Jiaming Yuan
8599f87597
Update JSON schema. ( #5982 )
...
* Update JSON schema for pseudo huber.
* Update JSON model schema.
2020-08-05 15:21:11 +08:00
Jiaming Yuan
9c93531709
Update Python custom objective demo. ( #5981 )
2020-08-05 12:27:19 +08:00
Jiaming Yuan
1149a7a292
Fix sklearn doc. ( #5980 )
2020-08-05 12:26:19 +08:00
Jiaming Yuan
b069431c28
Export DaskDeviceQuantileDMatrix in doc. [skip ci] ( #5975 )
2020-08-05 00:48:10 +08:00
Shaochen Shi
71197d1dfa
[jvm-packages] Fix wrong method name setAllowZeroForMissingValue. ( #5740 )
...
* Allow non-zero for missing value when training.
* Fix wrong method names.
* Add a unit test
* Move the getter/setter unit test to MissingValueHandlingSuite
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-08-01 17:16:42 -07:00
Philip Hyunsu Cho
5a2dcd1c33
[R] Provide better guidance for persisting XGBoost model ( #5964 )
...
* [R] Provide better guidance for persisting XGBoost model
* Update saving_model.rst
* Add a paragraph about xgb.serialize()
2020-07-31 20:00:26 -07:00
Philip Hyunsu Cho
bf2990e773
Add missing Pytest marks to AsyncIO unit test ( #5968 )
2020-08-01 10:56:24 +08:00
Philip Hyunsu Cho
5f3c811e84
[CI] Assign larger /dev/shm to NCCL ( #5966 )
...
* [CI] Assign larger /dev/shm to NCCL
* Use 10.2 artifact to run multi-GPU Python tests
* Add CUDA 10.0 -> 11.0 cross-version test; remove CUDA 10.0 target
2020-07-31 10:05:04 -07:00
Philip Hyunsu Cho
3fcfaad577
Add CMake flag to log C API invocations, to aid debugging ( #5925 )
...
* Add CMake flag to log C API invocations, to aid debugging
* Remove unnecessary parentheses
2020-07-30 19:24:28 -07:00
James Bourbeau
3b88bc948f
Update XGBoost + Dask overview documentation ( #5961 )
...
* Add imports to code snippet
* Better writing.
2020-07-31 09:58:50 +08:00
Jiaming Yuan
70903c872f
Force colored output for ninja build. ( #5959 )
2020-07-30 20:48:03 +08:00
boxdot
d268a2a463
Thread-safe prediction by making the prediction cache thread-local. ( #5853 )
...
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2020-07-30 12:33:50 +08:00
Jiaming Yuan
fa3715f584
[Dask] Asyncio support. ( #5862 )
2020-07-30 06:23:58 +08:00
Jiaming Yuan
e4a273e1da
Fix evaluate root split. ( #5948 )
2020-07-29 19:33:29 +08:00
Philip Hyunsu Cho
071e10c1d1
[CI] Fix broken Docker container 'cpu' ( #5956 )
2020-07-29 04:29:57 -07:00
Jiaming Yuan
f5fdcbe194
Disable feature validation on sklearn predict prob. ( #5953 )
...
* Fix issue when scikit learn interface receives transformed inputs.
2020-07-29 19:26:44 +08:00
Jiaming Yuan
18349a7ccf
[Breaking] Fix custom metric for multi output. ( #5954 )
...
* Set output margin to true for custom metric. This fixes only R and Python.
2020-07-29 19:25:27 +08:00
Jiaming Yuan
75b8c22b0b
Fix prediction heuristic ( #5955 )
...
* Relax check for prediction.
* Relax test in spark test.
* Add tests in C++.
2020-07-29 19:24:07 +08:00
Philip Hyunsu Cho
5879acde9a
[CI] Improve R linter script ( #5944 )
...
* [CI] Move lint to a separate script
* [CI] Improved lintr launcher
* Add lintr as a separate action
* Add custom parsing logic to print out logs
* Fix lintr issues in demos
* Run R demos
* Fix CRAN checks
* Install XGBoost into R env before running lintr
* Install devtools (needed to run demos)
2020-07-27 00:55:35 -07:00
Bobby Wang
8943eb4314
[BLOCKING] [jvm-packages] add gpu_hist and enable gpu scheduling ( #5171 )
...
* [jvm-packages] add gpu_hist tree method
* change updater hist to grow_quantile_histmaker
* add gpu scheduling
* pass correct parameters to xgboost library
* remove debug info
* add use.cuda for pom
* add CI for gpu_hist for jvm
* add gpu unit tests
* use gpu node to build jvm
* use nvidia-docker
* Add CLI interface to create_jni.py using argparse
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-07-26 21:53:24 -07:00
Philip Hyunsu Cho
6347fa1c2e
[R] Enable weighted learning to rank ( #5945 )
...
* [R] enable weighted learning to rank
* Add R unit test for ranking
* Fix lint
2020-07-26 21:10:36 -07:00
Philip Hyunsu Cho
ace7fd328b
[R] Add a compatibility layer to load Booster object from an old RDS file ( #5940 )
...
* [R] Add a compatibility layer to load Booster from an old RDS
* Modify QuantileHistMaker::LoadConfig() to be backward compatible with 1.1.x
* Add a big warning about compatibility in QuantileHistMaker::LoadConfig()
* Add testing suite
* Discourage use of saveRDS() in CRAN doc
2020-07-26 00:06:49 -07:00
Jiaming Yuan
40361043ae
[BLOCKING] Remove to_string. ( #5934 )
2020-07-26 10:21:26 +08:00
Philip Hyunsu Cho
12110c900e
[CI] Make Python model compatibility test runnable locally ( #5941 )
2020-07-25 16:58:02 -07:00
Philip Hyunsu Cho
487ab0ce73
[BLOCKING] Handle empty rows in data iterators correctly ( #5929 )
...
* [jvm-packages] Handle empty rows in data iterators correctly
* Fix clang-tidy error
* last empty row
* Add comments [skip ci]
Co-authored-by: Nan Zhu <nanzhu@uber.com >
2020-07-25 13:46:19 -07:00
FelixYBW
e6cd74ead3
Set a minimal reducer size and parent_down size ( #139 )
...
* set a minimal reducer msg size. Receive the same data size from parent each time.
* When parent read from a child, check it receive minimal reduce size.
fix bug. Rewrite the minimal reducer size check, make sure it's 1~N times of minimal reduce size
Assume the minimal reduce size is X, the logic here is
1: each child upload total_size of message
2: each parent receive X message at least, up to total_size
3: parent reduce X or NxX or total_size message
4: parent sends X or NxX or total_size message to its parent
4: parent's parent receive X message at least, up to total_size. Then reduce X or NxX or total_size message
6: parent's parent sends X or NxX or total_size message to its children
7: parent receives X or NxX or total_size message, sends to its children
8: child receive X or NxN or total_size message.
During the whole process, each transfer is (1~N)xX Byte message or up to total_size.
if X is larger than total_size, then allreduce allways reduce the whole messages and pass down.
* Follow style check rule
* fix the cpplint check
* fix allreduce_base header seq
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-07-25 12:46:45 -07:00
Jiaming Yuan
a4de2f68e4
Use cudaOccupancyMaxPotentialBlockSize to calculate the block size. ( #5926 )
2020-07-23 14:24:42 +08:00
Jiaming Yuan
fbfbd525d8
Cache dependencies on Github Action. ( #5928 )
2020-07-23 14:00:19 +08:00
Philip Hyunsu Cho
4af857f95d
Add explicit template specialization for portability ( #5921 )
...
* Add explicit template specializations
* Adding Specialization for FileAdapterBatch
2020-07-22 12:31:17 -07:00
Jiaming Yuan
bc1d3ee230
Fix r early stop with custom objective. ( #5923 )
...
* Specify `ntreelimit`.
2020-07-23 03:28:17 +08:00
Jiaming Yuan
30363d9c35
Remove R and JVM from appveyor. ( #5922 )
2020-07-23 03:26:48 +08:00
Jiaming Yuan
66cc1e02aa
Setup github action. ( #5917 )
2020-07-22 15:05:25 +08:00
Philip Hyunsu Cho
627cf41a60
Add option to enable all compiler warnings in GCC/Clang ( #5897 )
...
* Add option to enable all compiler warnings in GCC/Clang
* Fix -Wall for CUDA sources
* Make -Wall private req for xgboost-r
2020-07-21 23:34:03 -07:00
Jiaming Yuan
9b688aca3b
Fix mingw build with R. ( #5918 )
2020-07-22 02:56:49 +08:00
Philip Hyunsu Cho
8d7702766a
[Doc] Document new objectives and metrics available on GPUs ( #5909 )
2020-07-21 02:10:59 -07:00
Jiaming Yuan
03fb98fbde
Fix typo in CI. [skip ci] ( #5919 )
2020-07-21 14:25:27 +08:00
Jiaming Yuan
8b1afce316
Add Github Action for R. ( #5911 )
...
* Fix lintr errors.
2020-07-20 19:23:36 +08:00
Andy Adinets
b3d2e7644a
Support building XGBoost with CUDA 11 ( #5808 )
...
* Change serialization test.
* Add CUDA 11 tests on Linux CI.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-07-20 07:58:41 +08:00
Philip Hyunsu Cho
ac9136ee49
Further improvements and savings in Jenkins pipeline ( #5904 )
...
* Publish artifacts only on the master and release branches
* Build CUDA only for Compute Capability 7.5 when building PRs
* Run all Windows jobs in a single worker image
* Build nightly XGBoost4J SNAPSHOT JARs with Scala 2.12 only
* Show skipped Python tests on Windows
* Make Graphviz optional for Python tests
* Add back C++ tests
* Unstash xgboost_cpp_tests
* Fix label to CUDA 10.1
* Install cuPy for CUDA 10.1
* Install jsonschema
* Address reviewer's feedback
2020-07-18 03:30:40 -07:00
Jiaming Yuan
6c0c87216f
Fix Windows 2016 build. ( #5902 )
2020-07-18 05:50:17 +08:00
Philip Hyunsu Cho
71b0528a2f
GPU implementation of AFT survival objective and metric ( #5714 )
...
* Add interval accuracy
* De-virtualize AFT functions
* Lint
* Refactor AFT metric using GPU-CPU reducer
* Fix R build
* Fix build on Windows
* Fix copyright header
* Clang-tidy
* Fix crashing demo
* Fix typos in comment; explain GPU ID
* Remove unnecessary #include
* Add C++ test for interval accuracy
* Fix a bug in accuracy metric: use log pred
* Refactor AFT objective using GPU-CPU Transform
* Lint
* Fix lint
* Use Ninja to speed up build
* Use time, not /usr/bin/time
* Add cpu_build worker class, with concurrency = 1
* Use concurrency = 1 only for CUDA build
* concurrency = 1 for clang-tidy
* Address reviewer's feedback
* Update link to AFT paper
2020-07-17 01:18:13 -07:00
Jiaming Yuan
7c2686146e
Dask device dmatrix ( #5901 )
...
* Fix softprob with empty dmatrix.
2020-07-17 13:17:43 +08:00
Jiaming Yuan
e471056ec4
Fix sketch size calculation. ( #5898 )
2020-07-17 08:33:16 +08:00
Bobby Wang
730866a7bc
[CI] update spark version to 3.0.0 ( #5890 )
...
* [CI] update spark version to 3.0.0
* Update Dockerfile.jvm_cross
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-07-16 00:23:44 -07:00
Jiaming Yuan
029a8b533f
Simplify the data backends. ( #5893 )
2020-07-16 15:17:31 +08:00
Philip Hyunsu Cho
7aee0e51ed
Fix R package build with CMake 3.13 ( #5895 )
...
* Fix R package build with CMake 3.13
* Require OpenMP for xgboost-r target
2020-07-15 20:22:11 -07:00
Philip Hyunsu Cho
3c40f4a7f5
[CI] Reduce load on Windows CI pipeline ( #5892 )
2020-07-14 18:47:05 -07:00
Jiaming Yuan
3cae287dea
Fix NDK Build. ( #5886 )
...
* Explicit cast for slice.
2020-07-14 18:34:19 +08:00
Alexander Gugel
970b4b3fa2
Add XGBoosterGetNumFeature ( #5856 )
...
- add GetNumFeature to Learner
- add XGBoosterGetNumFeature to C API
- update c-api-demo accordingly
2020-07-13 23:25:17 -07:00
Philip Hyunsu Cho
e0c179c7cc
[CI] Enforce daily budget in Jenkins CI ( #5884 )
...
* [CI] Throttle Jenkins CI
* Don't use Jenkins master instance
2020-07-13 21:51:11 -07:00
Jiaming Yuan
dd445af56e
Cleanup on device sketch. ( #5874 )
...
* Remove old functions.
* Merge weighted and un-weighted into a common interface.
2020-07-14 10:15:54 +08:00
Bobby Wang
9f85e92602
[jvm-packages] update spark dependency to 3.0.0 ( #5836 )
2020-07-12 20:58:30 -07:00
Philip Hyunsu Cho
23e2c6ec91
Upgrade Rabit ( #5876 )
2020-07-09 16:18:33 -07:00
Zhang Zhang
1813804e36
Add new parameter singlePrecisionHistogram to xgboost4j-spark ( #5811 )
...
Expose the existing 'singlePrecisionHistogram' param to the Spark layer.
2020-07-08 16:29:35 -07:00
Philip Hyunsu Cho
0d411b0397
[CI] Simplify CMake build with modern CMake techniques ( #5871 )
...
* [CI] Simplify CMake build
* Make sure that plugins can be built
* [CI] Install lz4 on Mac
2020-07-08 04:23:24 -07:00
Philip Hyunsu Cho
22a31b1faa
[Doc] Document that CUDA 10.0 is required [skip ci] ( #5872 )
2020-07-07 18:55:19 -07:00
Rong Ou
06320729d4
fix device sketch with weights in external memory mode ( #5870 )
2020-07-08 08:44:07 +08:00
Jiaming Yuan
d0a29c3135
Remove print. ( #5867 )
2020-07-08 04:12:14 +08:00
Jiaming Yuan
a3ec964346
Accept iterator in device dmatrix. ( #5783 )
...
* Remove Device DMatrix.
2020-07-07 21:44:48 +08:00
Jiaming Yuan
048d969be4
Implement GK sketching on GPU. ( #5846 )
...
* Implement GK sketching on GPU.
* Strong tests on quantile building.
* Handle sparse dataset by binary searching the column index.
* Hypothesis test on dask.
2020-07-07 12:16:21 +08:00
Andy Adinets
ac3f0e78dc
Split Features into Groups to Compute Histograms in Shared Memory ( #5795 )
2020-07-07 15:04:35 +12:00
Jiaming Yuan
93c44a9a64
Move feature names and types of DMatrix from Python to C++. ( #5858 )
...
* Add thread local return entry for DMatrix.
* Save feature name and feature type in binary file.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-07-07 09:40:13 +08:00
Jiaming Yuan
4b0852ee41
Use dmlc stream when URI protocol is not local file. ( #5857 )
2020-07-07 03:07:12 +08:00
Alexander Gugel
0f17e35bce
Add c-api-demo to .gitignore ( #5855 )
2020-07-05 04:35:22 +08:00
Philip Hyunsu Cho
efe3e48ae2
Ensure that LoadSequentialFile() actually read the whole file ( #5831 )
2020-07-04 16:17:11 +08:00
Jiaming Yuan
1a0801238e
Implement iterative DMatrix. ( #5837 )
2020-07-03 11:44:52 +08:00
Jiaming Yuan
4d277d750d
Relax linear test. ( #5849 )
...
* Increased error in coordinate is mostly due to floating point error.
* Shotgun uses Hogwild!, which is non-deterministic and can have even greater
floating point error.
2020-07-03 07:49:53 +08:00
Jiaming Yuan
eb067c1c34
Relax test for shotgun. ( #5835 )
2020-07-01 19:20:29 +08:00
Jiaming Yuan
90a9c68874
Implement a DMatrix Proxy. ( #5803 )
2020-06-29 15:03:10 +08:00
Philip Hyunsu Cho
74bf00a5ab
De-duplicate macro _CRT_SECURE_NO_WARNINGS / _CRT_SECURE_NO_DEPRECATE ( #136 )
...
* De-duplicate macro _CRT_SECURE_NO_WARNINGS / _CRT_SECURE_NO_DEPRECATE
* Move all macros to base.h
* Fix CI
2020-06-28 09:51:50 -07:00
Jiaming Yuan
47c89775d6
Accept string for ArrayInterface constructor. ( #5799 )
2020-06-27 00:06:54 +08:00
Yuan Tang
95f11ed27e
Rename Ant Financial to Ant Group ( #5827 )
2020-06-25 15:25:36 -04:00
Jiaming Yuan
8234091368
Remove unweighted GK quantile. ( #5816 )
2020-06-23 14:27:46 +08:00
Philip Hyunsu Cho
dcff96ed27
[Doc] Fix rendering of Markdown docs, e.g. R doc ( #5821 )
2020-06-21 23:49:22 -07:00
Jiaming Yuan
8104f10328
Update document for model dump. ( #5818 )
...
* Clarify the relationship between dump and save.
* Mention the schema.
2020-06-22 14:33:54 +08:00
Jiaming Yuan
26143ad0b1
Update rabit. ( #5680 )
2020-06-22 14:32:43 +08:00
Jiaming Yuan
c4d721200a
Implement extend method for meta info. ( #5800 )
...
* Implement extend for host device vector.
2020-06-20 03:32:03 +08:00
Philip Hyunsu Cho
a6d9a06b7b
[CI] Fix cuDF install; merge 'gpu' and 'cudf' test suite ( #5814 )
2020-06-19 16:42:57 +08:00
Philip Hyunsu Cho
a67bc64819
Add an option to run brute-force test for JSON round-trip ( #5804 )
...
* Add an option to run brute-force test for JSON round-trip
* Apply reviewer's feedback
* Remove unneeded objects
* Parallel run.
* Max.
* Use signed 64-bit loop var, to support MSVC
* Add exhaustive test to CI
* Run JSON test in Win build worker
* Revert "Run JSON test in Win build worker"
This reverts commit c97b2c7dda37b3585b445d36961605b79552ca89.
* Revert "Add exhaustive test to CI"
This reverts commit c149c2ce9971a07a7289f9b9bc247818afd5a667.
Co-authored-by: fis <jm.yuan@outlook.com >
2020-06-17 23:46:02 -07:00
Rory Mitchell
abdf894fcf
Add cupy to Windows CI ( #5797 )
...
* Add cupy to Windows CI
* Update Jenkinsfile-win64
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
* Update Jenkinsfile-win64
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
* Update tests/python-gpu/test_gpu_prediction.py
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-06-17 21:55:09 -07:00
Jiaming Yuan
38ee514787
Implement fast number serialization routines. ( #5772 )
...
* Implement ryu algorithm.
* Implement integer printing.
* Full coverage roundtrip test.
2020-06-17 12:39:23 +08:00
fis
7c3a168ffd
Revert "Accept string for ArrayInterface constructor."
...
This reverts commit e8ecafb8dc .
2020-06-16 20:02:35 +08:00
fis
e8ecafb8dc
Accept string for ArrayInterface constructor.
2020-06-16 20:00:24 +08:00
Rory Mitchell
b47b5ac771
Use hypothesis ( #5759 )
...
* Use hypothesis
* Allow int64 array interface for groups
* Add packages to Windows CI
* Add to travis
* Make sure device index is set correctly
* Fix dask-cudf test
* appveyor
2020-06-16 12:45:59 +12:00
Ram Rachum
02884b08aa
Fix exception causes all over the codebase ( #5787 )
2020-06-15 21:06:07 +08:00
Alex
ae18a094b0
Add new skl model attribute for number of features ( #5780 )
2020-06-15 18:01:59 +08:00
James Lamb
d39da42e69
[R] Remove dependency on gendef for Visual Studio builds ( fixes #5608 ) ( #5764 )
...
* [R-package] Remove dependency on gendef for Visual Studio builds (fixes #5608 )
* clarify docs
* removed debugging print statement
* Make R CMake install more robust
* Fix doc format; add ToC
* Update build.rst
* Fix AppVeyor
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-06-15 00:20:44 +00:00
Jiaming Yuan
529b5c2cfd
[DOC] Mention dask blog post in doc. [skip ci] ( #5789 )
2020-06-14 13:00:19 +08:00
anttisaukko
1bcbe1fc14
Bump com.esotericsoftware to 4.0.2 ( #5690 )
...
Co-authored-by: Antti Saukko <antti.saukko@verizonmedia.com >
2020-06-13 21:06:14 -07:00
Jiaming Yuan
1fa84b61c1
Implement Empty method for host device vector. ( #5781 )
...
* Fix accessing nullptr.
2020-06-13 19:02:26 +08:00
Jiaming Yuan
306e38ff31
Avoid including c_api.h in header files. ( #5782 )
2020-06-12 16:24:24 +08:00
Jiaming Yuan
3028fa6b42
Implement weighted sketching for adapter. ( #5760 )
...
* Bounded memory tests.
* Fixed memory estimation.
2020-06-12 06:20:39 +08:00
James Lamb
c35be9dc40
[R] replace uses of T and F with TRUE and FALSE ( #5778 )
...
* [R-package] replace uses of T and F with TRUE and FALSE
* enable linting
* Remove skip
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-06-11 06:08:02 -04:00
Elliot Hershberg
cb7f7e542c
Added conda environment file for building docs ( #5773 )
2020-06-11 16:51:24 +08:00
James Lamb
c96e1ef283
[python-package] remove unused imports ( #5776 )
2020-06-11 16:50:27 +08:00
Philip Hyunsu Cho
1d22a9be1c
Revert "Reorder includes. ( #5749 )" ( #5771 )
...
This reverts commit d3a0efbf16 .
2020-06-09 10:29:28 -07:00
Philip Hyunsu Cho
d087a12b04
Add release note for 1.1.0 in NEWS.md ( #5763 )
...
* Add release note for 1.1.0 in NEWS.md
* Address reviewer's feedback
2020-06-08 14:16:10 -07:00
Philip Hyunsu Cho
b5ab009c19
Document addition of new committer @SmirnovEgorRu ( #5762 )
2020-06-07 22:57:49 -07:00
Jiaming Yuan
cacff9232a
Remove column major specialization. ( #5755 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-06-05 16:19:14 +08:00
Philip Hyunsu Cho
8fe7f5dc43
[CI] Pass new cpplint check ( #141 )
2020-06-05 00:45:53 -07:00
Jiaming Yuan
bd9d57f579
Add helper for generating batches of data. ( #5756 )
...
* Add helper for generating batches of data.
* VC keyword clash.
* Another clash.
2020-06-05 09:53:56 +08:00
Rory Mitchell
359023c0fa
Speed up python test ( #5752 )
...
* Speed up tests
* Prevent DeviceQuantileDMatrix initialisation with numpy
* Use joblib.memory
* Use RandomState
2020-06-05 11:39:24 +12:00
Jiaming Yuan
cfc23c6a6b
Remove max.depth in R gblinear example. ( #5753 )
2020-06-04 02:59:22 +08:00
Jiaming Yuan
d3a0efbf16
Reorder includes. ( #5749 )
...
* Reorder includes.
* R.
2020-06-03 17:30:47 +12:00
ShvetsKS
cd3d14ad0e
Add float32 histogram ( #5624 )
...
* new single_precision_histogram param was added.
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com >
Co-authored-by: fis <jm.yuan@outlook.com >
2020-06-03 11:24:53 +08:00
Jiaming Yuan
e49607af19
Add Python binding for rabit ops. ( #5743 )
2020-06-02 19:47:23 +08:00
Jiaming Yuan
e533908922
Expose device sketching in header. ( #5747 )
2020-06-02 13:02:53 +08:00
Peter Jung
0be0e6fd88
Add pkgconfig to cmake ( #5744 )
...
* Add pkgconfig to cmake
* Move xgboost.pc.in to cmake/
Co-authored-by: Peter Jung <peter.jung@heureka.cz >
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-06-01 18:22:33 -07:00
Philip Hyunsu Cho
b77e3e3fcc
[CI] Remove CUDA 9.0 from CI ( #5745 )
2020-06-01 18:15:45 -07:00
Jiaming Yuan
325156c7a9
Bump version in header. ( #5742 )
2020-06-01 18:21:18 +08:00
Jiaming Yuan
d19cec70f1
Don't use mask in array interface. ( #5730 )
2020-06-01 12:17:24 +08:00
Peter Jung
267c1ed784
Add swift package reference ( #5728 )
...
Co-authored-by: Peter Jung <peter.jung@heureka.cz >
2020-06-01 15:29:23 +12:00
Philip Hyunsu Cho
073b625bde
Bump version to 1.2.0 snapshot in master ( #5733 )
2020-05-31 00:11:34 -07:00
Jiaming Yuan
9e1b29944e
Fix loading old model. ( #5724 )
...
* Add test.
2020-05-31 14:55:32 +08:00
ShvetsKS
057c762ecd
Fix release degradation ( #5720 )
...
* fix release degradation, related to 5666
* less resizes
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com >
2020-05-31 04:37:54 +03:00
Peter Jung
251dc8a663
Allow pass fmap to importance plot ( #5719 )
...
Co-authored-by: Peter Jung <peter.jung@heureka.cz >
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
2020-05-29 19:55:35 +08:00
Rory Mitchell
f779980f7e
gpu_hist performance tweaks ( #5707 )
...
* Remove device vectors
* Remove allreduce synchronize
* Remove double buffer
2020-05-29 16:48:53 +12:00
Philip Hyunsu Cho
ca0d605b34
[Doc] Fix typos in AFT tutorial ( #5716 )
2020-05-28 14:04:34 -07:00
Jiaming Yuan
35e2205256
[dask] Return GPU Series when input is from cuDF. ( #5710 )
...
* Refactor predict function.
2020-05-28 17:51:20 +08:00
Philip Hyunsu Cho
91c646392d
Require Python 3.6+; drop Python 3.5 from CI ( #5715 )
2020-05-27 16:19:30 -07:00
Philip Hyunsu Cho
fdbb6ae856
Require CUDA 10.0+ in CMake build ( #5718 )
2020-05-27 16:18:18 -07:00
Jiaming Yuan
75a0025a3d
[CI] Remove CUDA 9.0 from Windows CI. ( #5674 )
...
* Remove CUDA 9.0 on Windows CI.
* Require cuda10 tag, to differentiate
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-05-27 12:23:36 -07:00
Dmitry Mottl
78b4e95f25
Changed build.rst (binary wheels are supported for macOS also) ( #5711 )
2020-05-27 07:18:45 -07:00
Philip Hyunsu Cho
e3aa7f1441
Define _CRT_SECURE_NO_WARNINGS to remove unneeded warnings in MSVC ( #5434 )
2020-05-25 22:46:07 -07:00
Jiaming Yuan
f145241593
Let XGBoostError inherit ValueError. ( #5696 )
2020-05-26 08:34:56 +08:00
Jiaming Yuan
8438c7d0e4
Fix IsDense. ( #5702 )
2020-05-26 08:24:37 +08:00
Philip Hyunsu Cho
e35ad8a074
[R] Fix duplicated libomp.dylib error on Mac OSX ( #5701 )
2020-05-24 23:37:33 -07:00
Jiaming Yuan
1ba24a7597
Remove redundant sketching. ( #5700 )
2020-05-24 08:47:20 +08:00
James Lamb
f656ef2fed
[R-package] Reduce duplication in configure.ac ( #5693 )
...
* updated configure
2020-05-22 12:15:22 +08:00
Jiaming Yuan
5af8161a1a
Implement Python data handler. ( #5689 )
...
* Define data handlers for DMatrix.
* Throw ValueError in scikit learn interface.
2020-05-22 11:53:55 +08:00
Andy Adinets
646def51e0
C++14 for xgboost ( #5664 )
2020-05-21 12:26:40 +12:00
Lorenz Walthert
60511a3222
Document more objective parameters in R package ( #5682 )
2020-05-20 14:00:55 +08:00
ShvetsKS
dd01e4ba8d
Distributed optimizations for 'hist' method with CPUs ( #5557 )
...
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com >
2020-05-20 06:03:03 +03:00
Rong Ou
e21a608552
add pointers to the gpu external memory paper ( #5684 )
2020-05-19 19:46:16 -07:00
Jiaming Yuan
7903286961
Remove silent from R demos. ( #5675 )
...
* Remove silent from R demos.
* Vignettes.
2020-05-19 18:20:46 +08:00
Jiaming Yuan
a6008d5d93
Add RABIT_DLL tag to definitions of rabit APIs. ( #140 )
...
* Add RABIT_DLL tag to definitions of rabit APIs.
* Fix Travis tests.
2020-05-19 18:20:31 +08:00
Jiaming Yuan
dd9aeb60ae
[JVM Packages] Catch dmlc error by ref. ( #5678 )
2020-05-19 13:00:12 +08:00
LionOrCatThatIsTheQuestion
83981a9ce3
Pseudo-huber loss metric added ( #5647 )
...
- Add pseudo huber loss objective.
- Add pseudo huber loss metric.
Co-authored-by: Reetz <s02reetz@iavgroup.local >
2020-05-18 21:08:07 +08:00
Jiaming Yuan
535479e69f
Add JSON schema to model dump. ( #5660 )
2020-05-15 10:18:43 +08:00
Jiaming Yuan
2c1a439869
Update Python demos with tests. ( #5651 )
...
* Remove GPU memory usage demo.
* Add tests for demos.
* Remove `silent`.
* Remove shebang as it's not portable.
2020-05-12 12:04:42 +08:00
Oleksandr Kuvshynov
4e64e2ef8e
skip missing lookup if nothing is missing in CPU hist partition kernel. ( #5644 )
...
* [xgboost] skip missing lookup if nothing is missing
2020-05-12 05:50:08 +03:00
Jiaming Yuan
9ad40901a8
Upgrade to CUDA 10.0 ( #5649 ) ( #5652 )
...
Co-authored-by: fis <jm.yuan@outlook.com >
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
2020-05-11 22:27:36 +08:00
Rory Mitchell
fcf57823b6
Reduce device synchronisation ( #5631 )
...
* Reduce device synchronisation
* Initialise pinned memory
2020-05-07 21:19:46 +12:00
Rory Mitchell
9910265064
Resolve vector<bool>::iterator crash ( #5642 )
2020-05-07 21:18:01 +12:00
Jiaming Yuan
21ed1f0c6d
Support 64bit seed. ( #5643 )
2020-05-07 14:52:38 +08:00
Jiaming Yuan
eaf2a00b5c
Enhance nvtx support. ( #5636 )
2020-05-06 22:54:24 +08:00
Jiaming Yuan
67d267f9da
Move device dmatrix construction code into ellpack. ( #5623 )
2020-05-06 19:43:59 +08:00
Jiaming Yuan
33e052b1e5
Remove dead code. ( #5635 )
2020-05-06 17:03:48 +08:00
Philip Hyunsu Cho
8de7f1928e
Fix build on big endian CPUs ( #5617 )
...
* Fix build on big endian CPUs
* Clang-tidy
2020-04-29 21:56:34 -07:00
Rory Mitchell
b9649e7b8e
Refactor gpu_hist split evaluation ( #5610 )
...
* Refactor
* Rewrite evaluate splits
* Add more tests
2020-04-30 08:58:12 +12:00
Yuan Tang
dfcdfabf1f
Move dask tutorial closer other distributed tutorials ( #5613 )
2020-04-28 02:24:00 +08:00
Jiaming Yuan
c90457f489
Refactor the CLI. ( #5574 )
...
* Enable parameter validation.
* Enable JSON.
* Catch `dmlc::Error`.
* Show help message.
2020-04-26 10:56:33 +08:00
Jiaming Yuan
7d93932423
Better message when no GPU is found. ( #5594 )
2020-04-26 10:00:57 +08:00
Jason E. Aten, Ph.D
8dfe7b3686
Clarify meaning of training parameter in XGBoosterPredict() ( #5604 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu >
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com >
2020-04-25 16:48:42 -07:00
Philip Hyunsu Cho
4fd95272c8
Instruct Mac users to install libomp ( #5606 )
2020-04-25 15:50:30 -07:00
Philip Hyunsu Cho
474cfddf91
[R] Address warnings to comply with CRAN submission policy ( #5600 )
...
* [R] Address warnings to comply with CRAN submission policy
* Include <xgboost/logging.h>
2020-04-25 13:34:36 -07:00
Philip Hyunsu Cho
a23de1c108
[CI] Grant public read access to Mac OSX wheels ( #5602 )
2020-04-25 11:51:26 -07:00
Philip Hyunsu Cho
f68155de6c
Fix compilation on Mac OSX High Sierra (10.13) ( #5597 )
...
* Fix compilation on Mac OSX High Sierra
* [CI] Build Mac OSX binary wheel using Travis CI
2020-04-25 10:53:03 -07:00
Jiaming Yuan
e726dd9902
Set device in device dmatrix. ( #5596 )
2020-04-25 13:42:53 +08:00
Philip Hyunsu Cho
4fb34a008d
Use 'default' visibility for C symbols ( #138 )
2020-04-23 20:48:52 -07:00
Philip Hyunsu Cho
2f7fcff4d7
Fix build on FreeBSD ( #133 )
2020-01-27 12:15:32 -08:00
Nan Zhu
6e563951af
fix hanging trainings ( #132 )
...
* fix hanging connections
* remove logging
2020-01-27 09:12:02 -08:00
Chen Qin
0d6a853212
fix xgboost build failure introduced by allgather interface ( #129 )
...
* fix missing allgether rabit declaration
* fix allgather signature mismatch
* fix type conversion
* fix GetRingPrevRank
2020-01-01 22:45:14 +08:00
Chen Qin
493ad834a1
allow duplicated bootstrap allreduce overwrite previous results ( #128 )
...
* allow timeout to 0 to eanble immediate exit
* disable duplicated signature check, overwrite results with same key
2019-11-13 10:19:58 +08:00
nateagr
1907b25cd0
Expose RabitAllGatherRing and RabitGetRingPrevRank ( #113 )
...
* add unittests
* Expose RabitAllGatherRing and RabitGetRingPrevRank
* Enabled TCP_NODELAY to decrease latency
2019-11-12 19:55:32 +08:00
Jiaming Yuan
90e2239372
Fix cmake variable. ( #126 )
2019-11-05 01:27:08 -05:00
Chen Qin
2f25347168
allow timeout to 0 to eanble immediate exit ( #125 )
2019-10-22 14:38:55 -07:00
Chen Qin
d22e0809a8
throw dmlc::Error ( #120 )
...
* throw dmlc::Error handled by xgboost jni
2019-10-16 13:12:15 -04:00
Philip Hyunsu Cho
33dbc10aab
Fix compilation failure on Windows ( #119 )
...
* Fix compilation failure on Windows
* Fix lint
2019-10-15 23:37:42 +07:00
Chen Qin
8e2c201d23
fix assert timeout_sec ( #117 )
2019-10-14 04:44:26 -04:00
Jiaming Yuan
ed9328ceae
Fix lint. ( #115 )
2019-10-13 07:38:29 -04:00
Jiaming Yuan
6dab74689c
Add SeekEnd to MemoryFixSizeBuffer. ( #109 )
...
* Don't assert buffer size.
2019-10-13 00:09:25 -04:00
Chen Qin
5d1b613910
exit when allreduce/broadcast error cause timeout ( #112 )
...
* keep async timeout task
* add missing pthread to cmake
* add tests
* Add a sleep period to avoid flushing the tracker.
2019-10-11 03:39:39 -04:00
Chen Qin
af7281afe3
unittests mock, cleanup ( #111 )
...
* cleanup, fix issue involved after remove is_bootstrap parameter
* misc
* clean
* add unittests
2019-10-01 13:36:11 -07:00
Chen Qin
ddcc2d85da
Clean up cmake script and code includes ( #106 )
...
* Clean up CMake scripts and related include paths.
* Add unittests.
2019-09-26 02:29:04 -04:00
Xu Xiao
e92641887b
remove unreached code of AllreduceRobust::CheckAndRecover ( #108 )
2019-09-18 23:06:59 -04:00
Jiaming Yuan
d4ce6807c7
Don't use _builtin_FUNCTION. ( #107 )
2019-09-18 12:05:23 -04:00
Chen Qin
9a7ac85d7e
remove is_bootstrap parameter ( #102 )
...
* apply openmp simd
* clean __buildin detection, moving windows build check from xgboost project, add openmp support for vectorize reduce
* apply openmp only to rabit
* orgnize rabit signature
* remove is_bootstrap, use load_checkpoint as implict flag
* visual studio don't support latest openmp
* orgnize omp declarations
* replace memory copy with vector cast
* Revert "replace memory copy with vector cast"
This reverts commit 28de4792dcdff40d83d458510d23b7ef0b191d79.
* Revert "orgnize omp declarations"
This reverts commit 31341233d31ce93ccf34d700262b1f3f6690bbfe.
* remove openmp settings, merge into a upcoming pr
* mis
* per feedback, update comments
2019-09-10 11:45:50 -07:00
Chen Qin
5797dcb64e
support bootstrap allreduce/broadcast ( #98 )
...
* support run rabit tests as xgboost subproject using xgboost/dmlc-core
* support tracker config set/get
* remove redudant printf
* remove redudant printf
* add c++0x declaration
* log allreduce/broadcast caller, engine should track caller stack for
investigation
* tracker support binary config format
* Revert "tracker support binary config format"
This reverts commit 2a28e5e2b55c200cb621af8d19f17ab1bc62503b.
* remove caller, prototype fetch allreduce/broadcast results from resbuf
* store cached allreduce/broadcast seq_no to tracker
* allow restore all caches from other nodes
* try new rabit collective cache, todo: recv_link seems down
* link up cache restore with main recovery
* cleanup load cache state
* update cache api
* pass test.mk
* have a working tests
* try to unify check into actionsummary
* more logging to debug distributed hist three method issue
* update rabit interface to support caller signature matching
* splite seq_counter from cur_cache_seq to different variables
* still see issue with inf loop
* support debug print caller as well as allreduce op
* cleanup
* remove get/set cache from model_recover, adding recover in
loadcheckpoint
* clarify rabit cache strategy, cache is set only by successful collective
call involving all nodes with unique cache key. if all nodes call
getcache at same time, we keep rabit run collective call. If some nodes
call getcache while others not, we backfill cache from those nodes with
most entries
* revert caller logs
* fix lint error
* fix engine mpi signature
* support getcache by ref
* allow result buffer presiet to filestream
* add loging
* try fix checkpoint failure recovery case
* use int64_t to avoid overflow caused seq fault
* try avoid int overflow
* try fix checkpoint failure recovery case
* try avoid seqno overflow to negative by offseting specifial flag value
adding cache seq no to checkpoint/load checkpoint/check point ack to avoid
confusion from cache recovery
* fix cache seq assert error
* remove loging, handle edge case
* add extensive log to checkpoint state with different seq no
* fix lint errors
* clean up comments before merge back to master
* add logs to allreduce/broadcast/checkpoint
* use unsinged int 32 and give seq no larger range
* address remove allreduce dropseq code segment
* using caller signature to filter bootstrapallreduces
* remove get/set cache from empty
* apply signature to reducer
* apply signature to broadcast
* add key to broadcat log
* fix broadcast signature
* fix default _line value for non linux system
* adding comments, remove sleep(1)
* fix osx build issue
* try fix mpi
* fix doc
* fix engine_empty api
* logging, adding more logs, restore immutable assertion
* print unsinged int with ud
* fix lint
* rename seqtype to kSeq and KCache indicating it's usage
apply kDiffSeq check to load_cache routine
* comment allreduce/broadcast log
* allow tests run on arm
* enable flag to turn on / off cache
* add log info alert if user choose to enable rabit bootstrap cache
* add rabit_debug setting so user can use config to turn on
* log flags when user turn on rabit_debug
* force rabit restart if tracker assign -1 rank
* use OPENMP to vecotrize reducer
* address comment
* Revert "address comment"
This reverts commit 1dc61f33e7357dad8fa65528abeb81db92c5f9ed.
* fix checkpoint size print 0
* per feedback, remove DISABLEOPEMP, address race condition
* - remove openmp from this pr
- update name from cache to boostrapcache
* add default value of signature macros
* remove openmp from cmake file
* Update src/allreduce_robust.cc
Co-Authored-By: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
* Update src/allreduce_robust.cc
Co-Authored-By: Philip Hyunsu Cho <chohyu01@cs.washington.edu >
* run test with cmake
* remove openmp
* fix cmake based tests
* use cmake test fix darwin .dylib issue
* move around rabit_signature definition due to windows build
* misc, add c++ check in CMakeFile
* per feedback
* resolve CMake file
* update rabit version
2019-08-27 18:12:33 -07:00
Nan Zhu
dba32d54d1
shutdown for multiple times ( #99 )
2019-07-16 12:41:39 -07:00
Nan Zhu
65b718a5e7
return values in Init and Finalize ( #96 )
...
* make inti function return values
* address the comments
2019-06-25 20:05:54 -07:00
Nan Zhu
fc85f776f4
allow not stop process in error ( #97 )
...
* allow not stop process in error
* fix merge error
2019-06-25 13:04:39 -07:00
Nan Zhu
a429748e24
allow multi call on init ( #92 )
2019-04-26 18:41:02 -07:00
Chen Qin
5c3b36f346
Allow using external dmlc-core ( #91 )
...
* Set `RABIT_BUILD_DMLC=1` if use dmlc-core in rabit
* remove dmlc-core
2019-04-26 15:28:45 +08:00
Chen Qin
e3d51d3e62
[rabit harden] Enable all tests ( #90 )
...
* include osx in tests
* address `time_wait` on port assignment
* increase submit attempts.
* cleanup tests
2019-04-24 19:12:11 +08:00
Chen Qin
ecd4bf7aae
[rabit harden] replace hardcopy dmlc-core headers with submodule links ( #86 )
...
* backport dmlc header changes to rabit
* use gitmodule to reference latest dmlc header files
* include ref to dmlc-core
fix cmake
* update cmake file, add cmake build traivs task
* try force using g++-4.8
* per feedback, update cmake
2019-03-23 13:11:29 +08:00
Chen Qin
785d7e54d3
[mpi] add engine_mpi travis build ( #83 )
2019-03-15 22:58:47 +08:00
Chen Qin
ed06e0c6af
[rabit harden] fix rabit tests ( #81 )
...
* enable model recovery tests
* force use gcc4.8 in Travis
2019-03-15 07:16:45 +08:00
Jiaming Yuan
1cc34f01db
Fix ssize_t definition. ( #80 )
...
* Fix linter.
2019-02-18 19:25:08 +08:00
Jiaming Yuan
0101a4719c
Remove dmlc logging. ( #78 )
...
* Remove dmlc logging header.
* Fix lint.
2019-02-16 18:37:54 -08:00
Jiaming Yuan
05941a5f96
Try fixing mingw build error when using CMake. ( #77 )
...
* Try fixing mingw build error when using CMake.
* Check __MINGW32__ .
* Fix linter.
2019-02-16 22:35:43 +08:00
Chen Qin
eb2590b774
workaround macosx java test race condition ( #74 )
...
* fix error in dmlc#57, clean up comments and naming
* include missing packages, disable recovery tests for now
* disable local_recover tests until we have a bug fix
* support larger cluster
* fix lint, merge with master
* fix mac osx test failure in https://github.com/dmlc/xgboost/pull/3818
* Update allreduce_robust.cc
2018-10-26 12:39:31 -07:00
Chen Qin
3a35dabfae
support larger cluster ( #73 )
...
* fix error in dmlc#57, clean up comments and naming
* include missing packages, disable recovery tests for now
* disable local_recover tests until we have a bug fix
* support larger cluster
* fix lint, merge with master
2018-10-22 10:13:45 -07:00
Chen Qin
69cdfae22f
disable travis model_recover tests, fix doc generate failure ( #71 )
...
* add missing packackges used in dmlc submit
* disable local_recovery tests til we have code fix
* fix doc gen failure
2018-10-19 18:18:16 -07:00
Chen Qin
785bde6f87
add missing packackges used in dmlc submit ( #70 )
2018-10-19 13:04:33 -07:00
Ruifeng Zheng
edc403fb2c
init ( #60 )
2018-07-04 12:31:24 -07:00
Philip Hyunsu Cho
87143deb4c
Don't define DMLC_LOG_STACK_TRACE on Solaris ( #59 )
...
DMLC_LOG_STACK_TRACE involves use of non-standard header execinfo.h, which
causes compilation failure on Solaris.
2018-06-15 22:33:46 -07:00
trivialfis
fc5072b100
Fix building shared library. ( #58 )
2018-05-24 09:05:37 -07:00
Will Storey
7bc46b8c75
Allow compiling with -Werror=strict-prototypes ( #56 )
...
Without this, with gcc 7.3.0, we see things like:
/xgboost/include/xgboost/c_api.h:98:1: error: function
declaration isn't a prototype [-Werror=strict-prototypes]
XGB_DLL const char *XGBGetLastError();
^~~~~~~
2018-03-18 22:21:35 -07:00
Dennis O'Brien
440e81db0b
Fixed print statements and xrange to be compatibile with Python 2 and 3. ( #55 )
2018-02-26 12:19:04 -08:00
David Hirvonen
0759d5ed2b
add cmake w/ relocatable pkgconfig installation ( #53 )
2018-01-07 14:49:39 -08:00
snehlatamohite
2eb1a1a371
Use -msse2 flag depending upon architecure while compiling the rabit code ( #49 )
2017-09-01 08:42:45 -07:00
Qiang Kou (KK)
41c96a25a9
To compile on ARM CPU ( #46 )
2017-07-12 20:24:19 -07:00
Artem Krylysov
0b406754fa
Fix C API header compatibility with C compilers ( #44 )
2017-06-01 09:21:48 -07:00
Ziyue Huang
ab5f203b44
fix error: ‘nullptr’ was not declared in this scope ( #43 )
2017-04-23 10:44:11 -07:00
tqchen
a1acf23b60
only doc rabit
2017-03-17 22:09:13 -07:00
tqchen
a764d45cfb
sync dmlc headers
2017-03-16 10:16:23 -07:00
AbdealiJK
21b5e12913
allreduce_robust.cc: Allow num_global_replica to be 0 ( #38 )
...
In some cases, users may not want to have any global replica of
the data being broadcasted/all-reduced. In such cases, set the
result_buffer_round to -1 as a flag that this is not necessary
and check for it.
2016-11-23 19:34:11 -08:00
Tianqi Chen
032152ad24
Update .travis.yml
2016-11-23 10:14:32 -08:00
kabu4i
af1b7d6e7a
Applied FreeBSD support ( #37 )
2016-11-15 21:10:51 -08:00
tqchen
a9a2a69dc1
Merge branch 'master' of ssh://github.com/tqchen/rabit
2016-08-26 15:06:03 -07:00
tqchen
cd1db1afaa
sync dmlc header
2016-08-26 15:05:42 -07:00
tomlaube
1007a26641
Fixing the imports to work with MPI ( #30 )
2016-08-26 15:04:41 -07:00
elferdo
7e15fdd9c6
FreeBSD does not have fopen64 (as of 10.3). Detect it and replace with ( #29 )
...
fopen.
2016-08-20 08:35:01 -07:00
Tianqi Chen
2dd7476ad7
Merge pull request #28 from randomjohnnyh/master
...
Use getaddrinfo instead of gethostbyname for thread safety
2016-07-27 10:40:24 -07:00
Johnny Ho
9d235c31a7
Use getaddrinfo instead of gethostbyname for thread safety
2016-07-27 02:35:02 -04:00
Tianqi Chen
8f61535b83
Update README.md
2016-05-10 20:14:53 -07:00
Tianqi Chen
b8aec1730c
Update README.md
2016-05-10 20:14:29 -07:00
tqchen
e19fced5cb
[FIX] rabit on single node
2016-05-10 20:05:59 -07:00
tqchen
849b20b7c8
add distributed checking
2016-04-11 15:43:01 -07:00
tqchen
be50e7b632
Make rabit library thread local
2016-03-01 20:12:51 -08:00
tqchen
aeb4008606
remove connect msg
2016-02-29 16:27:48 -08:00
tqchen
1392e9f3da
fix travis
2016-02-29 15:51:36 -08:00
tqchen
225f5258c7
[DMLC] Add dep to dmlc logging
2016-02-29 14:59:44 -08:00
tqchen
56ec4263f9
fix type
2016-02-28 13:19:54 -08:00
tqchen
e3188afbe8
fix
2016-02-28 13:09:18 -08:00
tqchen
c7d53aecc3
add link tag
2016-02-28 09:44:11 -08:00
tqchen
26c87ec6e7
fix test
2016-02-28 09:35:08 -08:00
tqchen
f0f07ecd22
fix
2016-02-27 20:51:00 -08:00
tqchen
e814dc8a4b
Fix docstring
2016-02-27 18:13:42 -08:00
tqchen
d45fca0298
fix build
2016-02-27 18:10:58 -08:00
tqchen
7479791f6a
refactor: librabit
2016-02-27 10:14:41 -08:00
tqchen
73b6e9bbd0
[TRACKER] remove tracker in rabit, use DMLC
2016-02-27 09:07:40 -08:00
tqchen
112d866dc9
[RABIT] fix rabit in local mode
2016-01-12 21:34:26 -08:00
tqchen
05b958c178
[RABIT] Sync with dmlc
2016-01-09 21:43:29 -08:00
Tianqi Chen
bed63208af
Merge pull request #26 from DrAndrey/master
...
Fix bug with name of sleep function
2015-11-18 09:58:21 -08:00
Andrey
291ab05023
Remove redundant whitespace again
2015-11-18 10:21:03 +03:00
Andrey
de251635b1
Remove redundant whitespace
2015-11-18 00:53:53 +03:00
Andrey
3a6be65a20
Fix bug with name of sleep function
2015-11-17 21:45:52 +03:00
Tianqi Chen
e81a11dd7e
Merge pull request #25 from daiyl0320/master
...
add retry mechanism to ConnectTracker and modify Listen backlog to 128 in rabit_traker.py
2015-10-20 19:34:01 -07:00
yonglong.dyl
35c3b371ea
add retry mechanism to ConnectTracker and modify Listen backlog to 128
...
in rabit_traker.py
2015-10-21 10:24:07 +08:00
tqchen
c71ed6fccb
try deply doxygen
2015-08-22 21:37:14 -07:00
tqchen
62e5647a33
try deply doxygen
2015-08-22 21:33:23 -07:00
tqchen
732f1c634c
try
2015-08-21 08:40:55 -07:00
tqchen
2fa6e0245a
ok
2015-08-21 08:08:32 -07:00
tqchen
053766503c
minor
2015-08-21 08:07:25 -07:00
tqchen
7b59dcb8b8
minor
2015-08-21 07:59:06 -07:00
tqchen
5934950ce2
new doc
2015-08-02 17:54:46 -07:00
tqchen
f5381871a3
ok
2015-08-01 21:40:05 -07:00
tqchen
44b60490f4
new doc
2015-08-01 21:36:09 -07:00
tqchen
387339bf17
add more
2015-07-30 18:16:15 -07:00
tqchen
9d4397aa4a
chg
2015-07-30 17:59:16 -07:00
tqchen
2879a4853b
chg
2015-07-30 17:58:42 -07:00
tqchen
30e3110170
ok
2015-07-28 23:18:15 -07:00
tqchen
9ff0301515
add link translation
2015-07-28 23:16:48 -07:00
tqchen
6b629c2e81
k
2015-07-27 18:41:17 -07:00
tqchen
32e19558e6
ok
2015-07-27 18:38:22 -07:00
tqchen
8f4839d1d9
fix
2015-07-27 18:34:43 -07:00
tqchen
93137b2e52
ok
2015-07-27 18:34:07 -07:00
tqchen
7eeeb79599
reload recommonmark
2015-07-27 18:33:19 -07:00
tqchen
a8f00cc4a5
minor
2015-07-27 18:16:03 -07:00
tqchen
19b0f019c7
ok
2015-07-27 18:14:01 -07:00
tqchen
dd011849b7
minor
2015-07-27 17:59:22 -07:00
tqchen
c1cdc194e9
minor
2015-07-27 17:50:02 -07:00
tqchen
fcf0f4351a
try rst
2015-07-27 17:47:28 -07:00
tqchen
cbc21ae531
try
2015-07-27 17:46:08 -07:00
tqchen
62ddfa7709
tiny
2015-07-26 21:13:35 -07:00
tqchen
aefc05cb91
final change
2015-07-26 21:09:58 -07:00
tqchen
2aee9b4959
minor
2015-07-26 20:57:48 -07:00
tqchen
fe4e7c2b96
ok
2015-07-26 20:56:54 -07:00
tqchen
800198349f
change to subtitle
2015-07-26 20:54:10 -07:00
tqchen
5ca33e48ea
ok
2015-07-26 20:52:52 -07:00
tqchen
88f7d24de9
update guide
2015-07-26 20:52:34 -07:00
tqchen
29d43ab52f
add code
2015-07-26 14:57:24 -07:00
tqchen
fe8bb3b60e
minor hack for readthedocs
2015-07-26 14:47:40 -07:00
tqchen
229c71d9b5
Merge branch 'master' of ssh://github.com/dmlc/rabit
2015-07-26 14:46:24 -07:00
tqchen
7424218392
ok
2015-07-26 14:46:16 -07:00
Tianqi Chen
d1d45bbdae
Update README.md
2015-07-26 14:43:08 -07:00
Tianqi Chen
1e8813f3bd
Update README.md
2015-07-26 14:42:57 -07:00
Tianqi Chen
1ccc9903a1
Update README.md
2015-07-26 14:41:25 -07:00
tqchen
0323e0670e
remove readme
2015-07-26 14:22:50 -07:00
tqchen
679a835d38
remove theme
2015-07-26 14:14:19 -07:00
tqchen
7ea5b7c209
remove numpydoc to napoleon
2015-07-26 14:02:43 -07:00
tqchen
b73e2be55e
Merge branch 'master' of ssh://github.com/dmlc/rabit
...
Conflicts:
doc/python-requirements.txt
2015-07-26 13:52:31 -07:00
tqchen
174228356d
ok
2015-07-26 13:51:56 -07:00
Tianqi Chen
1838e25b8a
Update python-requirements.txt
2015-07-26 13:05:52 -07:00
tqchen
bc4e957c39
ok
2015-07-26 13:00:18 -07:00
tqchen
fba6fc208c
ok
2015-07-26 12:54:21 -07:00
tqchen
025110185e
ok
2015-07-26 12:52:37 -07:00
tqchen
d50b905824
ok
2015-07-26 12:46:19 -07:00
tqchen
d4f2509178
ok
2015-07-26 12:43:49 -07:00
tqchen
cdf401a77c
ok
2015-07-26 12:40:21 -07:00
tqchen
fef0ef26f1
new doc
2015-07-26 12:29:18 -07:00
tqchen
cef360d782
ok
2015-07-26 12:15:00 -07:00
tqchen
c125d2a8bb
ok
2015-07-26 12:14:54 -07:00
tqchen
270a49ee75
add requirments
2015-07-23 22:22:52 -07:00
tqchen
744f9015bb
get the basic doc
2015-07-23 22:14:42 -07:00
tqchen
1cb5cad50c
Merge branch 'master' of ssh://github.com/dmlc/rabit
2015-07-03 17:42:00 -07:00
tqchen
8cc07ba391
minor
2015-07-03 17:41:52 -07:00
Tianqi Chen
d74f126592
Update .travis.yml
2015-07-03 15:35:47 -07:00
Tianqi Chen
52b3dcdf07
Update .travis.yml
2015-07-03 15:33:38 -07:00
Tianqi Chen
099581b591
Update .travis.yml
2015-07-03 15:31:43 -07:00
Tianqi Chen
1258046f14
Update .travis.yml
2015-07-03 15:29:30 -07:00
Tianqi Chen
7addac910b
Update Makefile
2015-07-03 15:23:26 -07:00
Tianqi Chen
0ea7adff92
Update .travis.yml
2015-07-03 15:21:20 -07:00
Tianqi Chen
f858856586
Update travis_script.sh
2015-07-03 15:20:59 -07:00
Tianqi Chen
d8eac4ae27
Update README.md
2015-07-03 15:17:22 -07:00
tqchen
3cc49ad0e8
lint and travis
2015-07-03 15:15:11 -07:00
tqchen
ceedf4ea96
fix
2015-05-28 12:37:06 -07:00
Tianqi Chen
fd8920c71d
fix win32
2015-05-28 12:24:26 -07:00
tqchen
8bbed35736
modify
2015-05-28 10:44:19 -07:00
Tianqi Chen
9520b90c4f
Merge pull request #14 from dmlc/hjk41
...
add kLongLong and kULongLong
2015-05-20 05:38:01 +02:00
Chuntao Hong
df14bb1671
fix type
2015-05-20 11:36:17 +08:00
Chuntao Hong
f441dc7ed8
replace tab with blankspace
2015-05-20 11:33:48 +08:00
Chuntao Hong
2467942886
remove unnecessary include
2015-05-20 11:32:16 +08:00
Chuntao Hong
181ef47053
defined long long and ulonglong
2015-05-20 11:27:50 +08:00
Chuntao Hong
1582180e5b
use int32_t to define int and int64_t to define long. in VC long is 32bit
2015-05-20 10:09:09 +08:00
tqchen
e0b7da0302
fix
2015-05-02 21:47:43 -07:00
tqchen
fa99857467
try fix warning on some platforms
2015-05-01 22:45:11 -07:00
tqchen
24f17df782
ok
2015-04-29 20:23:39 -07:00
tqchen
4fe8d1d66b
ok io
2015-04-29 20:21:37 -07:00
tqchen
a5d77ca08d
checkin new dmlc interface
2015-04-29 20:17:27 -07:00
tqchen
d1d2ab4599
remove at end
2015-04-28 10:49:44 -07:00
tqchen
e1ddcc2eb7
Merge branch 'master' of ssh://github.com/dmlc/rabit
2015-04-27 15:55:58 -07:00
tqchen
6745667eb0
new dmlc io
2015-04-27 15:55:51 -07:00
tqchen
c5b4610cfe
sge scheduler change
2015-04-26 22:08:47 -07:00
tqchen
fed1683b9b
minor
2015-04-25 21:24:38 -07:00
Tianqi Chen
c01520f173
change
2015-04-25 21:23:16 -07:00
tqchen
27340f95e4
final minor
2015-04-25 21:19:42 -07:00
Tianqi Chen
e03eabccda
allow win32
2015-04-25 21:18:36 -07:00
tqchen
82ca10acb6
better handling at msvc
2015-04-25 20:52:07 -07:00
Tianqi Chen
6601939588
Merge pull request #12 from zjf/patch-2
...
Update rabit-inl.h
2015-04-23 23:16:49 -07:00
Jianfeng Zhu
df8f917463
Update rabit-inl.h
...
Fix missing parenthese
2015-04-24 14:09:47 +08:00
tqchen
c60b284e1f
resize during tracker print
2015-04-20 11:37:45 -07:00
tqchen
c67967161e
fix io style
2015-04-19 00:21:38 -07:00
tqchen
f52daf9be1
make timer cross platform
2015-04-19 00:01:48 -07:00
tqchen
7568f75f45
new io interface
2015-04-17 20:35:44 -07:00
tqchen
3bf8661ec1
add std before basic
2015-04-13 13:43:34 -07:00
tqchen
18f4d6c0ba
remove rabit learn
2015-04-11 20:25:52 -07:00
tqchen
bcfbe51e7e
fix dmlc io
2015-04-11 18:16:52 -07:00
tqchen
ad383b084d
ok
2015-04-11 17:55:20 -07:00
tqchen
3b8c04a902
Merge branch 'master' of ssh://github.com/dmlc/rabit
2015-04-11 17:35:11 -07:00
tqchen
9dd97cc141
keepup with dmlc core
2015-04-11 17:35:03 -07:00
Ubuntu
ef13aaf379
ch
2015-04-11 05:29:07 +00:00
tqchen
50a66b3855
fix empty engine
2015-04-09 08:44:33 -07:00
tqchen
e08542c635
fix doc
2015-04-08 15:30:56 -07:00
tqchen
e95c96232a
remove I prefix from interface, serializable now takes in pointer
2015-04-08 15:25:58 -07:00
tqchen
b15f6cd2ac
rabit unifires with dmlc
2015-04-05 09:55:24 -07:00
tqchen
5634ec3008
ok
2015-04-03 22:25:33 -07:00
tqchen
2dd6c2f0c9
Merge branch 'master' of ssh://github.com/dmlc/rabit
2015-03-30 22:18:20 -07:00
tqchen
38d7f999a7
checkin wormhole spliter
2015-03-30 22:18:02 -07:00
Tianqi Chen
8acb96a627
Merge pull request #10 from ryanzz/master
...
fixed a mistake
2015-03-30 08:46:15 -07:00
ryanzz
911a1f0ce2
fixed a mistake
2015-03-30 16:25:36 +08:00
tqchen
732d8c33d1
inteface changing
2015-03-29 22:00:37 -07:00
tqchen
684ea0ad26
inteface changing
2015-03-29 22:00:33 -07:00
tqchen
8cb4c02165
add dmlc support
2015-03-28 22:44:10 -07:00
tqchen
be2ff703bc
allow adapting wormhole
2015-03-27 17:33:51 -07:00
tqchen
16975b447c
try pass on tokens during application submission
2015-03-27 11:04:19 -07:00
tqchen
eb1f4a4003
change auto to ip
2015-03-26 23:26:30 -07:00
tqchen
59e63bc135
minor
2015-03-21 00:38:37 -07:00
tqchen
62330505e1
ok
2015-03-21 00:37:59 -07:00
tqchen
14477f9f5a
add namenode
2015-03-21 00:35:30 -07:00
tqchen
75a6d349c6
add libhdfs opts
2015-03-21 00:26:30 -07:00
tqchen
e3c76bfafb
minmum fix
2015-03-21 00:25:16 -07:00
tqchen
8b3c435241
chg
2015-03-20 15:11:50 -07:00
tqchen
2035799817
test code
2015-03-20 13:02:46 -07:00
tqchen
7751b2b320
add debug
2015-03-15 23:52:16 -07:00
tqchen
769031375a
ok
2015-03-15 23:47:24 -07:00
tqchen
bd346b4844
ok
2015-03-15 23:44:32 -07:00
tqchen
faba1dca6c
add testload
2015-03-15 23:42:18 -07:00
tqchen
6f7783e4f6
add testload
2015-03-15 23:42:17 -07:00
tqchen
e5f034040e
ok
2015-03-15 23:20:30 -07:00
tqchen
3ed9ec808f
chg
2015-03-15 23:19:54 -07:00
tqchen
e552ac401e
ask for more ram in am
2015-03-15 23:14:56 -07:00
tqchen
b2505e3d6f
only stop nm when sucess
2015-03-15 23:02:15 -07:00
tqchen
bc696c9273
add queue info
2015-03-15 22:54:09 -07:00
tqchen
f3e867ed97
add option queue
2015-03-15 22:38:51 -07:00
tqchen
5dc843cff3
refactor fileio
2015-03-14 16:46:54 -07:00
tqchen
cd9c81be91
quick fix
2015-03-14 09:20:04 -07:00
tqchen
1e23af2adc
add virtual destructor to iseekstream
2015-03-14 00:20:37 -07:00
tqchen
f165ffbc95
fix hdfs
2015-03-13 22:59:04 -07:00
tqchen
8cc650847a
allow demo to pass in env
2015-03-13 22:27:36 -07:00
tqchen
fad4d69ee4
ok
2015-03-13 21:38:03 -07:00
tqchen
0fd6197b8b
fix more
2015-03-13 21:36:09 -07:00
tqchen
7423837303
fix more
2015-03-13 21:36:08 -07:00
tqchen
d25de54008
add temporal solution, run_yarn_prog.py
2015-03-13 21:13:19 -07:00
tqchen
e5a9e31d13
final attempt
2015-03-13 00:04:51 -07:00
tqchen
ed3bee84c2
add command back
2015-03-12 22:48:30 -07:00
tqchen
07740003b8
add hdfs to resource
2015-03-12 22:43:41 -07:00
tqchen
9b66e7edf2
fix hadoop
2015-03-12 20:57:49 -07:00
tqchen
6812f14886
ok
2015-03-12 09:44:43 -07:00
tqchen
08e1c16dd2
change hadoop prefix back to hadoop home
2015-03-12 09:06:42 -07:00
Tianqi Chen
d6b68286ee
Update build.sh
2015-03-12 09:03:02 -07:00
tqchen
146e069000
bugfix: logical boundary for ring buffer
2015-03-11 20:28:34 -07:00
tqchen
19cb685c40
ok
2015-03-12 02:59:50 +00:00
tqchen
4cf3c13750
Merge branch 'master' of ssh://github.com/tqchen/rabit
...
Conflicts:
tracker/rabit_tracker.py
2015-03-11 13:35:35 -07:00
tqchen
20daddbeda
add tracker
2015-03-11 13:27:23 -07:00
tqchen
c57dad8b17
add ringbased passing and batch schedule
2015-03-11 12:00:19 -07:00
tqchen
295d8a12f1
update
2015-03-10 15:28:10 -07:00
tqchen
994cb02a66
add sge
2015-03-10 15:26:40 -07:00
tqchen
014c86603d
OK
2015-03-10 10:51:39 -07:00
tqchen
091634b259
fix
2015-03-09 14:56:01 -07:00
tqchen
d558f6f550
redefine distributed means
2015-03-09 14:43:05 -07:00
tqchen
c8efc01367
more complicated yarn script
2015-03-09 14:36:44 -07:00
tqchen
28ca7becbd
add linear readme
2015-03-09 13:12:40 -07:00
tqchen
ca4b20fad1
add linear readme
2015-03-09 13:12:04 -07:00
tqchen
1133628c01
add linear readme
2015-03-09 13:11:17 -07:00
tqchen
6a1167611c
update docs
2015-03-09 13:00:34 -07:00
Tianqi Chen
a607047aa1
Update build.sh
2015-03-08 23:55:42 -07:00
tqchen
2c1cfd8be6
complete yarn
2015-03-08 23:51:42 -07:00
tqchen
4f28e32ebd
change formater
2015-03-08 12:29:07 -07:00
tqchen
2fbda812bc
fix stdin input
2015-03-08 12:22:11 -07:00
tqchen
3258bcf531
checkin yarn master
2015-03-08 11:03:13 -07:00
tqchen
67ebf81e7a
allow setup from env variables
2015-03-07 16:45:31 -08:00
tqchen
9b6bf57e79
fix hdfs
2015-03-07 09:08:21 -08:00
tqchen
395d5c29d5
add make system
2015-03-06 22:30:23 -08:00
tqchen
88ce76767e
refactor io, initial hdfs file access need test
2015-03-06 22:17:27 -08:00
tqchen
19be870562
chgs
2015-03-06 21:12:04 -08:00
tqchen
a1bd3c64f0
Merge branch 'master' of ssh://github.com/tqchen/rabit
2015-03-06 21:09:59 -08:00
tqchen
1a573f987b
introduce input split
2015-03-06 21:08:04 -08:00
tqchen
29476f1c6b
fix timer issue
2015-03-06 20:59:10 -08:00
tqchen
d4ec037f2e
fix rabit
2015-03-03 13:12:05 -08:00
tqchen
6612fcf36c
Merge branch 'master' of ssh://github.com/tqchen/rabit
2015-03-02 16:10:15 -08:00
tqchen
d29892cb22
add mock option statis
2015-03-02 16:10:08 -08:00
tqchen
4fa054e26e
new tracker
2015-03-02 07:32:25 +00:00
tqchen
75c647cd84
update tracker for host IP
2015-03-01 23:27:59 -08:00
tqchen
e4ce8efab5
add hadoop linear example
2015-03-02 04:36:48 +00:00
tqchen
76ecb4a031
add hadoop linear example
2015-03-02 04:35:56 +00:00
Ubuntu
2e1c4c945e
add hadoop linear example
2015-03-02 04:35:01 +00:00
tqchen
4db0a62a06
bugfix of lazy prepare
2015-02-11 20:31:46 -08:00
tqchen
87017bd4cd
license
2015-02-11 14:49:51 -08:00
tqchen
dc703e1b62
license
2015-02-11 14:48:59 -08:00
tqchen
c171440324
change license to bsd
2015-02-11 14:44:26 -08:00
Tianqi Chen
7db2070598
Update README.md
2015-02-09 20:53:29 -08:00
tqchen
581fe06a9b
add mocktest
2015-02-09 20:46:38 -08:00
tqchen
d2f252f87a
ok
2015-02-09 20:35:30 -08:00
tqchen
4a5b9e5f78
add all
2015-02-09 20:26:39 -08:00
tqchen
12ee049a74
init version of lbfgs
2015-02-09 17:44:32 -08:00
tqchen
37a28376bb
complete lbfgs solver
2015-02-09 11:04:19 -08:00
tqchen
6ade7cba94
complete lbfgs
2015-02-08 23:08:59 -08:00
tqchen
1bb8fe9615
chg makefile
2015-01-30 16:46:10 -08:00
tqchen
fb13cab216
change makefile
2015-01-30 16:30:45 -08:00
Tyler
1479e370f8
fixed small bug in mpi submission script
2015-01-25 00:12:46 -08:00
Tianqi Chen
0ca7a63670
Update README.md
2015-01-22 09:16:46 -08:00
tqchen
5ef4830b55
ok
2015-01-20 20:30:22 -08:00
tqchen
93a13381c1
chg note
2015-01-20 20:27:43 -08:00
tqchen
4ebe657dd7
fix in cxx11
2015-01-19 21:37:02 -08:00
tqchen
85b746394e
change def of reducer to take function ptr
2015-01-19 21:24:52 -08:00
tqchen
fe6366eb40
add engine base
2015-01-19 19:11:15 -08:00
Tianqi Chen
a98720ebc9
more deps
2015-01-19 08:20:43 -08:00
tqchen
1db6449b01
remove include in -I, make things easier to direct compile
2015-01-18 21:30:19 -08:00
tqchen
c7282acb2a
doc
2015-01-18 19:55:04 -08:00
tqchen
f332750359
minor fix
2015-01-18 18:17:41 -08:00
tqchen
9edb3b306f
update doc
2015-01-18 18:14:20 -08:00
tqchen
c46120a46b
add win32 ver
2015-01-16 21:10:47 -08:00
Tianqi Chen
537497f520
changes
2015-01-16 21:10:01 -08:00
Tianqi Chen
56a80f431b
check in windows solutions, pass small test in windows
2015-01-16 20:56:34 -08:00
tqchen
774d501c1f
add languages
2015-01-16 11:13:27 -08:00
tqchen
7396c87249
chg
2015-01-16 10:53:31 -08:00
tqchen
c7533f92bb
desgin goal
2015-01-16 10:50:05 -08:00
tqchen
38b7fec37a
ok
2015-01-16 10:46:55 -08:00
tqchen
c798fc2a29
change toolkit to rabitlearn
2015-01-16 10:45:54 -08:00
tqchen
f5245c615c
ok
2015-01-16 10:12:47 -08:00
nachocano
aebb7998a3
updating doc
2015-01-16 00:45:04 -08:00
nachocano
b87da8fe9a
small typo
2015-01-15 10:52:39 -08:00
tqchen
1f35478b82
chg docs
2015-01-15 10:29:32 -08:00
tqchen
6d5ac6446c
chg test folder
2015-01-15 10:24:58 -08:00
tqchen
8f23eb11d7
change convention
2015-01-15 10:22:59 -08:00
tqchen
0617281863
phrase python as a lib
2015-01-15 10:09:14 -08:00
nachocano
7d67f6f26d
removing section
2015-01-15 01:24:04 -08:00
nachocano
34c8253ad6
Merge branch 'master' of https://github.com/tqchen/allreduce
2015-01-15 01:22:21 -08:00
nachocano
86e61ad6a5
adding changes suggested by Tianqi
2015-01-15 01:21:40 -08:00
tqchen
6dbaddd2b9
ok
2015-01-14 22:11:00 -08:00
tqchen
a7faac2f09
ok
2015-01-14 21:59:45 -08:00
tqchen
f161d2f1e5
fix bug in initialization of routing
2015-01-14 19:40:41 -08:00
tqchen
797fe27efe
struct return type version
2015-01-14 15:43:28 -08:00
tqchen
a57c5c5425
add more error report when things goes wrong, need review
2015-01-14 15:32:36 -08:00
tqchen
968b33ec79
set all tracker thread to deamon
2015-01-14 12:05:00 -08:00
tqchen
87c7817124
add lazy check, need test, find a race condition
2015-01-14 11:58:43 -08:00
Tianqi Chen
bddfa2fc24
Merge pull request #7 from lqhl/master
...
update the fault tolerence section
2015-01-14 10:09:04 -08:00
Tianqi Chen
d05df9836b
Merge pull request #8 from cblsjtu/master
...
correct a mistake
2015-01-14 10:07:17 -08:00
Boliang Chen
2f2e481fc3
correct a mistake
2015-01-14 20:53:34 +08:00
Qin Liu
1dda51f1fa
update the fault tolerence section
2015-01-14 17:07:30 +08:00
tqchen
348a1e7619
change default behavior to behave normal
2015-01-13 22:21:15 -08:00
tqchen
478d250818
minor change
2015-01-13 20:01:15 -08:00
tqchen
532575b752
ok
2015-01-13 14:41:37 -08:00
tqchen
c127f9650c
Merge branch 'master' of ssh://github.com/tqchen/rabit
2015-01-13 14:29:20 -08:00
tqchen
3419cf9aa7
add auto caching of python in hadoop script, mock test module to python, with checkpt
2015-01-13 14:29:10 -08:00
tqchen
877fc42e40
add data
2015-01-13 12:51:55 -08:00
nachocano
f79e5fc041
adding more stuff
2015-01-13 01:00:58 -08:00
nachocano
95c6d7398f
adding more stuff
2015-01-13 00:59:20 -08:00
nachocano
5c7967e863
adding link
2015-01-13 00:49:57 -08:00
nachocano
54e2f7e90d
adding wrapper section
2015-01-13 00:48:37 -08:00
nachocano
48c42bf189
fixing stuff
2015-01-13 00:18:46 -08:00
nachocano
92c94176c1
adding some changes to kmeans
2015-01-13 00:13:05 -08:00
tqchen
15e085cd32
basic allreduce lib ready
2015-01-12 22:59:36 -08:00
tqchen
2d72c853df
checkin broadcast python module
2015-01-12 22:32:13 -08:00
tqchen
9a4a81f100
add wrapper
2015-01-12 21:33:01 -08:00
tqchen
61626aaf85
add more data types
2015-01-12 20:45:07 -08:00
tqchen
5a457d69fc
Merge branch 'master' of ssh://github.com/tqchen/rabit
...
Conflicts:
tracker/rabit_hadoop.py
2015-01-12 12:03:00 -08:00
tqchen
7572794add
add stacklevel for rabit
2015-01-12 12:02:28 -08:00
Tianqi Chen
60a10b3322
Merge pull request #6 from cblsjtu/master
...
modify some explanation
2015-01-12 08:55:41 -08:00
Boliang Chen
ec3fd9bd2a
modify some explanation
2015-01-12 23:46:20 +08:00
Boliang Chen
34cde09b2b
modify some explanation
2015-01-12 23:41:45 +08:00
nachocano
8dd94461e1
guide
2015-01-12 00:24:46 -08:00
nachocano
9e04ab62fb
adding breaks
2015-01-12 00:23:42 -08:00
nachocano
9907bafa1d
fix
2015-01-12 00:20:43 -08:00
nachocano
30f3971bee
adding more description to toolkit
2015-01-12 00:14:40 -08:00
tqchen
6b651176a3
yarn is part of hadoop script
2015-01-11 21:28:13 -08:00
tqchen
a120edc56e
shorter
2015-01-11 11:48:08 -08:00
tqchen
5146409a1d
simpler
2015-01-11 11:47:37 -08:00
tqchen
db2ebf7410
use unified script, auto detect hadoop version
2015-01-11 11:46:12 -08:00
tqchen
bfc3f61010
minor
2015-01-11 11:15:12 -08:00
tqchen
78bfe867e6
unify hadoop and yarn script
2015-01-11 11:13:02 -08:00
Tianqi Chen
03dca6d6b3
Merge pull request #5 from EricChenDM/master
...
add yarn script
2015-01-11 10:41:08 -08:00
chenshuaihua
b2dec95862
yarn script
2015-01-12 00:09:00 +08:00
chenshuaihua
26b5fdac40
yarn script
2015-01-11 23:54:31 +08:00
chenshuaihua
00323f462a
yarn script
2015-01-11 23:32:14 +08:00
chenshuaihua
981f69ff55
yarn script
2015-01-11 23:23:58 +08:00
chenshuaihua
5e843cfbbd
yarn script
2015-01-11 23:22:26 +08:00
chenshuaihua
b5ac85f103
yarn script
2015-01-11 23:19:04 +08:00
chenshuaihua
d81fb6a9e6
test
2015-01-11 21:59:38 +08:00
nachocano
d269cb9c50
guide stuff
2015-01-11 01:43:32 -08:00
nachocano
2d97833f48
slightly change
2015-01-11 01:35:04 -08:00
nachocano
eef79067a8
more cosmetic stuff
2015-01-11 01:31:10 -08:00
nachocano
aea4c10847
cosmetic changes to tutorial
2015-01-11 01:07:51 -08:00
Tianqi Chen
7eb4258951
Merge pull request #4 from cblsjtu/master
...
explain time out
2015-01-10 23:43:01 -08:00
Boliang Chen
c6d0be57d4
explain timeout
2015-01-11 15:39:50 +08:00
Boliang Chen
80b0d06b7e
merger from tqchen
2015-01-11 14:56:20 +08:00
Boliang Chen
8685b740cc
Merge remote-tracking branch 'tqchen/master'
2015-01-11 14:53:10 +08:00
Boliang Chen
7fa23f2d2f
modify default jobname
2015-01-11 14:52:48 +08:00
tqchen
ed264002a0
Merge branch 'master' of ssh://github.com/tqchen/rabit
...
Conflicts:
tracker/rabit_hadoop.py
2015-01-10 22:50:38 -08:00
tqchen
2e3361f0e0
fix -f
2015-01-10 22:49:56 -08:00
Boliang Chen
363994f29d
Merge remote-tracking branch 'tqchen/master'
2015-01-11 13:46:32 +08:00
Boliang Chen
3f4bf96c5d
temp
2015-01-11 13:46:18 +08:00
tqchen
0100fdd18d
auto jobname
2015-01-10 21:21:39 -08:00
cblsjtu
c0f85c681e
Merge pull request #1 from tqchen/master
...
merge from tqchen
2015-01-11 11:00:08 +08:00
tqchen
43c129f431
chg script
2015-01-10 17:49:09 -08:00
tqchen
500a57697d
chg script
2015-01-10 17:45:53 -08:00
tqchen
c2ab64afe3
fix comment
2015-01-10 10:01:31 -08:00
tqchen
6b30fb2bea
update cache script
2015-01-10 09:58:10 -08:00
Tianqi Chen
9d34d2e036
Merge pull request #1 from cblsjtu/master
...
fix several bugs
2015-01-10 09:29:55 -08:00
Boliang Chen
76c15dffde
remove blank
2015-01-11 00:16:05 +08:00
Boliang Chen
d986693fbd
fix bugs
2015-01-11 00:14:37 +08:00
Boliang Chen
7f5cb3aa0e
modify hs
2015-01-10 10:58:53 +08:00
Boliang Chen
697a01bfb4
har -> jar
2015-01-10 10:54:12 +08:00
tqchen
1b4921977f
update doc
2015-01-03 05:20:18 -08:00
tqchen
be355c1e60
minor
2015-01-01 06:06:55 -08:00
tqchen
d10a435d64
correct
2015-01-01 06:06:02 -08:00
tqchen
eb2b086b65
ok
2015-01-01 06:04:02 -08:00
tqchen
08ca3b0849
add more links
2015-01-01 06:02:32 -08:00
tqchen
61f21859d9
add api
2015-01-01 05:57:46 -08:00
tqchen
2bfbbfb381
checkin API doc
2015-01-01 05:48:34 -08:00
tqchen
31a3d22af4
add broadcast
2015-01-01 05:42:38 -08:00
tqchen
90a8505208
update guide
2015-01-01 05:42:03 -08:00
tqchen
06206e1d03
start checkin guides
2014-12-30 06:22:54 -08:00
tqchen
bfb9aa3d77
add native script
2014-12-30 04:37:50 -08:00
tqchen
1bcea65117
change nslave to nworker
2014-12-29 18:44:30 -08:00
tqchen
bdfa1a0220
change nslave to nworker
2014-12-29 18:42:24 -08:00
tqchen
39504825d8
add kmeans example
2014-12-29 18:32:56 -08:00
tqchen
76abd80cb7
change indentation
2014-12-29 18:17:20 -08:00
tqchen
b1340bf310
add auto cache
2014-12-29 06:50:17 -08:00
tqchen
c731e82fae
add command
2014-12-29 06:37:07 -08:00
tqchen
491716c418
chg
2014-12-29 06:21:34 -08:00
tqchen
d64d0ef1dc
cleanup submission script
2014-12-29 06:11:58 -08:00
tqchen
27d6977a3e
cpplint pass
2014-12-28 05:12:07 -08:00
tqchen
15836eb98e
add task id
2014-12-22 04:17:23 -08:00
tqchen
0dd51d5dd0
add attempt id for hadoop
2014-12-22 04:12:38 -08:00
tqchen
6e6031cbe9
add mock
2014-12-22 03:59:01 -08:00
tqchen
d82a6ed811
add file command
2014-12-22 03:48:14 -08:00
tqchen
ab7492dbc2
add support for yarn
2014-12-22 03:24:00 -08:00
tqchen
d3433c5946
change script
2014-12-22 01:54:11 -08:00
tqchen
975bcc8261
fix
2014-12-22 01:26:59 -08:00
tqchen
dd8d9646c4
rm mpi dep
2014-12-22 01:25:06 -08:00
tqchen
bb2ecc6ad5
remove c++11
2014-12-22 01:10:14 -08:00
tqchen
7a2ae105ea
fix script
2014-12-22 01:03:12 -08:00
tqchen
fd533d9a76
add kmeans
2014-12-22 00:32:08 -08:00
tqchen
5fe3c58b4a
add kmeans hadoop
2014-12-22 00:31:01 -08:00
tqchen
dcb6e22a9e
add mapred tasks
2014-12-22 00:20:13 -08:00
tqchen
12399a1d42
add more mocktest
2014-12-21 17:59:12 -08:00
tqchen
a624051b85
add keepalive to socket, fix recover problem when a node is requester and pass data
2014-12-21 17:55:08 -08:00
tqchen
cfea4dbe85
fix rabit for single node without initialization
2014-12-21 04:35:32 -08:00
tqchen
e40047f9c2
new mock test
2014-12-20 18:38:54 -08:00
tqchen
10bb407a2c
add mock engine
2014-12-20 18:31:33 -08:00
tqchen
ecf91ee081
change usage
2014-12-20 16:54:15 -08:00
tqchen
925d014271
change file structure
2014-12-20 16:19:54 -08:00
tqchen
77d74f6c0d
fix bug in lambda allreduce
2014-12-20 05:04:16 -08:00
tqchen
5570e7ceae
add complex types
2014-12-19 21:12:10 -08:00
tqchen
e72a869fd1
add complex reducer in
2014-12-19 20:57:53 -08:00
tqchen
2c0a0671ad
skip actions when there is only 1 node
2014-12-19 19:21:21 -08:00
tqchen
6151899ce2
add tracker print
2014-12-19 18:40:06 -08:00
tqchen
6bf282c6c2
isolate iserializable
2014-12-19 17:36:42 -08:00
tqchen
8c35cff02c
improve script
2014-12-19 04:21:16 -08:00
tqchen
9f42b78a18
improve tracker script
2014-12-19 04:20:45 -08:00
tqchen
69d7f71ae8
change kmeans to using lambda
2014-12-19 02:12:53 -08:00
tqchen
1754fdbf4e
enable support for lambda preprocessing function, and c++11
2014-12-19 02:00:43 -08:00
tqchen
58331067f8
cleanup testcases
2014-12-18 23:50:59 -08:00
tqchen
aa2cb38543
ResetLink still not ok
2014-12-18 21:45:38 -08:00
tqchen
6b18ee9edb
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-18 19:02:05 -08:00
tqchen
c8faed0b54
pass local model recover test
2014-12-18 18:53:58 -08:00
tqchen
dbd05a65b5
nice fix, start check local check
2014-12-18 18:39:24 -08:00
Tianqi Chen
31403a41cd
Update rabit.h
2014-12-09 21:03:41 -08:00
tqchen
3f22596e3c
check in license
2014-12-09 20:57:54 -08:00
tqchen
cc5efb8d81
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-09 20:56:33 -08:00
root
5aff7fab29
adding :
2014-12-08 17:15:49 +00:00
root
dfb3961eea
changing port
2014-12-08 17:13:42 +00:00
Tianqi Chen
39f2dcdfef
Update rabit_tracker.py
2014-12-08 08:36:55 -08:00
tqchen
2750679270
normal state running ok
2014-12-07 20:57:29 -08:00
tqchen
b38fa40fa6
fix ring passing
2014-12-07 20:25:42 -08:00
tqchen
8d570b54c7
add code to help link reuse, start test numreplica
2014-12-07 16:22:02 -08:00
tqchen
e2adce1cc1
add ring setup version
2014-12-07 16:09:28 -08:00
tqchen
322e40c72e
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-06 23:00:18 -08:00
tqchen
328cf187ba
check in the ring passing
2014-12-06 23:00:10 -08:00
nachocano
20b03e781c
to run all executables
2014-12-06 15:37:09 -08:00
nachocano
fcf2f0a03d
to stderr
2014-12-06 15:22:29 -08:00
nachocano
cd8ab469ff
Merge branch 'master' of https://github.com/tqchen/allreduce
2014-12-06 15:14:19 -08:00
nachocano
659b9cd517
changing number of repetitions
2014-12-06 15:14:14 -08:00
root
52d472c209
using hostfile
2014-12-06 20:30:35 +00:00
nachocano
9ed59e71f6
speed runner
2014-12-06 12:09:40 -08:00
nachocano
e0053c62e1
adding executable
2014-12-06 12:05:08 -08:00
nachocano
8f0d7d1d3e
changing to -ho not to conflict with help
2014-12-06 12:01:05 -08:00
nachocano
771891491c
Merge branch 'master' of https://github.com/tqchen/allreduce
2014-12-06 11:59:22 -08:00
nachocano
f203d13efc
speed runner
2014-12-06 11:59:16 -08:00
nachocano
14e400226a
submit mpi to include machine file
2014-12-06 11:33:05 -08:00
tqchen
58f80c5675
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-06 11:25:18 -08:00
tqchen
4a7d84e861
chg string bcast
2014-12-06 11:25:08 -08:00
tqchen
1519f74f3c
ok
2014-12-06 11:20:52 -08:00
tqchen
0e012cb05e
add speed test
2014-12-06 11:05:24 -08:00
tqchen
19631ecef6
more tracker renaming
2014-12-06 09:24:12 -08:00
tqchen
a569bf2698
change gitignore
2014-12-06 09:19:08 -08:00
tqchen
dc12958fc7
rename master to tracker, to emphasie rabit is p2p in computing
2014-12-06 09:15:31 -08:00
nachocano
67b68ceae6
adding timing
2014-12-05 16:00:47 -08:00
nachocano
54eb5623cb
worked on my machine !!! finally
2014-12-05 15:24:00 -08:00
nachocano
d9c22e54de
closer, but still does not work... stays in map 100%. I think an exception is being thrown
2014-12-05 13:28:42 -08:00
tqchen
7765e2dc55
add status report
2014-12-05 09:49:26 -08:00
tqchen
ab278513ab
ok
2014-12-05 09:39:51 -08:00
Tianqi Chen
e7a22792ac
Update submit_job_hadoop.py
2014-12-05 09:14:44 -08:00
Tianqi Chen
e05098cacb
Update submit_job_hadoop.py
2014-12-05 09:10:26 -08:00
Tianqi Chen
f9e95ab522
Update submit_job_hadoop.py
2014-12-05 09:09:20 -08:00
nachocano
bb7d6814a7
creating initial version of hadoop submit script. Not working.
...
Not sure how to get the master uri and port. I believe I cannot do it before I launch the job.
Updating the name from submit_job to submit_job_mpi
2014-12-05 03:27:02 -08:00
nachocano
e00fb99e7b
cosmetic
2014-12-04 19:02:11 -08:00
nachocano
e9a3f5169e
cosmetic changes
2014-12-04 18:02:07 -08:00
tqchen
1af3e81ada
chg robust to reliable
2014-12-04 17:32:22 -08:00
tqchen
7cd5474f1a
chg interface
2014-12-04 17:31:40 -08:00
tqchen
821eb21ae2
before make rabit public
2014-12-04 17:30:58 -08:00
tqchen
cc410b8c90
add local model in checkpoint interface, a new goal
2014-12-04 11:09:15 -08:00
tqchen
79e7862583
change note
2014-12-04 09:09:56 -08:00
tqchen
f9d634ce06
change notes
2014-12-04 09:09:29 -08:00
tqchen
65a1cdf8e5
remove doc from main repo
2014-12-04 09:07:36 -08:00
tqchen
67229fd7a9
change model
2014-12-04 09:05:48 -08:00
tqchen
3033177e9e
ok
2014-12-03 22:36:16 -08:00
tqchen
656a8fa3a2
ok
2014-12-03 22:32:30 -08:00
tqchen
0e9b64649a
ok
2014-12-03 22:30:23 -08:00
tqchen
9da3c6c573
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-03 22:28:59 -08:00
tqchen
09a1305628
chg readme
2014-12-03 22:27:52 -08:00
nachocano
7d314fef78
open for writing
2014-12-03 21:58:58 -08:00
nachocano
dece767084
Revert "open for writing"
...
This reverts commit 63bf9c7995 .
2014-12-03 21:58:33 -08:00
nachocano
63bf9c7995
open for writing
2014-12-03 21:58:17 -08:00
tqchen
1c76483b4b
ok
2014-12-03 21:53:34 -08:00
tqchen
9abe6ad4d8
checkin makefile
2014-12-03 21:30:11 -08:00
tqchen
8175df1002
bug fix in kmeans
2014-12-03 20:05:16 -08:00
tqchen
a1a1a8895e
add kmeans
2014-12-03 18:23:58 -08:00
tqchen
69af79d45d
sparse kmeans
2014-12-03 18:15:28 -08:00
nachocano
e3a95b2d1a
Merge branch 'master' of https://github.com/tqchen/allreduce
2014-12-03 15:39:05 -08:00
nachocano
5c23b94069
updating kmeans based on Tianqi feedback. More efficient now
2014-12-03 15:38:58 -08:00
tqchen
85bb6cd027
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-03 15:13:09 -08:00
tqchen
90b9f1a98a
add keepalive script
2014-12-03 15:04:30 -08:00
nachocano
55c2a5dc83
Merge branch 'master' of https://github.com/tqchen/allreduce
2014-12-03 14:21:42 -08:00
nachocano
1d0d5bb141
kmeans seems to be working.. not restarting anything though
2014-12-03 14:21:10 -08:00
tqchen
7a983a4079
add keepalive
2014-12-03 13:21:30 -08:00
tqchen
2523288509
basic recovery works
2014-12-03 12:19:08 -08:00
tqchen
8a6768763d
bug fixed ver
2014-12-03 11:51:39 -08:00
tqchen
a186f8c3aa
ok
2014-12-03 11:19:43 -08:00
tqchen
ceeb6f0690
bug version, check in and rollback
2014-12-03 11:17:39 -08:00
tqchen
f3e5b6e13c
ok
2014-12-03 10:00:47 -08:00
tqchen
34f2f887b1
add more broadcast and basic broadcast
2014-12-03 09:59:13 -08:00
nachocano
20b51cc9ce
cleaner
2014-12-03 01:44:34 -08:00
nachocano
56aad86231
adding incomplete kmeans.
...
I'm having a problem with the broadcast, and still need to implement the logic
2014-12-03 01:16:13 -08:00
tqchen
ed1de6df80
change AllReduce to Allreduce
2014-12-02 21:11:48 -08:00
nachocano
8cb5b68cb6
Merge branch 'master' of https://github.com/tqchen/allreduce
2014-12-02 11:28:27 -08:00
nachocano
e4abca9494
changing report folder to doc
2014-12-02 11:28:20 -08:00
tqchen
0a3300d773
rabit run on MPI
2014-12-02 11:20:19 -08:00
nachocano
2fab05c83e
adding some design goals.
2014-12-02 11:07:07 -08:00
nachocano
40f7ee1cab
adding simple image
2014-12-02 01:49:54 -08:00
nachocano
2c166d7a3a
adding some initial skeleton of the report.
2014-12-02 01:19:36 -08:00
tqchen
dcea64c838
check in model recover
2014-12-01 21:41:37 -08:00
tqchen
255218a2f3
change in interface, seems resetlink is still bad
2014-12-01 21:39:51 -08:00
tqchen
b76cd5858c
seems ok version
2014-12-01 20:18:25 -08:00
tqchen
46b5d46111
fix one bug, another comes
2014-12-01 19:53:41 -08:00
tqchen
993ff8bb91
find one bug, continue to next one
2014-12-01 19:34:27 -08:00
tqchen
2cde04867f
Merge branch 'master' of ssh://github.com/tqchen/rabit
2014-12-01 16:57:33 -08:00
tqchen
337840d29b
recover not yet working
2014-12-01 16:57:26 -08:00
Tianqi Chen
fd2c57b8a4
Update engine_robust.cc
2014-12-01 15:32:57 -08:00
tqchen
1c5167d96e
rabit seems ready to run
2014-12-01 10:32:30 -08:00
Tianqi Chen
0d63646015
Update README.md
2014-12-01 10:04:10 -08:00
Tianqi Chen
b5367f48f6
Update README.md
2014-12-01 10:03:45 -08:00
Tianqi Chen
62c8ce9657
Update README.md
2014-12-01 10:03:31 -08:00
tqchen
eb2ca06d67
fresh name fresh start
2014-12-01 09:17:05 -08:00
tqchen
16f729115e
checkin allreduce recover
2014-11-30 22:41:04 -08:00
tqchen
9355f5faf2
more conservative exception watching
2014-11-30 21:39:22 -08:00
tqchen
8cef2086f5
smarter select for allreduce and bcast
2014-11-30 21:31:45 -08:00
tqchen
f7928c68a3
next round try more careful select design
2014-11-30 21:07:34 -08:00
tqchen
ecb09a23bc
add recover data, do a round of review
2014-11-30 20:59:55 -08:00
tqchen
b9b58a1275
bugfix in decide
2014-11-30 17:48:30 -08:00
tqchen
4a6c01c83c
minor change in decide
2014-11-30 17:48:02 -08:00
tqchen
27f6f8ea9e
bugfix in msg passing
2014-11-30 17:42:18 -08:00
tqchen
d8d648549f
finish message passing, do a review on msg passing and decide
2014-11-30 17:40:30 -08:00
tqchen
38cd595235
check in message passing
2014-11-30 16:38:47 -08:00
tqchen
7a60cb7f3e
checkin decide request, todo message passing
2014-11-30 16:37:26 -08:00
tqchen
68f13cd739
tight
2014-11-30 11:46:21 -08:00
tqchen
d1ce3c697c
inline
2014-11-30 11:45:50 -08:00
tqchen
2e536eda29
check in the recover strategy
2014-11-30 11:42:59 -08:00
tqchen
155ed3a814
seems a OK version of reset, start to work on decide exec
2014-11-29 22:22:51 -08:00
tqchen
5b0bb53184
refactor code style, reset link still need thoughts
2014-11-29 20:15:27 -08:00
tqchen
42505f473d
finish reset link log
2014-11-29 15:14:43 -08:00
tqchen
98756c068a
livelock in oob send recv
2014-11-28 21:58:15 -08:00
tqchen
aa54a038f2
livelock in oob send recv
2014-11-28 21:56:58 -08:00
tqchen
a30075794b
initial version of robust engine, add discard link, need more random mock test, next milestone will be recovery
2014-11-28 15:56:12 -08:00
nachocano
a8128493c2
execute it like this: ./test.sh 4 4000 testcase0.conf ./
...
Now we are passing the folder where the round instances are saved.
The problem is that calling utils::Check or utils::Assert on 1 or 2 nodes, shutdowns all of them. Only those should be shutdown and this will work. There maybe some other mechanism to shutdown a particular node. Tianqi?
2014-11-28 01:48:26 -08:00
nachocano
faed8285cd
execute it like ./test.sh 4 4000 testcase0.conf to obtain a successful execution
...
updating mock. It now wraps the calls to sync and reads config from configuration file.
I believe it's better not to use the preprocessor directive, i.e. not to put any test code in the engine_tcp. I just call the mock in the test_allreduce file. It's a file purely for testing purposes, so it's fine to use the mock there.
2014-11-28 00:16:35 -08:00
nachocano
21f3f3eec4
adding const to variable to comply with google code convention...
...
may need to change more stuff though. Taint what else do you mean? Spaces, tabs, names?
2014-11-27 17:03:31 -08:00
tqchen
2f1ba40786
change in socket, to pass out error code
2014-11-27 16:17:07 -08:00
nachocano
c565104491
adding some references to mock inside TEST preprocessor directive.
...
It shouldn't be an assert because it shutdowns the process. Instead should check on the value and return some sort of error, so that we can recover.
The mock contains queues, indexed by the rank of the process. For each node, you can configure the behavior you expect (success or failure for now) when you call any of the methods (AllReduce, Broadcast, LoadCheckPoint and CheckPoint)... If you call several times AllReduce, the outputs will pop from the queue, i.e., first you can retrieve a success, then a failure and so on.
Pretty basic for now, need to tune it better
2014-11-26 17:24:29 -08:00
nachocano
54fcff189f
dummy mock for now
2014-11-26 16:37:23 -08:00
tqchen
d37f38c455
initial version of allreduce
2014-11-25 16:15:56 -08:00
Tianqi Chen
5e5bdda491
Initial commit
2014-11-25 14:37:18 -08:00