Jiaming Yuan
a30461cf87
[dask] Support all parameters in regressor and classifier. ( #6471 )
...
* Add eval_metric.
* Add callback.
* Add feature weights.
* Add custom objective.
2020-12-14 07:35:56 +08:00
Philip Hyunsu Cho
0d483cb7c1
Bump version to 1.4.0 snapshot in master ( #6486 )
2020-12-10 07:38:08 -08:00
Jiaming Yuan
0ffaf0f5be
Fix dask ip resolution. ( #6475 )
...
This adopts the solution used in dask/dask-xgboost#40 which employs the get_host_ip from dmlc-core tracker.
2020-12-07 16:36:23 -08:00
Jiaming Yuan
47b86180f6
Don't validate feature when number of rows is 0. ( #6472 )
2020-12-07 18:08:51 +08:00
Jiaming Yuan
703c2d06aa
Fix global config default value. ( #6470 )
2020-12-06 06:15:33 +08:00
Jiaming Yuan
d6386e45e8
Fix filtering callable objects in skl xgb param. ( #6466 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-12-05 17:20:36 +08:00
Philip Hyunsu Cho
c103ec51d8
Enforce row-major order in cuPy array ( #6459 )
2020-12-03 18:29:10 -08:00
Philip Hyunsu Cho
4f70e14031
Fix docstring of config.py to use correct versionadded ( #6458 )
2020-12-03 10:41:53 -08:00
Philip Hyunsu Cho
fb56da5e8b
Add global configuration ( #6414 )
...
* Add management functions for global configuration: XGBSetGlobalConfig(), XGBGetGlobalConfig().
* Add Python interface: set_config(), get_config(), and config_context().
* Add unit tests for Python
* Add R interface: xgb.set.config(), xgb.get.config()
* Add unit tests for R
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>
2020-12-03 00:05:18 -08:00
Jiaming Yuan
927c316aeb
Fix period in evaluation monitor. ( #6441 )
2020-11-29 03:18:33 +08:00
Jiaming Yuan
2ce2a1a4d8
[SKL] Propagate parameters to booster during set_param. ( #6416 )
2020-11-20 20:37:35 +08:00
Jiaming Yuan
a7b42adb74
Fix dask predict ( #6412 )
2020-11-20 10:10:52 +08:00
Jiaming Yuan
3ac173fc8b
Fix typo. ( #6399 )
2020-11-16 16:59:12 -08:00
Nikhil Choudhary
ae1662028a
Fixed few grammatical mistakes in doc ( #6393 )
2020-11-15 13:48:08 +08:00
Jiaming Yuan
fcd6fad822
[dask] Small cleanup. ( #6391 )
2020-11-14 22:15:05 +08:00
Jiaming Yuan
4ccf92ea34
[dask] Fix union of workers. ( #6375 )
2020-11-13 16:55:05 +08:00
Jiaming Yuan
fcfeb4959c
Deprecate positional arguments. ( #6365 )
...
Deprecate positional arguments in following functions:
- `__init__` for all classes in sklearn module.
- `fit` method for all classes in sklearn module.
- dask interface.
- `set_info` for `DMatrix` class.
Refactor the evaluation matrices handling.
2020-11-13 11:10:30 +08:00
Jiaming Yuan
c90f968d92
Update Python documents. ( #6376 )
2020-11-12 17:51:32 +08:00
Jiaming Yuan
6e12c2a6f8
[dask] Supoort running on GKE. ( #6343 )
...
* Avoid accessing `scheduler_info()['workers']`.
* Avoid calling `client.gather` inside task.
* Avoid using `client.scheduler_address`.
2020-11-11 18:04:34 +08:00
Jiaming Yuan
e65e3cf36e
Support shared library in system path. ( #6362 )
2020-11-10 16:04:25 +08:00
Jiaming Yuan
184e2eac7d
Add period to evaluation monitor. ( #6348 )
2020-11-10 07:47:48 +08:00
Jiaming Yuan
2cc9662005
Support slicing tree model ( #6302 )
...
This PR is meant the end the confusion around best_ntree_limit and unify model slicing. We have multi-class and random forests, asking users to understand how to set ntree_limit is difficult and error prone.
* Implement the save_best option in early stopping.
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2020-11-02 23:27:39 -08:00
Rory Mitchell
29745c6df2
Fix inclusive scan for large sizes ( #6234 )
2020-11-03 17:01:43 +13:00
Jiaming Yuan
7756192906
[dask] Fix prediction on DaskDMatrix with multiple meta data. ( #6333 )
...
* Unify the meta handling methods.
2020-11-02 19:18:44 -05:00
Jiaming Yuan
6ff331b705
Fix Python callback. ( #6320 )
2020-10-30 05:03:44 +08:00
Jiaming Yuan
74ea82209b
Lazy import dask libraries. ( #6309 )
...
* Lazy import dask libraries.
* Lint && fix.
* Use short name.
2020-10-28 15:50:11 -07:00
Jiaming Yuan
e8884c4637
Document tree method for feature weights. ( #6312 )
2020-10-28 13:42:13 -07:00
Jiaming Yuan
b180223d18
Cleanup RABIT. ( #6290 )
...
* Remove recovery and MPI speed tests.
* Remove readme.
* Remove Python binding.
* Add checks in C API.
2020-10-27 08:48:22 +08:00
Philip Hyunsu Cho
c8ec62103a
Deprecate LabelEncoder in XGBClassifier; Enable cuDF/cuPy inputs in XGBClassifier ( #6269 )
...
* Deprecate LabelEncoder in XGBClassifier; skip LabelEncoder for cuDF/cuPy inputs
* Add unit tests for cuDF and cuPy inputs with XGBClassifier
* Fix lint
* Clarify warning
* Move use_label_encoder option to XGBClassifier constructor
* Add a test for cudf.Series
* Add use_label_encoder to XGBRFClassifier doc
* Address reviewer feedback
2020-10-26 13:20:51 -07:00
Jiaming Yuan
d61b628bf5
Remove RABIT CMake targets. ( #6275 )
...
* Now it's built as part of libxgboost.
* Set correct C API error in RABIT initialization and finalization.
* Remove redundant message.
* Guard the tracker print C API.
2020-10-27 01:30:20 +08:00
Philip Hyunsu Cho
677f676172
Use UserWarning for old callback, as DeprecationWarning is not visible ( #6270 )
2020-10-22 01:10:52 -07:00
Jiaming Yuan
81c37c28d5
Time the CPU tests on Jenkins. ( #6257 )
...
* Time the CPU tests on Jenkins.
* Reduce thread contention.
* Add doc.
* Skip heavy tests on ARM.
2020-10-20 17:19:07 -07:00
Jiaming Yuan
52452bebb9
Fix cls typo. ( #6247 )
2020-10-16 16:40:44 +08:00
Jiaming Yuan
3da5a69dc9
Fix typo in dask interface. ( #6240 )
2020-10-15 15:26:29 +08:00
Jiaming Yuan
bed7ae4083
Loop over thrust::reduce. ( #6229 )
...
* Check input chunk size of dqdm.
* Add doc for current limitation.
2020-10-14 10:40:56 +13:00
Jiaming Yuan
b05073bda5
[dask] Test for data initializaton. ( #6226 )
2020-10-13 11:08:35 +08:00
Jiaming Yuan
2443275891
Cleanup Python code. ( #6223 )
...
* Remove pathlike as XGBoost 1.2 requires Python 3.6.
* Move conditional import of dask/distributed into dask module.
2020-10-12 15:44:41 +08:00
Jiaming Yuan
ab5b35134f
Rework Python callback functions. ( #6199 )
...
* Define a new callback interface for Python.
* Deprecate the old callbacks.
* Enable early stopping on dask.
2020-10-10 17:52:36 +08:00
Jiaming Yuan
70ce5216b5
Add high level tests for categorical data. ( #6179 )
...
* Fix unique.
2020-10-09 09:27:23 +08:00
Jiaming Yuan
7622b8cdb8
Enable categorical data support on Python DMatrix. ( #6166 )
...
* Only pandas is recognized.
2020-09-29 11:22:56 +08:00
Kyle Nicholson
e6a238c020
Update base margin dask ( #6155 )
...
* Add `base-margin`
* Add `output_margin` to regressor.
Co-authored-by: fis <jm.yuan@outlook.com>
2020-09-26 21:30:52 +08:00
Philip Hyunsu Cho
bd2b1eabd0
Add back support for scipy.sparse.coo_matrix ( #6162 )
2020-09-25 00:49:49 -07:00
Jiaming Yuan
78d72ef936
Add DaskDeviceQuantileDMatrix demo. ( #6156 )
2020-09-24 14:08:28 +08:00
Jiaming Yuan
33d80ffad0
[dask] Support more meta data on functional interface. ( #6132 )
...
* Add base_margin, label_(lower|upper)_bound.
* Test survival training with dask.
2020-09-21 16:56:37 +08:00
Jiaming Yuan
cc82ca167a
[dask] Refactor meta data handling. ( #6130 )
2020-09-18 13:26:40 +08:00
Rory Mitchell
47350f6acb
Allow kwargs in dask predict ( #6117 )
2020-09-15 13:04:03 +12:00
Boris Feld
24ca9348f7
Fix typo in xgboost.callback.early_stop docstring ( #6071 )
2020-09-06 13:37:07 +08:00
ShvetsKS
c1ca872d1e
Modin DF support ( #6055 )
...
* Modin DF support
* mode change
* tests were added, ci env was extended
* mode change
* Remove redundant installation of modin
* Add a pytest skip marker for modin
* Install Modin[ray] from PyPI
* fix interfering
* avoid extra conversion
* delete cv test for modin
* revert cv function
Co-authored-by: ShvetsKS <kirill.shvets@intel.com>
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-08-29 22:33:30 +03:00
Philip Hyunsu Cho
b3193052b3
Bump version to 1.3.0 snapshot in master ( #6052 )
2020-08-23 17:13:46 -07:00
Jiaming Yuan
a144daf034
Limit tree depth for GPU hist. ( #6045 )
2020-08-22 19:34:52 +08:00