Philip Hyunsu Cho
741fbf47c4
[CI] Update lint configuration to support latest pylint convention ( #4971 )
...
* Update lint configuration
* Use gcc 8 consistently in build instruction
2019-10-21 16:40:57 -07:00
Jacob Kim
a78d4e7aa8
Follow PEP 257 -- Docstring Conventions ( #4959 )
2019-10-17 23:45:25 -04:00
Jiaming Yuan
185e3f1916
Update GPU doc. ( #4953 )
2019-10-16 05:54:09 -04:00
Jiaming Yuan
7e72a12871
Don't set_params at the end of set_state. ( #4947 )
...
* Don't set_params at the end of set_state.
* Also fix another issue found in dask prediction.
* Add note about prediction.
Don't support other prediction modes at the moment.
2019-10-15 10:08:26 -04:00
Jiaming Yuan
2ebdec8aa6
Fix dask prediction. ( #4941 )
...
* Fix dask prediction.
* Add better error messages for wrong partition.
2019-10-14 23:19:34 -04:00
Peter Badida
a9053aff83
Fix incorrectly displayed Note in the doc ( #4943 )
2019-10-14 03:45:23 -04:00
Jiaming Yuan
4bbf062ed3
[Breaking] Update sklearn interface. ( #4929 )
...
* Remove nthread, seed, silent. Add tree_method, gpu_id, num_parallel_tree. Fix #4909 .
* Check data shape. Fix #4896 .
* Check element of eval_set is tuple. Fix #4875
* Add doc for random_state with hogwild. Fixes #4919
2019-10-12 02:50:09 -04:00
Jiaming Yuan
6c9b6f11da
Use cudf.concat explicitly. ( #4918 )
...
* Use `cudf.concat` explicitly.
* Add test.
2019-10-10 16:02:10 +13:00
Rory Mitchell
aefb1e5c2f
Resolve dask performance issues ( #4914 )
...
* Set dask client.map as impure function
* Remove nrows
* Remove slow check in verbose mode
2019-10-10 16:01:30 +13:00
Jiaming Yuan
d30e63a0a5
Support feature names/types for cudf. ( #4902 )
...
* Implement most of the pandas procedure for cudf except for type conversion.
* Requires an array of interfaces in metainfo.
2019-09-29 15:07:51 -04:00
Vibhu Jawa
2fa8b359e0
Add support for cudf.Series ( #4891 )
2019-09-25 23:52:28 -04:00
Jiaming Yuan
b8433c455a
Rewrite Dask interface. ( #4819 )
2019-09-25 01:30:14 -04:00
Jiaming Yuan
c7416002e9
Fix DMatrix doc. ( #4884 )
2019-09-23 01:55:04 -04:00
Jiaming Yuan
d669ea1eaa
Deprecate set group ( #4864 )
...
* Convert jvm package and R package.
* Restore for compatibility.
2019-09-17 21:26:54 -04:00
Jiaming Yuan
5374f52531
Complete cudf support. ( #4850 )
...
* Handles missing value.
* Accept all floating point and integer types.
* Move to cudf 9.0 API.
* Remove requirement on `null_count`.
* Arbitrary column types support.
2019-09-16 23:52:00 -04:00
Cyprien Ricque
830e73901d
eval_metrics print fixed ( #4803 )
2019-08-28 23:52:18 -04:00
Jiaming Yuan
9700776597
Cudf support. ( #4745 )
...
* Initial support for cudf integration.
* Add two C APIs for consuming data and metainfo.
* Add CopyFrom for SimpleCSRSource as a generic function to consume the data.
* Add FromDeviceColumnar for consuming device data.
* Add new MetaInfo::SetInfo for consuming label, weight etc.
2019-08-19 16:51:40 +12:00
Evan Kepner
53d4272c2a
add os.PathLike support for file paths to DMatrix and Booster Python classes ( #4757 )
2019-08-15 04:46:25 -04:00
Xu Xiao
97eece6ea0
[python package] include dmlc-tracker into xgb python pkg ( #4731 )
2019-08-05 12:21:07 -04:00
Jiaming Yuan
e930a8e54f
Remove old Python trouble shooting doc. [skip ci] ( #4729 )
2019-08-03 12:51:29 -04:00
Nan Zhu
1595e3f57b
upgrade version num ( #4670 )
...
* upgrade version num
* missign changes
* fix version script
* change versions
* rm files
* Update CMakeLists.txt
2019-07-17 15:25:35 -07:00
Oleksandr Pryimak
5544a730f1
Add optional dependencies to setup.py ( #4655 )
2019-07-16 17:12:43 +08:00
Philip Hyunsu Cho
1aaf4a679d
Fix early stopping in the Python package ( #4638 )
...
* Fix #4630 , #4421 : Preserve correct ordering between metrics, and always use last metric for early stopping
* Clarify semantics of early stopping in presence of multiple valid sets and metrics
* Add a test
* Fix lint
2019-07-07 01:01:03 -07:00
Marcos
562d9ae963
Eliminate FutureWarning: Series.base is deprecated ( #4337 )
...
* Remove all references to data.base
Should eliminate the deprecation warning in issue #4300
* Fix lint
2019-07-04 21:06:23 -07:00
Philip Hyunsu Cho
b7a1f22d24
Empty evaluation list in early stopping should produce meaningful error message ( #4633 )
...
* Empty evaluation list should not break early stopping
* Fix lint
* Update callback.py
2019-07-04 13:27:18 -07:00
Philip Hyunsu Cho
4df246191f
Add warning when save_model() is called from scikit-learn interface ( #4632 )
2019-07-03 23:37:53 -07:00
Oleksandr Pryimak
986fee6022
pytest tests/python fails if no pandas installed ( #4620 )
...
* _maybe_pandas_xxx should return their arguments unchanged if no pandas installed
* Tests should not assume pandas is installed
* Mark tests which require pandas as such
2019-07-01 02:54:08 +08:00
Philip Hyunsu Cho
a30176907f
Support Dask 2.0 ( #4617 )
2019-06-27 20:42:35 -07:00
Jiaming Yuan
8bdf15120a
Implement tree model dump with code generator. ( #4602 )
...
* Implement tree model dump with a code generator.
* Split up generators.
* Implement graphviz generator.
* Use pattern matching.
* [Breaking] Return a Source in `to_graphviz` instead of Digraph in Python package.
Co-Authored-By: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2019-06-26 15:20:44 +08:00
Jiaming Yuan
2cff735126
Update doc for feature constraints and n_gpus. ( #4596 )
...
* Update doc for feature constraints.
* Fix some warnings.
* Clean up doc for `n_gpus`.
2019-06-23 14:37:22 +08:00
Andy Adinets
9fa29ad753
Set reg_lambda=1e-5 for scikit-learn-like random forest classes. ( #4558 )
2019-06-22 08:02:13 +12:00
Philip Hyunsu Cho
30e1cb4e9e
Fix docstring for XGBModel.predict() [skip ci] ( #4592 )
2019-06-21 12:44:42 +08:00
Jiaming Yuan
9494950ee7
Address some sphinx warnings and errors, add doc for building doc. ( #4589 )
2019-06-20 15:07:36 -07:00
Jiaming Yuan
4591039eba
Remove remaining reg:linear. ( #4544 )
2019-06-11 16:04:09 +08:00
Jiaming Yuan
4e9965cb9d
Fix Python demo and doc. ( #4545 )
...
* Remove old doc.
* Fix checking __stdin__.
2019-06-11 08:58:41 +08:00
Philip Hyunsu Cho
c2a3902ba3
Fix #4497 : Enable feature importance property for DART booster ( #4525 )
2019-05-31 15:11:57 -07:00
Rory Mitchell
972f693eaf
Fix dask API sphinx docstrings ( #4507 )
...
* Fix dask API sphinx docstrings
* Update GPU docs page
2019-05-28 16:39:26 +12:00
yellowdolphin
3f7e5d9c47
add dll_path for cygwin users ( #4499 )
2019-05-27 12:04:28 +08:00
Rory Mitchell
09b90d9329
Add native support for Dask ( #4473 )
...
* Add native support for Dask
* Add multi-GPU demo
* Add sklearn example
2019-05-27 13:29:28 +12:00
Bryan Woods
278562db13
Add support for cross-validation using query ID ( #4474 )
...
* adding support for matrix slicing with query ID for cross-validation
* hail mary test of unrar installation for windows tests
* trying to modify tests to run in Github CI
* Remove dependency on wget and unrar
* Save error log from R test
* Relax assertion in test_training
* Use int instead of bool in C function interface
* Revise R interface
* Add XGDMatrixSliceDMatrixEx and keep old XGDMatrixSliceDMatrix for API compatibility
2019-05-23 10:45:02 -07:00
Sean Owen
5a567ec249
Ensure pandas DataFrame column names are treated as strings in type error message ( #4481 )
2019-05-21 16:19:35 +08:00
Philip Hyunsu Cho
515f5f5c47
[RFC] Version 0.90 release candidate ( #4475 )
...
* Release 0.90
* Add script to automatically generate acknowledgment
* Update NEWS.md
2019-05-20 01:02:44 -07:00
Philip Hyunsu Cho
bbe0dbd7ec
Migrate pylint check to Python 3 ( #4381 )
...
* Migrate lint to Python 3
* Fix lint errors
* Use Miniconda3 to use Python 3.7
* Use latest pylint and astroid
2019-04-21 01:01:54 -07:00
Mayank Suman
360f25ec27
Added language classifier for python ( #4327 )
...
* Added language classifier for python
* Removed python2 language classifier
* Fix formatting
2019-04-08 11:13:26 -07:00
Jiaming Yuan
82dca3c108
Don't store DMatrix handle until it's initialized. ( #4317 )
...
* Use a temporary variable to store the handle.
* Decode c++ error message.
* Simple note about saved binary.
2019-04-01 18:29:28 +08:00
Philip Hyunsu Cho
263e2038e9
Bump Python version number ( #4285 )
2019-03-21 14:40:44 -07:00
Jiaming Yuan
7b1b11390a
Mark Scikit-Learn RF interface as experimental in doc. ( #4258 )
...
* Mark Scikit-Learn RF interface as experimental in doc.
2019-03-16 00:45:32 +08:00
Andy Adinets
4352fcdb15
Brought the silent parameter for the SKLearn-like API back, marked it deprecated. ( #4255 )
...
* Brought the silent parameter for the SKLearn-like API back, marked it deprecated.
- added deprecation notice and warning
- removed silent from the tests for the SKLearn-like API
2019-03-14 09:45:08 +13:00
Andy Adinets
a36c3ed4f4
Added SKLearn-like random forest Python API. ( #4148 )
...
* Added SKLearn-like random forest Python API.
- added XGBRFClassifier and XGBRFRegressor classes to SKL-like xgboost API
- also added n_gpus and gpu_id parameters to SKL classes
- added documentation describing how to use xgboost for random forests,
as well as existing caveats
2019-03-12 22:28:19 +08:00
Patrick Ford
74009afcac
Added trees_to_df() method for Booster class ( #4153 )
...
* add test_parse_tree.py to tests/python
* Fix formatting
* Fix pylint error
* Ignore 'no member' error for Pandas dataframe
2019-02-26 13:28:24 -08:00