Jiaming Yuan
74ea82209b
Lazy import dask libraries. ( #6309 )
...
* Lazy import dask libraries.
* Lint && fix.
* Use short name.
2020-10-28 15:50:11 -07:00
Jiaming Yuan
2443275891
Cleanup Python code. ( #6223 )
...
* Remove pathlike as XGBoost 1.2 requires Python 3.6.
* Move conditional import of dask/distributed into dask module.
2020-10-12 15:44:41 +08:00
Ram Rachum
02884b08aa
Fix exception causes all over the codebase ( #5787 )
2020-06-15 21:06:07 +08:00
Jiaming Yuan
35e2205256
[dask] Return GPU Series when input is from cuDF. ( #5710 )
...
* Refactor predict function.
2020-05-28 17:51:20 +08:00
Jiaming Yuan
5af8161a1a
Implement Python data handler. ( #5689 )
...
* Define data handlers for DMatrix.
* Throw ValueError in scikit learn interface.
2020-05-22 11:53:55 +08:00
Jiaming Yuan
a461a9a90a
Define lazy isinstance for Python compat. ( #5364 )
...
* Avoid importing datatable.
* Fix #5363 .
2020-02-26 14:23:33 +08:00
Jiaming Yuan
472ded549d
Save Scikit-Learn attributes into learner attributes. ( #5245 )
...
* Remove the recommendation for pickle.
* Save skl attributes in booster.attr
* Test loading scikit-learn model with native booster.
2020-01-30 16:00:18 +08:00
Rory Mitchell
9c56480c61
Support dmatrix construction from cupy array ( #5206 )
2020-01-22 13:15:27 +13:00
K.O
018df6004e
Fix feature_name crated from int64index dataframe. ( #5081 )
2019-12-30 12:26:22 +08:00
Jiaming Yuan
6848d0426f
Clean up Python 2 compatibility code. ( #5161 )
2019-12-27 18:34:53 +08:00
Jiaming Yuan
ac457c56a2
Use `UpdateAllowUnknown' for non-model related parameter. ( #4961 )
...
* Use `UpdateAllowUnknown' for non-model related parameter.
Model parameter can not pack an additional boolean value due to binary IO
format. This commit deals only with non-model related parameter configuration.
* Add tidy command line arg for use-dmlc-gtest.
2019-10-23 05:50:12 -04:00
Jiaming Yuan
6c9b6f11da
Use cudf.concat explicitly. ( #4918 )
...
* Use `cudf.concat` explicitly.
* Add test.
2019-10-10 16:02:10 +13:00
Jiaming Yuan
d30e63a0a5
Support feature names/types for cudf. ( #4902 )
...
* Implement most of the pandas procedure for cudf except for type conversion.
* Requires an array of interfaces in metainfo.
2019-09-29 15:07:51 -04:00
Vibhu Jawa
2fa8b359e0
Add support for cudf.Series ( #4891 )
2019-09-25 23:52:28 -04:00
Jiaming Yuan
b8433c455a
Rewrite Dask interface. ( #4819 )
2019-09-25 01:30:14 -04:00
Jiaming Yuan
9700776597
Cudf support. ( #4745 )
...
* Initial support for cudf integration.
* Add two C APIs for consuming data and metainfo.
* Add CopyFrom for SimpleCSRSource as a generic function to consume the data.
* Add FromDeviceColumnar for consuming device data.
* Add new MetaInfo::SetInfo for consuming label, weight etc.
2019-08-19 16:51:40 +12:00
Evan Kepner
53d4272c2a
add os.PathLike support for file paths to DMatrix and Booster Python classes ( #4757 )
2019-08-15 04:46:25 -04:00
Jiaming Yuan
4e9965cb9d
Fix Python demo and doc. ( #4545 )
...
* Remove old doc.
* Fix checking __stdin__.
2019-06-11 08:58:41 +08:00
Rory Mitchell
09b90d9329
Add native support for Dask ( #4473 )
...
* Add native support for Dask
* Add multi-GPU demo
* Add sklearn example
2019-05-27 13:29:28 +12:00
Philip Hyunsu Cho
bbe0dbd7ec
Migrate pylint check to Python 3 ( #4381 )
...
* Migrate lint to Python 3
* Fix lint errors
* Use Miniconda3 to use Python 3.7
* Use latest pylint and astroid
2019-04-21 01:01:54 -07:00
Pasha Stetsenko
ff2d4c99fa
Update datatable usage ( #4123 )
2019-02-17 03:44:09 +08:00
PSEUDOTENSOR / Jonathan McKinney
9ac163d0bb
Allow import via python datatable. ( #3272 )
...
* Allow import via python datatable.
* Write unit tests
* Refactor dt API functions
* Refactor python code
* Lint fixes
* Address review comments
2018-06-20 13:16:18 -07:00
Rory Mitchell
16c63f30d0
Fix MultiIndex detection (breaks for latest pandas==0.21.0). ( #2872 )
2017-11-11 11:12:23 +13:00
AbdealiJK
6f16f0ef58
Use bst_float consistently throughout ( #1824 )
...
* Fix various typos
* Add override to functions that are overridden
gcc gives warnings about functions that are being overridden by not
being marked as oveirridden. This fixes it.
* Use bst_float consistently
Use bst_float for all the variables that involve weight,
leaf value, gradient, hessian, gain, loss_chg, predictions,
base_margin, feature values.
In some cases, when due to additions and so on the value can
take a larger value, double is used.
This ensures that type conversions are minimal and reduces loss of
precision.
2016-11-30 10:02:10 -08:00
Yuan (Terry) Tang
63829d656c
Fix mknfold using new StratifiedKFold API ( #1660 )
2016-10-12 14:43:37 -07:00
Jonathan Rahn
c8ae52f17a
add scikit-learn v0.18 compatibility ( #1636 )
...
* add scikit-learn v0.18 compatibility
import KFold & StratifiedKFold from sklearn.model_selection instead of sklearn.cross_validation
* change DeprecationWarning to ImportError
DeprecationWarning isn't an exception, so it should work the other way around.
2016-10-09 20:37:28 -07:00
tqchen
149589c583
[PYTHON] Refactor trainnig API to use callback
2016-05-19 21:31:23 -07:00
sinhrks
9da2f3e613
DOC/TST: Fix Python sklearn dep
2016-05-01 17:27:43 +09:00
sinhrks
8fc2456c87
Enable flake8
2016-04-24 17:32:31 +09:00
tqchen
ecb3a271be
[PYTHON-DIST] Distributed xgboost python training API.
2016-02-29 16:54:13 -08:00
tqchen
ebc802756f
temp compatibility with sklearn
2016-02-25 21:57:00 -08:00
Faron
4b3a053913
stratified cv for python wrapper
...
finalize docstring
2016-02-15 16:06:17 +01:00
terrytangyuan
0eb6240fd0
Fixed all lint errors
2015-12-11 18:46:15 -06:00
sinhrks
25c4fbd0cb
Cleanup pandas support
2015-11-13 06:55:04 +09:00