Jiaming Yuan
7e477a2adb
Fix data loading ( #4862 )
...
* Fix loading text data.
* Fix config regex.
* Try to explain the error better in exception.
* Update doc.
2019-10-22 12:33:14 -04:00
Philip Hyunsu Cho
95295ce026
[CI] Use latest dask ( #4973 )
...
* Remove version spec, to use latest dask always
2019-10-22 07:00:13 -04:00
Philip Hyunsu Cho
741fbf47c4
[CI] Update lint configuration to support latest pylint convention ( #4971 )
...
* Update lint configuration
* Use gcc 8 consistently in build instruction
2019-10-21 16:40:57 -07:00
Jiaming Yuan
4771bb0d41
Catch exception in transform function omp context. ( #4960 )
2019-10-21 17:03:38 +08:00
Jiaming Yuan
010b8f1428
Revert "[jvm-packages] update rabit, surface new changes to spark, add parity and failure tests ( #4876 )" ( #4965 )
...
This reverts commit 86ed01c4bbecef66e1bc4d02fb13116bd6130fae.
2019-10-18 14:02:35 -07:00
Chen Qin
86ed01c4bb
[jvm-packages] update rabit, surface new changes to spark, add parity and failure tests ( #4876 )
...
* Expose sets of rabit configurations to spark layer
2019-10-18 15:07:31 -04:00
Jiaming Yuan
31030a8d3a
Set correct file permission. ( #4964 )
2019-10-18 12:54:29 -04:00
Jiaming Yuan
ae536756ae
Add Model and Configurable interface. ( #4945 )
...
* Apply Configurable to objective functions.
* Apply Model to Learner and Regtree, gbm.
* Add Load/SaveConfig to objs.
* Refactor obj tests to use smart pointer.
* Dummy methods for Save/Load Model.
2019-10-18 01:56:02 -04:00
Jiaming Yuan
9fc681001a
Copy CMake parameter from dmlc-core. ( #4948 )
2019-10-17 23:46:32 -04:00
Jacob Kim
a78d4e7aa8
Follow PEP 257 -- Docstring Conventions ( #4959 )
2019-10-17 23:45:25 -04:00
Rory Mitchell
60748b2071
Use heuristic to select histogram node, avoid rabit call ( #4951 )
2019-10-18 11:33:54 +13:00
Jiaming Yuan
185e3f1916
Update GPU doc. ( #4953 )
2019-10-16 05:54:09 -04:00
Jiaming Yuan
7e72a12871
Don't set_params at the end of set_state. ( #4947 )
...
* Don't set_params at the end of set_state.
* Also fix another issue found in dask prediction.
* Add note about prediction.
Don't support other prediction modes at the moment.
2019-10-15 10:08:26 -04:00
Jiaming Yuan
2ebdec8aa6
Fix dask prediction. ( #4941 )
...
* Fix dask prediction.
* Add better error messages for wrong partition.
2019-10-14 23:19:34 -04:00
Jiaming Yuan
b61d534472
Span: use size_t' for index_type, add front' and `back'. ( #4935 )
...
* Use `size_t' for index_type. Add `front' and `back'.
* Remove a batch of `static_cast'.
2019-10-14 09:13:33 -04:00
Peter Badida
a9053aff83
Fix incorrectly displayed Note in the doc ( #4943 )
2019-10-14 03:45:23 -04:00
Jiaming Yuan
0e0849fa1e
Mention dask in readme. [skip ci] ( #4942 )
2019-10-14 03:44:08 -04:00
Jiaming Yuan
3d46bd0fa5
Ignore columnar alignment requirement. ( #4928 )
...
* Better error message for wrong type.
* Fix stride size.
2019-10-13 06:41:43 -04:00
Yuan Tang
05d4751540
Update README.md ( #4940 )
2019-10-13 02:37:19 -04:00
Yuan Tang
08ff510e48
Mention Kubernetes on README ( #4939 )
2019-10-13 01:43:09 -04:00
Philip Hyunsu Cho
f7487e4c2a
[CI] Run cuDF tests in Jenkins CI server ( #4927 )
2019-10-13 00:04:54 -04:00
Philip Hyunsu Cho
5b4f28cc46
[CI] Raise timeout threshold in Jenkins ( #4938 )
2019-10-12 23:47:35 -04:00
Jiaming Yuan
4bbf062ed3
[Breaking] Update sklearn interface. ( #4929 )
...
* Remove nthread, seed, silent. Add tree_method, gpu_id, num_parallel_tree. Fix #4909 .
* Check data shape. Fix #4896 .
* Check element of eval_set is tuple. Fix #4875
* Add doc for random_state with hogwild. Fixes #4919
2019-10-12 02:50:09 -04:00
Jiaming Yuan
c2cce4fac3
Update dmlc-core. ( #4924 )
...
* Fixed some threading errors.
* Allow updating parameters.
2019-10-09 23:16:45 -04:00
Jiaming Yuan
6c9b6f11da
Use cudf.concat explicitly. ( #4918 )
...
* Use `cudf.concat` explicitly.
* Add test.
2019-10-10 16:02:10 +13:00
Rory Mitchell
aefb1e5c2f
Resolve dask performance issues ( #4914 )
...
* Set dask client.map as impure function
* Remove nrows
* Remove slow check in verbose mode
2019-10-10 16:01:30 +13:00
Oleksandr Pryimak
80977182c5
Use bundled gtest ( #4900 )
...
* Suggest to use gtest bundled with dmlc
* Use dmlc bundled gtest in all CI scripts
* Make clang-tidy to use dmlc embedded gtest
2019-10-09 16:26:19 -07:00
Jiaming Yuan
095de3bf5f
Export c++ headers in CMake installation. ( #4897 )
...
* Move get transpose into cc.
* Clean up headers in host device vector, remove thrust dependency.
* Move span and host device vector into public.
* Install c++ headers.
* Short notes for c and c++.
Co-Authored-By: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2019-10-06 23:53:09 -04:00
Jiaming Yuan
4ab1df5fe6
Check deprecated n_gpus. ( #4908 )
2019-10-02 02:05:14 -04:00
Jiaming Yuan
7e24a8d245
Improve doc and demo for dask. ( #4907 )
...
* Add a readme with link to doc.
* Add more comments in the demonstrations code.
* Workaround https://github.com/dask/distributed/issues/3081 .
2019-09-30 23:59:37 -04:00
Jiaming Yuan
d30e63a0a5
Support feature names/types for cudf. ( #4902 )
...
* Implement most of the pandas procedure for cudf except for type conversion.
* Requires an array of interfaces in metainfo.
2019-09-29 15:07:51 -04:00
Vibhu Jawa
2fa8b359e0
Add support for cudf.Series ( #4891 )
2019-09-25 23:52:28 -04:00
Liangcai Li
82ee2317e8
Add case for LongParam. ( #4885 )
...
To support specifying long parameter as String, the same as other basic
type, such as Int, Double ...
2019-09-25 05:41:53 -07:00
Jiaming Yuan
b8433c455a
Rewrite Dask interface. ( #4819 )
2019-09-25 01:30:14 -04:00
Rong Ou
562bb0ae31
remove device shards ( #4867 )
2019-09-25 13:15:46 +08:00
Jiaming Yuan
0b89cd1dfa
Support gamma in GPU_Hist. ( #4874 )
...
* Just prevent building the tree instead of using an explicit pruner.
2019-09-24 10:16:08 +08:00
Jiaming Yuan
a40b72d127
Workaround isnan across different environments. ( #4883 )
2019-09-23 21:34:27 -04:00
Jiaming Yuan
c7416002e9
Fix DMatrix doc. ( #4884 )
2019-09-23 01:55:04 -04:00
Nan Zhu
fc8c9b0521
[jvm-packages] enable deterministic repartitioning when checkpoint is enabled ( #4807 )
...
* do reparititoning in DataUtil
* keep previous behavior of partitioning without checkpoint
* deterministic repartitioning
* change
2019-09-19 15:21:05 -07:00
Xu Xiao
277e25797b
[jvm-packages] refine numAliveCores method of SparkParallelismTracker ( #4858 )
...
* refine numAliveCores
* refine XGBoostToMLlibParams
* fix waitForCondition
* resolve conflicts
* Update SparkParallelismTracker.scala
2019-09-19 15:18:29 -07:00
Honza Sterba
22209b7b95
[jvm-packages] Add BigDenseMatrix ( #4383 )
...
* Add BigDenseMatrix
* ability to create DMatrix with bigger than Integer.MAX_VALUE size arrays
* uses sun.misc.Unsafe
* make DMatrix test work from a jar as well
2019-09-18 20:46:14 -07:00
Jiaming Yuan
57106a3459
Fix parsing empty json object. ( #4868 )
...
* Fix parsing empty json object.
* Better error message.
2019-09-18 03:31:46 -04:00
Rong Ou
006eb80578
ignore vscode and clion files ( #4866 )
...
* ignore vscode and clion files
* ignore all .idea directories
2019-09-17 21:27:40 -04:00
Jiaming Yuan
d669ea1eaa
Deprecate set group ( #4864 )
...
* Convert jvm package and R package.
* Restore for compatibility.
2019-09-17 21:26:54 -04:00
Philip Hyunsu Cho
0e0955a6d8
Add link to Ruby XGBoost gem ( #4856 )
2019-09-17 10:40:44 -07:00
Jiaming Yuan
5374f52531
Complete cudf support. ( #4850 )
...
* Handles missing value.
* Accept all floating point and integer types.
* Move to cudf 9.0 API.
* Remove requirement on `null_count`.
* Arbitrary column types support.
2019-09-16 23:52:00 -04:00
Rong Ou
125bcec62e
Move ellpack page construction into DMatrix ( #4833 )
2019-09-16 23:50:55 -04:00
Chen Qin
512f037e55
[rabit_bootstrap_cache ] failed xgb worker recover from other workers ( #4808 )
...
* Better recovery support. Restarting only the failed workers.
2019-09-16 23:31:52 -04:00
Xu Xiao
c89bcc4de5
[blocking] fix parallel eval_split of hist updater ( #4851 )
...
* Don't call rabit functions inside parallel loop.
2019-09-13 09:35:03 -04:00
Jiaming Yuan
6a5e805886
Add n_jobs as an alias of nthread. ( #4842 )
2019-09-09 19:57:12 -04:00