Jiaming Yuan
c635d4c46a
Implement ubjson. ( #7549 )
...
* Implement ubjson.
This is a partial implementation of UBJSON with support for typed arrays. Some missing
features are `f64`, typed object, and the no-op.
2022-01-10 23:24:23 +08:00
Jiaming Yuan
001503186c
Rewrite approx ( #7214 )
...
This PR rewrites the approx tree method to use codebase from hist for better performance and code sharing.
The rewrite has many benefits:
- Support for both `max_leaves` and `max_depth`.
- Support for `grow_policy`.
- Support for mono constraint.
- Support for feature weights.
- Support for easier bin configuration (`max_bin`).
- Support for categorical data.
- Faster performance for most of the datasets. (many times faster)
- Support for prediction cache.
- Significantly better performance for external memory.
- Unites the code base between approx and hist.
2022-01-10 21:15:05 +08:00
Jiaming Yuan
ed95e77752
[jvm-packages] Update JNI header. ( #7550 )
2022-01-10 14:59:40 +08:00
Jiaming Yuan
91c1a1c52f
Fix index type for bitfield. ( #7541 )
2022-01-05 19:23:29 +08:00
Jiaming Yuan
0df2ae63c7
Fix num_boosted_rounds for linear model. ( #7538 )
...
* Add note.
* Fix n boosted rounds.
2022-01-05 03:29:33 +08:00
Jiaming Yuan
28af6f9abb
Remove omp_get_max_threads in gbm and linear. ( #7537 )
...
* Use ctx in gbm.
* Use ctx threads in gbm and linear.
2022-01-05 03:28:52 +08:00
Jiaming Yuan
eea094e1bc
Remove some warnings from clang. ( #7533 )
...
* Unused variable.
* Unnecessary virtual function.
2022-01-05 03:28:21 +08:00
Jiaming Yuan
ec56d5869b
[doc] Include dask examples into doc. ( #7530 )
2022-01-05 03:27:22 +08:00
Jiaming Yuan
54582f641a
[doc] Use cross references in sphinx doc. ( #7522 )
...
* Use cross references instead of URL.
* Fix auto doc for callback.
2022-01-05 03:21:25 +08:00
Jiaming Yuan
eb1efb54b5
Define feature_names_in_. ( #7526 )
...
* Define `feature_names_in_`.
* Raise attribute error if it's not defined.
2022-01-05 01:35:34 +08:00
Jiaming Yuan
8f0a42a266
Initial support for multi-label classification. ( #7521 )
...
* Add support in sklearn classifier.
2022-01-04 23:58:21 +08:00
Jiaming Yuan
68cdbc9c16
Remove omp_get_max_threads in CPU predictor. ( #7519 )
...
This is part of the on going effort to remove the dependency on global omp variables.
2022-01-04 22:12:15 +08:00
Ikko Ashimine
5516281881
Fix typo in tree_model.cc ( #7539 )
...
occurance -> occurrence
2021-12-30 20:12:25 +08:00
Randall Britten
a4a0ebb85d
[doc] Lowercase omega for per tree complexity ( #7532 )
...
As suggested on issue #7480
2021-12-29 23:05:54 +08:00
Louis Desreumaux
3886c3dd8f
Remove macro definitions of snprintf and vsnprintf ( #7536 )
2021-12-26 08:05:59 +08:00
Ginko Balboa
29bfa94bb6
Fix external memory with gpu_hist and subsampling combination bug. ( #7481 )
...
Instead of accessing data from the `original_page_`, access the data from the first page of the available batch.
fix #7476
Co-authored-by: jiamingy <jm.yuan@outlook.com>
2021-12-24 11:15:35 +08:00
Jiaming Yuan
7f399eac8b
Use double for GPU Hist node sum. ( #7507 )
2021-12-22 08:41:35 +08:00
Jiaming Yuan
eabec370e4
[R] Fix single sample prediction. ( #7524 )
2021-12-21 14:11:07 +08:00
Bobby Wang
e8c1eb99e4
[jvm-package] Clean up the legacy gpu support tests ( #7523 )
2021-12-21 09:15:51 +08:00
Xiaochang Wu
59bd1ab17e
Skip callback demo test if matplotlib is not installed ( #7520 )
2021-12-19 08:20:38 +08:00
Jiaming Yuan
58a6723eb1
Initial support for multioutput regression. ( #7514 )
...
* Add num target model parameter, which is configured from input labels.
* Change elementwise metric and indexing for weights.
* Add demo.
* Add tests.
2021-12-18 09:28:38 +08:00
Jiaming Yuan
9ab73f737e
Extract Sketch Entry from hist maker. ( #7503 )
...
* Extract Sketch Entry from hist maker.
* Add a new sketch container for sorted inputs.
* Optimize bin search.
2021-12-18 05:36:56 +08:00
Qingyun Wu
b4a1236cfc
[doc] Update the link to the tuning example in FLAML
2021-12-17 14:31:00 +08:00
Bobby Wang
24e25802a7
[jvm-packages] Add Rapids plugin support ( #7491 )
...
* Add GPU pre-processing pipeline.
2021-12-17 13:11:12 +08:00
Jiaming Yuan
5b1161bb64
Convert labels into tensor. ( #7456 )
...
* Add a new ctor to tensor for `initilizer_list`.
* Change labels from host device vector to tensor.
* Rename the field from `labels_` to `labels` since it's a public member.
2021-12-17 00:58:35 +08:00
Jiaming Yuan
6f8a4633b7
Fix Python typehint with upgraded mypy. ( #7513 )
2021-12-16 23:08:08 +08:00
Jiaming Yuan
70b12d898a
[dask] Fix ddqdm with empty partition. ( #7510 )
...
* Fix empty partition.
* war.
2021-12-16 20:37:29 +08:00
Jiaming Yuan
a512b4b394
[doc] Promote dask from experimental. [skip ci] ( #7509 )
2021-12-16 14:17:06 +08:00
Jiaming Yuan
05497a9141
[dask] Fix asyncio. ( #7508 )
2021-12-13 01:48:25 +08:00
Jiaming Yuan
01152f89ee
Remove unused parameters. ( #7499 )
2021-12-09 14:24:51 +08:00
Harvey
1864fab592
Minor edits to Parameters doc page. ( #7500 )
...
* bost -> both
* doc improvement
* use original filename
* syntax highlight false
* missed a few highlights
2021-12-07 15:46:44 +08:00
Jiaming Yuan
021f8bf28b
Fix pylint. ( #7498 )
2021-12-07 13:23:30 +08:00
Jiaming Yuan
eee527d264
Add approx partitioner. ( #7467 )
2021-11-27 15:22:06 +08:00
Jiaming Yuan
85cbd32c5a
Add range-based slicing to tensor view. ( #7453 )
2021-11-27 13:42:36 +08:00
danmarinescu
6f38f5affa
Updated CMake version requirement in build.rst ( #7487 )
...
The documentation states that to build from source you need CMake 3.13 or higher. However, according to https://github.com/dmlc/xgboost/blob/master/CMakeLists.txt#L1 CMake 3.14 or higher is required.
2021-11-27 09:58:01 +08:00
Jiaming Yuan
557ffc4bf5
Reduce base margin to 2 dim for now. ( #7455 )
2021-11-27 00:46:13 +08:00
Jiaming Yuan
bf7bb575b4
Test CPU histogram with cat data. ( #7465 )
2021-11-27 00:43:28 +08:00
Bobby Wang
24be04e848
[jvm-packages] Add DeviceQuantileDMatrix to Scala binding ( #7459 )
2021-11-24 20:23:18 +08:00
Philip Hyunsu Cho
619c450a49
[CI] Add missing step extract_branch ( #7479 )
2021-11-24 17:35:59 +08:00
Jiaming Yuan
820e1c01ef
Fix macos package upload. ( #7475 )
...
* Split up the tests.
2021-11-24 03:43:49 +08:00
Jiaming Yuan
488f12a996
Fix github macos package upload. ( #7474 )
2021-11-24 00:29:11 +08:00
Jiaming Yuan
c024c42dce
Modernize XGBoost Python document. ( #7468 )
...
* Use sphinx gallery to integrate examples.
* Remove mock objects.
* Add dask doc inventory.
2021-11-23 23:24:52 +08:00
Philip Hyunsu Cho
96a9848c9e
[CI] Fix continuous delivery pipeline for MacOS ( #7472 )
2021-11-23 22:22:08 +08:00
Jiaming Yuan
b124a27f57
Support scipy sparse in dask. ( #7457 )
2021-11-23 16:45:36 +08:00
Jiaming Yuan
5262e933f7
Remove unnecessary constexpr. ( #7466 )
2021-11-23 16:42:08 +08:00
Philip Hyunsu Cho
0c67685e43
[CI] Add a helper script to aid Maven release ( #7470 )
...
* [CI] Add a helper script to aid Maven release
* Move script to dev/ [skip ci]
* Update command [skip ci]
2021-11-23 00:11:07 -08:00
Harvey
0552ca8021
Fix typo ( #7469 )
2021-11-23 08:58:45 +08:00
Jiaming Yuan
176110a22d
Support external memory in CPU histogram building. ( #7372 )
2021-11-23 01:13:33 +08:00
Jiaming Yuan
d33854af1b
[Breaking] Accept multi-dim meta info. ( #7405 )
...
This PR changes base_margin into a 3-dim array, with one of them being reserved for multi-target classification. Also, a breaking change is made for binary serialization due to extra dimension along with a fix for saving the feature weights. Lastly, it unifies the prediction initialization between CPU and GPU. After this PR, the meta info setter in Python will be based on array interface.
2021-11-18 23:02:54 +08:00
Jiaming Yuan
9fb4338964
Add test for eta and mitigate float error. ( #7446 )
...
* Add eta test.
* Don't skip test.
2021-11-18 20:42:48 +08:00