xgboost

Author	SHA1	Message	Date
Jiaming Yuan	bc267dd729	Use ptr from `mmap` for `GHistIndexMatrix` and `ColumnMatrix`. (#9315 ) * Use ptr from mmap for `GHistIndexMatrix` and `ColumnMatrix`. - Define a resource for holding various types of memory pointers. - Define ref vector for holding resources. - Swap the underlying resources for GHist and ColumnM. - Add documentation for current status. - s390x support is removed. It should work if you can compile XGBoost, all the old workaround code does is to get GCC to compile.	2023-06-27 19:05:46 +08:00
Jiaming Yuan	cfa9c42eb4	Fix callback in AFT viz demo. (#9333 ) * Fix callback in AFT viz demo. - Update the callback function. - Add lint check.	2023-06-26 22:35:02 +08:00
Jiaming Yuan	54da4b3185	Cleanup to prepare for using mmap pointer in external memory. (#9317 ) - Update SparseDMatrix comment. - Use a pointer in the bitfield. We will replace the `std::vector<bool>` in `ColumnMatrix` with bitfield. - Clean up the page source. The timer is removed as it's inaccurate once we swap the mmap pointer into the page.	2023-06-22 06:43:11 +08:00
Jiaming Yuan	6d22ea793c	Test QDM with sparse data on CPU. (#9316 )	2023-06-19 21:27:03 +08:00
Jiaming Yuan	ee6809e642	Use mmap for external memory. (#9282 ) - Have basic infrastructure for mmap. - Release file write handle.	2023-06-19 18:52:55 +08:00
Rong Ou	d8beb517ed	Support bitwise allreduce in NCCL communicator (#9300 )	2023-06-17 01:56:50 +08:00
Rong Ou	e70810be8a	Refactor device communicator to make allreduce more flexible (#9295 )	2023-06-14 03:53:03 +08:00
Philip Hyunsu Cho	c2f0486d37	[CI] Run two pipeline loaders for responsiveness (#9294 )	2023-06-12 09:52:40 -07:00
Jiaming Yuan	152e2fb072	Unify test helpers for creating ctx. (#9274 )	2023-06-10 03:35:22 +08:00
Jiaming Yuan	ea0deeca68	Disable dense optimization in hist for distributed training. (#9272 )	2023-06-10 02:31:34 +08:00
github-actions[bot]	8c1065f645	[CI] Update RAPIDS to latest stable (#9278 ) Co-authored-by: hcho3 <hcho3@users.noreply.github.com>	2023-06-09 09:55:08 -07:00
Jiaming Yuan	1fcc26a6f8	Set `ndcg` to default for LTR. (#8822 ) - Add document. - Add tests. - Use `ndcg` with `topk` as default.	2023-06-09 23:31:33 +08:00
Rong Ou	ff122d61ff	More tests for cpu predictor with column split (#9270 )	2023-06-08 22:47:19 +08:00
Boris	7f9cb921f4	Rearranged maven profiles so that scala-2.13 artifacts are published without gpu-related libraries (#9253 )	2023-06-05 13:52:10 -07:00
Rong Ou	962a20693f	More support for column split in cpu predictor (#9244 ) - Added column split support to `PredictInstance` and `PredictLeaf`. - Refactoring of tests.	2023-06-05 08:05:38 +08:00
Philip Hyunsu Cho	288539ac78	[CI] Automatically bump Rapids version in containers (#9234 ) * [CI] Use RAPIDS 23.04 * [CI] Remove outdated filters in dependabot * [CI] Automatically bump Rapids version in containers * Automate pull request	2023-06-02 08:17:41 -07:00
Jiaming Yuan	9fbde21e9d	Rework the precision metric. (#9222 ) - Rework the precision metric for both CPU and GPU. - Mention it in the document. - Cleanup old support code for GPU ranking metric. - Deterministic GPU implementation. * Drop support for classification. * type. * use batch shape. * lint. * cpu build. * cpu build. * lint. * Tests. * Fix. * Cleanup error message.	2023-06-02 20:49:43 +08:00
Philip Hyunsu Cho	db8288121d	Revert "Publishing scala-2.13 artifacts to the maven S3 repo. (#9224 )" (#9233 ) This reverts commit bb2a17b90c4f212191932dce3453591bd7f47cfc.	2023-06-01 14:39:39 -07:00
Boris	bb2a17b90c	Publishing scala-2.13 artifacts to the maven S3 repo. (#9224 )	2023-06-01 10:45:18 -07:00
Jiaming Yuan	62e9387cd5	[ci] Update PySpark version. (#9214 )	2023-05-31 03:00:44 +08:00
Jiaming Yuan	17fd3f55e9	Optimize adapter element counting on GPU. (#9209 ) - Implement a simple `IterSpan` for passing iterators with size. - Use shared memory for column size counts. - Use one thread for each sample in row count to reduce atomic operations.	2023-05-30 23:28:43 +08:00
Jiaming Yuan	03bc6e6427	Remove unused variables. (#9210 ) - remove used variables. - Remove signed comparison warnings.	2023-05-28 05:24:15 +08:00
Boris	a01df102c9	Scala 2.13 support. (#9099 ) 1. Updated the test logic 2. Added smoke tests for Spark examples. 3. Added integration tests for Spark with Scala 2.13	2023-05-27 19:34:02 +08:00
Jiaming Yuan	8c174ef2d3	[CI] Update images that are not related to binary release. (#9205 ) * [CI] Update images that are not related to the binary release. - Update clang-tidy, prefer tools from the Ubuntu repository. - Update GPU image to 22.04. - Small cleanup to the tidy script. - Remove gpu_jvm, which seems to be unused.	2023-05-27 17:40:46 +08:00
Rong Ou	5b69534b43	Support column split in multi-target `hist` (#9171 )	2023-05-26 16:56:05 +08:00
Jiaming Yuan	3913ff470f	Import data lazily during tests. (#9176 )	2023-05-23 03:58:31 +08:00
Rong Ou	603f8ce2fa	Support `hist` in the partition builder under column split (#9120 )	2023-05-11 05:24:29 +08:00
Rong Ou	52311dcec9	Fix multi-threaded gtests (#9148 )	2023-05-10 19:15:32 +08:00
Jiaming Yuan	85988a3178	Wait for data CUDA stream instead of sync. (#9144 ) --------- Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2023-05-09 09:52:21 +08:00
Uriya Harpeness	a075aa24ba	Move python tool configurations to pyproject.toml, and add the python 3.11 classifier. (#9112 )	2023-05-06 02:59:06 +08:00
Jiaming Yuan	55968ed3fa	Fix monotone constraints on CPU. (#9122 )	2023-05-06 01:07:54 +08:00
Jiaming Yuan	47b3cb6fb7	Remove unused parameters in RABIT. (#9108 )	2023-05-05 05:26:24 +08:00
Jiaming Yuan	08ce495b5d	Use Booster context in DMatrix. (#8896 ) - Pass context from booster to DMatrix. - Use context instead of integer for `n_threads`. - Check the consistency configuration for `max_bin`. - Test for all combinations of initialization options.	2023-04-28 21:47:14 +08:00
Jiaming Yuan	1f9a57d17b	[Breaking] Require format to be specified in input URI. (#9077 ) Previously, we use `libsvm` as default when format is not specified. However, the dmlc data parser is not particularly robust against errors, and the most common type of error is undefined format. Along with which, we will recommend users to use other data loader instead. We will continue the maintenance of the parsers as it's currently used for many internal tests including federated learning.	2023-04-28 19:45:15 +08:00
Jiaming Yuan	e206b899ef	Rework MAP and Pairwise for LTR. (#9075 )	2023-04-28 02:39:12 +08:00
Rong Ou	511d4996b5	Rely on gRPC to generate random port (#9102 )	2023-04-27 09:48:26 +08:00
Scott Gustafson	353ed5339d	Convert ``DaskXGBClassifier.classes_`` to an array (#8452 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-04-27 02:23:35 +08:00
Rong Ou	a320b402a5	More refactoring to take advantage of collective aggregators (#9081 )	2023-04-26 03:36:09 +08:00
Philip Hyunsu Cho	a5cd2412de	Replace setup.py with pyproject.toml (#9021 ) * Create pyproject.toml * Implement a custom build backend (see below) in packager directory. Build logic from setup.py has been refactored and migrated into the new backend. * Tested: pip wheel . (build wheel), python -m build --sdist . (source distribution)	2023-04-20 13:51:39 -07:00
Rong Ou	42d100de18	Make sure metrics work with federated learning (#9037 )	2023-04-19 15:39:11 +08:00
Jiaming Yuan	ef13dd31b1	Rework the NDCG objective. (#9015 )	2023-04-18 21:16:06 +08:00
Rong Ou	ba9d24ff7b	Make sure metrics work with column-wise distributed training (#9020 )	2023-04-18 03:48:23 +08:00
WeichenXu	191d0aa5cf	[spark] Make spark model have the same UID with its estimator (#9022 ) Signed-off-by: Weichen Xu <weichen.xu@databricks.com>	2023-04-14 02:53:30 +08:00
Philip Hyunsu Cho	8e0f320db3	[CI] Don't run CI automatically for dependabot (#9034 )	2023-04-13 08:19:56 -07:00
Jiaming Yuan	fe9dff339c	Convert federated learner test into test suite. (#9018 ) * Convert federated learner test into test suite. - Add specialization to learning to rank.	2023-04-11 09:52:55 +08:00
Jiaming Yuan	2c8d735cb3	Fix tests with pandas 2.0. (#9014 ) * Fix tests with pandas 2.0. - `is_categorical` is replaced by `is_categorical_dtype`. - one hot encoding returns boolean type instead of integer type.	2023-04-11 00:17:34 +08:00
Jiaming Yuan	1cf4d93246	Convert federated tests into test suite. (#9006 ) - Add specialization for learning to rank.	2023-04-04 01:29:47 +08:00
Rong Ou	15e073ca9d	Make objectives work with vertical distributed and federated learning (#9002 )	2023-04-03 17:07:42 +08:00
Jiaming Yuan	bac22734fb	Remove ntree limit in python package. (#8345 ) - Remove `ntree_limit`. The parameter has been deprecated since 1.4.0. - The SHAP package compatibility is broken.	2023-03-31 19:01:55 +08:00
Jiaming Yuan	d062a9e009	Define pair generation strategies for LTR. (#8984 )	2023-03-30 12:00:35 +08:00

1 2 3 4 5 ...

1369 Commits