xgboost

Author	SHA1	Message	Date
Jiaming Yuan	83a66b4994	Support categorical data for hist. (#7695 ) * Extract partitioner from hist. * Implement categorical data support by passing the gradient index directly into the partitioner. * Organize/update document. * Remove code for negative hessian.	2022-02-25 03:47:14 +08:00
Jiaming Yuan	f60d95b0ba	[R] Construct booster object in `load.raw`. (#7686 )	2022-02-24 10:06:18 +08:00
Bobby Wang	89aa8ddf52	[jvm-packages] fix the prediction issue for multi:softmax (#7694 )	2022-02-24 01:09:45 +08:00
Jiaming Yuan	6762c45494	Small cleanup to gradient index and hist. (#7668 ) * Code comments. * Const accessor to index. * Remove some weird variables in the `Index` class. * Simplify the `MemStackAllocator`.	2022-02-23 11:37:21 +08:00
Jiaming Yuan	49c74a5369	Update R package description. (#7691 ) * Change role. * Remove cmake file when building the package.	2022-02-23 08:36:37 +08:00
Bobby Wang	e3e6de5ed9	[jvm-packages] unify the set features API (#7692 ) xgboost4j-spark provides 2 sets of API for setting features, one for CPU, another for GPU, which may cause confusion. This PR removes the GPU API and adds an override CPU function setFeaturesCol to accept Array[String] parameters.	2022-02-23 03:37:25 +08:00
Jiaming Yuan	c859764d29	[doc] Clarify that states in callbacks are mutated. (#7685 ) * Fix copy for cv. This prevents inserting default callbacks into the input list. * Clarify the behavior of callbacks in training/cv. * Fix typos in doc.	2022-02-22 11:45:00 +08:00
Jiaming Yuan	584bae1fc6	Fix document build with scikit-learn (#7684 ) * Require sphinx >= 4.4 for RTD. * Install sklearn.	2022-02-22 08:58:54 +08:00
Jiaming Yuan	e56d1779e1	Require Python 3.7. (#7682 ) * Update setup.py.	2022-02-21 05:46:48 +08:00
Jiaming Yuan	549f3bd781	Honor CPU counts from CFS. (#7654 )	2022-02-21 03:13:26 +08:00
Jiaming Yuan	671b3c8d8e	Fix typo. (#7680 )	2022-02-20 03:42:47 +08:00
Jiaming Yuan	b2341eab0c	[R] Fix broken links. (#7670 )	2022-02-20 00:55:48 +08:00
Bobby Wang	131858e7cb	[jvm-packages] Do not repartition when nWorker = 1 (#7676 )	2022-02-19 21:45:54 +08:00
Jiaming Yuan	f08c5dcb06	Cleanup some pylint errors. (#7667 ) * Cleanup some pylint errors. * Cleanup pylint errors in rabit modules. * Make data iter an abstract class and cleanup private access. * Cleanup no-self-use for booster.	2022-02-19 18:53:12 +08:00
Jiaming Yuan	b76c5d54bf	Define export symbols in callback module. (#7665 )	2022-02-19 18:52:41 +08:00
Jiaming Yuan	7366d3b20c	Ensure models with categorical splits don't use old binary format. (#7666 )	2022-02-19 08:05:28 +08:00
Jiaming Yuan	14d61b0141	[doc] Update document for building from source. (#7664 ) - Mention standard install command for R package. - Remove repeated "get source" step. - Remove troubleshooting on Windows. It's outdated considering VS 2022 is already out.	2022-02-19 04:57:03 +08:00
Jiaming Yuan	d625dc2047	Work around nvcc error. (#7673 )	2022-02-19 01:41:46 +08:00
Jiaming Yuan	3877043d41	Avoid print for R package. (#7672 )	2022-02-18 08:06:24 +08:00
Jiaming Yuan	711f7f3851	Avoid `std::terminate` for R package. (#7661 ) This is part of CRAN policies.	2022-02-17 01:27:20 +08:00
Jiaming Yuan	12949c6b31	[R] Implement feature weights. (#7660 )	2022-02-16 22:20:52 +08:00
Philip Hyunsu Cho	0149f81a5a	[CI] Fix S3 upload (#7662 )	2022-02-16 01:35:27 -08:00
Jiaming Yuan	93eebe8664	[doc] Fix broken link. [skip ci] (#7655 )	2022-02-15 14:07:34 +08:00
Jiaming Yuan	0da7d872ef	[doc] Update for prediction. (#7648 )	2022-02-15 05:01:55 +08:00
Jiaming Yuan	0d0abe1845	Support optimal partitioning for GPU hist. (#7652 ) * Implement `MaxCategory` in quantile. * Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function. * Extract an evaluator from GPU Hist to store the needed states. * Added some CUDA stream/event utilities. * Update document with references. * Fixed a bug in approx evaluator where the number of data points is less than the number of categories.	2022-02-15 03:03:12 +08:00
Jiaming Yuan	2369d55e9a	Add tests for prediction cache. (#7650 ) * Extract the test from approx for other tree methods. * Add note on how it works.	2022-02-15 00:28:00 +08:00
Jiaming Yuan	5cd1f71b51	[dask] Improve configuration for port. (#7645 ) - Try port 0 to let the OS return the available port. - Add port configuration.	2022-02-14 21:34:34 +08:00
Jiaming Yuan	b52c4e13b0	[dask] Fix empty partition with pandas input. (#7644 ) Empty partition is different from empty dataset. For the former case, each worker has non-empty dask collections, but each collection might contain empty partition.	2022-02-14 19:35:51 +08:00
Jiaming Yuan	1f020a6097	Add maintainer for R package. (#7649 )	2022-02-12 23:45:30 +08:00
Jiaming Yuan	1441a6cd27	[CI] Update R cache. (#7646 )	2022-02-11 19:50:11 +08:00
Jiaming Yuan	2775c2a1ab	Prepare external memory support for hist. (#7638 ) This PR prepares the GHistIndexMatrix to host the column matrix which is used by the hist tree method by accepting sparse_threshold parameter. Some cleanups are made to ensure the correct batch param is being passed into DMatrix along with some additional tests for correctness of SimpleDMatrix.	2022-02-10 16:58:02 +08:00
dependabot[bot]	87c01f49d8	Bump hadoop-common from 2.7.3 to 2.10.1 in /jvm-packages/xgboost4j-flink (#7641 ) Bumps hadoop-common from 2.7.3 to 2.10.1. --- updated-dependencies: - dependency-name: org.apache.hadoop:hadoop-common dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-02-09 17:07:35 -08:00
Jiaming Yuan	fe4ce920b2	[dask] Cleanup dask module. (#7634 ) * Add a new utility for mapping function onto workers. * Unify the type for feature names. * Clean up the iterator. * Fix prediction with DaskDMatrix worker specification. * Fix base margin with DeviceQuantileDMatrix. * Support vs 2022 in setup.py.	2022-02-08 20:41:46 +08:00
Jiaming Yuan	926af9951e	Add missing train parameter for sklearn interface. (#7629 ) Some other parameters are still missing and rely on **kwargs, for instance parameters from dart.	2022-02-08 13:20:19 +08:00
Jiaming Yuan	3e693e4f97	[dask] Fix nthread config with dask sklearn wrapper. (#7633 )	2022-02-08 06:38:32 +08:00
Ed Shee	d152c59a9c	fixed broken link to Seldon XGBoost server (#7628 )	2022-02-05 01:03:29 +08:00
Philip Hyunsu Cho	34a238ca98	[CI] Clean up Python wheel build pipeline (#7626 ) * [CI] Always upload artifacts to [branch_name]/ * [CI] Move detailed setup inside build_python_wheels.sh * Fix typo	2022-02-03 00:55:44 -08:00
Philip Hyunsu Cho	f6e6d0b2c0	[CI] Build Python wheels for MacOS (x86_64 and arm64) (#7621 ) * Build Python wheels for OSX (x86_64 and arm64) * Use Conda's libomp when running Python tests * fix * Add comment to explain CIBW_TARGET_OSX_ARM64 * Update release script * Add comments in build_python_wheels.sh * Document wheel pipeline	2022-02-02 17:35:48 -08:00
Philip Hyunsu Cho	271a7c5d43	[Doc] fix typo in install doc (#7623 )	2022-01-31 13:35:56 -08:00
Philip Hyunsu Cho	c621775f34	Replace all uses of deprecated function sklearn.datasets.load_boston (#7373 ) * Replace all uses of deprecated function sklearn.datasets.load_boston * More renaming * Fix bad name * Update assertion * Fix n boosted rounds. * Avoid over regularization. * Rebase. * Avoid over regularization. * Whac-a-mole Co-authored-by: fis <jm.yuan@outlook.com>	2022-01-30 04:27:57 -08:00
Philip Hyunsu Cho	b4340abf56	Add special handling for multi:softmax in sklearn predict (#7607 ) * Add special handling for multi:softmax in sklearn predict * Add test coverage	2022-01-29 15:54:49 -08:00
david-cortes	7f738e7f6f	[R] Accept CSR data for predictions (#7615 )	2022-01-30 00:54:57 +08:00
Michael Chirico	549bd419bb	use exit hook to remove temp file (#7611 ) This guarantees the removal will trigger for unexpected early exits	2022-01-29 16:06:52 +08:00
Philip Hyunsu Cho	f21301c749	[Doc] Add instruction to install XGBoost for Apple Silicon using Conda (#7612 )	2022-01-28 01:06:39 -08:00
Jiaming Yuan	81210420c6	Remove `omp_get_max_threads` (#7608 ) This is the one last PR for removing omp global variable. * Add context object to the `DMatrix`. This bridges `DMatrix` with https://github.com/dmlc/xgboost/issues/7308 . * Require context to be available at the construction time of booster. * Add `n_threads` support for R csc DMatrix constructor. * Remove `omp_get_max_threads` in R glue code. * Remove threading utilities that rely on omp global variable.	2022-01-28 16:09:22 +08:00
Philip Hyunsu Cho	028bdc1740	[R] Fix typo in docstring (#7606 )	2022-01-26 23:33:25 +08:00
Jiaming Yuan	e060519d4f	Avoid regenerating the gradient index for approx. (#7591 )	2022-01-26 21:41:30 +08:00
Jiaming Yuan	5d7818e75d	Remove `omp_get_max_threads` in tree updaters. (#7590 )	2022-01-26 19:55:47 +08:00
Jiaming Yuan	24789429fd	Support latest pandas Index type. (#7595 )	2022-01-26 18:20:10 +08:00
AJ Schmidt	511805c981	Compress fatbins (#7601 ) * compress CUDA device code Co-authored-by: ptaylor <paul.e.taylor@me.com>	2022-01-25 18:30:59 +08:00

... 2 3 4 5 6 ...

5832 Commits