xgboost

Author	SHA1	Message	Date
Jiaming Yuan	07cf3d3e53	Fix threads in DMatrix slice. (#8667 )	2023-01-14 07:16:57 +08:00
Jiaming Yuan	cfa994d57f	Multi-target support for L1 error. (#8652 ) - Add matrix support to the median function. - Iterate through each target for quantile computation.	2023-01-11 05:51:14 +08:00
Jiaming Yuan	badeff1d74	Init estimation for regression. (#8272 )	2023-01-11 02:04:56 +08:00
Jiaming Yuan	26c9882e23	Fix loading GPU pickle with a CPU-only xgboost distribution. (#8632 ) We can handle loading the pickle on a CPU-only machine if the XGBoost is built with CUDA enabled (Linux and Windows PyPI package), but not if the distribution is CPU-only (macOS PyPI package).	2023-01-05 02:14:30 +08:00
Jiaming Yuan	beefd28471	Split up SHAP from `RegTree`. (#8612 ) * Split up SHAP from `RegTree`. Simplify the tree interface.	2023-01-04 18:17:47 +08:00
James Lamb	fa44a33ee6	remove unused variables in JSON-parsing code (#8627 )	2023-01-04 15:50:33 +08:00
Jiaming Yuan	8d545ab2a2	Implement fit stump. (#8607 )	2023-01-04 04:14:51 +08:00
Rong Ou	3ceeb8c61c	Add data split mode to DMatrix MetaInfo (#8568 )	2022-12-25 20:37:37 +08:00
Rong Ou	77b069c25d	Support bitwise allreduce operations in the communicator (#8623 )	2022-12-25 06:40:05 +08:00
Jiaming Yuan	c6a8754c62	Define CUDA Context. (#8604 ) We will transition to non-default and non-blocking CUDA stream.	2022-12-20 15:15:07 +08:00
Jiaming Yuan	a10e4cba4e	Fix linalg iterator. (#8603 )	2022-12-16 23:05:03 +08:00
Jiaming Yuan	43a647a4dd	Fix inference with categorical feature. (#8591 )	2022-12-15 17:57:26 +08:00
Rong Ou	15a88ceef0	Fix deprecated CUB calls in CUDA 12.0 (#8578 )	2022-12-12 17:02:30 +08:00
Jiaming Yuan	3e26107a9c	Rename and extract `Context`. (#8528 ) * Rename `GenericParameter` to `Context`. * Rename header file to reflect the change. * Rename all references.	2022-12-07 04:58:54 +08:00
Jiaming Yuan	e38fe21e0d	Cleanup regression objectives. (#8539 )	2022-12-07 01:05:42 +08:00
Jiaming Yuan	e3bf5565ab	Extract transform iterator. (#8498 )	2022-12-05 21:37:07 +08:00
Rong Ou	78d65a1928	Initial support for column-wise data split (#8468 )	2022-12-04 01:37:51 +08:00
Rong Ou	a8255ea678	Add an in-memory collective communicator (#8494 )	2022-12-01 00:24:12 +08:00
Jiaming Yuan	157e98edf7	Support half type from cupy. (#8487 )	2022-11-30 17:56:42 +08:00
Jiaming Yuan	addaa63732	Support null value in CUDA array interface. (#8486 ) * Support null value in CUDA array interface. - Fix for potential null value in array interface. - Fix incorrect check on mask stride. * Simple tests. * Extract mask.	2022-11-28 17:48:25 -08:00
Jiaming Yuan	3fc1046fd3	Reduce compiler warnings on CPU-only build. (#8483 )	2022-11-29 00:04:16 +08:00
Jiaming Yuan	e07245f110	Take datatable as row major input. (#8472 ) * Take datatable as row major input. Try to avoid a transform with dense table.	2022-11-24 09:20:13 +08:00
Jiaming Yuan	5f1a6fca0d	[R] Use new interface for creating DMatrix from CSR. (#8455 ) * [R] Use new interface for creating DMatrix from CSR. - CSC is still using the old API. The old API is not aware of `nthread` parameter, which makes DMatrix to use all available thread during construction and during transformation lie `SparsePage` -> `CSCPage`.	2022-11-23 21:36:43 +08:00
Robert Maynard	16f96b6cfb	Work with newer thrust and libcudacxx (#8454 ) * Thrust 1.17 removes the experimental/pinned_allocator. When xgboost is brought into a large project it can be compiled against Thrust 1.17+ which don't offer this experimental allocator. To ensure that going forward xgboost works in all environments we provide a xgboost namespaced version of the pinned_allocator that previously was in Thrust.	2022-11-11 04:22:53 +08:00
Rong Ou	8e76f5f595	Use `DataSplitMode` to configure data loading (#8434 ) * Use `DataSplitMode` to configure data loading	2022-11-08 16:21:50 +08:00
Jiaming Yuan	a408c34558	Update JSON parser demo with categorical feature. (#8401 ) - Parse categorical features in the Python example. - Add tests. - Update document.	2022-10-28 20:57:43 +08:00
Jiaming Yuan	bb5e18c29c	Fix CUDA async stream. (#8380 )	2022-10-22 23:13:28 +08:00
Dmitry Razdoburdin	5bd849f1b5	Unify the partitioner for hist and approx. Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com> Co-authored-by: jiamingy <jm.yuan@outlook.com>	2022-10-20 02:49:20 +08:00
Rong Ou	8f3dee58be	Speed up tests with federated learning enabled (#8350 ) * Speed up tests with federated learning enabled * Re-enable timeouts Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-17 15:17:04 -07:00
Jiaming Yuan	031d66ec27	Configuration for init estimation. (#8343 ) * Configuration for init estimation. * Check whether the model needs configuration based on const attribute `ModelFitted` instead of a mutable state. * Add parameter `boost_from_average` to tell whether the user has specified base score. * Add tests.	2022-10-18 01:52:24 +08:00
Jiaming Yuan	3ef1703553	Allow using string view to find JSON value. (#8332 ) - Allow comparison between string and string view. - Fix compiler warnings.	2022-10-13 17:10:13 +08:00
Rong Ou	39afdac3be	Better error message when world size and rank are set as strings (#8316 ) Co-authored-by: jiamingy <jm.yuan@outlook.com>	2022-10-12 15:53:25 +08:00
Rory Mitchell	210915c985	Use integer gradients in gpu_hist split evaluation (#8274 )	2022-10-11 12:16:27 +02:00
Philip Hyunsu Cho	bc7a6ec603	Fix clang tidy (#8314 ) * Fix clang-tidy * Exempt clang-tidy from budget check * Move clang-tidy	2022-10-06 05:16:06 -08:00
Dmitry Razdoburdin	c24e9d712c	Dispatcher for template parameters of BuildHist Kernels (#8259 ) * Intoducing Column Wise Hist Building * linting * more linting * bug fixing * Removing column samping optimization for a while to simplify the review process. * linting * Removing unnecessary changes * Use DispatchBinType in hist_util.cc * Adding force_read_by column flag to buildhist. Adding tests for column wise buiilhist. * Introducing new dispatcher for compile time flags in hist building * fixing bug with using of DispatchBinType * Fixing building * Merging with master branch Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com> Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2022-10-06 03:02:29 -08:00
Rong Ou	8d4038da57	Don't split input data in federated mode (#8279 ) Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 18:19:28 -08:00
Rong Ou	668b8a0ea4	[Breaking] Switch from rabit to the collective communicator (#8257 ) * Switch from rabit to the collective communicator * fix size_t specialization * really fix size_t * try again * add include * more include * fix lint errors * remove rabit includes * fix pylint error * return dict from communicator context * fix communicator shutdown * fix dask test * reset communicator mocklist * fix distributed tests * do not save device communicator * fix jvm gpu tests * add python test for federated communicator * Update gputreeshap submodule Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 14:39:01 -08:00
Jiaming Yuan	97c3a80a34	Add C document to sphinx, fix arrow. (#8300 ) - Group C API. - Add C API sphinx doc. - Consistent use of `OptionalArg` and the parameter name `config`. - Remove call to deprecated functions in demo. - Fix some formatting errors. - Add links to c examples in the document (only visible with doxygen pages) - Fix arrow.	2022-10-05 09:52:15 +08:00
Rory Mitchell	d686bf52a6	Reduce time for some multi-gpu tests (#8288 ) * Faster dask tests * Reuse AllReducer objects in tests. * Faster boost from prediction tests. * Use rmm dask fixture. * Speed up dask demo. * mypy * Format with black. * mypy * Clang-tidy Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-04 02:49:33 -08:00
Philip Hyunsu Cho	ca0547bb65	[CI] Use RAPIDS 22.10 (#8298 ) * [CI] Use RAPIDS 22.10 * Store CUDA and RAPIDS versions in one place * Fix * Add missing #include * Update gputreeshap submodule * Fix * Remove outdated distributed tests	2022-10-03 23:18:07 -08:00
Jiaming Yuan	55cf24cc32	Obtain CSR matrix from DMatrix. (#8269 )	2022-09-29 20:41:43 +08:00
Jiaming Yuan	6d1452074a	Remove MGPU cpp tests. (#8276 ) Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-09-27 21:18:23 +08:00
Rory Mitchell	8f77677193	Use quantised gradients in gpu_hist histograms (#8246 )	2022-09-26 17:35:35 +02:00
Jiaming Yuan	4056974e37	Fix sparse threshold warning. (#8268 )	2022-09-26 22:22:11 +08:00
Jiaming Yuan	3fd331f8f2	Add checks to C pointer arguments. (#8254 )	2022-09-22 19:02:22 +08:00
Dmitry Razdoburdin	eb7bbee2c9	Optional by-column histogram build. (#8233 ) Co-authored-by: dmitry.razdoburdin <drazdobu@jfldaal005.jf.intel.com>	2022-09-22 05:16:13 +08:00
Jiaming Yuan	b791446623	Initial support for IPv6 (#8225 ) - Merge rabit socket into XGBoost. - Dask interface support. - Add test to the socket.	2022-09-21 18:06:50 +08:00
Jiaming Yuan	fffb1fca52	Calculate `base_score` based on input labels for mae. (#8107 ) Fit an intercept as base score for abs loss.	2022-09-20 20:53:54 +08:00
Jiaming Yuan	bdf265076d	Make `QuantileDMatrix` default to sklearn esitmators. (#8220 )	2022-09-13 13:52:19 +08:00
Rong Ou	a2686543a9	Common interface for collective communication (#8057 ) * implement broadcast for federated communicator * implement allreduce * add communicator factory * add device adapter * add device communicator to factory * add rabit communicator * add rabit communicator to the factory * add nccl device communicator * add synchronize to device communicator * add back print and getprocessorname * add python wrapper and c api * clean up types * fix non-gpu build * try to fix ci * fix std::size_t * portable string compare ignore case * c style size_t * fix lint errors * cross platform setenv * fix memory leak * fix lint errors * address review feedback * add python test for rabit communicator * fix failing gtest * use json to configure communicators * fix lint error * get rid of factories * fix cpu build * fix include * fix python import * don't export collective.py yet * skip collective communicator pytest on windows * add review feedback * update documentation * remove mpi communicator type * fix tests * shutdown the communicator separately Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2022-09-12 15:21:12 -07:00

... 2 3 4 5 6 ...

1504 Commits