xgboost

Author	SHA1	Message	Date
Jiaming Yuan	8bdea72688	[Python] Require black and isort for new Python files. (#8096 ) * [Python] Require black and isort for new Python files. - Require black and isort for spark and dask module. These files are relatively new and are more conform to the black formatter. We will convert the rest of the library as we move forward. Other libraries including dask/distributed and optuna use the same formatting style and have a more strict standard. The black formatter is indeed quite nice, automating it can help us unify the code style. - Gather Python checks into a single script.	2022-07-20 10:25:24 +08:00
WeichenXu	f23cc92130	[pyspark] User guide doc and tutorials (#8082 ) Co-authored-by: Bobby Wang <wbo4958@gmail.com>	2022-07-19 22:25:14 +08:00
Bobby Wang	f801d3cf15	[PySpark] change the returning model type to string from binary (#8085 ) * [PySpark] change the returning model type to string from binary XGBoost pyspark can be can be accelerated by RAPIDS Accelerator seamlessly by changing the returning model type from binary to string.	2022-07-19 18:39:20 +08:00
Jiaming Yuan	e28f6f6657	[doc] Integrate pyspark module into sphinx doc [skip ci] (#8066 )	2022-07-17 10:46:09 +08:00
Rafail Giavrimis	579ab23b10	Check cudf lazily (#8084 )	2022-07-17 09:27:43 +08:00
Bobby Wang	a33f35eecf	[PySpark] add gpu support for spark local mode (#8068 )	2022-07-17 07:59:06 +08:00
Bobby Wang	91bb9e2cb3	[PySpark] fix raw_prediction_col parameter and minor cleanup (#8067 )	2022-07-16 17:58:57 +08:00
Jiaming Yuan	dae7a41baa	Update Python requirement to >=3.8. (#8071 ) Additional changes: - Use mamba for CPU test on Jenkins. - Cleanup CPU test dependencies. - Restore some of the modin tests	2022-07-14 18:01:47 +08:00
WeichenXu	176fec8789	PySpark XGBoost integration (#8020 ) Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu> Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2022-07-13 13:11:18 +08:00
Jiaming Yuan	a5bc8e2c6a	Fix mypy error with the latest dask. (#8052 ) * Fix mypy error with latest dask. Dask is adding type hints to its codebase and as the result, checks in XGBoost can be performed more rigorously. - Remove compatibility with old dask version where multi lock was missing. - Restrict input of `X` to be non-series. - Adopt latest definition of `Delayed`. - Avoid passing optional `host_ip`. - Avoid deprecated `worker.nthreads`.	2022-07-09 08:02:42 +08:00
Jiaming Yuan	701f32b227	[py-sckl] Raise import error if skl is not installed. (#8049 )	2022-07-09 05:56:46 +08:00
Jiaming Yuan	ff1c559084	Remove unused variable. (#8046 )	2022-07-05 01:59:22 +08:00
Jiaming Yuan	dcaf580476	Fix Python package source install. (#8036 ) * Copy gputreeshap.	2022-06-29 21:45:09 +08:00
Joris LIMONIER	f470ad3af9	Fix multiple typos (#8028 ) Fix 4 "graphiz" instead of "graphviz".	2022-06-27 19:21:58 +08:00
Gavin Zhang	6426449c8b	Support IBM i OS (#7920 )	2022-06-02 23:38:35 +08:00
Jiaming Yuan	6b55150e80	Fix pylint errors. (#7967 )	2022-06-02 18:04:46 +08:00
Gyeongjae Choi	cc6d57aa0d	Add minimal emscripten build support (#7954 )	2022-05-30 14:11:40 +08:00
Tim Sabsch	7a039e03fe	Fix incomplete type hints for `verbose` (#7945 )	2022-05-30 12:08:24 +08:00
Jiaming Yuan	d314680a15	Verify shared object version at load. (#7928 )	2022-05-23 20:53:30 +08:00
Jiaming Yuan	f93a727869	Address remaining mypy errors in python package. (#7914 )	2022-05-18 22:46:15 +08:00
Chengyang	806c92c80b	Add Type Hints for Python Package (#7742 ) Co-authored-by: Chengyang Gu <bridgream@gmail.com> Co-authored-by: Jiamingy <jm.yuan@outlook.com>	2022-05-17 22:14:09 +08:00
Rong Ou	77d4a53c32	use RabitContext intead of init/finalize (#7911 )	2022-05-17 12:15:41 +08:00
Rong Ou	af907e2d0d	Demo of federated learning using NVFlare (#7879 ) Co-authored-by: jiamingy <jm.yuan@outlook.com>	2022-05-14 22:45:41 +08:00
Jiaming Yuan	c8f9d4b6e6	Show libxgboost.so path in build info. (#7893 )	2022-05-13 18:08:56 +08:00
Jiaming Yuan	db80671d6b	Fix monotone constraint with tuple input. (#7891 )	2022-05-13 04:00:03 +08:00
Jiaming Yuan	8ba4722d04	Remove pyarrow workaround. (#7884 )	2022-05-11 20:54:48 +08:00
Rong Ou	14ef38b834	Initial support for federated learning (#7831 ) Federated learning plugin for xgboost: * A gRPC server to aggregate MPI-style requests (allgather, allreduce, broadcast) from federated workers. * A Rabit engine for the federated environment. * Integration test to simulate federated learning. Additional followups are needed to address GPU support, better security, and privacy, etc.	2022-05-05 21:49:22 +08:00
Jiaming Yuan	ad06172c6b	Refactor pandas dataframe handling. (#7843 )	2022-04-26 18:53:43 +08:00
Jiaming Yuan	f0f76259c9	Remove `STRING_TYPES`. (#7827 )	2022-04-22 19:07:51 +08:00
Jiaming Yuan	c70fa502a5	Expose `feature_types` to sklearn interface. (#7821 )	2022-04-21 20:23:35 +08:00
Jiaming Yuan	52d4eda786	Deprecate `use_label_encoder` in XGBClassifier. (#7822 ) * Deprecate `use_label_encoder` in XGBClassifier. * We have removed the encoder, now prepare to remove the indicator.	2022-04-21 13:14:02 +08:00
Jiaming Yuan	bcce17e688	Remove text loading in basic walk through demo. (#7753 )	2022-04-01 00:59:42 +08:00
Jiaming Yuan	02dd7b6913	Remove use of distutils. (#7770 ) distutils is deprecated and replaced by other stdlib constructs.	2022-03-31 19:03:10 +08:00
Jiaming Yuan	522636cb52	Bump version. (#7769 )	2022-03-31 06:33:22 +08:00
Jiaming Yuan	9150fdbd4d	Support pandas nullable types. (#7760 )	2022-03-30 08:51:52 +08:00
Jiaming Yuan	a50b84244e	Cleanup configuration for constraints. (#7758 )	2022-03-29 04:22:46 +08:00
Jiaming Yuan	3c9b04460a	Move `num_parallel_tree` to model parameter. (#7751 ) The size of forest should be a property of model itself instead of a training hyper-parameter.	2022-03-29 02:32:42 +08:00
Jiaming Yuan	b3ba0e8708	Check cupy lazily. (#7752 )	2022-03-26 06:09:58 +08:00
Chengyang	c92ab2ce49	Add type hints to core.py (#7707 ) Co-authored-by: Chengyang Gu <bridgream@gmail.com> Co-authored-by: jiamingy <jm.yuan@outlook.com>	2022-03-23 21:12:14 +08:00
Xiaochang Wu	613ec36c5a	Support building SimpleDMatrix from Arrow data format (#7512 ) * Integrate with Arrow C data API. * Support Arrow dataset. * Support Arrow table. Co-authored-by: Xiaochang Wu <xiaochang.wu@intel.com> Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com> Co-authored-by: Zhang Zhang <zhang.zhang@intel.com>	2022-03-15 13:25:19 +08:00
Jiaming Yuan	a62a3d991d	[dask] prediction with categorical data. (#7708 )	2022-03-10 00:21:48 +08:00
Pradipta Ghosh	68b6d6bbe2	Fix for Feature shape mismatch error (#7715 )	2022-03-03 21:36:29 +08:00
Cheng Li	a92e0f6240	multi groups in the constraints (#7711 )	2022-03-01 18:10:15 +08:00
Jiaming Yuan	83a66b4994	Support categorical data for hist. (#7695 ) * Extract partitioner from hist. * Implement categorical data support by passing the gradient index directly into the partitioner. * Organize/update document. * Remove code for negative hessian.	2022-02-25 03:47:14 +08:00
Jiaming Yuan	c859764d29	[doc] Clarify that states in callbacks are mutated. (#7685 ) * Fix copy for cv. This prevents inserting default callbacks into the input list. * Clarify the behavior of callbacks in training/cv. * Fix typos in doc.	2022-02-22 11:45:00 +08:00
Jiaming Yuan	e56d1779e1	Require Python 3.7. (#7682 ) * Update setup.py.	2022-02-21 05:46:48 +08:00
Jiaming Yuan	f08c5dcb06	Cleanup some pylint errors. (#7667 ) * Cleanup some pylint errors. * Cleanup pylint errors in rabit modules. * Make data iter an abstract class and cleanup private access. * Cleanup no-self-use for booster.	2022-02-19 18:53:12 +08:00
Jiaming Yuan	b76c5d54bf	Define export symbols in callback module. (#7665 )	2022-02-19 18:52:41 +08:00
Jiaming Yuan	0d0abe1845	Support optimal partitioning for GPU hist. (#7652 ) * Implement `MaxCategory` in quantile. * Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function. * Extract an evaluator from GPU Hist to store the needed states. * Added some CUDA stream/event utilities. * Update document with references. * Fixed a bug in approx evaluator where the number of data points is less than the number of categories.	2022-02-15 03:03:12 +08:00
Jiaming Yuan	5cd1f71b51	[dask] Improve configuration for port. (#7645 ) - Try port 0 to let the OS return the available port. - Add port configuration.	2022-02-14 21:34:34 +08:00

... 3 4 5 6 7 ...

900 Commits