xgboost

Author	SHA1	Message	Date
Bobby Wang	f3df0d0eb4	[jvm-packages] update scala style configuration (#10836 )	2024-09-24 17:39:44 +08:00
Bobby Wang	2a03685bff	[jvm-packages] shade xgboost spark packages (#10833 )	2024-09-24 15:46:06 +08:00
Bobby Wang	19b55b300b	[jvm-packages] Support Ranker (#10823 )	2024-09-22 02:02:15 +08:00
Bobby Wang	67c8c96784	[jvm-packages] [breaking] rework xgboost4j-spark and xgboost4j-spark-gpu (#10639 ) - Introduce an abstract XGBoost Estimator - Update to the latest XGBoost parameters - Add all XGBoost parameters supported in XGBoost4j-spark. - Add setter and getter for these parameters. - Remove the deprecated parameters - Address the missing value handling - Remove any ETL operations in XGBoost - Rework the GPU plugin - Expand sanity tests for CPU and GPU consistency	2024-09-11 15:54:19 +08:00
Jiaming Yuan	43a57c4a85	Bump development version to 2.2. (#10376 )	2024-06-04 12:59:16 +08:00
Bobby Wang	932d7201f9	[jvm-packages] refine tracker (#10313 ) Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2024-05-23 12:46:21 +08:00
Jiaming Yuan	a5a58102e5	Revamp the rabit implementation. (#10112 ) This PR replaces the original RABIT implementation with a new one, which has already been partially merged into XGBoost. The new one features: - Federated learning for both CPU and GPU. - NCCL. - More data types. - A unified interface for all the underlying implementations. - Improved timeout handling for both tracker and workers. - Exhausted tests with metrics (fixed a couple of bugs along the way). - A reusable tracker for Python and JVM packages.	2024-05-20 11:56:23 +08:00
Philip Hyunsu Cho	1168a68872	[jvm-packages] Update release scripts (#9983 ) * [jvm-packages] Add Scala version suffix to xgboost-jvm package (#9776) * Update JVM script (#9714) * Revamp pom.xml * Update instructions in prepare_jvm_release.py * Fix formatting * [jvm-packages] Fix POM for xgboost-jvm metapackage (#9893) * [jvm-packages] Fix POM for xgboost-jvm metapackage * Add script for updating the Scala version * Update change_scala_version.py to also change scala.version property (#9897) * Remove 'release-cpu-only' profile * Remove scala-2.13 profile; enable gpu package for Scala 2.13	2024-01-12 10:37:55 -08:00
Jiaming Yuan	3976455af9	[jvm-packages] Use UBJ for checkpoints. (#9954 )	2024-01-08 13:26:12 +08:00
Jiaming Yuan	38dd91f491	Save model in ubj as the default. (#9947 )	2024-01-05 17:53:36 +08:00
Bobby Wang	36a552ac98	[jvm-packages] support stage-level scheduling (#9775 )	2023-11-14 08:59:45 +08:00
Jiaming Yuan	c75a3bc0a9	[breaking] [jvm-packages] Remove rabit check point. (#9599 ) - Add `numBoostedRound` to jvm packages - Remove rabit checkpoint version. - Change the starting version of training continuation in JVM [breaking]. - Redefine the checkpoint version policy in jvm package. [breaking] - Rename the Python check point callback parameter. [breaking] - Unifies the checkpoint policy between Python and JVM.	2023-09-26 18:06:34 +08:00
Jiaming Yuan	58530b1bc4	Bump version to 2.1. (#9498 )	2023-08-18 01:04:04 +08:00
Bobby Wang	344f90b67b	[jvm-packages] throw exception when tree_method=approx and device=cuda (#9478 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-08-14 17:52:14 +08:00
jinmfeng001	04c99683c3	Change training stage from ResultStage to ShuffleMapStage (#9423 )	2023-08-03 23:40:04 +08:00
Bobby Wang	8f0efb4ab3	[jvm-packages] automatically set the max/min direction for best score (#9404 )	2023-07-27 11:09:55 +08:00
Bobby Wang	1b657a5513	[jvm-packages] set device to cuda when tree method is "gpu_hist" (#9412 )	2023-07-24 18:32:25 +08:00
Jiaming Yuan	f4fb2be101	[jvm-packages] Add the new `device` parameter. (#9385 )	2023-07-17 18:40:39 +08:00
Jiaming Yuan	04aff3af8e	Define the new `device` parameter. (#9362 )	2023-07-13 19:30:25 +08:00
jinmfeng001	a1367ea1f8	Set feature_names and feature_types in jvm-packages (#9364 ) * 1. Add parameters to set feature names and feature types 2. Save feature names and feature types to native json model * Change serialization and deserialization format to ubj.	2023-07-12 15:18:46 +08:00
Jiaming Yuan	1fcc26a6f8	Set `ndcg` to default for LTR. (#8822 ) - Add document. - Add tests. - Use `ndcg` with `topk` as default.	2023-06-09 23:31:33 +08:00
Boris	a01df102c9	Scala 2.13 support. (#9099 ) 1. Updated the test logic 2. Added smoke tests for Spark examples. 3. Added integration tests for Spark with Scala 2.13	2023-05-27 19:34:02 +08:00
austinzh	3b742dc4f1	Stop using Rabit in predition (#9054 )	2023-04-21 19:38:07 +08:00
Emil Ejbyfeldt	a84a1fde02	[jvm-packages] Update scalatest to 3.2.15 (#8925 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-04-20 22:16:56 +08:00
Jiaming Yuan	564df59204	[breaking] [jvm-packages] Remove scala-implemented tracker. (#9045 )	2023-04-20 16:29:35 +08:00
Jiaming Yuan	26209a42a5	Define git attributes for renormalization. (#8921 )	2023-03-16 02:43:11 +08:00
Jiaming Yuan	badeff1d74	Init estimation for regression. (#8272 )	2023-01-11 02:04:56 +08:00
Bobby Wang	4e12f3e1bc	[Breaking][jvm-packages] Bump rapids version to 22.12.0 (#8648 ) * [jvm-packages] Bump rapids version to 22.12.0 This PR bumps spark version to 3.1.1 and the rapids version to 22.12.0, which results in the latest xgboost can't run with the old rapids packages.	2023-01-07 18:59:17 +08:00
Jiaming Yuan	f73520bfff	Bump development version to 2.0. (#8390 )	2022-10-28 15:21:19 +08:00
Rong Ou	668b8a0ea4	[Breaking] Switch from rabit to the collective communicator (#8257 ) * Switch from rabit to the collective communicator * fix size_t specialization * really fix size_t * try again * add include * more include * fix lint errors * remove rabit includes * fix pylint error * return dict from communicator context * fix communicator shutdown * fix dask test * reset communicator mocklist * fix distributed tests * do not save device communicator * fix jvm gpu tests * add python test for federated communicator * Update gputreeshap submodule Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 14:39:01 -08:00
Jiaming Yuan	f835368bcf	Mark next release as 1.7 instead of 2.0 (#8281 )	2022-09-28 14:33:37 +08:00
Bobby Wang	8d247f0d64	[jvm-packages] fix spark-rapids compatibility issue (#8240 ) * [jvm-packages] fix spark-rapids compatibility issue spark-rapids (from 22.10) has shimmed GpuColumnVector, which means we can't call it directly. So this PR call the UnshimmedGpuColumnVector	2022-09-22 23:31:29 +08:00
Rong Ou	7d43e74e71	JNI wrapper for the collective communicator (#8242 )	2022-09-21 04:20:25 +08:00
Bobby Wang	a68580e2a7	[jvm-packages] fix executor crashing issue when transforming on xgboost4j-spark-gpu (#8025 ) * [jvm-packages] fix executor crashing issue when transforming on xgboost4j-spark-gpu the API XGBoosterSetParam is not thread-safe. Dring the phase of transforming, XGBoost runs several transforming tasks at a time, and each of them will set the "gpu_id" and "predictor" parameters, so if several tasks (multi-threads) all XGBoosterSetParam simultaneously, it may cause the memory to be corrupted and cause SIGSEGV. This PR first get the booster from broadcast and set to the correct gpu_id and predictor, and then all transforming taskes will use the same booster to do the transforming.	2022-06-24 01:18:41 +08:00
Rong Ou	e5ec546da5	[Breaking] Remove rabit support for custom reductions and `grow_local_histmaker` updater (#7992 )	2022-06-21 15:08:23 +08:00
Bobby Wang	545fd4548e	[jvm-packages] refactor xgboost read/write (#7956 ) 1. Removed the duplicated Default XGBoost read/write which is copied from spark 2.3.x 2. Put some utils into util package	2022-06-01 11:38:49 +08:00
Bobby Wang	6275cdc486	[jvm-packages] add format option when saving a model (#7940 )	2022-05-30 15:49:59 +08:00
Bobby Wang	fbc3d861bb	[jvm-packages] remove default parameters (#7938 )	2022-05-28 10:31:19 +08:00
Bobby Wang	5ef33adf68	[jvm-packges] set the correct objective if user doesn't explicitly set it (#7781 )	2022-05-18 14:05:18 +08:00
Bobby Wang	b41cf92dc2	[jvm-packages] move dmatrix building into rabit context for cpu pipeline (#7908 )	2022-05-17 14:52:25 +08:00
Bobby Wang	11e46e4bc0	[Breaking][jvm-packages] make classification model be xgboost-compatible (#7896 )	2022-05-14 15:43:05 +08:00
Bobby Wang	9fa7ed1743	[Breaking][jvm-packages] remove timeoutRequestWorkers parameter (#7839 )	2022-05-13 16:26:25 +08:00
Bobby Wang	a94e1b172e	[jvm-packages] Fix model compatibility (#7845 )	2022-04-28 02:05:38 +08:00
Bobby Wang	dc2e699656	[Breaking][jvm-packages] Use barrier execution mode (#7836 ) With the introduction of the barrier execution mode. we don't need to kill SparkContext when some xgboost tasks failed. Instead, Spark will handle the errors for us. So in this PR, `killSparkContextOnWorkerFailure` parameter is deleted.	2022-04-25 17:09:52 +08:00
Bobby Wang	c45665a55a	[jvm-packages] move the dmatrix building into rabit context (#7823 ) This fixes the QuantileDeviceDMatrix in distributed environment.	2022-04-23 00:06:50 +08:00
Bobby Wang	2d83b2ad8f	[jvm-packages] add hostIp and python exec for rabit tracker (#7808 )	2022-04-15 16:28:43 +08:00
Bobby Wang	3f536b5308	[jvm-packages] fix evaluation when featuresCols is used (#7798 )	2022-04-13 12:52:50 +08:00
Bobby Wang	118192f116	[jvm-packages] xgboost4j-spark should work when featuresCols is specified (#7789 )	2022-04-08 13:21:04 +08:00
Bobby Wang	2454407f3a	[jvm-packages] unify setFeaturesCol API for XGBoostRegressor (#7784 )	2022-04-05 13:35:33 +08:00
Jiaming Yuan	522636cb52	Bump version. (#7769 )	2022-03-31 06:33:22 +08:00

1 2 3 4 5 ...

279 Commits