xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	1168a68872	[jvm-packages] Update release scripts (#9983 ) * [jvm-packages] Add Scala version suffix to xgboost-jvm package (#9776) * Update JVM script (#9714) * Revamp pom.xml * Update instructions in prepare_jvm_release.py * Fix formatting * [jvm-packages] Fix POM for xgboost-jvm metapackage (#9893) * [jvm-packages] Fix POM for xgboost-jvm metapackage * Add script for updating the Scala version * Update change_scala_version.py to also change scala.version property (#9897) * Remove 'release-cpu-only' profile * Remove scala-2.13 profile; enable gpu package for Scala 2.13	2024-01-12 10:37:55 -08:00
Jiaming Yuan	3976455af9	[jvm-packages] Use UBJ for checkpoints. (#9954 )	2024-01-08 13:26:12 +08:00
Jiaming Yuan	38dd91f491	Save model in ubj as the default. (#9947 )	2024-01-05 17:53:36 +08:00
Bobby Wang	36a552ac98	[jvm-packages] support stage-level scheduling (#9775 )	2023-11-14 08:59:45 +08:00
Jiaming Yuan	c75a3bc0a9	[breaking] [jvm-packages] Remove rabit check point. (#9599 ) - Add `numBoostedRound` to jvm packages - Remove rabit checkpoint version. - Change the starting version of training continuation in JVM [breaking]. - Redefine the checkpoint version policy in jvm package. [breaking] - Rename the Python check point callback parameter. [breaking] - Unifies the checkpoint policy between Python and JVM.	2023-09-26 18:06:34 +08:00
Jiaming Yuan	58530b1bc4	Bump version to 2.1. (#9498 )	2023-08-18 01:04:04 +08:00
Bobby Wang	344f90b67b	[jvm-packages] throw exception when tree_method=approx and device=cuda (#9478 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-08-14 17:52:14 +08:00
jinmfeng001	04c99683c3	Change training stage from ResultStage to ShuffleMapStage (#9423 )	2023-08-03 23:40:04 +08:00
Bobby Wang	8f0efb4ab3	[jvm-packages] automatically set the max/min direction for best score (#9404 )	2023-07-27 11:09:55 +08:00
Bobby Wang	1b657a5513	[jvm-packages] set device to cuda when tree method is "gpu_hist" (#9412 )	2023-07-24 18:32:25 +08:00
Jiaming Yuan	f4fb2be101	[jvm-packages] Add the new `device` parameter. (#9385 )	2023-07-17 18:40:39 +08:00
Jiaming Yuan	04aff3af8e	Define the new `device` parameter. (#9362 )	2023-07-13 19:30:25 +08:00
jinmfeng001	a1367ea1f8	Set feature_names and feature_types in jvm-packages (#9364 ) * 1. Add parameters to set feature names and feature types 2. Save feature names and feature types to native json model * Change serialization and deserialization format to ubj.	2023-07-12 15:18:46 +08:00
Jiaming Yuan	1fcc26a6f8	Set `ndcg` to default for LTR. (#8822 ) - Add document. - Add tests. - Use `ndcg` with `topk` as default.	2023-06-09 23:31:33 +08:00
Boris	a01df102c9	Scala 2.13 support. (#9099 ) 1. Updated the test logic 2. Added smoke tests for Spark examples. 3. Added integration tests for Spark with Scala 2.13	2023-05-27 19:34:02 +08:00
austinzh	3b742dc4f1	Stop using Rabit in predition (#9054 )	2023-04-21 19:38:07 +08:00
Emil Ejbyfeldt	a84a1fde02	[jvm-packages] Update scalatest to 3.2.15 (#8925 ) --------- Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2023-04-20 22:16:56 +08:00
Jiaming Yuan	564df59204	[breaking] [jvm-packages] Remove scala-implemented tracker. (#9045 )	2023-04-20 16:29:35 +08:00
Jiaming Yuan	26209a42a5	Define git attributes for renormalization. (#8921 )	2023-03-16 02:43:11 +08:00
Jiaming Yuan	badeff1d74	Init estimation for regression. (#8272 )	2023-01-11 02:04:56 +08:00
Bobby Wang	4e12f3e1bc	[Breaking][jvm-packages] Bump rapids version to 22.12.0 (#8648 ) * [jvm-packages] Bump rapids version to 22.12.0 This PR bumps spark version to 3.1.1 and the rapids version to 22.12.0, which results in the latest xgboost can't run with the old rapids packages.	2023-01-07 18:59:17 +08:00
Jiaming Yuan	f73520bfff	Bump development version to 2.0. (#8390 )	2022-10-28 15:21:19 +08:00
Rong Ou	668b8a0ea4	[Breaking] Switch from rabit to the collective communicator (#8257 ) * Switch from rabit to the collective communicator * fix size_t specialization * really fix size_t * try again * add include * more include * fix lint errors * remove rabit includes * fix pylint error * return dict from communicator context * fix communicator shutdown * fix dask test * reset communicator mocklist * fix distributed tests * do not save device communicator * fix jvm gpu tests * add python test for federated communicator * Update gputreeshap submodule Co-authored-by: Hyunsu Philip Cho <chohyu01@cs.washington.edu>	2022-10-05 14:39:01 -08:00
Jiaming Yuan	f835368bcf	Mark next release as 1.7 instead of 2.0 (#8281 )	2022-09-28 14:33:37 +08:00
Bobby Wang	8d247f0d64	[jvm-packages] fix spark-rapids compatibility issue (#8240 ) * [jvm-packages] fix spark-rapids compatibility issue spark-rapids (from 22.10) has shimmed GpuColumnVector, which means we can't call it directly. So this PR call the UnshimmedGpuColumnVector	2022-09-22 23:31:29 +08:00
Rong Ou	7d43e74e71	JNI wrapper for the collective communicator (#8242 )	2022-09-21 04:20:25 +08:00
Bobby Wang	a68580e2a7	[jvm-packages] fix executor crashing issue when transforming on xgboost4j-spark-gpu (#8025 ) * [jvm-packages] fix executor crashing issue when transforming on xgboost4j-spark-gpu the API XGBoosterSetParam is not thread-safe. Dring the phase of transforming, XGBoost runs several transforming tasks at a time, and each of them will set the "gpu_id" and "predictor" parameters, so if several tasks (multi-threads) all XGBoosterSetParam simultaneously, it may cause the memory to be corrupted and cause SIGSEGV. This PR first get the booster from broadcast and set to the correct gpu_id and predictor, and then all transforming taskes will use the same booster to do the transforming.	2022-06-24 01:18:41 +08:00
Rong Ou	e5ec546da5	[Breaking] Remove rabit support for custom reductions and `grow_local_histmaker` updater (#7992 )	2022-06-21 15:08:23 +08:00
Bobby Wang	545fd4548e	[jvm-packages] refactor xgboost read/write (#7956 ) 1. Removed the duplicated Default XGBoost read/write which is copied from spark 2.3.x 2. Put some utils into util package	2022-06-01 11:38:49 +08:00
Bobby Wang	6275cdc486	[jvm-packages] add format option when saving a model (#7940 )	2022-05-30 15:49:59 +08:00
Bobby Wang	fbc3d861bb	[jvm-packages] remove default parameters (#7938 )	2022-05-28 10:31:19 +08:00
Bobby Wang	5ef33adf68	[jvm-packges] set the correct objective if user doesn't explicitly set it (#7781 )	2022-05-18 14:05:18 +08:00
Bobby Wang	b41cf92dc2	[jvm-packages] move dmatrix building into rabit context for cpu pipeline (#7908 )	2022-05-17 14:52:25 +08:00
Bobby Wang	11e46e4bc0	[Breaking][jvm-packages] make classification model be xgboost-compatible (#7896 )	2022-05-14 15:43:05 +08:00
Bobby Wang	9fa7ed1743	[Breaking][jvm-packages] remove timeoutRequestWorkers parameter (#7839 )	2022-05-13 16:26:25 +08:00
Bobby Wang	a94e1b172e	[jvm-packages] Fix model compatibility (#7845 )	2022-04-28 02:05:38 +08:00
Bobby Wang	dc2e699656	[Breaking][jvm-packages] Use barrier execution mode (#7836 ) With the introduction of the barrier execution mode. we don't need to kill SparkContext when some xgboost tasks failed. Instead, Spark will handle the errors for us. So in this PR, `killSparkContextOnWorkerFailure` parameter is deleted.	2022-04-25 17:09:52 +08:00
Bobby Wang	c45665a55a	[jvm-packages] move the dmatrix building into rabit context (#7823 ) This fixes the QuantileDeviceDMatrix in distributed environment.	2022-04-23 00:06:50 +08:00
Bobby Wang	2d83b2ad8f	[jvm-packages] add hostIp and python exec for rabit tracker (#7808 )	2022-04-15 16:28:43 +08:00
Bobby Wang	3f536b5308	[jvm-packages] fix evaluation when featuresCols is used (#7798 )	2022-04-13 12:52:50 +08:00
Bobby Wang	118192f116	[jvm-packages] xgboost4j-spark should work when featuresCols is specified (#7789 )	2022-04-08 13:21:04 +08:00
Bobby Wang	2454407f3a	[jvm-packages] unify setFeaturesCol API for XGBoostRegressor (#7784 )	2022-04-05 13:35:33 +08:00
Jiaming Yuan	522636cb52	Bump version. (#7769 )	2022-03-31 06:33:22 +08:00
Bobby Wang	89aa8ddf52	[jvm-packages] fix the prediction issue for multi:softmax (#7694 )	2022-02-24 01:09:45 +08:00
Bobby Wang	e3e6de5ed9	[jvm-packages] unify the set features API (#7692 ) xgboost4j-spark provides 2 sets of API for setting features, one for CPU, another for GPU, which may cause confusion. This PR removes the GPU API and adds an override CPU function setFeaturesCol to accept Array[String] parameters.	2022-02-23 03:37:25 +08:00
Jiaming Yuan	ac7a36367c	[jvm-packages] Implement new `save_raw` in jvm-packages. (#7570 ) * New `toByteArray` that accepts a parameter for format.	2022-01-19 16:00:14 +08:00
Jiaming Yuan	001503186c	Rewrite approx (#7214 ) This PR rewrites the approx tree method to use codebase from hist for better performance and code sharing. The rewrite has many benefits: - Support for both `max_leaves` and `max_depth`. - Support for `grow_policy`. - Support for mono constraint. - Support for feature weights. - Support for easier bin configuration (`max_bin`). - Support for categorical data. - Faster performance for most of the datasets. (many times faster) - Support for prediction cache. - Significantly better performance for external memory. - Unites the code base between approx and hist.	2022-01-10 21:15:05 +08:00
Bobby Wang	e8c1eb99e4	[jvm-package] Clean up the legacy gpu support tests (#7523 )	2021-12-21 09:15:51 +08:00
Bobby Wang	24e25802a7	[jvm-packages] Add Rapids plugin support (#7491 ) * Add GPU pre-processing pipeline.	2021-12-17 13:11:12 +08:00
Bobby Wang	7cfb310eb4	Rework transform (#7440 ) extract the common part of transform code from XGBoostClassifier and XGBoostRegressor	2021-11-18 15:48:57 +08:00

1 2 3 4 5 ...

272 Commits