xgboost

Author	SHA1	Message	Date
Jiaming Yuan	38a23f66a8	Fix typo in release script. [skip ci] (#7238 )	2021-09-17 11:14:05 +08:00
Jiaming Yuan	8ad7e8eeb0	[doc] Fix typo. [skip ci] (#7226 )	2021-09-17 11:13:49 +08:00
Jiaming Yuan	22d56cebf1	Encode pandas categorical data automatically. (#7231 )	2021-09-17 11:09:55 +08:00
Jiaming Yuan	32e0858501	Fix travis. (#7237 )	2021-09-17 10:06:23 +08:00
Jiaming Yuan	31c1e13f90	Categorical data support in CPU sketching. (#7221 )	2021-09-17 04:37:09 +08:00
Jiaming Yuan	9f63d6fead	[jvm-packages] Deprecate constructors with implicit missing value. (#7225 )	2021-09-17 04:35:04 +08:00
Jiaming Yuan	0ed979b096	Support more input types for categorical data. (#7220 ) * Support more input types for categorical data. * Shorten the type name from "categorical" to "c". * Tests for np/cp array and scipy csr/csc/coo. * Specify the type for feature info.	2021-09-16 20:39:30 +08:00
Jiaming Yuan	2942dc68e4	Fix mixed types in GPU sketching. (#7228 )	2021-09-16 00:10:25 +08:00
Jiaming Yuan	037dd0820d	Implement `__sklearn_is_fitted__`. (#7230 )	2021-09-15 19:09:04 +08:00
Jiaming Yuan	d997c967d5	Demo for experimental categorical data support. (#7213 )	2021-09-15 08:20:12 +08:00
Jiaming Yuan	3515931305	Initial support for external memory in gradient index. (#7183 ) * Add hessian to batch param in preparation of new approx impl. * Extract a push method for gradient index matrix. * Use span instead of vector ref for hessian in sketching. * Create a binary format for gradient index.	2021-09-13 12:40:56 +08:00
Christian Lorentzen	a0dcf6f5c1	[DOC] Improve tutorial on feature interactions (#7219 )	2021-09-12 21:40:02 +08:00
Jiaming Yuan	804b2ac60f	Expose DMatrix API for CUDA columnar and array. (#7217 ) * Use JSON encoded configurations. * Expose them into header file.	2021-09-09 17:55:25 +08:00
Jiaming Yuan	68a2c7b8d6	Fix memory leak in demo. (#7216 )	2021-09-09 13:51:03 +08:00
Jiaming Yuan	b12e7f7edd	Add noexcept to JSON objects. (#7205 )	2021-09-07 13:56:48 +08:00
Jiaming Yuan	3a4f51f39f	Avoid calling CUDA code on CPU for linear model. (#7154 )	2021-09-01 10:45:31 +08:00
Jiaming Yuan	ba69244a94	Restore the custom double atomic add. (#7198 )	2021-08-28 18:30:42 +08:00
Jiaming Yuan	7a1d67f9cb	[breaking] Use integer atomic for GPU histogram. (#7180 ) On GPU we use rouding factor to truncate the gradient for deterministic results. This PR changes the gradient representation to fixed point number with exponent aligned with rounding factor. [breaking] Drop non-deterministic histogram. Use fixed point for shared memory. This PR is to improve the performance of GPU Hist. Co-authored-by: Andy Adinets <aadinets@nvidia.com>	2021-08-28 05:17:05 +08:00
Jiaming Yuan	e7d7ab6bc3	Better error message for `ncclUnhandledCudaError`. (#7190 )	2021-08-27 10:29:22 +08:00
Philip Hyunsu Cho	b70e07da1f	[CI] Clean up in beginning of each task in Win CI (#7189 )	2021-08-25 04:15:22 -07:00
Jiaming Yuan	cdfaa705f3	Fix building on CUDA 11.0. (#7187 )	2021-08-25 02:57:53 -07:00
Philip Hyunsu Cho	3060f0b562	[CI] Automatically build GPU-enabled R package for Windows (#7185 ) * [CI] Automatically build GPU-enabled R package for Windows * Update Jenkinsfile-win64 * Build R package for the release branch only * Update install doc	2021-08-25 02:11:01 -07:00
Jiaming Yuan	9c64618cb6	[breaking] Remove CUDA sm_35, add sm_86 (#7182 )	2021-08-25 16:04:23 +08:00
Philip Hyunsu Cho	d04312b9c0	[CI] Fix hanging Python setup in Windows CI (#7186 )	2021-08-24 22:03:51 -07:00
Jiaming Yuan	ee8d1f5ed8	Fix histogram truncation. (#7181 ) * Fix truncation. * Lint. * lint.	2021-08-24 18:34:32 -07:00
Jiaming Yuan	3290a4f3ed	Re-enable feature validation in predict proba. (#7177 )	2021-08-22 15:28:08 +08:00
Jiaming Yuan	bf562bd33c	Remove unused code. (#7175 )	2021-08-18 14:02:19 +08:00
Anton Kostin	01b7acba30	Update conf.py (#7174 )	2021-08-17 03:38:26 +08:00
Anton Kostin	ec849ec335	Update README.md (#7173 )	2021-08-17 03:37:53 +08:00
Martin Petříček	46c46829ce	Fix model loading from stream (#7067 ) Fix bug introduced in 17913713b554d820a8ce94226d854b4a5f1d8bbc (allow loading from byte array) When loading model from stream, only last buffer read from the input stream is used to construct the model. This may work for models smaller than 1 MiB (if you are lucky enough to read the whole model at once), but will always fail if the model is larger.	2021-08-15 21:04:33 +08:00
Jiaming Yuan	6bcbc77226	[doc] Fix typo. [skip ci] (#7170 )	2021-08-13 03:48:16 +08:00
Jiaming Yuan	3f38d983a6	Fix prediction configuration. (#7159 ) After the predictor parameter was added to the constructor, this configuration was broken.	2021-08-11 16:34:36 +08:00
Jiaming Yuan	9600ca83f3	Remove synchronization in monitor. (#7164 ) * Remove synchronization in monitor. Calling rabit functions during destruction is flaky. * Add xgboost prefix to nvtx marker.	2021-08-11 16:33:53 +08:00
Jiaming Yuan	149f209af6	Extract histogram builder from CPU Hist. (#7152 ) * Extract the CPU histogram builder. * Fix tests. * Reduce number of histograms being built.	2021-08-09 21:15:21 +08:00
Philip Hyunsu Cho	336af4f974	Work around a segfault observed in SparsePage::Push() (#7161 ) * Work around a segfault observed in SparsePage::Push() * Revert "Work around a segfault observed in SparsePage::Push()" This reverts commit 30934844d00908750a5442082eb4769b1489f6a9. * Don't call vector::resize() inside OpenMP block * Set GITHUB_PAT env var to fix R tests * Use built-in GITHUB_TOKEN	2021-08-08 02:12:30 -07:00
AJ Schmidt	f7003dc819	Include cpack (#7160 ) Co-authored-by: ptaylor <paul.e.taylor@me.com>	2021-08-07 00:57:34 +08:00
Jiaming Yuan	8a84be37b8	Pass scikit learn estimator checks for regressor. (#7130 ) * Check data shape. * Check labels.	2021-08-03 18:58:20 +08:00
Jiaming Yuan	8ee127469f	[R] Fix nthread in DMatrix constructor. (#7127 ) * Break the R C API for nthread.	2021-08-03 17:39:25 +08:00
Jiaming Yuan	ba47eda61b	[doc] Use figure directive. (#7143 )	2021-08-03 15:56:25 +08:00
Jiaming Yuan	e2c406f5c8	Support `min_delta` in early stopping. (#7137 ) * Support `min_delta` in early stopping. * Remove abs_tol.	2021-08-03 14:29:17 +08:00
Jiaming Yuan	7bdedacb54	Document for `process_type`. (#7135 ) * Update document for prune and refresh. * Add demo.	2021-08-03 13:11:52 +08:00
Jiaming Yuan	d080b5a953	Fix model slicing. (#7149 ) * Use correct pointer. * Remove best_iteration/best_score.	2021-08-03 11:51:56 +08:00
Jiaming Yuan	36346f8f56	C API demo for inference. (#7151 )	2021-08-03 00:46:47 +08:00
Jiaming Yuan	1369133916	[dask] Remove the workaround for segfault. (#7146 )	2021-07-30 03:57:53 +08:00
Philip Hyunsu Cho	f1a4a1ac95	[CI] Upgrade build image to CentOS 7 + GCC 8; require CUDA 10.1 and later (#7141 )	2021-07-29 10:54:33 -07:00
graue70	dfdf0b08fc	Fix typo and grammatical mistake in error message (#7134 )	2021-07-28 17:17:05 +08:00
Gil Forsyth	92ae3abc97	[dask] Disallow importing non-dask estimators from xgboost.dask (#7133 ) * Disallow importing non-dask estimators from xgboost.dask This is mostly a style change, but also avoids a user error (that I have committed on a few occasions). Since `XGBRegressor` and `XGBClassifier` are imported as parent classes for the `dask` estimators, without defining an `__all__`, autocomplete (or muscle) memory will produce the following with little prompting: ``` from xgboost.dask import XGBClassifier ``` There's nothing inherently wrong with that, but given that `XGBClassifier` is not `dask` enabled, it can lead to confusing behavior until you figure out you should've typed ``` from xgboost.dask import DaskXGBClassifier ``` Another option is to alias import the existing non-dask estimators. * Remove base/iter class, add train predict funcs	2021-07-28 02:07:23 +08:00
Robert Maynard	1a75f43304	Allow compilation with nvcc 11.4 (#7131 ) * Use type aliases for discard iterators * update to include host_vector as thrust 1.12 doesn't bring it in as a side-effect * cub::DispatchRadixSort requires signed offset types	2021-07-27 20:05:33 +08:00
Jiaming Yuan	7017dd5a26	[JVM-Packages] Use Python tracker in XGBoost for JVM package. (#7132 )	2021-07-27 16:20:42 +08:00
Jiaming Yuan	48d5de80a2	[R] Fix softprob reshape. (#7126 )	2021-07-27 15:25:17 +08:00

... 2 3 4 5 6 ...

5608 Commits