xgboost

Author	SHA1	Message	Date
Jiaming Yuan	570f8ae4ba	Use black on more Python files. (#8137 )	2022-08-11 01:38:11 +08:00
Praateek Mahajan	ff471b3fab	In PySpark Estimator example use the model with validation_indicator (#8131 ) * use the validation_indicator model * use the validation_indicator model for regression	2022-08-03 13:57:41 +08:00
WeichenXu	f23cc92130	[pyspark] User guide doc and tutorials (#8082 ) Co-authored-by: Bobby Wang <wbo4958@gmail.com>	2022-07-19 22:25:14 +08:00
Jiaming Yuan	52d4eda786	Deprecate `use_label_encoder` in XGBClassifier. (#7822 ) * Deprecate `use_label_encoder` in XGBClassifier. * We have removed the encoder, now prepare to remove the indicator.	2022-04-21 13:14:02 +08:00
Jiaming Yuan	bcce17e688	Remove text loading in basic walk through demo. (#7753 )	2022-04-01 00:59:42 +08:00
Jiaming Yuan	4d81c741e9	External memory support for hist (#7531 ) * Generate column matrix from gHistIndex. * Avoid synchronization with the sparse page once the cache is written. * Cleanups: Remove member variables/functions, change the update routine to look like approx and gpu_hist. * Remove pruner.	2022-03-22 00:13:20 +08:00
Jiaming Yuan	cd55823112	Demo for using custom objective with multi-target regression. (#7736 )	2022-03-20 17:44:25 +08:00
Jiaming Yuan	1d468e20a4	Optimize GPU evaluation function for categorical data. (#7705 ) * Use transform and cache.	2022-02-28 17:46:29 +08:00
Jiaming Yuan	0da7d872ef	[doc] Update for prediction. (#7648 )	2022-02-15 05:01:55 +08:00
Jiaming Yuan	0d0abe1845	Support optimal partitioning for GPU hist. (#7652 ) * Implement `MaxCategory` in quantile. * Implement partition-based split for GPU evaluation. Currently, it's based on the existing evaluation function. * Extract an evaluator from GPU Hist to store the needed states. * Added some CUDA stream/event utilities. * Update document with references. * Fixed a bug in approx evaluator where the number of data points is less than the number of categories.	2022-02-15 03:03:12 +08:00
Philip Hyunsu Cho	c621775f34	Replace all uses of deprecated function sklearn.datasets.load_boston (#7373 ) * Replace all uses of deprecated function sklearn.datasets.load_boston * More renaming * Fix bad name * Update assertion * Fix n boosted rounds. * Avoid over regularization. * Rebase. * Avoid over regularization. * Whac-a-mole Co-authored-by: fis <jm.yuan@outlook.com>	2022-01-30 04:27:57 -08:00
Jiaming Yuan	b4ec1682c6	Update document for multi output and categorical. (#7574 ) * Group together categorical related parameters. * Update documents about multioutput and categorical.	2022-01-19 04:35:17 +08:00
Jiaming Yuan	001503186c	Rewrite approx (#7214 ) This PR rewrites the approx tree method to use codebase from hist for better performance and code sharing. The rewrite has many benefits: - Support for both `max_leaves` and `max_depth`. - Support for `grow_policy`. - Support for mono constraint. - Support for feature weights. - Support for easier bin configuration (`max_bin`). - Support for categorical data. - Faster performance for most of the datasets. (many times faster) - Support for prediction cache. - Significantly better performance for external memory. - Unites the code base between approx and hist.	2022-01-10 21:15:05 +08:00
Jiaming Yuan	58a6723eb1	Initial support for multioutput regression. (#7514 ) * Add num target model parameter, which is configured from input labels. * Change elementwise metric and indexing for weights. * Add demo. * Add tests.	2021-12-18 09:28:38 +08:00
Jiaming Yuan	c024c42dce	Modernize XGBoost Python document. (#7468 ) * Use sphinx gallery to integrate examples. * Remove mock objects. * Add dask doc inventory.	2021-11-23 23:24:52 +08:00
Jiaming Yuan	45aef75cca	Move skl `eval_metric` and `early_stopping rounds` to model params. (#6751 ) A new parameter `custom_metric` is added to `train` and `cv` to distinguish the behaviour from the old `feval`. And `feval` is deprecated. The new `custom_metric` receives transformed prediction when the built-in objective is used. This enables XGBoost to use cost functions from other libraries like scikit-learn directly without going through the definition of the link function. `eval_metric` and `early_stopping_rounds` in sklearn interface are moved from `fit` to `__init__` and is now saved as part of the scikit-learn model. The old ones in `fit` function are now deprecated. The new `eval_metric` in `__init__` has the same new behaviour as `custom_metric`. Added more detailed documents for the behaviour of custom objective and metric.	2021-10-28 17:20:20 +08:00
Jiaming Yuan	2eee87423c	Remove old custom objective demo. (#7369 ) We have 2 new custom objective demos covering both regression and classification with accompanying tutorials in documents.	2021-10-27 16:31:48 +08:00
Jiaming Yuan	15685996fc	[doc] Small improvements for categorical data document. (#7330 )	2021-10-20 18:04:32 +08:00
Jiaming Yuan	6cdcfe8128	Improve external memory demo. (#7320 ) * Use npy format. * Add evaluation. * Use make_regression.	2021-10-17 11:25:24 +08:00
Jiaming Yuan	0bd8f21e4e	Add document for categorical data. (#7307 )	2021-10-12 16:10:59 +08:00
Jiaming Yuan	0ed979b096	Support more input types for categorical data. (#7220 ) * Support more input types for categorical data. * Shorten the type name from "categorical" to "c". * Tests for np/cp array and scipy csr/csc/coo. * Specify the type for feature info.	2021-09-16 20:39:30 +08:00
Jiaming Yuan	d997c967d5	Demo for experimental categorical data support. (#7213 )	2021-09-15 08:20:12 +08:00
Jiaming Yuan	7bdedacb54	Document for `process_type`. (#7135 ) * Update document for prune and refresh. * Add demo.	2021-08-03 13:11:52 +08:00
Jiaming Yuan	778135f657	Fix parameter loading with training continuation. (#7121 ) * Add a demo for training continuation.	2021-07-23 10:51:47 +08:00
Jiaming Yuan	e6088366df	Export Python Interface for external memory. (#7070 ) * Add Python iterator interface. * Add tests. * Add demo. * Add documents. * Handle empty dataset.	2021-07-22 15:15:53 +08:00
Jiaming Yuan	663136aa08	Implement feature score for linear model. (#7048 ) * Add feature score support for linear model. * Port R interface to the new implementation. * Add linear model support in Python. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2021-06-25 14:34:02 +08:00
Jiaming Yuan	5c87c2bba8	Update demo for prediction. (#6789 ) * Remove use of deprecated ntree_limit. * Add sklearn demo.	2021-03-27 03:09:25 +08:00
MBSMachineLearning	95cbfad990	"featue_map" typo changed to "feature_map" (#6540 )	2020-12-21 22:11:11 +08:00
Jiaming Yuan	c90f968d92	Update Python documents. (#6376 )	2020-11-12 17:51:32 +08:00
Jiaming Yuan	81c37c28d5	Time the CPU tests on Jenkins. (#6257 ) * Time the CPU tests on Jenkins. * Reduce thread contention. * Add doc. * Skip heavy tests on ARM.	2020-10-20 17:19:07 -07:00
Jiaming Yuan	ab5b35134f	Rework Python callback functions. (#6199 ) * Define a new callback interface for Python. * Deprecate the old callbacks. * Enable early stopping on dask.	2020-10-10 17:52:36 +08:00
Christian Lorentzen	cf4f019ed6	[Breaking] Change default evaluation metric for classification to logloss / mlogloss (#6183 ) * Change DefaultEvalMetric of classification from error to logloss * Change default binary metric in plugin/example/custom_obj.cc * Set old error metric in python tests * Set old error metric in R tests * Fix missed eval metrics and typos in R tests * Fix setting eval_metric twice in R tests * Add warning for empty eval_metric for classification * Fix Dask tests Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-10-02 12:06:47 -07:00
John Quitto-Graham	e0e4f15d0e	Fix a comment in demo to use correct reference (#6190 ) Co-authored-by: John Quitto Graham <johnq@dgx07.aselab.nvidia.com>	2020-10-01 13:16:04 -07:00
Jiaming Yuan	b5f52f0b1b	Validate weights are positive values. (#6115 )	2020-09-15 09:03:55 +08:00
Daniel Steinberg	68c55a37d9	Add cache name back to external_memory.py files. (#6088 )	2020-09-06 16:01:09 +08:00
Jiaming Yuan	4d99c58a5f	Feature weights (#5962 )	2020-08-18 19:55:41 +08:00
Jiaming Yuan	9c93531709	Update Python custom objective demo. (#5981 )	2020-08-05 12:27:19 +08:00
Jiaming Yuan	18349a7ccf	[Breaking] Fix custom metric for multi output. (#5954 ) * Set output margin to true for custom metric. This fixes only R and Python.	2020-07-29 19:25:27 +08:00
Jiaming Yuan	a3ec964346	Accept iterator in device dmatrix. (#5783 ) * Remove Device DMatrix.	2020-07-07 21:44:48 +08:00
Jiaming Yuan	5af8161a1a	Implement Python data handler. (#5689 ) * Define data handlers for DMatrix. * Throw ValueError in scikit learn interface.	2020-05-22 11:53:55 +08:00
Jiaming Yuan	2c1a439869	Update Python demos with tests. (#5651 ) * Remove GPU memory usage demo. * Add tests for demos. * Remove `silent`. * Remove shebang as it's not portable.	2020-05-12 12:04:42 +08:00
Jiaming Yuan	9c1103e06c	[Breaking] Set output margin to True for custom objective. (#5564 ) * Set output margin to True for custom objective in Python and R. * Add a demo for writing multi-class custom objective function. * Run tests on selected demos.	2020-04-20 20:44:12 +08:00
Nicolas Scozzaro	04f69b43e6	fix typo "customized" (#5515 )	2020-04-12 14:43:48 +08:00
Jiaming Yuan	0202e04a8e	Add base margin to sklearn interface. (#5151 )	2019-12-24 09:43:41 +08:00
Jiaming Yuan	5b2f805e74	Doc and demo for customized metric and obj. (#4598 ) Co-Authored-By: Theodore Vasiloudis <theodoros.vasiloudis@gmail.com>	2019-06-26 16:13:12 +08:00
Sergei Chipiga	8dac0d1009	Fix typo in python demo (#3676 )	2018-09-06 14:56:21 -07:00
Philip Hyunsu Cho	983cb0b374	Add option to disable default metric (#3606 )	2018-08-18 11:39:20 -07:00
Philip Hyunsu Cho	ac7fc1306b	Fix #3598 : document that custom objective can't contain colon (:) (#3601 )	2018-08-16 19:05:40 -07:00
zhaocc	8fb3388af2	fix typo (#3188 )	2018-03-21 19:24:29 -04:00
Peter M. Landwehr	86bf930497	Fix typo: cutomize -> customize (#3073 )	2018-02-04 22:56:04 +01:00

1 2

91 Commits