xgboost

Author	SHA1	Message	Date
Jiaming Yuan	1d0ca49761	Example JSON model parser and Schema. (#5137 )	2019-12-23 19:47:35 +08:00
Jiaming Yuan	a4b929385e	Note for `DaskDMatrix`. (#5144 ) * Brief introduction to `DaskDMatrix`. * Add xgboost.dask.train to API doc	2019-12-23 18:55:32 +08:00
Jiaming Yuan	c8bdb652c4	Add check for length of weights. (#4872 )	2019-12-21 11:30:58 +08:00
Rory Mitchell	3d04a8cc97	Use dynamic types for array interface columns instead of templates (#5108 )	2019-12-21 16:08:10 +13:00
Jiaming Yuan	b915788708	Remove benchmark code in GPU test. (#5141 ) * Update Jenkins script.	2019-12-21 11:00:21 +08:00
Philip Hyunsu Cho	74f545bde3	[CI] Repair download URL for Maven 3.6.1 (#5139 )	2019-12-20 10:07:40 +08:00
Jiaming Yuan	e521bb6f83	Added new train_folds parameter (#5114 ) https://stackoverflow.com/questions/32433458/how-to-specify-train-and-test-indices-for-xgb-cv-in-r-package-xgboost/51412073	2019-12-19 15:02:19 +01:00
Philip Hyunsu Cho	37fdfa03f8	[jvm-packages] Comply with scala style convention + fix broken unit test (#5134 ) * Fix scala style check * fix messed unit test	2019-12-18 17:26:58 -08:00
cpfarrell	bc9d88259f	[jvm-packages] Allow for bypassing spark missing value check (#4805 ) * Allow for bypassing spark missing value check * Update documentation for dealing with missing values in spark xgboost	2019-12-18 10:48:20 -08:00
Jiaming Yuan	27b3646d29	Tests and documents for new JSON routines. (#5120 )	2019-12-18 08:44:27 +08:00
Jiaming Yuan	63ffd2f686	Check against R seed. (#5125 ) * Handle it in R instead.	2019-12-17 19:14:59 +08:00
Jiaming Yuan	2fdb34ed2e	Fix metric name loading. (#5122 )	2019-12-16 10:14:02 +08:00
Jiaming Yuan	3136185bc5	JSON configuration IO. (#5111 ) * Add saving/loading JSON configuration. * Implement Python pickle interface with new IO routines. * Basic tests for training continuation.	2019-12-15 17:31:53 +08:00
Rory Mitchell	5aa007d7b2	Fix visual studio output library directories (#5119 )	2019-12-15 15:08:20 +13:00
Jiaming Yuan	ad4a1c732c	Small refinements for JSON model. (#5112 ) * Naming consistency. * Remove duplicated test.	2019-12-11 19:49:01 +08:00
Jiaming Yuan	208ab3b1ff	Model IO in JSON. (#5110 )	2019-12-11 11:20:40 +08:00
Rory Mitchell	c7cc657a4d	Use adapters for SparsePageDMatrix (#5092 )	2019-12-11 15:59:23 +13:00
Jiaming Yuan	e089e16e3d	Pass pointer to model parameters. (#5101 ) * Pass pointer to model parameters. This PR de-duplicates most of the model parameters except the one in `tree_model.h`. One difficulty is `base_score` is a model property but can be changed at runtime by objective function. Hence when performing model IO, we need to save the one provided by users, instead of the one transformed by objective. Here we created an immutable version of `LearnerModelParam` that represents the value of model parameter after configuration.	2019-12-10 12:11:22 +08:00
Rory Mitchell	979f74d51a	Group builder modified for incremental building (#5098 )	2019-12-10 14:33:56 +13:00
Jiaming Yuan	1cb6bcc382	Remove dead code in colmaker. (#5105 )	2019-12-10 09:32:37 +08:00
Egor Smirnov	b1789b0346	added tracking execution time for UpdatePredictionCache function (#5107 )	2019-12-10 01:32:56 +08:00
Jiaming Yuan	38763aa4fa	Update document for tree_method. [skip ci] (#5106 )	2019-12-09 22:55:00 +08:00
Jiaming Yuan	608ebbe444	Fix GPU ID and prediction cache from pickle (#5086 ) * Hack for saving GPU ID. * Declare prediction cache on GBTree. * Add a simple test. * Add `auto` option for GPU Predictor.	2019-12-07 16:02:06 +08:00
Jiaming Yuan	7ef5b78003	Implement JSON IO for updaters (#5094 ) * Implement JSON IO for updaters. * Remove parameters in split evaluator.	2019-12-07 00:24:00 +08:00
Jiaming Yuan	2dcb62ddfb	Add IO utilities. (#5091 ) * Add fixed size stream for reading model stream. * Add file extension.	2019-12-05 22:15:34 +08:00
Jiaming Yuan	64af1ecf86	[Breaking] Remove num roots. (#5059 )	2019-12-05 21:58:43 +08:00
Jiaming Yuan	f3d8536702	Don't use 0 for "fresh leaf". (#5084 ) * Allow using right child as marker for Exact tree_method.	2019-12-05 11:50:51 +08:00
Jiaming Yuan	df9bdbbcb9	Fix parsing empty vector in parameter. (#5087 )	2019-12-05 11:42:01 +08:00
Jiaming Yuan	f5e13dcb9b	Implement training observer. (#5088 )	2019-12-05 11:12:20 +08:00
Jiaming Yuan	f0ca53d9ec	Convenient methods for JSON integer. (#5089 ) * Fix parsing empty object.	2019-12-05 11:01:12 +08:00
yage	dcde433402	Fix MacOS build error. (#5080 ) * Update build script. * Update build doc.	2019-12-04 19:34:35 +08:00
Rory Mitchell	e3c34c79be	External data adapters (#5044 ) * Use external data adapters as lightweight intermediate layer between external data and DMatrix	2019-12-04 10:56:17 +13:00
Kodi Arfer	f2277e7106	Use DART tree weights when computing SHAPs (#5050 ) This PR fixes tree weights in dart being ignored when computing contributions. * Fix ellpack page source link. * Add tree weights to compute contribution.	2019-12-03 19:55:53 +08:00
Philip Hyunsu Cho	64f4361b47	[CI] Locate vcomp140.dll from System32 directory (#5078 )	2019-12-02 02:09:32 -08:00
Jiaming Yuan	761e938dbe	Support dask dataframe as y for classifier. (#5077 ) * Support dask dataframe as y for classifier. * Lint.	2019-12-02 11:53:30 +08:00
yage	b9dbfe0931	Update doc for building on OSX (#5074 ) Co-Authored-By: Jiaming Yuan <jm.yuan@outlook.com>	2019-11-28 20:14:44 +08:00
Jiaming Yuan	9f52e834dc	[doc] Some notes for external memory. (#5065 )	2019-11-26 00:22:02 +08:00
Jiaming Yuan	d667ea9335	[CI] Fix Travis tests. (#5062 ) - Install wget explicitly to match openssl. - Install CMake explicitly. - Use newer miniconda link. - Reenable unittests. - gcc@9 + xcode@10 for osx due to missing <_stdio.h>. Other versions of gcc should also work. But as homebrew pour gcc@9 after update by default, so I just stick with latest version. - Disabled one external memory test for OSX. Not sure about the thread implementation in there and fixing external memory is beyond the scope of this PR. - Use Python3 with conda in jvm package.	2019-11-25 03:32:10 +08:00
Jiaming Yuan	04c640f562	Add cuDF DataFrame to doc. (#5053 )	2019-11-19 18:29:40 +08:00
Jiaming Yuan	a4f5c86276	Allow using RandomState object from Numpy in sklearn interface. (#5049 )	2019-11-19 10:56:39 +08:00
Philip Hyunsu Cho	4d2779663e	Require Python 3.5+ in setup.py (#5021 )	2019-11-18 18:55:58 -08:00
Jiaming Yuan	98b051269b	Assert dask client at early stage. (#5048 )	2019-11-19 10:55:26 +08:00
Rory Mitchell	e67388fb8f	Some guidelines on device memory usage (#5038 ) * Add memory usage demo * Update documentation	2019-11-17 07:48:24 +13:00
Rong Ou	0afcc55d98	Support multiple batches in gpu_hist (#5014 ) * Initial external memory training support for GPU Hist tree method.	2019-11-16 14:50:20 +08:00
Jiaming Yuan	97abcc7ee2	Extract interaction constraint from split evaluator. (#5034 ) * Extract interaction constraints from split evaluator. The reason for doing so is mostly for model IO, where num_feature and interaction_constraints are copied in split evaluator. Also interaction constraint by itself is a feature selector, acting like column sampler and it's inefficient to bury it deep in the evaluator chain. Lastly removing one another copied parameter is a win. * Enable inc for approx tree method. As now the implementation is spited up from evaluator class, it's also enabled for approx method. * Removing obsoleted code in colmaker. They are never documented nor actually used in real world. Also there isn't a single test for those code blocks. * Unifying the types used for row and column. As the size of input dataset is marching to billion, incorrect use of int is subject to overflow, also singed integer overflow is undefined behaviour. This PR starts the procedure for unifying used index type to unsigned integers. There's optimization that can utilize this undefined behaviour, but after some testings I don't see the optimization is beneficial to XGBoost.	2019-11-14 20:11:41 +08:00
Sebastian	886bf93ba4	fix: reset hit counter for next batch (#5035 )	2019-11-14 10:22:02 +08:00
sriramch	2abe69d774	- ndcg ltr implementation on gpu (#5004 ) * - ndcg ltr implementation on gpu - this is a follow-up to the pairwise ltr implementation	2019-11-13 11:21:04 +13:00
Philip Hyunsu Cho	f4e7b707c9	Revert #4529 (#5008 ) * Revert " Optimize ‘hist’ for multi-core CPU (#4529)" This reverts commit 4d6590be3c9a043d44d9e4fe0a456a9f8179ec72. * Fix build	2019-11-12 09:35:03 -08:00
KaiJin Ji	1733c9e8f7	Improve operation efficiency for single predict (#5016 ) * Improve operation efficiency for single predict	2019-11-10 02:01:28 +08:00
mitama	374648c21a	Add better error message for invalid feature names (#5024 )	2019-11-10 01:58:14 +08:00

1 2 3 4 5 ...

3983 Commits