xgboost

Author	SHA1	Message	Date
Jiaming Yuan	088c43d666	Fix changing locale. (#5314 ) * Fix changing locale. * Don't use locale guard. As number parsing is implemented in house, we don't need locale. * Update doc.	2020-02-17 13:01:48 +08:00
Jiaming Yuan	213f4fa45a	Fix loading old logit model, helper for converting old pickle. (#5281 ) * Fix loading old logit model. * Add a helper script for converting old pickle file. * Add version as a model parameter. * Remove the size check in R test to relax the size constraint. * Add missing R doc for passing linting. Run devtools. * Cleanup old model IO logic. * Test compatibility on CI. * Make the argument as required.	2020-02-13 15:28:13 +08:00
Jiaming Yuan	472ded549d	Save Scikit-Learn attributes into learner attributes. (#5245 ) * Remove the recommendation for pickle. * Save skl attributes in booster.attr * Test loading scikit-learn model with native booster.	2020-01-30 16:00:18 +08:00
Jiaming Yuan	ef19480eda	Add dart to JSON schema. (#5218 ) * Add dart to JSON schema. * Use spaces instead of tab.	2020-01-28 13:29:09 +08:00
Kodi Arfer	f100b8d878	[Breaking] Don't drop trees during DART prediction by default (#5115 ) * Simplify DropTrees calling logic * Add `training` parameter for prediction method. * [Breaking]: Add `training` to C API. * Change for R and Python custom objective. * Correct comment. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu> Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>	2020-01-13 21:48:30 +08:00
Jiaming Yuan	7b65698187	Enforce correct data shape. (#5191 ) * Fix syncing DMatrix columns. * notes for tree method. * Enable feature validation for all interfaces except for jvm. * Better tests for boosting from predictions. * Disable validation on JVM.	2020-01-13 15:48:17 +08:00
Jiaming Yuan	1d0ca49761	Example JSON model parser and Schema. (#5137 )	2019-12-23 19:47:35 +08:00
Jiaming Yuan	a4b929385e	Note for `DaskDMatrix`. (#5144 ) * Brief introduction to `DaskDMatrix`. * Add xgboost.dask.train to API doc	2019-12-23 18:55:32 +08:00
Jiaming Yuan	27b3646d29	Tests and documents for new JSON routines. (#5120 )	2019-12-18 08:44:27 +08:00
Jiaming Yuan	9f52e834dc	[doc] Some notes for external memory. (#5065 )	2019-11-26 00:22:02 +08:00
Jiaming Yuan	d667ea9335	[CI] Fix Travis tests. (#5062 ) - Install wget explicitly to match openssl. - Install CMake explicitly. - Use newer miniconda link. - Reenable unittests. - gcc@9 + xcode@10 for osx due to missing <_stdio.h>. Other versions of gcc should also work. But as homebrew pour gcc@9 after update by default, so I just stick with latest version. - Disabled one external memory test for OSX. Not sure about the thread implementation in there and fixing external memory is beyond the scope of this PR. - Use Python3 with conda in jvm package.	2019-11-25 03:32:10 +08:00
Jiaming Yuan	97abcc7ee2	Extract interaction constraint from split evaluator. (#5034 ) * Extract interaction constraints from split evaluator. The reason for doing so is mostly for model IO, where num_feature and interaction_constraints are copied in split evaluator. Also interaction constraint by itself is a feature selector, acting like column sampler and it's inefficient to bury it deep in the evaluator chain. Lastly removing one another copied parameter is a win. * Enable inc for approx tree method. As now the implementation is spited up from evaluator class, it's also enabled for approx method. * Removing obsoleted code in colmaker. They are never documented nor actually used in real world. Also there isn't a single test for those code blocks. * Unifying the types used for row and column. As the size of input dataset is marching to billion, incorrect use of int is subject to overflow, also singed integer overflow is undefined behaviour. This PR starts the procedure for unifying used index type to unsigned integers. There's optimization that can utilize this undefined behaviour, but after some testings I don't see the optimization is beneficial to XGBoost.	2019-11-14 20:11:41 +08:00
Jiaming Yuan	7e477a2adb	Fix data loading (#4862 ) * Fix loading text data. * Fix config regex. * Try to explain the error better in exception. * Update doc.	2019-10-22 12:33:14 -04:00
Jiaming Yuan	b8433c455a	Rewrite Dask interface. (#4819 )	2019-09-25 01:30:14 -04:00
sriramch	f22b1c0348	Fix external memory documentation [skip ci] (#4747 ) * - fix external memory documentation [skip ci] - to state that it is supported now on gpu algorithms	2019-08-08 09:27:02 +08:00
Rong Ou	851b5b3808	Remove gpu_exact tree method (#4742 )	2019-08-07 11:43:20 +12:00
Jiaming Yuan	ad1192e8a3	Remove `silent` in doc. [skip ci] (#4689 )	2019-07-20 05:53:42 -04:00
Mingjie Tang	beb7b295a8	Add tutorial for distributed training and batch prediction with Kubernetes (#4621 ) * provide the readme * update for format * reformat * reformat -2 * update again * update format * update w.r.t yinlou's comments * Add kubernetes tutorial to Table of Contents * Style edit	2019-07-14 23:27:27 -07:00
Philip Hyunsu Cho	cd3a3f99da	Fix doc for customized objective/metric [skip ci] (#4608 )	2019-06-26 13:40:34 -07:00
Jiaming Yuan	5b2f805e74	Doc and demo for customized metric and obj. (#4598 ) Co-Authored-By: Theodore Vasiloudis <theodoros.vasiloudis@gmail.com>	2019-06-26 16:13:12 +08:00
Jiaming Yuan	2cff735126	Update doc for feature constraints and `n_gpus`. (#4596 ) * Update doc for feature constraints. * Fix some warnings. * Clean up doc for `n_gpus`.	2019-06-23 14:37:22 +08:00
Jiaming Yuan	9494950ee7	Address some sphinx warnings and errors, add doc for building doc. (#4589 )	2019-06-20 15:07:36 -07:00
Philip Hyunsu Cho	515f5f5c47	[RFC] Version 0.90 release candidate (#4475 ) * Release 0.90 * Add script to automatically generate acknowledgment * Update NEWS.md	2019-05-20 01:02:44 -07:00
tqchen	91c513a0c1	fix doc	2019-04-29 17:50:46 -07:00
Ravi Kalia	146e83f3b3	Fix typo in model.rst (#4393 )	2019-04-27 14:22:07 -07:00
Philip Hyunsu Cho	331cd3e4f7	Document limitation of one-split-at-a-time Greedy tree learning heuristic (#4233 )	2019-03-08 10:05:39 -08:00
Philip Hyunsu Cho	4f26053b09	Fix typo in Feature Interaction Constraints tutorial (#3975 )	2018-12-06 19:38:40 -08:00
Philip Hyunsu Cho	828d75714d	Fix #3857 : take down AWS YARN tutorial, as it is outdated (#3885 )	2018-11-08 23:08:32 -08:00
Andrew Thia	9254c58e4d	[TREE] add interaction constraints (#3466 ) * add interaction constraints * enable both interaction and monotonic constraints at the same time * fix lint * add R test, fix lint, update demo * Use dmlc::JSONReader to express interaction constraints as nested lists; Use sparse arrays for bookkeeping * Add Python test for interaction constraints * make R interaction constraints parameter based on feature index instead of column names, fix R coding style * Fix lint * Add BlueTea88 to CONTRIBUTORS.md * Short circuit when no constraint is specified; address review comments * Add tutorial for feature interaction constraints * allow interaction constraints to be passed as string, remove redundant column_names argument * Fix typo * Address review comments * Add comments to Python test	2018-09-04 09:35:39 -07:00
Philip Hyunsu Cho	cb4de521c1	Document CUDA requirement, lack of external memory on GPU (#3624 ) * Document fact that GPU doesn't support external memory * Document CUDA requirement	2018-08-22 22:47:10 -07:00
Grant W Schneider	57f3c2f252	Remove errant $ (#3618 )	2018-08-21 12:32:38 -07:00
Philip Hyunsu Cho	0b607fb884	Add link to XGBoost4J-Spark tutorial on AWS Yarn tutorial (#3582 )	2018-08-12 07:27:28 -07:00
Philip Hyunsu Cho	3c72654e3b	Revert "Fix #3485 , #3540 : Don't use dropout for predicting test sets" (#3563 ) * Revert "Fix #3485, #3540: Don't use dropout for predicting test sets (#3556)" This reverts commit 44811f233071c5805d70c287abd22b155b732727. * Document behavior of predict() for DART booster * Add notice to parameter.rst	2018-08-08 09:48:55 -07:00
Zeno Gantner	e3e776bd58	grammar fixes and typos (#3568 )	2018-08-08 09:48:27 -07:00
Nan Zhu	31d1baba3d	[jvm-packages] Tutorial of XGBoost4J-Spark (#3534 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * add new * update doc * finish Gang Scheduling * more * intro * Add sections: Prediction, Model persistence and ML pipeline. * Add XGBoost4j-Spark MLlib pipeline example * partial finished version * finish the doc * adjust code * fix the doc * use rst * Convert XGBoost4J-Spark tutorial to reST * Bring XGBoost4J up to date * add note about using hdfs * remove duplicate file * fix descriptions * update doc * Wrap HDFS/S3 export support as a note * update * wrap indexing_mode example in code block	2018-08-03 21:17:50 -07:00
Philip Hyunsu Cho	05b089405d	Doc modernization (#3474 ) * Change doc build to reST exclusively * Rewrite Intro doc in reST; create toctree * Update parameter and contribute * Convert tutorials to reST * Convert Python tutorials to reST * Convert CLI and Julia docs to reST * Enable markdown for R vignettes * Done migrating to reST * Add guzzle_sphinx_theme to requirements * Add breathe to requirements * Fix search bar * Add link to user forum	2018-07-19 14:22:16 -07:00
redditur	d5f1b74ef5	'hist': Montonic Constraints (#3085 ) * Extended monotonic constraints support to 'hist' tree method. * Added monotonic constraints tests. * Fix the signature of NoConstraint::CalcSplitGain() * Document monotonic constraint support in 'hist' * Update signature of Update to account for latest refactor	2018-03-05 16:45:49 -08:00
Dmitry Mottl	20b733e1a0	Minor: removed extra parenthesis in doc (#3119 )	2018-02-20 02:55:29 -08:00
Philip Hyunsu Cho	375d75304d	Fix typos, addressing issues #2212 and #3090 (#3105 )	2018-02-09 11:16:44 -08:00
Kyle Willett	7e07b2b93d	Correcting small typos in documentation. (#1901 )	2016-12-31 20:47:51 +08:00
Matthew Drury	edc356f7ec	Add monotonic tutorial. (#1870 )	2016-12-14 20:17:19 -06:00
Dr. Kashif Rasul	da2556f58a	fixed some typos (#1814 )	2016-11-25 16:34:57 -05:00
marugari	c332eb5a2b	add Dart tutorial	2016-07-10 20:12:42 +09:00
tqchen	84ae514d7e	[DOC] refactor doc	2016-05-20 13:09:42 -07:00

44 Commits