xgboost

Author	SHA1	Message	Date
Yuan (Terry) Tang	090b37e85d	Bumped up err assert in glm test (#1792 )	2016-11-20 18:23:19 -06:00
AbdealiJK	b94fcab4dc	Add dump_format=json option (#1726 ) * Add format to the params accepted by DumpModel Currently, only the test format is supported when trying to dump a model. The plan is to add more such formats like JSON which are easy to read and/or parse by machines. And to make the interface for this even more generic to allow other formats to be added. Hence, we make some modifications to make these function generic and accept a new parameter "format" which signifies the format of the dump to be created. * Fix typos and errors in docs * plugin: Mention all the register macros available Document the register macros currently available to the plugin writers so they know what exactly can be extended using hooks. * sparce_page_source: Use same arg name in .h and .cc * gbm: Add JSON dump The dump_format argument can be used to specify what type of dump file should be created. Add functionality to dump gblinear and gbtree into a JSON file. The JSON file has an array, each item is a JSON object for the tree. For gblinear: - The item is the bias and weights vectors For gbtree: - The item is the root node. The root node has a attribute "children" which holds the children nodes. This happens recursively. * core.py: Add arg dump_format for get_dump()	2016-11-04 09:55:25 -07:00
Yuan (Terry) Tang	63829d656c	Fix mknfold using new StratifiedKFold API (#1660 )	2016-10-12 14:43:37 -07:00
Yuan (Terry) Tang	a64fd74421	Fix wrong expected feature types (#1646 )	2016-10-08 21:16:29 -07:00
Vadim Khotilovich	693ddb860e	More robust DMatrix creation from a sparse matrix (#1606 ) * [CORE] DMatrix from sparse w/ explicit #col #row; safer arg types * [python-package] c-api change for _init_from_csr _init_from_csc * fix spaces * [R-package] adopt the new XGDMatrixCreateFromCSCEx interface * [CORE] redirect old sparse creators to new ones	2016-09-25 10:01:22 -07:00
Yoshinori Nakano	949d1e3027	add Dart booster (#1220 )	2016-06-08 14:04:01 -07:00
Vadim Khotilovich	9a48a40cf1	Fixes for multiple and default metric (#1239 ) * fix multiple evaluation metrics * create DefaultEvalMetric only when really necessary * py test for #1239 * make travis happy	2016-06-04 22:17:35 -07:00
tqchen	149589c583	[PYTHON] Refactor trainnig API to use callback	2016-05-19 21:31:23 -07:00
Alistair Johnson	6750c8b743	Added other feature importances in python package (#1135 ) * added new function to calculate other feature importances * added capability to plot other feature importance measures * changed plotting default to fscore * added info on importance_type to boilerplate comment * updated text of error statement * added self module name to fix call * added unit test for feature importances * style fixes	2016-05-02 12:25:24 -05:00
sinhrks	9da2f3e613	DOC/TST: Fix Python sklearn dep	2016-05-01 17:27:43 +09:00
Faron	ad3f49e881	[py] eta decay bugfix	2016-04-30 15:51:57 +02:00
sinhrks	6bab164d80	Bug mixing DMatrix's with and without feature names	2016-04-30 14:42:57 +09:00
Faron	cf607e2448	[py] split value histograms	2016-04-28 20:26:21 +02:00
sinhrks	c55cc809e5	BUG: XGBClassifier.feature_importances_ raises ValueError if input is pandas DataFrame	2016-04-27 21:50:03 +09:00
sinhrks	8fc2456c87	Enable flake8	2016-04-24 17:32:31 +09:00
tqchen	ec2fb5bc48	Fix multi-class loading	2016-03-10 19:22:26 -08:00
terrytangyuan	803a6fe474	Separate dependencies and lightweight test env for Python	2016-02-28 20:11:10 -06:00
tqchen	4a16b729fc	[PYTHON] Simplify training logic, update rabit lib	2016-02-28 13:20:55 -08:00
tqchen	90bc7f8f6b	[TEST] Fix travis test when reading hdfs	2016-02-27 18:15:32 -08:00
Tianqi Chen	758a77de9c	Fix testcase after update and allow hdfs load	2016-02-26 17:04:51 -08:00
ivallesp	ed5c98f0ee	re-using the verbose-eval parameter in the cv and aggcv methods and tests adapted	2016-02-19 17:14:57 +01:00
Alexis Mignon	52e9085579	Merge branch 'master' of github.com:AlexisMignon/xgboost	2016-02-16 11:00:57 +01:00
Alexis Mignon	6e27d7539f	- Added test cases for the use of custom objective functions - Made the indentation more consistent with pep8	2016-02-16 10:59:25 +01:00
Faron	4b3a053913	stratified cv for python wrapper finalize docstring	2016-02-15 16:06:17 +01:00
FrozenFingerz	177259a0a7	unittest for cv bugfixes added	2015-12-29 14:13:40 +01:00
sinhrks	25c4fbd0cb	Cleanup pandas support	2015-11-13 06:55:04 +09:00
Faron	7f2628acd7	unittest for 'num_class > 2' added	2015-11-12 08:23:11 +01:00
Far0n	ce5930c365	best_ntree_limit attribute added - best_ntree_limit as new booster atrribute added - usage of bst.best_ntree_limit in python doc added - fixed wrong 'best_iteration' after training continuation	2015-11-10 15:37:22 +01:00
Yuan (Terry) Tang	1dd96b6cdc	Merge pull request #597 from JohanManders/python-pandas-dtypes Python pandas dtypes	2015-11-09 18:08:41 -06:00
Yuan (Terry) Tang	01053f8f2f	Merge pull request #594 from Far0n/feval python: multiple eval_metrics changes	2015-11-08 10:10:28 -05:00
FrozenFingerz	3d36fa8f4e	python: unittest for early stopping of cv	2015-11-08 11:42:57 +01:00
FrozenFingerz	b59018aa05	python: multiple eval_metrics changes - allows feval to return a list of tuples (name, error/score value) - changed behavior for multiple eval_metrics in conjunction with early_stopping: Instead of raising an error, the last passed evel_metric (or last entry in return value of feval) is used for early stopping - allows list of eval_metrics in dict-typed params - unittest for new features / behavior documentation updated - example for assigning a list to 'eval_metric' - note about early stopping on last passed eval metric - info msg for used eval metric added	2015-11-08 11:23:54 +01:00
Yuan (Terry) Tang	190e58a8c6	Added test for maximize parameter	2015-11-04 22:25:10 -06:00
Johan Manders	b0f38e9352	Changed 4 tests Changed symbol test to give error on < sign, not on = sign Changed 3 other functions, so that float is used instead of q	2015-11-03 21:32:47 +01:00
Yuan (Terry) Tang	deb802b2be	Merge pull request #587 from Far0n/py_train python training continuation & maximize parameter	2015-11-03 08:16:12 -06:00
Far0n	8e1adddc2b	added unittest for training continuation	2015-11-03 14:44:17 +01:00
terrytangyuan	7d297b418f	Added more thorough test for early stopping (+1 squashed commit) Squashed commits: [4f78cc0] Added test for early stopping (+1 squashed commit)	2015-11-02 20:37:27 -06:00
terrytangyuan	166e878830	Added tests for additional params in sklearn wrapper (+1 squashed commit) Squashed commits: [43892b9] Added tests for additional params in sklearn wrapper	2015-11-02 19:54:36 -06:00
terrytangyuan	15a0d27eed	Fixed bug in eta decay (+2 squashed commits) Squashed commits: [b67caf2] Fix build [365ceaa] Fixed bug in eta decay	2015-10-31 12:54:27 -04:00
terrytangyuan	888edba03f	Added test for eta decay (+3 squashed commits) Squashed commits: [9109887] Added test for eta decay(+1 squashed commit) Squashed commits: [1336bd4] Added tests for eta decay (+2 squashed commit) Squashed commits: [91aac2d] Added tests for eta decay (+1 squashed commit) Squashed commits: [3ff48e7] Added test for eta decay [6bb1eed] Rewrote Rd files [bf0dec4] Added learning_rates for diff eta in each boosting round	2015-10-31 12:36:29 -04:00
sinhrks	1f19b78287	Python: adjusts plot_importance ylim	2015-10-25 03:16:53 +09:00
Tianqi Chen	d4d36eed45	Merge pull request #528 from terrytangyuan/test More Unit Tests for Python Package	2015-10-22 08:39:32 -07:00
terrytangyuan	ec2cdafec5	Added fixed random seed for tests (+1 squashed commit) Squashed commits: [76e3664] Added fixed random seed for tests	2015-10-21 23:38:41 -05:00
terrytangyuan	755072e378	Fix failed tests (+2 squashed commits) Squashed commits: [962e1e4] Fix failed tests [21ca3fb] Removed one unnecessary line	2015-10-21 23:15:34 -05:00
sinhrks	6f046327ac	Allow plot function to handle XGBModel	2015-10-22 01:00:54 +09:00
terrytangyuan	fc5036a630	Deleted redundant blank lines	2015-10-04 23:29:40 -05:00
terrytangyuan	9d627e2567	DOC: Updated contributors.md	2015-10-04 23:26:46 -05:00
terrytangyuan	5dd23a2195	TST: Added test for parameter tuning using GridSearchCV	2015-10-04 23:16:00 -05:00
terrytangyuan	956e50686e	TST: Added test for early stopping	2015-10-04 23:15:25 -05:00
terrytangyuan	412310ed04	Added test for regression ysing Boston Housing dataset	2015-10-04 23:04:23 -05:00

1 2

73 Commits