xgboost

Author	SHA1	Message	Date
Johan Manders	f116722e68	Merge pull request #3 from dmlc/master Getting latest version from dmlc	2015-10-17 14:41:13 +02:00
Tianqi Chen	8e4dc43368	Merge pull request #540 from JohanManders/quansie-python-training-patch-1 Update training.py and sklearn.py for evals_result	2015-10-16 20:42:29 -07:00
Johan Manders	00387cb645	Removed th last few trailing whitespaces	2015-10-14 14:26:18 +02:00
Johan Manders	0f8f8e05b2	One line was too long	2015-10-14 14:18:31 +02:00
Johan Manders	82c2ba4c44	Removed trailing whitespaces and Change Error to XGBoostError	2015-10-14 14:17:57 +02:00
Johan Manders	edf4595bc1	Added evals result demos	2015-10-14 13:45:59 +02:00
Johan Manders	f1e1cc28ff	Access xgboost eval metrics by using sklearn	2015-10-14 13:43:14 +02:00
Johan Manders	122ec48a89	Update evals_result.py	2015-10-14 13:40:20 +02:00
Johan Manders	6e2bdcbbbc	Demo for accessing eval metrics in xgboost	2015-10-14 13:22:39 +02:00
Johan Manders	67f3c687b8	Added Johan Manders to the list, asked by Tianqi Chen	2015-10-14 13:06:14 +02:00
Johan Manders	9c8420a4dc	Updated the documentation a bit Will upload some demos for guide-python later.	2015-10-14 12:53:42 +02:00
Johan Manders	e960a09ff4	Made eval_results for sklearn output the same structure as in the new training.py Changed the name of eval_results to evals_result, so that the naming is the same in training.py and sklearn.py Made the structure of evals_result the same as in training.py, the names of the keys are different: In sklearn.py you cannot name your evals_result, but they are automatically called 'validation_0', 'validation_1' etc. The dict evals_result will output something like: {'validation_0': {'logloss': ['0.674800', '0.657121']}, 'validation_1': {'logloss': ['0.63776', '0.58372']}} In training.py you can name your multiple evals_result with a watchlist like: watchlist = [(dtest,'eval'), (dtrain,'train')] The dict evals_result will output something like: {'train': {'logloss': ['0.68495', '0.67691']}, 'eval': {'logloss': ['0.684877', '0.676767']}} You can access the evals_result using the evals_result() function.	2015-10-14 12:51:46 +02:00
Johan Manders	e339cdec52	Too many branches and unused key	2015-10-12 16:47:24 +02:00
Johan Manders	40566cdbba	update sklearn.py because evals_result in training.py changed Because I changed the training.py, the sklearn.py had to be changed also to be able to read all the data form evals_result.	2015-10-12 16:31:23 +02:00
quansie	30d0d5fb96	Merge pull request #2 from quansie/quansie-python-training-patch-1 Removed extra spaces	2015-10-12 14:28:50 +02:00
quansie	b758a13813	Removed extra spaces	2015-10-12 14:26:23 +02:00
quansie	541580d157	Update training.py	2015-10-12 14:19:25 +02:00
quansie	8a484e990e	Merge pull request #1 from quansie/quansie-python-training-patch-1 training.py - pass all eval_metric information to evals_result	2015-10-12 14:11:34 +02:00
quansie	1ca737ed55	Update training.py Made changes to training.py to make sure all eval_metric information get passed to evals_result. Previous version lost and mislabeled data in evals_result when using more than one eval_metric. Structure of eval_metric is now: eval_metric[evals][eval_metric] = list of metrics Example: >>> dtrain = xgb.DMatrix('agaricus.txt.train', silent=True) >>> dtest = xgb.DMatrix('agaricus.txt.test', silent=True) >>> param = [('max_depth', 2), ('objective', 'binary:logistic'), ('bst:eta', 0.01), ('eval_metric', 'logloss'), ('eval_metric', 'error')] >>> watchlist = [(dtest,'eval'), (dtrain,'train')] >>> num_round = 3 >>> evals_result = {} >>> bst = xgb.train(param, dtrain, num_round, watchlist, evals_result=evals_result) >>> print(evals_result['eval']['logloss']) >>> print(evals_result) Prints: ['0.684877', '0.676767', '0.668817'] {'train': {'logloss': ['0.684954', '0.676917', '0.669036'], 'error': ['0.04652', '0.04652', '0.04652']}, 'eval': {'logloss': ['0.684877', '0.676767', '0.668817'], 'error': ['0.042831', '0.042831', '0.042831']}}	2015-10-11 01:09:05 +02:00
Tong He	e9edb03eff	Merge pull request #533 from kferris10/master Switch default missing values from 0 to NA in R package	2015-10-08 10:47:28 -07:00
kferris	d5a34339e5	Updated Changes	2015-10-08 13:22:23 -04:00
kferris	32ca060094	Fix merge conflicts	2015-10-08 08:58:27 -04:00
Tong He	81d4d4d2c1	Update utils.R	2015-10-07 18:26:33 -07:00
kferris	7a94bdb60c	Switch missing values from 0 to NA in R package	2015-10-07 18:51:47 -04:00
terrytangyuan	1080dc256a	Fix Travis build	2015-10-05 00:46:56 -05:00
terrytangyuan	fc5036a630	Deleted redundant blank lines	2015-10-04 23:29:40 -05:00
terrytangyuan	9d627e2567	DOC: Updated contributors.md	2015-10-04 23:26:46 -05:00
terrytangyuan	5dd23a2195	TST: Added test for parameter tuning using GridSearchCV	2015-10-04 23:16:00 -05:00
terrytangyuan	956e50686e	TST: Added test for early stopping	2015-10-04 23:15:25 -05:00
terrytangyuan	412310ed04	Added test for regression ysing Boston Housing dataset	2015-10-04 23:04:23 -05:00
terrytangyuan	d20bfb12e4	Added assertions for classification tests	2015-10-04 23:01:07 -05:00
terrytangyuan	3dbd4af263	TST: Added tests for multi-class classification	2015-10-04 22:57:13 -05:00
terrytangyuan	7b9b4f821b	TST: Added tests for binary classification	2015-10-04 22:53:31 -05:00
terrytangyuan	1411d3f37f	TST: Added test for custom_objective function in cv	2015-10-04 22:45:10 -05:00
terrytangyuan	dfb89e3442	TST: Added test for show_stdv when using cv	2015-10-04 22:42:39 -05:00
terrytangyuan	0c360fe55f	TST: Added test for fpreproc	2015-10-04 22:30:45 -05:00
Tianqi Chen	3109069019	Merge pull request #525 from sinhrks/df_columns Python supports pd.DataFrame with non-str columns	2015-10-04 10:01:09 -07:00
sinhrks	dbcb4c8729	Support non-str column names	2015-10-04 13:30:01 +09:00
Tianqi Chen	2859c190cd	Merge pull request #522 from sinhrks/pandas python DMatrix now accepts pandas DataFrame	2015-10-02 10:19:14 -07:00
Tianqi Chen	9c39f69559	Merge pull request #524 from sinhrks/cv_pandas Python CV returns pd.DataFrame or np.ndarray	2015-10-02 10:18:13 -07:00
sinhrks	b958c55ac6	CV returns ndarray or DataFrame	2015-10-02 22:38:03 +09:00
sinhrks	b943becc61	python DMatrix now accepts pandas DataFrame	2015-10-01 22:52:32 +09:00
Tianqi Chen	db490d1c75	Merge pull request #503 from sinhrks/feature_types Python: Add feature_types to DMatrix	2015-09-29 14:14:48 -07:00
sinhrks	f6f3473d17	Change to properties	2015-09-28 22:36:39 +09:00
sinhrks	db692a30e5	Add feature_types	2015-09-28 22:25:35 +09:00
Tianqi Chen	b0591c8042	Merge pull request #514 from nerdcha/master Fix makefile typo	2015-09-21 15:05:20 -07:00
Jamie Hall	f5920f8cbd	Fix makefile typo	2015-09-22 07:18:15 +10:00
Tianqi Chen	05b242d542	Merge pull request #511 from nerdcha/master Use homebrew gcc if available	2015-09-20 17:18:38 -07:00
Jamie Hall	6c3e4d7d0d	Use homebrew gcc if available	2015-09-21 08:55:42 +10:00
Tianqi Chen	f28459497d	fix pylint in setup	2015-09-18 20:22:54 -07:00

... 4 5 6 7 8 ...

2226 Commits