xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	b40959042c	Document 0.72.1 version (#3458 )	2018-07-08 15:42:09 -07:00
kodonnell	6bed54ac39	python sklearn api: defaulting to best_ntree_limit if defined, otherwise current behaviour (#3445 ) * python sklearn api: defaulting to best_ntree_limit if defined, otherwise current behaviour * Fix whitespace	2018-07-08 14:35:52 -07:00
ngoyal2707	cb017d0c9a	[jvm-packages] removed old group_data from spark api (#3451 )	2018-07-07 22:21:01 -07:00
Nan Zhu	aa90e5c6ce	[jvm-packages] disable booster setup for xgboost4j-spark (#3456 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * disable booster setup in spark * check in parameter conversion * fix compilation issue * update exception type	2018-07-07 21:57:24 -07:00
Philip Hyunsu Cho	66e74d2223	Fix get_uint_info() (#3442 ) * Add regression test	2018-07-05 20:06:59 -07:00
Philip Hyunsu Cho	48d6e68690	Add callback interface to re-direct console output (#3438 ) * Add callback interface to re-direct console output * Exempt TrackerLogger from custom logging * Fix lint	2018-07-05 11:32:30 -07:00
Philip Hyunsu Cho	45bf4fbffb	Add a notice for binary PyPI wheel (#3443 )	2018-07-05 08:28:43 -07:00
Tianqi Chen	01aff45f26	Update README.md	2018-07-04 13:09:32 -07:00
Tianqi Chen	e62639c59b	[DOCS] Update link to readme (#3437 )	2018-07-04 12:24:33 -07:00
Yanbo Liang	aec6299c49	[jvm-packages] Expose nativeBooster for XGBoostClassificationModel and XGBoostRegressionModel. (#3428 )	2018-07-01 15:06:16 -07:00
Nikita Titov	295252249e	fixed MinGW missed dll (#3430 )	2018-07-01 16:43:33 +00:00
liuliang01	0cf88d036f	Add qid like ranklib format (#2749 ) * add qid for https://github.com/dmlc/xgboost/issues/2748 * change names * change spaces * change qid to bst_uint type * change qid type to size_t * change qid first to SIZE_MAX * change qid type from size_t to uint64_t * update dmlc-core * fix qids name error * fix group_ptr_ error * Style fix * Add qid handling logic to SparsePage * New MetaInfo format + backward compatibility fix Old MetaInfo format (1.0) doesn't contain qid field. We still want to be able to read from MetaInfo files saved in old format. Also, define a new format (2.0) that contains the qid field. This way, we can distinguish files that contain qid and those that do not. * Update MetaInfo test * Simply group assignment logic * Explicitly set qid=nullptr in NativeDataIter NativeDataIter's callback does not support qid field. Users of NativeDataIter will need to call setGroup() function separately to set group information. * Save qids_ in SaveBinary() * Upgrade dmlc-core submodule * Add a test for reading qid * Add contributor * Check the size of qids_ * Document qid format	2018-06-30 20:24:03 +00:00
Oliver Laslett	18813a26ab	allow arbitrary cross validation fold indices (#3353 ) * allow arbitrary cross validation fold indices - use training indices passed to `folds` parameter in `training.cv` - update doc string * add tests for arbitrary fold indices	2018-06-30 19:23:49 +00:00
Mike Liu	594bcea83e	Save and load model in sklearn API (#3192 ) * Add (load\|save)_model to XGBModel * Add docstring * Fix docstring * Fix mixed use of space and tab * Add a test * Fix Flake8 style errors	2018-06-30 19:21:49 +00:00
Rory Mitchell	24fde92660	Build universal wheels using GPU CI (#3424 )	2018-06-29 13:45:24 +00:00
Yun Ni	30d10ab035	Convert handle == nullptr from SegFault to user-friendly error. (#3021 ) * Convert SegFault to user-friendly error. * Apply the change to DMatrix API as well	2018-06-29 06:30:26 +00:00
cinqS	8bec8d5e9a	Better doc for save_model() / load_model() (#3143 ) Be clear that they do not save Python-specific attributes	2018-06-29 04:24:33 +00:00
pdesahb	12e34f32e2	Fix tweedie handling of base_score (#3295 ) * fix tweedie margin calculations * add entry to contributors	2018-06-28 15:43:05 +00:00
Henry Gouk	64b8cffde3	Refactor of FastHistMaker to allow for custom regularisation methods (#3335 ) * Refactor to allow for custom regularisation methods * Implement compositional SplitEvaluator framework * Fixed segfault when no monotone_constraints are supplied. * Change pid to parentID * test_monotone_constraints.py now passes * Refactor ColMaker and DistColMaker to use SplitEvaluator * Performance optimisation when no monotone_constraints specified * Fix linter messages * Fix a few more linter errors * Update the amalgamation * Add bounds check * Add check for leaf node * Fix linter error in param.h * Fix clang-tidy errors on CI * Fix incorrect function name * Fix clang-tidy error in updater_fast_hist.cc * Enable SSE2 for Win32 R MinGW Addresses https://github.com/dmlc/xgboost/pull/3335#issuecomment-400535752 * Add contributor	2018-06-28 07:37:25 +00:00
Philip Hyunsu Cho	cafc621914	Do not unzip google test archive if exists (#3416 )	2018-06-28 04:10:39 +00:00
Philip Hyunsu Cho	e2743548ed	Fix wget for google tests in tests (#3414 ) CI tests were failing because wget prompts "the user" for a response whenever the google test archive is already on the disk. Fix: Use `-nc` option to skip download when the archive already exists	2018-06-27 22:12:56 +00:00
Rory Mitchell	a0a1df1aba	Refactor python tests (#3410 ) * Add unit test utility * Refactor updater tests. Add coverage for histmaker.	2018-06-27 11:20:27 +12:00
Adam Johnston	0988fb191f	[jvm-packages] avoid use of Seq.apply in buildGroups (#3413 )	2018-06-26 16:00:28 -07:00
ngoyal2707	5cd851ccef	added code for instance based weighing for rank objectives (#3379 ) * added code for instance based weighing for rank objectives * Fix lint	2018-06-22 15:10:59 -07:00
Nan Zhu	d062c6f61b	[jvm-packages] Maven central release stuffs (#3401 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * maven central release	2018-06-22 06:41:28 -07:00
PSEUDOTENSOR / Jonathan McKinney	9ac163d0bb	Allow import via python datatable. (#3272 ) * Allow import via python datatable. * Write unit tests * Refactor dt API functions * Refactor python code * Lint fixes * Address review comments	2018-06-20 13:16:18 -07:00
James	eecf341ea7	[jvm-packages] Added latest version number example (#3374 ) * Added latest version number example * Added latest version number example	2018-06-18 22:09:39 -07:00
Thejaswi	0e78034607	Shared memory atomics while building histogram (#3384 ) * Use shared memory atomics for building histograms, whenever possible	2018-06-19 16:03:09 +12:00
Yanbo Liang	2c4359e914	[jvm-packages] XGBoost Spark integration refactor (#3387 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * [jvm-packages] XGBoost Spark integration refactor. (#3313) * XGBoost Spark integration refactor. * Make corresponding update for xgboost4j-example * Address comments. * [jvm-packages] Refactor XGBoost-Spark params to make it compatible with both XGBoost and Spark MLLib (#3326) * Refactor XGBoost-Spark params to make it compatible with both XGBoost and Spark MLLib * Fix extra space. * [jvm-packages] XGBoost Spark supports ranking with group data. (#3369) * XGBoost Spark supports ranking with group data. * Use Iterator.duplicate to prevent OOM. * Update CheckpointManagerSuite.scala * Resolve conflicts	2018-06-18 15:39:18 -07:00
Tong He	e6696337e4	Fix CRAN check for lintr (#3372 ) * fix CRAN check * Update submodules dmlc-core and rabit * Add kintr to rmingw test	2018-06-18 12:53:52 -07:00
Bruce Qu	578a0c7ddb	params confusion fixed (#3386 )	2018-06-15 13:17:35 -07:00
Gorkem Ozkaya	34e3edfb1a	Update index.md (#3228 )	2018-06-07 21:51:06 -07:00
ngoyal2707	902ecbade8	added python doc string for nthreads to dmatrix (#3363 )	2018-06-08 14:16:30 +12:00
Rory Mitchell	a96039141a	Dmatrix refactor stage 1 (#3301 ) * Use sparse page as singular CSR matrix representation * Simplify dmatrix methods * Reduce statefullness of batch iterators * BREAKING CHANGE: Remove prob_buffer_row parameter. Users are instead recommended to sample their dataset as a preprocessing step before using XGBoost.	2018-06-07 10:25:58 +12:00
Andy Adinets	286dccb8e8	GPU binning and compression. (#3319 ) * GPU binning and compression. - binning and index compression are done inside the DeviceShard constructor - in case of a DMatrix with multiple row batches, it is first converted into a single row batch	2018-06-05 17:15:13 +12:00
Rory Mitchell	3f7696ff53	Cleanup old artefacts in Jenkins (#3361 )	2018-06-05 15:16:37 +12:00
Philip Hyunsu Cho	bd01acdfbc	Save outputs in high precision in CLI prediction (#3356 ) Currently, `CLIPredict()` saves prediction results in default 6-digit precision which causes precision loss. This PR sets precision to a level so that the conversion back to `bst_float` is lossless. Related: #3298.	2018-06-03 14:15:47 -07:00
Nan Zhu	f66731181f	Update 0.8 version num (#3358 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * update 0.80	2018-06-02 07:06:01 -07:00
Philip Hyunsu Cho	1214081f99	Release version 0.72 (#3337 ) v0.72	2018-06-01 16:00:31 -07:00
Ryota Suzuki	b7cbec4d4b	Fix print.xgb.Booster for R (#3338 ) * Fix print.xgb.Booster valid_handle should be TRUE when x$handle is NOT null * Update xgb.Booster.R Modify is.null.handle to return TRUE for NULL handle	2018-05-29 11:44:55 -07:00
Kristian Gampong	a510e68dda	Add validate_features option for Booster predict (#3323 ) * Add validate_features option for Booster predict * Fix trailing whitespace in docstring	2018-05-29 11:40:49 -07:00
Yanbo Liang	b018ef104f	Remove output_margin from XGBClassifier.predict_proba argument list. (#3343 )	2018-05-28 10:30:21 -07:00
trivialfis	34aeee2961	Fix test_param.cc header path (#3317 )	2018-05-28 10:26:29 -07:00
Dave Challis	8efbadcde4	Point rabit submodule at latest commit from master. (#3330 )	2018-05-28 10:21:10 -07:00
pdavalo	480e3fd764	Sklearn: validation set weights (#2354 ) * Add option to use weights when evaluating metrics in validation sets * Add test for validation-set weights functionality * simplify case with no weights for test sets * fix lint issues	2018-05-23 17:06:20 -07:00
Philip Hyunsu Cho	71e226120a	For CRAN submission, remove all #pragma's that suppress compiler warnings (#3329 ) * For CRAN submission, remove all #pragma's that suppress compiler warnings A few headers in dmlc-core contain #pragma's that disable compiler warnings, which is against the CRAN submission policy. Fix the problem by removing the offending #pragma's as part of the command `make Rbuild`. This addresses issue #3322. * Fix script to improve Cygwin/MSYS compatibility We need this to pass rmingw CI test * Remove remove_warning_suppression_pragma.sh from packaged tarball	2018-05-23 09:58:39 -07:00
Thejaswi	d367e4fc6b	Fix for issue 3306. (#3324 )	2018-05-23 13:42:20 +12:00
Sergei Lebedev	8f6aadd4b7	[jvm-packages] Fixed CheckpointManagerSuite for Scala 2.10 (#3332 ) As before, the compilation error is caused by mixing positional and labelled arguments.	2018-05-19 18:28:11 -07:00
Rory Mitchell	3ee725e3bb	Add cuda forwards compatibility (#3316 )	2018-05-17 10:59:22 +12:00
Rory Mitchell	f8b7686719	Add cuda 8/9.1 centos 6 builds, test GPU wheel on CPU only container. (#3309 ) * Add cuda 8/9.1 centos 6 builds, test GPU wheel on CPU only container. * Add Google test	2018-05-17 10:57:01 +12:00

1 2 3 4 5 ...

3327 Commits