xgboost

Author	SHA1	Message	Date
ras44	54980b8959	Fix typo in xgboost_R.h (#4432 )	2019-05-02 19:18:34 +08:00
James Lamb	5e97de6a41	fixed typos in R package docs (#4345 ) * fixed typos in R package docs * updated verbosity parameter in xgb.train docs	2019-04-21 15:54:11 +08:00
Jiaming Yuan	207f058711	Refactor CMake scripts. (#4323 ) * Refactor CMake scripts. * Remove CMake CUDA wrapper. * Bump CMake version for CUDA. * Use CMake to handle Doxygen. * Split up CMakeList. * Export install target. * Use modern CMake. * Remove build.sh * Workaround for gpu_hist test. * Use cmake 3.12. * Revert machine.conf. * Move CLI test to gpu. * Small cleanup. * Support using XGBoost as submodule. * Fix windows * Fix cpp tests on Windows * Remove duplicated find_package.	2019-04-15 10:08:12 -07:00
Jean-Francois Zinque	956e73f183	Fix matrix attributes not sliced (#4311 )	2019-04-10 11:14:44 -07:00
Jiaxiang Li	1ca5698221	Make the train and test input with same colnames. (#4329 ) Fix the bug report of https://github.com/dmlc/xgboost/issues/4328. I am the beginner of the Git so just try my best to follows the guide, https://xgboost.readthedocs.io/en/latest/contribute.html#r-package. I find there is no `dev` branch, so I pull this fix from my master branch to the original master branch.	2019-04-04 15:59:27 -07:00
Jiaming Yuan	29a1356669	Deprecate `reg:linear' in favor of` reg:squarederror'. (#4267 ) * Deprecate `reg:linear' in favor of `reg:squarederror'. * Replace the use of `reg:linear'. * Replace the use of `silent`.	2019-03-17 17:55:04 +08:00
Tong He	259fb809e9	fix R-devel errors (#4251 )	2019-03-12 10:06:54 -07:00
Jiaming Yuan	617f572c0f	Update R contribute link. (#4236 )	2019-03-09 01:50:07 +08:00
Jiaming Yuan	2e618af743	Fix cpplint. (#4157 ) * Add comment after #endif. * Add missing headers.	2019-02-18 00:16:29 +08:00
Kodi Arfer	6a569b8cd9	Avoid generating NaNs in UnwoundPathSum (#3943 ) * Avoid generating NaNs in UnwoundPathSum. Kudos to Jakub Zakrzewski for tracking down the bug. * Add a test	2019-01-03 15:04:46 -08:00
Jiaming Yuan	e0a279114e	Unify logging facilities. (#3982 ) * Unify logging facilities. * Enhance `ConsoleLogger` to handle different verbosity. * Override macros from `dmlc`. * Don't use specialized gamma when building with GPU. * Remove verbosity cache in monitor. * Test monitor. * Deprecate `silent`. * Fix doc and messages. * Fix python test. * Fix silent tests.	2018-12-14 19:29:58 +08:00
Tong He	84a3af8dc0	Fix CRAN check warnings/notes (#3988 ) * fix * reorder declaration to match initialization	2018-12-12 08:23:20 -06:00
Bruno Tremblay	32de54fdee	Update R-package/R/xgb.ggplot.R (#3820 ) Changed width parameter of var important ggplot from 0.05 to 0.5 to make it more visible when displaying more variables.	2018-10-23 20:52:33 -07:00
Philip Hyunsu Cho	b38c636d05	Fix #3523 : Fix CustomGlobalRandomEngine for R (#3781 ) Symptom Apple Clang's implementation of `std::shuffle` expects doesn't work correctly when it is run with the random bit generator for R package: ```cpp CustomGlobalRandomEngine::result_type CustomGlobalRandomEngine::operator()() { return static_cast<result_type>( std::floor(unif_rand() * CustomGlobalRandomEngine::max())); } ``` Minimial reproduction of failure (compile using Apple Clang 10.0): ```cpp std::vector<int> feature_set(100); std::iota(feature_set.begin(), feature_set.end(), 0); // initialize with 0, 1, 2, 3, ..., 99 std::shuffle(feature_set.begin(), feature_set.end(), common::GlobalRandom()); // This returns 0, 1, 2, ..., 99, so content didn't get shuffled at all!!! ``` Note that this bug is platform-dependent; it does not appear when GCC or upstream LLVM Clang is used. Diagnosis Apple Clang's `std::shuffle` expects 32-bit integer inputs, whereas `CustomGlobalRandomEngine::operator()` produces 64-bit integers. Fix Have `CustomGlobalRandomEngine::operator()` produce 32-bit integers. Closes #3523.	2018-10-15 09:39:13 -07:00
Tong He	0b7fd74138	fix R check warning (#3728 )	2018-09-27 17:53:49 -07:00
Philip Hyunsu Cho	fbe9d41dd0	Disable flaky tests in R-package/tests/testthat/test_update.R (#3723 )	2018-09-26 14:21:41 -07:00
jakehoare	7707982a85	Amend xgb.createFolds to handle classes of a single element. (#3630 ) * Amend xgb.createFolds to handle classes of a single element. * Fix variable name	2018-09-12 09:23:05 -05:00
Vadim Khotilovich	ad3a0bbab8	Add the missing max_delta_step (#3668 ) * add max_delta_step to SplitEvaluator * test for max_delta_step * missing x2 factor for L1 term * remove gamma from ElasticNet	2018-09-12 08:43:41 -05:00
Philip Hyunsu Cho	c87153ed32	Fix CRAN check by removing reference to std::cerr (#3660 ) * Fix CRAN check by removing reference to std::cerr * Mask tests that fail on 32-bit Windows R	2018-09-05 11:44:00 -07:00
Andrew Thia	9254c58e4d	[TREE] add interaction constraints (#3466 ) * add interaction constraints * enable both interaction and monotonic constraints at the same time * fix lint * add R test, fix lint, update demo * Use dmlc::JSONReader to express interaction constraints as nested lists; Use sparse arrays for bookkeeping * Add Python test for interaction constraints * make R interaction constraints parameter based on feature index instead of column names, fix R coding style * Fix lint * Add BlueTea88 to CONTRIBUTORS.md * Short circuit when no constraint is specified; address review comments * Add tutorial for feature interaction constraints * allow interaction constraints to be passed as string, remove redundant column_names argument * Fix typo * Address review comments * Add comments to Python test	2018-09-04 09:35:39 -07:00
Vadim Khotilovich	5b662cbe1c	[R] R-interface for SHAP interactions (#3636 ) * add R-interface for SHAP interactions * update docs for new roxygen version	2018-08-30 19:06:21 -05:00
Jakob Richter	725f4c36f2	replace nround with nrounds to match actual parameter (#3592 )	2018-08-15 11:13:53 -07:00
Philip Hyunsu Cho	96826a3515	Release version 0.80 (#3541 ) * Up versions * Write release note for 0.80	2018-08-13 01:38:37 -07:00
Philip Hyunsu Cho	109473dae2	Fix #3545 : XGDMatrixCreateFromCSCEx silently discards empty trailing rows (#3553 ) * Fix #3545: XGDMatrixCreateFromCSCEx silently discards empty trailing rows Description: The bug is triggered when 1. The data matrix has empty rows at the bottom. More precisely, the rows `n-k+1`, `n-k+2`, ..., `n` of the matrix have missing values in all dimensions (`n` number of instances, `k` number of trailing rows) 2. The data matrix is given as Compressed Sparse Column (CSC) format. Diagnosis: When the CSC matrix is converted to Compressed Sparse Row (CSR) format (this is common format used for DMatrix), the trailing empty rows are silently ignored. More specifically, the row pointer (`offset`) of the newly created CSR matrix does not take account of these rows. Fix: Modify the row pointer. * Add regression test	2018-08-05 10:15:42 -07:00
Brandon Greenwell	b5fad42da2	Issue warning when requesting bivariate plotting (#3516 )	2018-07-27 16:15:37 -07:00
Henry Gouk	64b8cffde3	Refactor of FastHistMaker to allow for custom regularisation methods (#3335 ) * Refactor to allow for custom regularisation methods * Implement compositional SplitEvaluator framework * Fixed segfault when no monotone_constraints are supplied. * Change pid to parentID * test_monotone_constraints.py now passes * Refactor ColMaker and DistColMaker to use SplitEvaluator * Performance optimisation when no monotone_constraints specified * Fix linter messages * Fix a few more linter errors * Update the amalgamation * Add bounds check * Add check for leaf node * Fix linter error in param.h * Fix clang-tidy errors on CI * Fix incorrect function name * Fix clang-tidy error in updater_fast_hist.cc * Enable SSE2 for Win32 R MinGW Addresses https://github.com/dmlc/xgboost/pull/3335#issuecomment-400535752 * Add contributor	2018-06-28 07:37:25 +00:00
Tong He	e6696337e4	Fix CRAN check for lintr (#3372 ) * fix CRAN check * Update submodules dmlc-core and rabit * Add kintr to rmingw test	2018-06-18 12:53:52 -07:00
Ryota Suzuki	b7cbec4d4b	Fix print.xgb.Booster for R (#3338 ) * Fix print.xgb.Booster valid_handle should be TRUE when x$handle is NOT null * Update xgb.Booster.R Modify is.null.handle to return TRUE for NULL handle	2018-05-29 11:44:55 -07:00
Philip Hyunsu Cho	71e226120a	For CRAN submission, remove all #pragma's that suppress compiler warnings (#3329 ) * For CRAN submission, remove all #pragma's that suppress compiler warnings A few headers in dmlc-core contain #pragma's that disable compiler warnings, which is against the CRAN submission policy. Fix the problem by removing the offending #pragma's as part of the command `make Rbuild`. This addresses issue #3322. * Fix script to improve Cygwin/MSYS compatibility We need this to pass rmingw CI test * Remove remove_warning_suppression_pragma.sh from packaged tarball	2018-05-23 09:58:39 -07:00
Tong He	098075b81b	CRAN Submission for 0.71.1 (#3311 ) * fix for CRAN manual checks * fix for CRAN manual checks * pass local check * fix variable naming style * Adding Philip's record	2018-05-14 17:32:39 -07:00
Brandon Greenwell	d13f1a0f16	Fix typo (#3305 )	2018-05-09 10:18:36 -07:00
Thomas J. Leeper	c2b647f26e	fix typo in README (#3263 )	2018-04-22 09:24:38 -04:00
Philip Hyunsu Cho	230cb9b787	Release version 0.71 (#3200 )	2018-04-11 21:43:32 +09:00
Tong He	ace4016c36	Replace cBind by cbind (#3203 ) * modify test_helper.R * fix noLD * update desc * fix solaris test * fix desc * improve fix * fix url * change Matrix cBind to cbind * fix * fix error in demo * fix examples	2018-03-28 10:05:47 -07:00
Yuan (Terry) Tang	92782a8406	Change DESCRIPTION to more modern look (#3179 ) So other things can be added in comment field, such as ORCID.	2018-03-23 10:45:10 -04:00
Arjan van der Velde	04221a7469	rank_metric: add AUC-PR (#3172 ) * rank_metric: add AUC-PR Implementation of the AUC-PR calculation for weighted data, proposed by Keilwagen, Grosse and Grau (https://doi.org/10.1371/journal.pone.0092209) * rank_metric: fix lint warnings * Implement tests for AUC-PR and fix implementation * add aucpr to documentation for other languages	2018-03-23 10:43:47 -04:00
Vadim Khotilovich	706be4e5d4	Additional improvements for gblinear (#3134 ) * fix rebase conflict * [core] additional gblinear improvements * [R] callback for gblinear coefficients history * force eta=1 for gblinear python tests * add top_k to GreedyFeatureSelector * set eta=1 in shotgun test * [core] fix SparsePage processing in gblinear; col-wise multithreading in greedy updater * set sorted flag within TryInitColData * gblinear tests: use scale, add external memory test * fix multiclass for greedy updater * fix whitespace * fix typo	2018-03-13 01:27:13 -05:00
Vadim Khotilovich	9ffe8596f2	[core] fix slow predict-caching with many classes (#3109 ) * fix prediction caching inefficiency for multiclass * silence some warnings * redundant if * workaround for R v3.4.3 bug; fixes #3081	2018-02-15 18:31:42 -06:00
Tong He	98be9aef9a	A fix for CRAN submission of version 0.7-0 (#3061 ) * modify test_helper.R * fix noLD * update desc * fix solaris test * fix desc * improve fix * fix url	2018-01-27 17:06:28 -08:00
Vadim Khotilovich	526801cdb3	[R] fix for the 32 bit windows issue (#2994 ) * [R] disable thred_local for 32bit windows * [R] require C++11 and GNU make in DESCRIPTION * [R] enable 32+64 build and check in appveyor	2017-12-31 14:18:50 -08:00
Vadim Khotilovich	76f8f51438	[R] AppVeyor CI for R package (#2954 ) * [R] fix finding R.exe with cmake on WIN when it is in PATH * [R] appveyor config for R package * [R] wrap the lines to make R check happier * [R] install only binary dep-packages in appveyor * [R] for MSVC appveyor, also build a binary for R package and keep as an artifact	2017-12-17 16:37:45 -06:00
Vadim Khotilovich	e8a6597957	[R] maintenance Nov 2017; SHAP plots (#2888 ) * [R] fix predict contributions for data with no colnames * [R] add a render parameter for xgb.plot.multi.trees; fixes #2628 * [R] update Rd's * [R] remove unnecessary dep-package from R cmake install * silence type warnings; readability * [R] silence complaint about incomplete line at the end * [R] initial version of xgb.plot.shap() * [R] more work on xgb.plot.shap * [R] enforce black font in xgb.plot.tree; fixes #2640 * [R] if feature names are available, check in predict that they are the same; fixes #2857 * [R] cran check and lint fixes * remove tabs * [R] add references; a test for plot.shap	2017-12-05 09:45:34 -08:00
Scott Lundberg	78c4188cec	SHAP values for feature contributions (#2438 ) * SHAP values for feature contributions * Fix commenting error * New polynomial time SHAP value estimation algorithm * Update API to support SHAP values * Fix merge conflicts with updates in master * Correct submodule hashes * Fix variable sized stack allocation * Make lint happy * Add docs * Fix typo * Adjust tolerances * Remove unneeded def * Fixed cpp test setup * Updated R API and cleaned up * Fixed test typo	2017-10-12 12:35:51 -07:00
Vadim Khotilovich	74db9757b3	[R package] GPU support (#2732 ) * [R] MSVC compatibility * [GPU] allow seed in BernoulliRng up to size_t and scale to uint32_t * R package build with cmake and CUDA * R package CUDA build fixes and cleanups * always export the R package native initialization routine on windows * update the install instructions doc * fix lint * use static_cast directly to set BernoulliRng seed * [R] demo for GPU accelerated algorithm * tidy up the R package cmake stuff * R pack cmake: installs main dependency packages if needed * [R] version bump in DESCRIPTION * update NEWS * added short missing/sparse values explanations to FAQ	2017-09-28 18:15:28 -05:00
Bernie Gray	cd7659937b	[R] many minor changes to increase the robustness of the R code (#2404 ) * many minor changes to increase robustness of R code * fixing which mistake in xgb.model.dt.tree.R and a few cosmetics	2017-06-15 22:56:23 -05:00
Vadim Khotilovich	c82276386d	[R] xgb.importance: fix for multiclass gblinear, new 'trees' parameter (#2388 )	2017-06-07 13:13:21 -05:00
Michaël Benesty	8e2a1ff2bf	Improve setinfo documentation on R package (#2357 )	2017-05-30 20:08:31 +02:00
davidt0x	b29b7d1d76	Fixed loop bound in create.new.tree.features (#2328 ) for loop in create.new.tree.features was referencing length(trees) as the upper bound of the loop. trees is a base R dataset and not the model that the code is generating. Changed loop boundary to model$niter which should be the number of trees.	2017-05-30 17:50:33 +02:00
Vadim Khotilovich	da1629e848	[gbtree] fix update process to work with multiclass and multitree; fixes #2315 (#2332 )	2017-05-21 23:47:57 -05:00
Vadim Khotilovich	b52db87d5c	adding feature contributions to R and gblinear (#2295 ) * [gblinear] add features contribution prediction; fix DumpModel bug * [gbtree] minor changes to PredContrib * [R] add feature contribution prediction to R * [R] bump up version; update NEWS * [gblinear] fix the base_margin issue; fixes #1969 * [R] list of matrices as output of multiclass feature contributions * [gblinear] make order of DumpModel coefficients consistent: group index changes the fastest	2017-05-21 07:41:51 -04:00

... 3 4 5 6 7 ...

951 Commits