xgboost

Author	SHA1	Message	Date
fuhaoda	dd60fc23e6	Simplify INI-style config reader using C++11 STL (#4478 ) * simplify the config.h file * revise config.h * revised config.h * revise format * revise format issues * revise whitespace issues * revise whitespace namespace format issues * revise namespace format issues * format issues * format issues * format issues * format issues * Revert submodule changes * minor change * Update src/common/config.h Co-Authored-By: Philip Hyunsu Cho <chohyu01@cs.washington.edu> * address format issue from trivialfis * Use correct cub submodule	2019-05-30 11:57:56 -07:00
Bryan Woods	278562db13	Add support for cross-validation using query ID (#4474 ) * adding support for matrix slicing with query ID for cross-validation * hail mary test of unrar installation for windows tests * trying to modify tests to run in Github CI * Remove dependency on wget and unrar * Save error log from R test * Relax assertion in test_training * Use int instead of bool in C function interface * Revise R interface * Add XGDMatrixSliceDMatrixEx and keep old XGDMatrixSliceDMatrix for API compatibility	2019-05-23 10:45:02 -07:00
Jiaxiang Li	2e052e74b6	Update CONTRIBUTORS.md (#4335 )	2019-04-05 10:45:23 -07:00
Matthew Jones	92b7577c62	[REVIEW] Enable Multi-Node Multi-GPU functionality (#4095 ) * Initial commit to support multi-node multi-gpu xgboost using dask * Fixed NCCL initialization by not ignoring the opg parameter. - it now crashes on NCCL initialization, but at least we're attempting it properly * At the root node, perform a rabit::Allreduce to get initial sum_gradient across workers * Synchronizing in a couple of more places. - now the workers don't go down, but just hang - no more "wild" values of gradients - probably needs syncing in more places * Added another missing max-allreduce operation inside BuildHistLeftRight * Removed unnecessary collective operations. * Simplified rabit::Allreduce() sync of gradient sums. * Removed unnecessary rabit syncs around ncclAllReduce. - this improves performance _significantly_ (7x faster for overall training, 20x faster for xgboost proper) * pulling in latest xgboost * removing changes to updater_quantile_hist.cc * changing use_nccl_opg initialization, removing unnecessary if statements * added definition for opaque ncclUniqueId struct to properly encapsulate GetUniqueId * placing struct defintion in guard to avoid duplicate code errors * addressing linting errors * removing * removing additional arguments to AllReduer initialization * removing distributed flag * making comm init symmetric * removing distributed flag * changing ncclCommInit to support multiple modalities * fix indenting * updating ncclCommInitRank block with necessary group calls * fix indenting * adding print statement, and updating accessor in vector * improving print statement to end-line * generalizing nccl_rank construction using rabit * assume device_ordinals is the same for every node * test, assume device_ordinals is identical for all nodes * test, assume device_ordinals is unique for all nodes * changing names of offset variable to be more descriptive, editing indenting * wrapping ncclUniqueId GetUniqueId() and aesthetic changes * adding synchronization, and tests for distributed * adding to tests * fixing broken #endif * fixing initialization of gpu histograms, correcting errors in tests * adding to contributors list * adding distributed tests to jenkins * fixing bad path in distributed test * debugging * adding kubernetes for distributed tests * adding proper import for OrderedDict * adding urllib3==1.22 to address ordered_dict import error * added sleep to allow workers to save their models for comparison * adding name to GPU contributors under docs	2019-03-02 10:03:22 +13:00
Sam Wilkinson	a2dc929598	Update CONTRIBUTORS.md (#3999 )	2018-12-15 18:10:52 +08:00
Chen Qin	87f49995be	update rabit (#3835 )	2018-10-30 09:15:19 -07:00
Philip Hyunsu Cho	4302fc4027	Update committer list (#3788 ) * Update committer list * Update CONTRIBUTORS.md * Minor format fix	2018-10-14 23:41:03 -07:00
weitian	9504f411c1	[jvm-packages] For training data with group, empty RDD partition threw exception (#3749 ) (#3750 )	2018-10-09 09:03:22 -07:00
Andrew Thia	9254c58e4d	[TREE] add interaction constraints (#3466 ) * add interaction constraints * enable both interaction and monotonic constraints at the same time * fix lint * add R test, fix lint, update demo * Use dmlc::JSONReader to express interaction constraints as nested lists; Use sparse arrays for bookkeeping * Add Python test for interaction constraints * make R interaction constraints parameter based on feature index instead of column names, fix R coding style * Fix lint * Add BlueTea88 to CONTRIBUTORS.md * Short circuit when no constraint is specified; address review comments * Add tutorial for feature interaction constraints * allow interaction constraints to be passed as string, remove redundant column_names argument * Fix typo * Address review comments * Add comments to Python test	2018-09-04 09:35:39 -07:00
liuliang01	0cf88d036f	Add qid like ranklib format (#2749 ) * add qid for https://github.com/dmlc/xgboost/issues/2748 * change names * change spaces * change qid to bst_uint type * change qid type to size_t * change qid first to SIZE_MAX * change qid type from size_t to uint64_t * update dmlc-core * fix qids name error * fix group_ptr_ error * Style fix * Add qid handling logic to SparsePage * New MetaInfo format + backward compatibility fix Old MetaInfo format (1.0) doesn't contain qid field. We still want to be able to read from MetaInfo files saved in old format. Also, define a new format (2.0) that contains the qid field. This way, we can distinguish files that contain qid and those that do not. * Update MetaInfo test * Simply group assignment logic * Explicitly set qid=nullptr in NativeDataIter NativeDataIter's callback does not support qid field. Users of NativeDataIter will need to call setGroup() function separately to set group information. * Save qids_ in SaveBinary() * Upgrade dmlc-core submodule * Add a test for reading qid * Add contributor * Check the size of qids_ * Document qid format	2018-06-30 20:24:03 +00:00
pdesahb	12e34f32e2	Fix tweedie handling of base_score (#3295 ) * fix tweedie margin calculations * add entry to contributors	2018-06-28 15:43:05 +00:00
Henry Gouk	64b8cffde3	Refactor of FastHistMaker to allow for custom regularisation methods (#3335 ) * Refactor to allow for custom regularisation methods * Implement compositional SplitEvaluator framework * Fixed segfault when no monotone_constraints are supplied. * Change pid to parentID * test_monotone_constraints.py now passes * Refactor ColMaker and DistColMaker to use SplitEvaluator * Performance optimisation when no monotone_constraints specified * Fix linter messages * Fix a few more linter errors * Update the amalgamation * Add bounds check * Add check for leaf node * Fix linter error in param.h * Fix clang-tidy errors on CI * Fix incorrect function name * Fix clang-tidy error in updater_fast_hist.cc * Enable SSE2 for Win32 R MinGW Addresses https://github.com/dmlc/xgboost/pull/3335#issuecomment-400535752 * Add contributor	2018-06-28 07:37:25 +00:00
Andy Adinets	286dccb8e8	GPU binning and compression. (#3319 ) * GPU binning and compression. - binning and index compression are done inside the DeviceShard constructor - in case of a DMatrix with multiple row batches, it is first converted into a single row batch	2018-06-05 17:15:13 +12:00
Philip Hyunsu Cho	1214081f99	Release version 0.72 (#3337 )	2018-06-01 16:00:31 -07:00
Tong He	098075b81b	CRAN Submission for 0.71.1 (#3311 ) * fix for CRAN manual checks * fix for CRAN manual checks * pass local check * fix variable naming style * Adding Philip's record	2018-05-14 17:32:39 -07:00
Andrew Hannigan	5c9f0ff9d9	Check existance of seed/nthread keys before checking their value. (#2669 )	2017-09-27 03:05:59 -04:00
Nan Zhu	1190dc62a7	Update CONTRIBUTORS.md (#2719 )	2017-09-17 15:07:57 -07:00
Juang, Yi-Lin	812300bb7f	Update CONTRIBUTORS.md (#2350 )	2017-05-27 08:38:32 -07:00
gaw89	0f3a404d91	Sklearn kwargs (#2338 ) * Added kwargs support for Sklearn API * Updated NEWS and CONTRIBUTORS * Fixed CONTRIBUTORS.md * Added clarification of *kwargs and test for proper usage Fixed lint error * Fixed more lint errors and clf assigned but never used * Fixed more lint errors * Fixed more lint errors * Fixed issue with changes from different branch bleeding over * Fixed issue with changes from other branch bleeding over * Added note that kwargs may not be compatible with Sklearn * Fixed linting on kwargs note	2017-05-23 21:47:53 -05:00
Rory Mitchell	8ab5d4611c	[GPU-Plugin] (#2227 ) * Add fast histogram algorithm * Fix Linux build * Add 'gpu_id' parameter	2017-04-25 16:37:10 -07:00
Tianqi Chen	d281c6aafa	Update CONTRIBUTORS.md	2017-04-22 08:46:31 -07:00
AbdealiJK	6f16f0ef58	Use bst_float consistently throughout (#1824 ) * Fix various typos * Add override to functions that are overridden gcc gives warnings about functions that are being overridden by not being marked as oveirridden. This fixes it. * Use bst_float consistently Use bst_float for all the variables that involve weight, leaf value, gradient, hessian, gain, loss_chg, predictions, base_margin, feature values. In some cases, when due to additions and so on the value can take a larger value, double is used. This ensures that type conversions are minimal and reduces loss of precision.	2016-11-30 10:02:10 -08:00
Adam Pocock	445029bb82	[jvm-packages] XGBoost4j Windows fixes (#1639 ) * Changes for Mingw64 compilation to ensure long is a consistent size. Mainly impacts the Java API which would not compile, but there may be silent errors on Windows with large datasets before this patch (as long is 32-bits when compiled with mingw64 even in 64-bit mode). * Adding ifdefs to ensure it still compiles on MacOS * Makefile and create_jni.bat changes for Windows. * Switching XGDMatrixCreateFromCSREx JNI call to use size_t cast * Fixing lint error, adding profile switching to jvm-packages build to make create-jni.bat get called, adding myself to Contributors.Md	2016-10-18 08:35:25 -04:00
Baltazar Bieniek	7addebb2ea	Updated - fix merged (#1425 ) https://github.com/dmlc/xgboost/pull/1417	2016-08-02 14:46:45 -07:00
convexquad	313764b3be	Expose predictLeaf functionality in Scala XGBoostModel (#1351 )	2016-07-12 06:55:24 -04:00
Yuan (Terry) Tang	98d8a8b871	Added contributor	2016-01-12 09:25:32 -06:00
Yuan (Terry) Tang	d1439a10a8	Update CONTRIBUTORS.md	2016-01-10 12:16:02 -06:00
Sam Thomson	2e9e6c82f9	grammar/style fixes for "Introduction to Boosted Trees" docs	2015-11-17 13:26:33 -08:00
Far0n	8676a1bf56	Update CONTRIBUTORS.md	2015-11-02 21:27:05 +01:00
Yuan (Terry) Tang	1dcedb23ec	Update CONTRIBUTORS.md	2015-10-28 22:57:41 -04:00
Tianqi Chen	4b4ade8342	Update CONTRIBUTORS.md	2015-10-22 08:40:36 -07:00
Tianqi Chen	d4d36eed45	Merge pull request #528 from terrytangyuan/test More Unit Tests for Python Package	2015-10-22 08:39:32 -07:00
tqchen	eee3046624	[DOC] Add contributor	2015-10-20 19:44:06 -07:00
Johan Manders	67f3c687b8	Added Johan Manders to the list, asked by Tianqi Chen	2015-10-14 13:06:14 +02:00
terrytangyuan	9d627e2567	DOC: Updated contributors.md	2015-10-04 23:26:46 -05:00
Huayi Zhang	c49c6565e5	Add contributor	2015-09-18 10:35:41 +08:00
terrytangyuan	fbf2a5feed	DOC: Updated CONTRIBUTORS.md	2015-09-07 22:49:10 -04:00
phunterlau	db444c4a08	update with comments on PR #450 , fixed styles and updated CHANGES and CONTRIBUTORS	2015-08-20 10:10:34 -07:00
sinhrks	d24b36adf9	ENH: Add visualization to python package	2015-08-16 00:57:21 +09:00
tqchen	e8de5da3a5	Document refactor change badge	2015-08-02 19:01:38 -07:00
Tianqi Chen	2a01c5c865	Update CONTRIBUTORS.md	2015-07-30 22:26:10 -07:00
Tianqi Chen	80b6ec4478	update more contributor names	2015-07-21 21:31:39 -07:00
Tianqi Chen	9203d26a2f	Update CONTRIBUTORS.md	2015-07-21 08:13:07 -07:00
Tianqi Chen	4cf116ceb6	Update CONTRIBUTORS.md	2015-07-20 22:58:10 -07:00
Tianqi Chen	41f30c288e	Update CONTRIBUTORS.md	2015-07-20 22:56:29 -07:00
tqchen	d18492e751	add list of contributors	2015-07-20 22:48:45 -07:00

46 Commits