xgboost

Author	SHA1	Message	Date
Vadim Khotilovich	00eda28b3c	MinGW: shared library prefix and appveyor CI (#2539 ) * for MinGW, drop the 'lib' prefix from shared library name * fix defines for 'g++ 4.8 or higher' to include g++ >= 5 * fix compile warnings * [Appveyor] add MinGW with python; remove redundant jobs * [Appveyor] also do python build for one of msvc jobs	2017-07-25 01:06:47 -05:00
PSEUDOTENSOR / Jonathan McKinney	6b375f6ad8	Multi-threaded XGDMatrixCreateFromMat for faster DMatrix creation (#2530 ) * Multi-threaded XGDMatrixCreateFromMat for faster DMatrix creation from numpy arrays for python interface.	2017-07-21 14:43:17 +12:00
Maurus Cuelenaere	6bd1869026	Add prediction of feature contributions (#2003 ) * Add prediction of feature contributions This implements the idea described at http://blog.datadive.net/interpreting-random-forests/ which tries to give insight in how a prediction is composed of its feature contributions and a bias. * Support multi-class models * Calculate learning_rate per-tree instead of using the one from the first tree * Do not rely on node.base_weight * learning_rate having the same value as the node mean value (aka leaf value, if it were a leaf); instead calculate them (lazily) on-the-fly * Add simple test for contributions feature * Check against param.num_nodes instead of checking for non-zero length * Loop over all roots instead of only the first	2017-05-14 00:58:10 -05:00
Preston Parry	1ab8088a09	Removes extraneous log (#2186 ) This log appears to fire every time I ask the python package to make a prediction. It's the only log that fires from XGBoost. When we're getting predictions on millions of items a day in production, this log seems out of place.	2017-04-11 17:38:29 -07:00
Huffers	d45cf240a9	Remove xgboost's thread_local and switch to dmlc::ThreadLocalStore (#2121 ) * Remove xgboost's own version of thread_local and switch to dmlc::ThreadLocalStore (#2109) * Update dmlc-core	2017-03-27 09:09:18 -07:00
Tianqi Chen	d581a3d0e7	[UPDATE] Update rabit and threadlocal (#2114 ) * [UPDATE] Update rabit and threadlocal * minor fix to make build system happy * upgrade requirement to g++4.8 * upgrade dmlc-core * update travis	2017-03-16 18:48:37 -07:00
Oleg Sofrygin	9d19e13ed0	adding a copy of base_margin to slice, fixes a bug where base_margin was notcopied during cross-validation (#2007 )	2017-03-16 10:36:57 -07:00
Tianqi Chen	fd19b7a188	Automatically remove nan from input data when it is sparse. (#2062 ) * [DATALoad] Automatically remove Nan when load from sparse matrix * add log	2017-02-25 08:59:17 -08:00
Simon DENEL	7078c41dad	Changing omp_get_num_threads to omp_get_max_threads (#1831 ) * Updating dmlc-core * Changing omp_get_num_threads to omp_get_max_threads	2016-12-04 11:26:45 -08:00
AbdealiJK	6f16f0ef58	Use bst_float consistently throughout (#1824 ) * Fix various typos * Add override to functions that are overridden gcc gives warnings about functions that are being overridden by not being marked as oveirridden. This fixes it. * Use bst_float consistently Use bst_float for all the variables that involve weight, leaf value, gradient, hessian, gain, loss_chg, predictions, base_margin, feature values. In some cases, when due to additions and so on the value can take a larger value, double is used. This ensures that type conversions are minimal and reduces loss of precision.	2016-11-30 10:02:10 -08:00
AbdealiJK	97371ff7e5	c_api.cc: Bring back silent argument (#1794 ) In ecb3a271bed151252fb048528ce5a90ad75bb68f the silent argument in XGDMatrixCreateFromFile of c_api.cc was always overridden to be false. This disabled the functionality to hide log messages. This commit reverts that part to enable the hiding of log messages.	2016-11-20 22:04:36 -08:00
wl2776	6b5a23ccd5	fix build in MSVC 2013 (#1757 )	2016-11-10 12:34:30 -08:00
AbdealiJK	b94fcab4dc	Add dump_format=json option (#1726 ) * Add format to the params accepted by DumpModel Currently, only the test format is supported when trying to dump a model. The plan is to add more such formats like JSON which are easy to read and/or parse by machines. And to make the interface for this even more generic to allow other formats to be added. Hence, we make some modifications to make these function generic and accept a new parameter "format" which signifies the format of the dump to be created. * Fix typos and errors in docs * plugin: Mention all the register macros available Document the register macros currently available to the plugin writers so they know what exactly can be extended using hooks. * sparce_page_source: Use same arg name in .h and .cc * gbm: Add JSON dump The dump_format argument can be used to specify what type of dump file should be created. Add functionality to dump gblinear and gbtree into a JSON file. The JSON file has an array, each item is a JSON object for the tree. For gblinear: - The item is the bias and weights vectors For gbtree: - The item is the root node. The root node has a attribute "children" which holds the children nodes. This happens recursively. * core.py: Add arg dump_format for get_dump()	2016-11-04 09:55:25 -07:00
Vadim Khotilovich	3efff6d052	fix for VX (#1614 )	2016-09-27 15:19:20 -07:00
Vadim Khotilovich	693ddb860e	More robust DMatrix creation from a sparse matrix (#1606 ) * [CORE] DMatrix from sparse w/ explicit #col #row; safer arg types * [python-package] c-api change for _init_from_csr _init_from_csc * fix spaces * [R-package] adopt the new XGDMatrixCreateFromCSCEx interface * [CORE] redirect old sparse creators to new ones	2016-09-25 10:01:22 -07:00
Tianqi Chen	ecec5f7959	[CORE] Refactor cache mechanism (#1540 )	2016-09-02 20:39:07 -07:00
Tianqi Chen	df38f251be	Fix warnings from g++5 or higher (#1510 )	2016-08-26 16:14:10 -07:00
RAMitchell	93196eb811	cmake build system (#1314 ) * Changed c api to compile under MSVC * Include functional.h header for MSVC * Add cmake build	2016-07-02 19:07:35 -07:00
Vadim Khotilovich	9a48a40cf1	Fixes for multiple and default metric (#1239 ) * fix multiple evaluation metrics * create DefaultEvalMetric only when really necessary * py test for #1239 * make travis happy	2016-06-04 22:17:35 -07:00
Vadim Khotilovich	ea9285dd4f	methods to delete an attribute and get names of available attributes	2016-05-14 18:19:18 -05:00
Vadim Khotilovich	811c6ef58b	obey the lint	2016-04-26 22:11:19 -05:00
Vadim Khotilovich	0527b17c9d	avoid collecting duplicate parameters in Booster::cfg_	2016-04-25 22:08:53 -05:00
Wojciech Migda	6a5eb47789	XGBoosterCreate api unified to use const DMatrix[] argument	2016-03-26 19:42:58 +01:00
tqchen	86871d4be9	[JVM] Add Iterator loading API	2016-03-04 17:37:46 -08:00
tqchen	ecb3a271be	[PYTHON-DIST] Distributed xgboost python training API.	2016-02-29 16:54:13 -08:00
tqchen	4a16b729fc	[PYTHON] Simplify training logic, update rabit lib	2016-02-28 13:20:55 -08:00
tqchen	2dc6c2dc52	[R] enable R compile [R] Enable R build for windows and linux	2016-01-16 10:24:02 -08:00
tqchen	d75e3ed05d	[LIBXGBOOST] pass demo running.	2016-01-16 10:24:01 -08:00

28 Commits