xgboost

Author	SHA1	Message	Date
liuliang01	0cf88d036f	Add qid like ranklib format (#2749 ) * add qid for https://github.com/dmlc/xgboost/issues/2748 * change names * change spaces * change qid to bst_uint type * change qid type to size_t * change qid first to SIZE_MAX * change qid type from size_t to uint64_t * update dmlc-core * fix qids name error * fix group_ptr_ error * Style fix * Add qid handling logic to SparsePage * New MetaInfo format + backward compatibility fix Old MetaInfo format (1.0) doesn't contain qid field. We still want to be able to read from MetaInfo files saved in old format. Also, define a new format (2.0) that contains the qid field. This way, we can distinguish files that contain qid and those that do not. * Update MetaInfo test * Simply group assignment logic * Explicitly set qid=nullptr in NativeDataIter NativeDataIter's callback does not support qid field. Users of NativeDataIter will need to call setGroup() function separately to set group information. * Save qids_ in SaveBinary() * Upgrade dmlc-core submodule * Add a test for reading qid * Add contributor * Check the size of qids_ * Document qid format	2018-06-30 20:24:03 +00:00
Rory Mitchell	a96039141a	Dmatrix refactor stage 1 (#3301 ) * Use sparse page as singular CSR matrix representation * Simplify dmatrix methods * Reduce statefullness of batch iterators * BREAKING CHANGE: Remove prob_buffer_row parameter. Users are instead recommended to sample their dataset as a preprocessing step before using XGBoost.	2018-06-07 10:25:58 +12:00
Rory Mitchell	ccf80703ef	Clang-tidy static analysis (#3222 ) * Clang-tidy static analysis * Modernise checks * Google coding standard checks * Identifier renaming according to Google style	2018-04-19 18:57:13 +12:00
EvanChong	790da458e7	Sync number of features after loaded matrix in different workers. (#2722 )	2017-11-29 11:19:12 -08:00
Xiaoguang Sun	2ae56ca84f	Use int32_t explicitly when serializing version (#2389 ) Use int32_t explicitly when serializing version field of dmatrix in binary format. On ILP64 architectures, although very little, size of int is 64 bits.	2017-06-07 10:03:42 -07:00
AbdealiJK	b045ccd764	data.cc: Remove redundant ftype variable	2016-12-04 11:25:57 -08:00
AbdealiJK	6f16f0ef58	Use bst_float consistently throughout (#1824 ) * Fix various typos * Add override to functions that are overridden gcc gives warnings about functions that are being overridden by not being marked as oveirridden. This fixes it. * Use bst_float consistently Use bst_float for all the variables that involve weight, leaf value, gradient, hessian, gain, loss_chg, predictions, base_margin, feature values. In some cases, when due to additions and so on the value can take a larger value, double is used. This ensures that type conversions are minimal and reduces loss of precision.	2016-11-30 10:02:10 -08:00
yuanbowen	5898f1c59e	[DATA] fix instance weights loading	2016-05-23 18:40:41 +08:00
tqchen	ecb3a271be	[PYTHON-DIST] Distributed xgboost python training API.	2016-02-29 16:54:13 -08:00
tqchen	413f119c7e	Update dmlc-core	2016-02-10 13:11:21 -08:00
tqchen	63c4ad7617	[APPROX] Make global proposal default, add group ptr solution	2016-02-10 11:19:10 -08:00
tqchen	ce4d59ed69	[TREE] Enable global proposal for faster speed	2016-02-10 11:19:10 -08:00
Ubuntu	c36195795a	increase shard	2016-02-10 11:17:18 -08:00
Ubuntu	46be6181b5	[DIST] fix distirbuted setting	2016-02-10 11:17:18 -08:00
tqchen	b27b51f60e	[PLUGIN] Add densify parser	2016-02-10 11:17:18 -08:00
tqchen	634db18a0f	[TRAVIS] cleanup travis script	2016-01-16 10:25:12 -08:00
tqchen	fd173e260f	[FIX] change evaluation to more precision	2016-01-16 10:25:12 -08:00
tqchen	67fbf8d264	[TEST] add partial load option	2016-01-16 10:25:12 -08:00
tqchen	6de1c86d18	[LZ4] enable 16 bit index	2016-01-16 10:25:11 -08:00
tqchen	96f4542a67	[PLUGIN] Add plugin system	2016-01-16 10:25:11 -08:00
tqchen	36c389ac46	[DATA] Isolate the format of page file	2016-01-16 10:25:11 -08:00
tqchen	2dc6c2dc52	[R] enable R compile [R] Enable R build for windows and linux	2016-01-16 10:24:02 -08:00
tqchen	72347e2d45	[DATA] Make it fully compatible with rank	2016-01-16 10:24:01 -08:00
tqchen	ef1021e759	[IO] Enable external memory	2016-01-16 10:24:01 -08:00
tqchen	d75e3ed05d	[LIBXGBOOST] pass demo running.	2016-01-16 10:24:01 -08:00
tqchen	dedd87662b	[OBJ] Add basic objective function and registry	2016-01-16 10:24:01 -08:00
tqchen	46bcba7173	[DATA] basic data refactor done, basic version of csr source.	2016-01-16 10:24:00 -08:00
tqchen	3d708e4788	latest data	2016-01-16 10:24:00 -08:00

28 Commits