xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	45bf4fbffb	Add a notice for binary PyPI wheel (#3443 )	2018-07-05 08:28:43 -07:00
Tianqi Chen	01aff45f26	Update README.md	2018-07-04 13:09:32 -07:00
Tianqi Chen	e62639c59b	[DOCS] Update link to readme (#3437 )	2018-07-04 12:24:33 -07:00
Yanbo Liang	aec6299c49	[jvm-packages] Expose nativeBooster for XGBoostClassificationModel and XGBoostRegressionModel. (#3428 )	2018-07-01 15:06:16 -07:00
Nikita Titov	295252249e	fixed MinGW missed dll (#3430 )	2018-07-01 16:43:33 +00:00
liuliang01	0cf88d036f	Add qid like ranklib format (#2749 ) * add qid for https://github.com/dmlc/xgboost/issues/2748 * change names * change spaces * change qid to bst_uint type * change qid type to size_t * change qid first to SIZE_MAX * change qid type from size_t to uint64_t * update dmlc-core * fix qids name error * fix group_ptr_ error * Style fix * Add qid handling logic to SparsePage * New MetaInfo format + backward compatibility fix Old MetaInfo format (1.0) doesn't contain qid field. We still want to be able to read from MetaInfo files saved in old format. Also, define a new format (2.0) that contains the qid field. This way, we can distinguish files that contain qid and those that do not. * Update MetaInfo test * Simply group assignment logic * Explicitly set qid=nullptr in NativeDataIter NativeDataIter's callback does not support qid field. Users of NativeDataIter will need to call setGroup() function separately to set group information. * Save qids_ in SaveBinary() * Upgrade dmlc-core submodule * Add a test for reading qid * Add contributor * Check the size of qids_ * Document qid format	2018-06-30 20:24:03 +00:00
Oliver Laslett	18813a26ab	allow arbitrary cross validation fold indices (#3353 ) * allow arbitrary cross validation fold indices - use training indices passed to `folds` parameter in `training.cv` - update doc string * add tests for arbitrary fold indices	2018-06-30 19:23:49 +00:00
Mike Liu	594bcea83e	Save and load model in sklearn API (#3192 ) * Add (load\|save)_model to XGBModel * Add docstring * Fix docstring * Fix mixed use of space and tab * Add a test * Fix Flake8 style errors	2018-06-30 19:21:49 +00:00
Rory Mitchell	24fde92660	Build universal wheels using GPU CI (#3424 )	2018-06-29 13:45:24 +00:00
Yun Ni	30d10ab035	Convert handle == nullptr from SegFault to user-friendly error. (#3021 ) * Convert SegFault to user-friendly error. * Apply the change to DMatrix API as well	2018-06-29 06:30:26 +00:00
cinqS	8bec8d5e9a	Better doc for save_model() / load_model() (#3143 ) Be clear that they do not save Python-specific attributes	2018-06-29 04:24:33 +00:00
pdesahb	12e34f32e2	Fix tweedie handling of base_score (#3295 ) * fix tweedie margin calculations * add entry to contributors	2018-06-28 15:43:05 +00:00
Henry Gouk	64b8cffde3	Refactor of FastHistMaker to allow for custom regularisation methods (#3335 ) * Refactor to allow for custom regularisation methods * Implement compositional SplitEvaluator framework * Fixed segfault when no monotone_constraints are supplied. * Change pid to parentID * test_monotone_constraints.py now passes * Refactor ColMaker and DistColMaker to use SplitEvaluator * Performance optimisation when no monotone_constraints specified * Fix linter messages * Fix a few more linter errors * Update the amalgamation * Add bounds check * Add check for leaf node * Fix linter error in param.h * Fix clang-tidy errors on CI * Fix incorrect function name * Fix clang-tidy error in updater_fast_hist.cc * Enable SSE2 for Win32 R MinGW Addresses https://github.com/dmlc/xgboost/pull/3335#issuecomment-400535752 * Add contributor	2018-06-28 07:37:25 +00:00
Philip Hyunsu Cho	cafc621914	Do not unzip google test archive if exists (#3416 )	2018-06-28 04:10:39 +00:00
Philip Hyunsu Cho	e2743548ed	Fix wget for google tests in tests (#3414 ) CI tests were failing because wget prompts "the user" for a response whenever the google test archive is already on the disk. Fix: Use `-nc` option to skip download when the archive already exists	2018-06-27 22:12:56 +00:00
Rory Mitchell	a0a1df1aba	Refactor python tests (#3410 ) * Add unit test utility * Refactor updater tests. Add coverage for histmaker.	2018-06-27 11:20:27 +12:00
Adam Johnston	0988fb191f	[jvm-packages] avoid use of Seq.apply in buildGroups (#3413 )	2018-06-26 16:00:28 -07:00
ngoyal2707	5cd851ccef	added code for instance based weighing for rank objectives (#3379 ) * added code for instance based weighing for rank objectives * Fix lint	2018-06-22 15:10:59 -07:00
Nan Zhu	d062c6f61b	[jvm-packages] Maven central release stuffs (#3401 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * maven central release	2018-06-22 06:41:28 -07:00
PSEUDOTENSOR / Jonathan McKinney	9ac163d0bb	Allow import via python datatable. (#3272 ) * Allow import via python datatable. * Write unit tests * Refactor dt API functions * Refactor python code * Lint fixes * Address review comments	2018-06-20 13:16:18 -07:00
James	eecf341ea7	[jvm-packages] Added latest version number example (#3374 ) * Added latest version number example * Added latest version number example	2018-06-18 22:09:39 -07:00
Thejaswi	0e78034607	Shared memory atomics while building histogram (#3384 ) * Use shared memory atomics for building histograms, whenever possible	2018-06-19 16:03:09 +12:00
Yanbo Liang	2c4359e914	[jvm-packages] XGBoost Spark integration refactor (#3387 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * [jvm-packages] XGBoost Spark integration refactor. (#3313) * XGBoost Spark integration refactor. * Make corresponding update for xgboost4j-example * Address comments. * [jvm-packages] Refactor XGBoost-Spark params to make it compatible with both XGBoost and Spark MLLib (#3326) * Refactor XGBoost-Spark params to make it compatible with both XGBoost and Spark MLLib * Fix extra space. * [jvm-packages] XGBoost Spark supports ranking with group data. (#3369) * XGBoost Spark supports ranking with group data. * Use Iterator.duplicate to prevent OOM. * Update CheckpointManagerSuite.scala * Resolve conflicts	2018-06-18 15:39:18 -07:00
Tong He	e6696337e4	Fix CRAN check for lintr (#3372 ) * fix CRAN check * Update submodules dmlc-core and rabit * Add kintr to rmingw test	2018-06-18 12:53:52 -07:00
Bruce Qu	578a0c7ddb	params confusion fixed (#3386 )	2018-06-15 13:17:35 -07:00
Gorkem Ozkaya	34e3edfb1a	Update index.md (#3228 )	2018-06-07 21:51:06 -07:00
ngoyal2707	902ecbade8	added python doc string for nthreads to dmatrix (#3363 )	2018-06-08 14:16:30 +12:00
Rory Mitchell	a96039141a	Dmatrix refactor stage 1 (#3301 ) * Use sparse page as singular CSR matrix representation * Simplify dmatrix methods * Reduce statefullness of batch iterators * BREAKING CHANGE: Remove prob_buffer_row parameter. Users are instead recommended to sample their dataset as a preprocessing step before using XGBoost.	2018-06-07 10:25:58 +12:00
Andy Adinets	286dccb8e8	GPU binning and compression. (#3319 ) * GPU binning and compression. - binning and index compression are done inside the DeviceShard constructor - in case of a DMatrix with multiple row batches, it is first converted into a single row batch	2018-06-05 17:15:13 +12:00
Rory Mitchell	3f7696ff53	Cleanup old artefacts in Jenkins (#3361 )	2018-06-05 15:16:37 +12:00
Philip Hyunsu Cho	bd01acdfbc	Save outputs in high precision in CLI prediction (#3356 ) Currently, `CLIPredict()` saves prediction results in default 6-digit precision which causes precision loss. This PR sets precision to a level so that the conversion back to `bst_float` is lossless. Related: #3298.	2018-06-03 14:15:47 -07:00
Nan Zhu	f66731181f	Update 0.8 version num (#3358 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * update 0.80	2018-06-02 07:06:01 -07:00
Philip Hyunsu Cho	1214081f99	Release version 0.72 (#3337 ) v0.72	2018-06-01 16:00:31 -07:00
Ryota Suzuki	b7cbec4d4b	Fix print.xgb.Booster for R (#3338 ) * Fix print.xgb.Booster valid_handle should be TRUE when x$handle is NOT null * Update xgb.Booster.R Modify is.null.handle to return TRUE for NULL handle	2018-05-29 11:44:55 -07:00
Kristian Gampong	a510e68dda	Add validate_features option for Booster predict (#3323 ) * Add validate_features option for Booster predict * Fix trailing whitespace in docstring	2018-05-29 11:40:49 -07:00
Yanbo Liang	b018ef104f	Remove output_margin from XGBClassifier.predict_proba argument list. (#3343 )	2018-05-28 10:30:21 -07:00
trivialfis	34aeee2961	Fix test_param.cc header path (#3317 )	2018-05-28 10:26:29 -07:00
Dave Challis	8efbadcde4	Point rabit submodule at latest commit from master. (#3330 )	2018-05-28 10:21:10 -07:00
pdavalo	480e3fd764	Sklearn: validation set weights (#2354 ) * Add option to use weights when evaluating metrics in validation sets * Add test for validation-set weights functionality * simplify case with no weights for test sets * fix lint issues	2018-05-23 17:06:20 -07:00
Philip Hyunsu Cho	71e226120a	For CRAN submission, remove all #pragma's that suppress compiler warnings (#3329 ) * For CRAN submission, remove all #pragma's that suppress compiler warnings A few headers in dmlc-core contain #pragma's that disable compiler warnings, which is against the CRAN submission policy. Fix the problem by removing the offending #pragma's as part of the command `make Rbuild`. This addresses issue #3322. * Fix script to improve Cygwin/MSYS compatibility We need this to pass rmingw CI test * Remove remove_warning_suppression_pragma.sh from packaged tarball	2018-05-23 09:58:39 -07:00
Thejaswi	d367e4fc6b	Fix for issue 3306. (#3324 )	2018-05-23 13:42:20 +12:00
Sergei Lebedev	8f6aadd4b7	[jvm-packages] Fixed CheckpointManagerSuite for Scala 2.10 (#3332 ) As before, the compilation error is caused by mixing positional and labelled arguments.	2018-05-19 18:28:11 -07:00
Rory Mitchell	3ee725e3bb	Add cuda forwards compatibility (#3316 )	2018-05-17 10:59:22 +12:00
Rory Mitchell	f8b7686719	Add cuda 8/9.1 centos 6 builds, test GPU wheel on CPU only container. (#3309 ) * Add cuda 8/9.1 centos 6 builds, test GPU wheel on CPU only container. * Add Google test	2018-05-17 10:57:01 +12:00
Tong He	098075b81b	CRAN Submission for 0.71.1 (#3311 ) * fix for CRAN manual checks * fix for CRAN manual checks * pass local check * fix variable naming style * Adding Philip's record	2018-05-14 17:32:39 -07:00
Nan Zhu	49b9f39818	[jvm-packages] update xgboost4j cross build script to be compatible with older glibc (#3307 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * static glibc glibc++ * update to build with glib 2.12 * remove unsupported flags * update version number * remove properties * remove unnecessary command * update poms	2018-05-10 06:39:44 -07:00
Philip Hyunsu Cho	9a8211f668	Update dmlc-core submodule (#3221 ) * Update dmlc-core submodule * Fix dense_parser to work with the latest dmlc-core * Specify location of Google Test * Add more source files in dmlc-minimum to get latest dmlc-core working * Update dmlc-core submodule	2018-05-09 18:55:29 -07:00
mallniya	039dbe6aec	freebsd support in libpath.py (#3247 )	2018-05-09 16:13:30 -07:00
Clive Chan	0c0a78c255	Suggest git submodule update instead of delete + reclone (#3214 )	2018-05-09 14:39:17 -07:00
Will Storey	747381b520	Improve .gitignore patterns (#3184 ) * Adjust xgboost entries in .gitignore They were overly broad. In particularly this was inconvenient when working with tools such as fzf that use the .gitignore to decide what to include. As written, we'd not look into /include/xgboost. * Make cosmetic improvements to .gitignore * Remove dmlc-core from .gitignore This seems unnecessary and has the drawback that tools that use .gitignore to know files to skip mean they won't look here, and being able to inspect the submodule files with them is useful.	2018-05-09 14:31:59 -07:00

1 2 3 4 5 ...

3321 Commits