xgboost

Author	SHA1	Message	Date
Tianqi Chen	2fb19eb448	Add appveyor badge	2016-11-10 12:49:33 -08:00
Zhongxiao Ma	55bfc29942	keep builtin evaluations while using customized evaluation function (#1624 ) * keep builtin evaluations while using customized evaluation function * fix concat bytes to str	2016-11-10 12:40:48 -08:00
Morten Hustveit	8b9d9669bb	Have ConsoleLogger log to stderr instead of stdout (#1714 ) On Unix systems, it's common for programs to read their input from stdin, and write their output to stdout. Messages should be written to stderr, where they won't corrupt a program's output, and where they can be seen by the user even if the output is being redirected. This is mostly a problem when XGBoost is being used from Python or from another program.	2016-11-10 12:39:52 -08:00
wl2776	6b5a23ccd5	fix build in MSVC 2013 (#1757 )	2016-11-10 12:34:30 -08:00
RAMitchell	e3a7f85f15	GPU plug-in improvements + basic Windows continuous integration (#1752 ) * GPU Plugin: Reduce memory, improve performance, fix gcc compiler bug, add out of memory exceptions * Add basic Windows continuous integration for cmake VS2013, VS2015	2016-11-10 12:34:09 -08:00
joandre	91b75f9b41	Fix a small typo in GeneralParams class. Change customEval parameter name from "custom_obj" to "custom_eval". (#1741 )	2016-11-06 12:44:49 -05:00
Tony DiFranco	2ad0948444	Tweedie Regression Post-Rebase (#1737 ) * add support for tweedie regression * added back readme line that was accidentally deleted * fixed linting errors * add support for tweedie regression * added back readme line that was accidentally deleted * fixed linting errors * rebased with upstream master and added R example * changed parameter name to tweedie_variance_power * linting error fix * refactored tweedie-nloglik metric to be more like the other parameterized metrics * added upper and lower bound check to tweedie metric * add support for tweedie regression * added back readme line that was accidentally deleted * fixed linting errors * added upper and lower bound check to tweedie metric * added back readme line that was accidentally deleted * rebased with upstream master and added R example * rebased again on top of upstream master * linting error fix * added upper and lower bound check to tweedie metric * rebased with master * lint fix * removed whitespace at end of line 186 - elementwise_metric.cc	2016-11-05 17:02:32 -07:00
AbdealiJK	52b9867be5	Add docs fro update_seq (#1735 ) * Fix typos and messages in docs * parameter.md: Add docs for updater_seq Mention the updater_seq parameter which sets the order of the tree updaters to run and also specifies which ones to run. This can be useful when pruning is not required or even a custom plugin is being built along with xgboost.	2016-11-04 16:07:29 -07:00
AbdealiJK	b94fcab4dc	Add dump_format=json option (#1726 ) * Add format to the params accepted by DumpModel Currently, only the test format is supported when trying to dump a model. The plan is to add more such formats like JSON which are easy to read and/or parse by machines. And to make the interface for this even more generic to allow other formats to be added. Hence, we make some modifications to make these function generic and accept a new parameter "format" which signifies the format of the dump to be created. * Fix typos and errors in docs * plugin: Mention all the register macros available Document the register macros currently available to the plugin writers so they know what exactly can be extended using hooks. * sparce_page_source: Use same arg name in .h and .cc * gbm: Add JSON dump The dump_format argument can be used to specify what type of dump file should be created. Add functionality to dump gblinear and gbtree into a JSON file. The JSON file has an array, each item is a JSON object for the tree. For gblinear: - The item is the bias and weights vectors For gbtree: - The item is the root node. The root node has a attribute "children" which holds the children nodes. This happens recursively. * core.py: Add arg dump_format for get_dump()	2016-11-04 09:55:25 -07:00
Alireza Bagheri Garakani	9c693f0f5f	scale_pos_weight default value (#1712 ) Should say 1 (not 0)	2016-11-03 12:52:26 -07:00
David Lichtenberg	8156b71912	Typo is OSX installation instructions (#1718 ) The `cd ..;` in the one liner takes you up a directory instead of into the xgboost directory. This will cause that step of the installation to fail. It seems like you are meant to enter the xgboost directory as you did in the instructions for installing xgboost without openmp.	2016-11-03 12:52:16 -07:00
AbdealiJK	378eb7d7c8	Fix typos and messages in docs (#1723 )	2016-10-30 22:52:19 -07:00
Nan Zhu	6082184cd1	[jvm-packages] update API docs (#1713 ) * add back train method but mark as deprecated * fix scalastyle error * update java doc * update	2016-10-27 18:53:22 -07:00
Nan Zhu	d321375df5	[jvm-packages] Fix mis configure of nthread (#1709 ) * add back train method but mark as deprecated * fix scalastyle error * change class to object in examples * fix compilation error * fix mis configuration	2016-10-27 12:10:35 -04:00
Nan Zhu	f12074d355	[jvm-packages] release blog (#1706 )	2016-10-26 21:35:42 -04:00
Nan Zhu	f801c22710	[jvm-packages] change class to object in examples (#1703 ) * change class to object in examples * fix compilation error	2016-10-26 14:54:56 -04:00
Nan Zhu	016ab89484	[jvm-packages] Parameter tuning tool for XGBoost (#1664 )	2016-10-23 16:58:18 -04:00
RAMitchell	ac41845d4b	Add GPU accelerated tree construction plugin (#1679 )	2016-10-20 20:14:47 -07:00
Eric Liu	9b2e41340b	make DMatrix._init_from_npy2d only copy data when necessary (#1637 ) * make DMatrix._init_from_npy2d only copy data when necessary When creating DMatrix from a 2d ndarray, it can unnecessarily copy the input data. This can be problematic when the data is already very large--running out of memory. The copy is temporary (going out of scope at the end of this function) but it still adds to peak memory usage. ``numpy.array`` copies its input no matter what by default. By adding ``copy=False``, it will only do so when necessary. Since XGDMatrixCreateFromMat is readonly on the input buffer, this copy is not needed. Also added comments explaining when a copy can happen (if data ordering/layout is wrong or if type is not 32-bit float). * remove whitespace	2016-10-20 09:30:52 -07:00
Jan Gorecki	e79a803a30	simplify installation of R pkg devel version (#1653 )	2016-10-18 10:24:01 -07:00
Liam Huang	001d8c4023	correct CalcDCG in rank_metric.cc and rank_obj.cc (#1642 ) * correct CalcDCG in rank_metric.cc DCG use log base-2, however `std::log` returns log base-e. * correct CalcDCG in rank_obj.cc DCG use log base-2, however `std::log` returns log base-e. * use std::log2 instead of std::log make it more elegant * use std::log2 instead of std::log make it more elegant	2016-10-18 10:23:41 -07:00
ziguang1216	94a9e3222e	[python-package] Fix the issue #1439 (#1666 ) Fix 1439 Fix python_wrapper when eval set name contain '-' will cause early_stop maximize variable con't set to True propely Change-Id: Ib0595afd4ae7b445a84c00a3a8faeccc506c6d13	2016-10-18 10:22:51 -07:00
EQGM	d3fc815b45	fix the problem that there is no libxgboost.dll (#1674 ) fix the problem that there is no libxgboost.dll built with Visual Studio.	2016-10-18 09:56:48 -07:00
saihttam	4b9d488387	Add option on OSX to use macports (#1675 )	2016-10-18 09:56:00 -07:00
Adam Pocock	445029bb82	[jvm-packages] XGBoost4j Windows fixes (#1639 ) * Changes for Mingw64 compilation to ensure long is a consistent size. Mainly impacts the Java API which would not compile, but there may be silent errors on Windows with large datasets before this patch (as long is 32-bits when compiled with mingw64 even in 64-bit mode). * Adding ifdefs to ensure it still compiles on MacOS * Makefile and create_jni.bat changes for Windows. * Switching XGDMatrixCreateFromCSREx JNI call to use size_t cast * Fixing lint error, adding profile switching to jvm-packages build to make create-jni.bat get called, adding myself to Contributors.Md	2016-10-18 08:35:25 -04:00
Jiading Gai	be90deb9b6	Fix a bug to handle Executable and Library with same name (xgboost) correctly. (#1669 ) add_library(libxgboost SHARED ${SOURCES}) builds a library named liblibxgboost.so; However, simply changing it to add_library(xgboost ...) won't work, as add_executable(xgboost ...) and add_library(xgbboost ...) will then have the same target name. This patch correctly handles the same-name situation through SET_TARGET_PROPERTIES.	2016-10-15 18:29:40 -07:00
Nan Zhu	f5c776f64f	[jvm-packages] add apache maven repo url and bump up default spark version to 2.0.1 (#1650 ) * add apache maven repo url and bump up default spark version to 2.0.1	2016-10-13 08:55:03 -04:00
Nan Zhu	813a53882a	[jvm-packages] deprecate Flaky test (#1662 ) * deprecate flaky test	2016-10-13 07:21:24 -04:00
Yuan (Terry) Tang	63829d656c	Fix mknfold using new StratifiedKFold API (#1660 )	2016-10-12 14:43:37 -07:00
Nan Zhu	b56c6097d9	[jvm-packages] add Spark and XGBoost tutorial (#1649 ) * add back train method but mark as deprecated * add Spark and XGBoost tutorial * fix scalastyle error	2016-10-11 09:41:24 -07:00
Tianqi Chen	8a7a6dba71	Update .travis.yml	2016-10-09 20:37:57 -07:00
Jonathan Rahn	c8ae52f17a	add scikit-learn v0.18 compatibility (#1636 ) * add scikit-learn v0.18 compatibility import KFold & StratifiedKFold from sklearn.model_selection instead of sklearn.cross_validation * change DeprecationWarning to ImportError DeprecationWarning isn't an exception, so it should work the other way around.	2016-10-09 20:37:28 -07:00
Yuan (Terry) Tang	a64fd74421	Fix wrong expected feature types (#1646 )	2016-10-08 21:16:29 -07:00
Kirill Sevastyanenko	485b6c86cc	rm redundant lines in travis.yml (#1633 )	2016-10-08 10:48:58 -07:00
Vadim Khotilovich	f9648ac320	[R-package] store numeric attributes with higher precision (#1628 )	2016-10-03 11:01:17 -07:00
Nan Zhu	1673bcbe7e	[jvm-packages] separate classification and regression model and integrate with ML package (#1608 )	2016-09-30 11:49:03 -04:00
Shengwen Yang	3b9987ca9c	Fix the issue 1474 (#1615 ) * Fix 1474 * Fix crash issue when saving and loading poisson model * Rollback the wrong fix	2016-09-29 19:29:47 -07:00
Vadim Khotilovich	3efff6d052	fix for VX (#1614 )	2016-09-27 15:19:20 -07:00
Nan Zhu	37bc122c90	[jvm-packages] Robust dmatrix creation (#1613 ) * add back train method but mark as deprecated * robust matrix creation in jvm	2016-09-26 13:35:04 -04:00
phoenixbai	915ac0b8fe	the fix of missing value assignment for name_ variable in EvalRankList method (#1558 )	2016-09-26 08:57:17 -05:00
Vadim Khotilovich	693ddb860e	More robust DMatrix creation from a sparse matrix (#1606 ) * [CORE] DMatrix from sparse w/ explicit #col #row; safer arg types * [python-package] c-api change for _init_from_csr _init_from_csc * fix spaces * [R-package] adopt the new XGDMatrixCreateFromCSCEx interface * [CORE] redirect old sparse creators to new ones	2016-09-25 10:01:22 -07:00
Guido Tapia	e06f6a0df7	Update README.md - added windows binaries (#1600 ) Added a link to the nightly windows binaries hosted on Guido Tapia's (my) blog	2016-09-21 23:14:07 -07:00
Guido Tapia	b0bfddba72	Update build.md - added link to nightly windows binaries (#1601 ) Apologies for 2 PRs, was easier using githubs interface rather than doing it through git	2016-09-21 23:13:56 -07:00
chanis	62830be29d	[python-package] modify libpath.py and fix typos (#1594 ) * Update Makefile * Update Makefile * modify __init__.py * modified libpath.py and fixed typos	2016-09-21 10:12:19 -07:00
Vlad Sandulescu	9f8116416b	Added KDD Cup 2016 competition (#1596 ) merged thanks	2016-09-21 11:47:01 -04:00
reg.zhuce	3ee145b8dc	[jvm-packages] IndexOutOfBoundsException (#1589 ) ml.dmlc.xgboost4j.scala.spark.XGBoost.scala:51 values is empty when we meet it at first time, so values(0) throw an IndexOutOfBoundsException. It should be dVector.values(i) instead of values(i).	2016-09-20 09:13:47 -04:00
chanis	d8876b0b73	[python-package] modify __init__.py (#1587 ) * Update Makefile * Update Makefile * modify __init__.py	2016-09-19 09:43:36 -07:00
Manuel Schiller	d3c4d19c91	fix spelling mistake (#1584 )	2016-09-18 09:52:01 -07:00
Xin Yin	7245145712	[jvm-packages] Fixed the sanity check for parameter 'nthread' against 'spark.task.cpus'. (#1582 )	2016-09-16 11:31:35 -04:00
chanis	4041c39090	fix Makefile (#1579 ) * Update Makefile * Update Makefile	2016-09-15 10:44:49 -07:00

1 2 3 4 5 ...

2885 Commits