Go to file

Maurus Cuelenaere 6bd1869026 Add prediction of feature contributions (#2003 )

* Add prediction of feature contributions

This implements the idea described at http://blog.datadive.net/interpreting-random-forests/
which tries to give insight in how a prediction is composed of its feature contributions
and a bias.

* Support multi-class models

* Calculate learning_rate per-tree instead of using the one from the first tree

* Do not rely on node.base_weight * learning_rate having the same value as the node mean value (aka leaf value, if it were a leaf); instead calculate them (lazily) on-the-fly

* Add simple test for contributions feature

* Check against param.num_nodes instead of checking for non-zero length

* Loop over all roots instead of only the first

2017-05-14 00:58:10 -05:00

amalgamation

Histogram Optimized Tree Grower (#1940 )

2017-01-13 09:25:55 -08:00

demo

Update README.md (#2202 )

2017-04-17 15:28:37 -07:00

dmlc-core @ b5bec5481d

Remove xgboost's thread_local and switch to dmlc::ThreadLocalStore (#2121 )

2017-03-27 09:09:18 -07:00

doc

[R] maintenance Apr 2017 (#2237 )

2017-05-01 22:51:34 -07:00

include/xgboost

Add prediction of feature contributions (#2003 )

2017-05-14 00:58:10 -05:00

jvm-packages

Removed 'flink.suffix' and added 'flink.version' (#2277 )

2017-05-10 08:42:40 -07:00

make

config.mk: Set TEST_COVER to 0 by default (#1853 )

2016-12-11 19:48:15 +01:00

plugin

[GPU Plugin] Fast histogram speed improvements. Updated benchmarks. (#2258 )

2017-05-08 09:21:38 -07:00

python-package

Add prediction of feature contributions (#2003 )

2017-05-14 00:58:10 -05:00

R-package

Fix typo (#2264 )

2017-05-07 16:54:48 -07:00

rabit @ a764d45cfb

[UPDATE] Update rabit and threadlocal (#2114 )

2017-03-16 18:48:37 -07:00

src

Add prediction of feature contributions (#2003 )

2017-05-14 00:58:10 -05:00

tests

Add prediction of feature contributions (#2003 )

2017-05-14 00:58:10 -05:00

.gitignore

[jvm-packages] fix the persistence of XGBoostEstimator (#2265 )

2017-05-08 21:58:06 -07:00

.gitmodules

[REFACTOR] cleanup structure

2016-01-16 10:24:00 -08:00

.travis.yml

new thread local requires xcode8

2017-03-17 09:40:34 -07:00

appveyor.yml

GPU plug-in improvements + basic Windows continuous integration (#1752 )

2016-11-10 12:34:09 -08:00

build.sh

Minor fix on installation guide and (the probably deprecated) build script

2016-02-24 12:46:37 +08:00

CMakeLists.txt

[GPU Plugin] Fast histogram speed improvements. Updated benchmarks. (#2258 )

2017-05-08 09:21:38 -07:00

CONTRIBUTORS.md

[GPU-Plugin] (#2227 )

2017-04-25 16:37:10 -07:00

ISSUE_TEMPLATE.md

issue template (#1475 )

2016-08-17 22:50:37 -07:00

LICENSE

update year in LICENSE, conf.py and README.md files

2016-03-15 16:51:34 +03:00

Makefile

ENH more makefile updates (#2133 )

2017-03-22 16:22:15 -05:00

NEWS.md

[R] maintenance Apr 2017 (#2237 )

2017-05-01 22:51:34 -07:00

README.md

[GPU-Plugin] (#2227 )

2017-04-25 16:37:10 -07:00

README.md

eXtreme Gradient Boosting

Documentation | Resources | Installation | Release Notes | RoadMap

XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. It implements machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting (also known as GBDT, GBM) that solve many data science problems in a fast and accurate way. The same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond billions of examples.

What's New

Ask a Question

For reporting bugs please use the xgboost/issues page.
For generic questions or to share your experience using XGBoost please use the XGBoost User Group

Help to Make XGBoost Better

XGBoost has been developed and used by a group of active community members. Your help is very valuable to make the package better for everyone.

Check out call for contributions and Roadmap to see what can be improved, or open an issue if you want something.
Contribute to the documents and examples to share your experience with other users.
Add your stories and experience to Awesome XGBoost.
Please add your name to CONTRIBUTORS.md and after your patch has been merged.
- Please also update NEWS.md on changes and improvements in API and docs.

License

Reference

Tianqi Chen and Carlos Guestrin. XGBoost: A Scalable Tree Boosting System. In 22nd SIGKDD Conference on Knowledge Discovery and Data Mining, 2016
XGBoost originates from research project at University of Washington, see also the Project Page at UW.

Languages

C++ 45.5%

Python 20.3%

Cuda 15.2%

R 6.8%

Scala 6.4%

Other 5.6%