2014-02-08 13:00:49 -08:00
2014-02-08 11:24:35 -08:00
2014-02-07 22:38:26 -08:00
2014-02-06 16:06:18 -08:00
2014-02-06 09:28:03 -08:00
2014-02-07 22:43:13 -08:00
2014-02-08 13:00:49 -08:00

xgboost

Creater: Tianqi Chen: tianqi.tchen AT gmail

General Purpose Gradient Boosting Library

Goal: A stand-alone efficient library to do learning via boosting in functional space

Existing interesting features: (1) Sparse feature format, handling of missing features. This allows efficient categorical feature encoding as indicators. The speed of booster only depens on number of existing features. (2) Layout of gradient boosting algorithm to support generic tasks, see project wiki.

Planned key components:

(1) Gradient boosting models: - regression tree - linear model/lasso (2) Objectives to support tasks: - regression - classification - ranking - matrix factorization - structured prediction (3) OpenMP implementation(optional)

File extension convention: (1) .h are interface, utils and data structures, with detailed comment; (2) .cpp are implementations that will be compiled, with less comment; (3) .hpp are implementations that will be included by .cpp, with less comment

See also: https://github.com/tqchen/xgboost/wiki

Description
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Readme 33 MiB
Languages
C++ 45.5%
Python 20.3%
Cuda 15.2%
R 6.8%
Scala 6.4%
Other 5.6%