Go to file

Philip Hyunsu Cho 437b368b1f Update dmlc-core submodule (#3546 )

This bring many goodies, including:

* Ability to specify delimiter and weight_column for CSV files:
```python
dtrain = xgboost.DMatrix('train.csv?format=csv&label_column=0&weight_column=1&delimiter= ')
```
* Ability to choose between 0-based and 1-based indexing for LIBSVM/LIBFM files:
```python
dtrain = xgboost.DMatrix('train.libsvm?indexing_mode=1')    # use 1-based indexing
dtest = xgboost.DMatrix('test.libsvm')                      # use 0-based indexing (default)
dtest2 = xgboost.DMatrix('test2.libsvm?indexing_mode=-1')  # use heuristic to detect 0-based / 1-based
```
* Fix a bug in float parsing (issue dmlc/dmlc-core#440)

2018-08-01 15:15:40 -07:00

.github

[DOCS] Update link to readme (#3437 )

2018-07-04 12:24:33 -07:00

amalgamation

Refactor of FastHistMaker to allow for custom regularisation methods (#3335 )

2018-06-28 07:37:25 +00:00

cmake

Enable building with sanitizers. (#3525 )

2018-07-31 17:25:47 +12:00

cub @ b20808b1b0

Update cub submodule again (fixes GPU build) (#2599 )

2017-08-13 22:14:40 +12:00

demo

fix typo (#3188 )

2018-03-21 19:24:29 -04:00

dmlc-core @ f2afdc7788

Update dmlc-core submodule (#3546 )

2018-08-01 15:15:40 -07:00

doc

Enable building with sanitizers. (#3525 )

2018-07-31 17:25:47 +12:00

include/xgboost

Add callback interface to re-direct console output (#3438 )

2018-07-05 11:32:30 -07:00

jvm-packages

[jvm-packages] consider spark.task.cpus when controlling parallelism (#3530 )

2018-07-31 06:19:45 -07:00

make

Not use -msse2 on power or arm arch. close #2446 (#2475 )

2017-07-06 20:06:55 -04:00

plugin

Dmatrix refactor stage 1 (#3301 )

2018-06-07 10:25:58 +12:00

python-package

Fix bug of using list(x) function when x is string (#3432 )

2018-07-30 07:36:34 -07:00

R-package

Issue warning when requesting bivariate plotting (#3516 )

2018-07-27 16:15:37 -07:00

rabit @ 87143deb4c

Fix CRAN check for lintr (#3372 )

2018-06-18 12:53:52 -07:00

src

Fix typo in ElasticNet threshold function (#3527 )

2018-07-30 14:08:14 +12:00

tests

Added finding quantiles on GPU. (#3393 )

2018-07-27 14:03:16 +12:00

.clang-tidy

Fix model saving for 'count:possion': max_delta_step as Booster attribute (#3515 )

2018-07-27 09:55:54 -07:00

.editorconfig

Added configuration for python into .editorconfig (#3494 )

2018-07-23 00:24:10 -07:00

.gitignore

Improve .gitignore patterns (#3184 )

2018-05-09 14:31:59 -07:00

.gitmodules

Upgrading to NCCL2 (#3404 )

2018-07-10 00:42:15 -07:00

.travis.yml

Clang-tidy static analysis (#3222 )

2018-04-19 18:57:13 +12:00

appveyor.yml

Dynamically allocate GPU histogram memory (#3519 )

2018-07-28 21:22:41 +12:00

build.sh

Suggest git submodule update instead of delete + reclone (#3214 )

2018-05-09 14:39:17 -07:00

CITATION

simplify software citation (#2912 )

2017-12-01 02:58:13 -08:00

CMakeLists.txt

Enable building with sanitizers. (#3525 )

2018-07-31 17:25:47 +12:00

CONTRIBUTORS.md

Add qid like ranklib format (#2749 )

2018-06-30 20:24:03 +00:00

Jenkinsfile

Upgrade cuda version to 9.2 for CI workflows (#3460 )

2018-07-08 23:04:51 -07:00

LICENSE

update year in LICENSE, conf.py and README.md files

2016-03-15 16:51:34 +03:00

Makefile

Add callback interface to re-direct console output (#3438 )

2018-07-05 11:32:30 -07:00

NEWS.md

Document 0.72.1 version (#3458 )

2018-07-08 15:42:09 -07:00

README.md

Update README.md

2018-07-04 13:09:32 -07:00

README.md

eXtreme Gradient Boosting

Community | Documentation | Resources | Contributors | Release Notes

XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. It implements machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting (also known as GBDT, GBM) that solve many data science problems in a fast and accurate way. The same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond billions of examples.

License

Contribute to XGBoost

XGBoost has been developed and used by a group of active community members. Your help is very valuable to make the package better for everyone. Checkout the Community Page

Reference

Tianqi Chen and Carlos Guestrin. XGBoost: A Scalable Tree Boosting System. In 22nd SIGKDD Conference on Knowledge Discovery and Data Mining, 2016
XGBoost originates from research project at University of Washington.

Languages

C++ 45.5%

Python 20.3%

Cuda 15.2%

R 6.8%

Scala 6.4%

Other 5.6%