Go to file

sriramch fed665ae8a - training with external memory part 1 of 2 (#4486 )

* - training with external memory part 1 of 2
   - this pr focuses on computing the quantiles using multiple gpus on a
     dataset that uses the external cache capabilities
   - there will a follow-up pr soon after this that will support creation
     of histogram indices on large dataset as well
   - both of these changes are required to support training with external memory
   - the sparse pages in dmatrix are taken in batches and the the cut matrices
     are incrementally built
   - also snuck in some (perf) changes related to sketches aggregation amongst multiple
     features across multiple sparse page batches. instead of aggregating the summary
     inside each device and merged later, it is aggregated in-place when the device
     is working on different rows but the same feature

2019-05-30 08:18:34 +12:00

.github

Enable auto-locking of issues closed long ago (#3821 )

2018-10-23 19:21:58 -07:00

amalgamation

Refactor fast-hist, add tests for some updaters. (#3836 )

2018-11-07 21:15:07 +13:00

cmake

[CI] Add Windows GPU to Jenkins CI pipeline (#4463 )

2019-05-14 04:45:06 +00:00

cub @ b20808b1b0

Update cub submodule again (fixes GPU build) (#2599 )

2017-08-13 22:14:40 +12:00

demo

Add native support for Dask (#4473 )

2019-05-27 13:29:28 +12:00

dev

[RFC] Version 0.90 release candidate (#4475 )

2019-05-20 01:02:44 -07:00

dmlc-core @ 3943914eed

[CI] Add Python and C++ tests for Windows GPU target (#4469 )

2019-05-16 01:06:46 +00:00

doc

Fix dask API sphinx docstrings (#4507 )

2019-05-28 16:39:26 +12:00

include/xgboost

De-duplicate GPU parameters. (#4454 )

2019-05-29 11:55:57 +08:00

jvm-packages

[jvm-packages] Add back reg:linear for scala. (#4490 )

2019-05-23 15:02:08 -07:00

make

Not use -msse2 on power or arm arch. close #2446 (#2475 )

2017-07-06 20:06:55 -04:00

plugin

Replaced std::vector with HostDeviceVector in MetaInfo and SparsePage. (#3446 )

2018-08-30 14:28:47 +12:00

python-package

Fix dask API sphinx docstrings (#4507 )

2019-05-28 16:39:26 +12:00

R-package

Add support for cross-validation using query ID (#4474 )

2019-05-23 10:45:02 -07:00

rabit @ a429748e24

[jvm-packages] allow partial evaluation of dataframe before prediction (#4407 )

2019-04-26 21:02:40 -07:00

src

- training with external memory part 1 of 2 (#4486 )

2019-05-30 08:18:34 +12:00

tests

- training with external memory part 1 of 2 (#4486 )

2019-05-30 08:18:34 +12:00

.clang-tidy

Fix clang-tidy warnings. (#4149 )

2019-03-13 02:25:51 +08:00

.editorconfig

Added configuration for python into .editorconfig (#3494 )

2018-07-23 00:24:10 -07:00

.gitignore

Add native support for Dask (#4473 )

2019-05-27 13:29:28 +12:00

.gitmodules

Upgrading to NCCL2 (#3404 )

2018-07-10 00:42:15 -07:00

.travis.yml

[CI] Refactor Jenkins CI pipeline + migrate all Linux tests to Jenkins (#4401 )

2019-04-26 18:39:12 -07:00

appveyor.yml

[CI] Fix Windows tests (#4403 )

2019-04-25 20:25:43 -07:00

CITATION

simplify software citation (#2912 )

2017-12-01 02:58:13 -08:00

CMakeLists.txt

[RFC] Version 0.90 release candidate (#4475 )

2019-05-20 01:02:44 -07:00

CONTRIBUTORS.md

Add support for cross-validation using query ID (#4474 )

2019-05-23 10:45:02 -07:00

Jenkinsfile

Add support for cross-validation using query ID (#4474 )

2019-05-23 10:45:02 -07:00

Jenkinsfile-win64

[CI] Add Python and C++ tests for Windows GPU target (#4469 )

2019-05-16 01:06:46 +00:00

LICENSE

Include full text of Apache 2.0 license (#3698 )

2018-09-12 20:46:55 -07:00

Makefile

[CI] Refactor Jenkins CI pipeline + migrate all Linux tests to Jenkins (#4401 )

2019-04-26 18:39:12 -07:00

NEWS.md

[RFC] Version 0.90 release candidate (#4475 )

2019-05-20 01:02:44 -07:00

README.md

Added travis logo (#4344 )

2019-04-08 21:20:15 -07:00

README.md

eXtreme Gradient Boosting

Community | Documentation | Resources | Contributors | Release Notes

XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. It implements machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting (also known as GBDT, GBM) that solve many data science problems in a fast and accurate way. The same code runs on major distributed environment (Hadoop, SGE, MPI) and can solve problems beyond billions of examples.

License

Contribute to XGBoost

XGBoost has been developed and used by a group of active community members. Your help is very valuable to make the package better for everyone. Checkout the Community Page

Reference

Tianqi Chen and Carlos Guestrin. XGBoost: A Scalable Tree Boosting System. In 22nd SIGKDD Conference on Knowledge Discovery and Data Mining, 2016
XGBoost originates from research project at University of Washington.