Jiaming Yuan
f2f7dd87b8
Use view for SparsePage exclusively. ( #6590 )
2021-01-11 18:04:55 +08:00
Igor Rukhovich
5c8ccf4455
Improved InitSampling function speed by 2.12 times ( #6410 )
...
* Improved InitSampling function speed by 2.12 times
* Added explicit conversion
2020-12-15 20:59:24 -08:00
ShvetsKS
956beead70
Thread local memory allocation for BuildHist ( #6358 )
...
* thread mem locality
* fix apply
* cleanup
* fix lint
* fix tests
* simple try
* fix
* fix
* apply comments
* fix comments
* fix
* apply simple comment
Co-authored-by: ShvetsKS <kirill.shvets@intel.com>
2020-11-25 17:50:12 +03:00
Jiaming Yuan
44a9d69efb
Small cleanup to evaluator. ( #6400 )
2020-11-20 09:33:51 +08:00
Jiaming Yuan
d711d648cb
Fix label errors in graph visualization ( #6369 )
2020-11-11 17:44:59 -08:00
Igor Moura
5e1e972aea
Clean up warnings ( #6325 )
2020-10-30 23:50:29 +08:00
Sergio Gavilán
b181a88f9f
Reduced some C++ compiler warnings ( #6197 )
...
* Removed some warnings
* Rebase with master
* Solved C++ Google Tests errors made by refactoring in order to remove warnings
* Undo renaming path -> path_
* Fix style check
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-10-29 12:36:00 -07:00
vcarpani
671971e12e
Compiler warnings ( #6286 )
...
* Fix warnings for json.h
* Fix warnings for metric.h
* Fix warnings for updater_quantile_hist.cc.
* Fix warnings for updater_histmaker.cc.
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-10-28 13:46:15 -07:00
Igor Moura
d1254808d5
Clean up C++ warnings ( #6213 )
2020-10-19 23:02:33 +08:00
Jiaming Yuan
bed7ae4083
Loop over thrust::reduce. ( #6229 )
...
* Check input chunk size of dqdm.
* Add doc for current limitation.
2020-10-14 10:40:56 +13:00
Jiaming Yuan
b5b24354b8
More categorical tests and disable shap sparse test. ( #6219 )
...
* Fix tree load with 32 category.
2020-10-10 16:12:37 +08:00
Jiaming Yuan
70ce5216b5
Add high level tests for categorical data. ( #6179 )
...
* Fix unique.
2020-10-09 09:27:23 +08:00
vcarpani
6bc9747df5
Reduce compile warnings ( #6198 )
...
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-10-08 23:14:59 +08:00
ShvetsKS
a4ce0eae43
CPU predict performance improvement ( #6127 )
...
Co-authored-by: ShvetsKS <kirill.shvets@intel.com>
2020-10-08 15:50:21 +03:00
Jiaming Yuan
444131a2e6
Add categorical data support to GPU Hist. ( #6164 )
2020-09-29 11:27:25 +08:00
Jiaming Yuan
7065779afa
Improve JSON format for categorical features. ( #6128 )
...
* Gather categories for all nodes.
2020-09-21 15:35:05 +08:00
Philip Hyunsu Cho
9e955fb9b0
[R] Check warnings explicitly for model compatibility tests ( #6114 )
...
* [R] Check warnings explicitly for model compatibility tests
* Address reviewer's feedback
2020-09-15 10:49:48 -07:00
Jiaming Yuan
e5d40b39cd
[Breaking] Don't save leaf child count in JSON. ( #6094 )
...
The field is deprecated and not used anywhere in XGBoost.
2020-09-08 11:11:13 +08:00
Jiaming Yuan
2fcc4f2886
Unify evaluation functions. ( #6037 )
2020-08-26 14:23:27 +08:00
Jiaming Yuan
80c8547147
Make binary bin search reusable. ( #6058 )
...
* Move binary search row to hist util.
* Remove dead code.
2020-08-26 05:05:11 +08:00
Jiaming Yuan
20c95be625
Expand categorical node. ( #6028 )
...
Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>
2020-08-25 18:53:57 +08:00
Jiaming Yuan
a144daf034
Limit tree depth for GPU hist. ( #6045 )
2020-08-22 19:34:52 +08:00
Qi Zhang
989ddd036f
Swap byte-order in binary serializer to support big-endian arch ( #5813 )
...
* fixed some endian issues
* Use dmlc::ByteSwap() to simplify code
* Fix lint check
* [CI] Add test for s390x
* Download latest CMake on s390x
* Fix a bug in my code
* Save magic number in dmatrix with byteswap on big-endian machine
* Save version in binary with byteswap on big-endian machine
* Load scalar with byteswap in MetaInfo
* Add a debugging message
* Handle arrays correctly when byteswapping
* EOF can also be 255
* Handle magic number in MetaInfo carefully
* Skip Tree.Load test for big-endian, since the test manually builds little-endian binary model
* Handle missing packages in Python tests
* Don't use boto3 in model compatibility tests
* Add s390 Docker file for local testing
* Add model compatibility tests
* Add R compatibility test
* Revert "Add R compatibility test"
This reverts commit c2d2bdcb7dbae133cbb927fcd20f7e83ee2b18a8.
Co-authored-by: Qi Zhang <q.zhang@ibm.com>
Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>
2020-08-18 14:47:17 -07:00
Jiaming Yuan
4d99c58a5f
Feature weights ( #5962 )
2020-08-18 19:55:41 +08:00
Jiaming Yuan
0b2a26fa74
Remove skmaker. ( #5971 )
2020-08-09 15:23:31 +08:00
boxdot
d268a2a463
Thread-safe prediction by making the prediction cache thread-local. ( #5853 )
...
Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com>
2020-07-30 12:33:50 +08:00
Jiaming Yuan
e4a273e1da
Fix evaluate root split. ( #5948 )
2020-07-29 19:33:29 +08:00
Philip Hyunsu Cho
ace7fd328b
[R] Add a compatibility layer to load Booster object from an old RDS file ( #5940 )
...
* [R] Add a compatibility layer to load Booster from an old RDS
* Modify QuantileHistMaker::LoadConfig() to be backward compatible with 1.1.x
* Add a big warning about compatibility in QuantileHistMaker::LoadConfig()
* Add testing suite
* Discourage use of saveRDS() in CRAN doc
2020-07-26 00:06:49 -07:00
Jiaming Yuan
a4de2f68e4
Use cudaOccupancyMaxPotentialBlockSize to calculate the block size. ( #5926 )
2020-07-23 14:24:42 +08:00
Philip Hyunsu Cho
4af857f95d
Add explicit template specialization for portability ( #5921 )
...
* Add explicit template specializations
* Adding Specialization for FileAdapterBatch
2020-07-22 12:31:17 -07:00
Andy Adinets
ac3f0e78dc
Split Features into Groups to Compute Histograms in Shared Memory ( #5795 )
2020-07-07 15:04:35 +12:00
Philip Hyunsu Cho
1d22a9be1c
Revert "Reorder includes. ( #5749 )" ( #5771 )
...
This reverts commit d3a0efbf162f3dceaaf684109e1178c150b32de3.
2020-06-09 10:29:28 -07:00
Jiaming Yuan
d3a0efbf16
Reorder includes. ( #5749 )
...
* Reorder includes.
* R.
2020-06-03 17:30:47 +12:00
ShvetsKS
cd3d14ad0e
Add float32 histogram ( #5624 )
...
* new single_precision_histogram param was added.
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com>
Co-authored-by: fis <jm.yuan@outlook.com>
2020-06-03 11:24:53 +08:00
Rory Mitchell
f779980f7e
gpu_hist performance tweaks ( #5707 )
...
* Remove device vectors
* Remove allreduce synchronize
* Remove double buffer
2020-05-29 16:48:53 +12:00
Andy Adinets
646def51e0
C++14 for xgboost ( #5664 )
2020-05-21 12:26:40 +12:00
ShvetsKS
dd01e4ba8d
Distributed optimizations for 'hist' method with CPUs ( #5557 )
...
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com>
2020-05-20 06:03:03 +03:00
Jiaming Yuan
535479e69f
Add JSON schema to model dump. ( #5660 )
2020-05-15 10:18:43 +08:00
Oleksandr Kuvshynov
4e64e2ef8e
skip missing lookup if nothing is missing in CPU hist partition kernel. ( #5644 )
...
* [xgboost] skip missing lookup if nothing is missing
2020-05-12 05:50:08 +03:00
Rory Mitchell
fcf57823b6
Reduce device synchronisation ( #5631 )
...
* Reduce device synchronisation
* Initialise pinned memory
2020-05-07 21:19:46 +12:00
Jiaming Yuan
eaf2a00b5c
Enhance nvtx support. ( #5636 )
2020-05-06 22:54:24 +08:00
Rory Mitchell
b9649e7b8e
Refactor gpu_hist split evaluation ( #5610 )
...
* Refactor
* Rewrite evaluate splits
* Add more tests
2020-04-30 08:58:12 +12:00
Jiaming Yuan
c90457f489
Refactor the CLI. ( #5574 )
...
* Enable parameter validation.
* Enable JSON.
* Catch `dmlc::Error`.
* Show help message.
2020-04-26 10:56:33 +08:00
Andy Adinets
73142041b9
For histograms, opting into maximum shared memory available per block. ( #5491 )
2020-04-21 14:56:42 +12:00
Rory Mitchell
b2827a80e1
Use non-synchronising scan ( #5560 )
2020-04-20 15:51:34 +12:00
Rory Mitchell
d6d1035950
gpu_hist performance fixes ( #5558 )
...
* Remove unnecessary cuda API calls
* Fix histogram memory growth
2020-04-19 12:21:13 +12:00
Jiaming Yuan
c245eb8755
Fix r interaction constraints ( #5543 )
...
* Unify the parsing code.
* Cleanup.
2020-04-18 06:53:51 +08:00
ShvetsKS
a2d86b8e4b
Optimizations for RNG in InitData kernel ( #5522 )
...
* optimizations for subsampling in InitData
* optimizations for subsampling in InitData
Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com>
2020-04-16 18:24:32 +03:00
Rory Mitchell
e268fb0093
Use thrust functions instead of custom functions ( #5544 )
2020-04-16 21:41:16 +12:00
Rory Mitchell
ca4e05660e
Purge device_helpers.cuh ( #5534 )
...
* Simplifications with caching_device_vector
* Purge device helpers
2020-04-15 21:51:56 +12:00