xgboost

Author	SHA1	Message	Date
Jiaming Yuan	5384ed85c8	Use caching allocator from RMM, when RMM is enabled (#6131 )	2020-09-17 21:51:49 -07:00
neko	6bc9b9dc4f	Fix doc for CMake requirement. (#6123 )	2020-09-16 17:59:43 +08:00
Philip Hyunsu Cho	9e955fb9b0	[R] Check warnings explicitly for model compatibility tests (#6114 ) * [R] Check warnings explicitly for model compatibility tests * Address reviewer's feedback	2020-09-15 10:49:48 -07:00
Philip Hyunsu Cho	33577ef5d3	Add MAPE metric (#6119 )	2020-09-14 18:45:27 -07:00
Rory Mitchell	47350f6acb	Allow kwargs in dask predict (#6117 )	2020-09-15 13:04:03 +12:00
Jiaming Yuan	b5f52f0b1b	Validate weights are positive values. (#6115 )	2020-09-15 09:03:55 +08:00
Jiaming Yuan	c6f2b8c841	Upgrade gputreeshap. (#6099 ) * Upgrade gputreeshap. Co-authored-by: Rory Mitchell <r.a.mitchell.nz@gmail.com>	2020-09-15 12:57:22 +12:00
Vitalie Spinu	1453bee3e7	[R] Remove stringi dependency (#6109 ) * [R] Fix empty empty tests and a test warnings * [R] Remove stringi dependency (fix #5905) * Fix R lint check * [R] Fix automatic conversion to factor in R < 4.0.0 in xgb.model.dt.tree * Add `R` Makefile variable Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-09-12 13:18:08 -07:00
Jiaming Yuan	07945290a2	Remove unused RABIT targets. (#6110 ) * Remove rabit mock. * Remove rabit base.	2020-09-11 14:09:44 +08:00
Jiaming Yuan	c92d751ad1	Enable building rabit on Windows (#6105 )	2020-09-11 11:54:46 +08:00
Jiaming Yuan	08bdb2efc8	Fix dask doc. [skip ci] (#6108 )	2020-09-11 10:56:12 +08:00
Bobby Wang	00b0ad1293	[Doc] add doc for kill_spark_context_on_worker_failure parameter (#6097 ) * [Doc] add doc for kill_spark_context_on_worker_failure parameter * resolve comments	2020-09-09 21:28:44 -07:00
Philip Hyunsu Cho	d0ccb13d09	Work around a compiler bug in MacOS AppleClang 11 (#6103 ) * Workaround a compiler bug in MacOS AppleClang * [CI] Run C++ test with MacOS Catalina + AppleClang 11.0.3 * [CI] Migrate cmake_test on MacOS from Travis CI to GitHub Actions * Install OpenMP runtime * [CI] Use CMake to locate lz4 lib	2020-09-09 21:21:55 -07:00
Philip Hyunsu Cho	9338582d79	[CI] Fix CTest by running it in a correct directory (#6104 ) * [CI] Fix CTest by running it in a correct directory * [CI] Do not run dmlc-core unit tests with sanitizer	2020-09-09 10:31:09 -07:00
Jiaming Yuan	3dcd85fab5	Refactor rabit tests (#6096 ) * Merge rabit tests into XGBoost. * Run them On CI. * Simplification for CMake scripts.	2020-09-09 12:30:29 +08:00
Jiaming Yuan	318bffaa10	Fix custom obj link. [skip ci] (#6100 )	2020-09-09 10:55:38 +08:00
Jiaming Yuan	b0001a6e29	Correct style warnings from clang-tidy for rabit. (#6095 )	2020-09-08 12:13:58 +08:00
Hristo Iliev	da61d9460b	[jvm-packages] Add getNumFeature method (#6075 ) * Add getNumFeature to the Java API * Add getNumFeature to the Scala API * Add unit tests for getNumFeature Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-09-07 20:57:46 -07:00
Jiaming Yuan	93e9af43bb	Unify set index data. (#6062 )	2020-09-08 11:38:41 +08:00
Jiaming Yuan	e5d40b39cd	[Breaking] Don't save leaf child count in JSON. (#6094 ) The field is deprecated and not used anywhere in XGBoost.	2020-09-08 11:11:13 +08:00
Jiaming Yuan	5994f3b14c	Don't link imported target. (#6093 )	2020-09-07 02:51:09 -07:00
Philip Hyunsu Cho	974ba12f38	Fix CMake build with BUILD_STATIC_LIB option (#6090 ) * Fix CMake build with BUILD_STATIC_LIB option * Disable BUILD_STATIC_LIB option when R/JVM pkg is enabled * Add objxgboost to install target only when BUILD_STATIC_LIB=ON	2020-09-07 02:38:29 -07:00
Daniel Steinberg	68c55a37d9	Add cache name back to external_memory.py files. (#6088 )	2020-09-06 16:01:09 +08:00
Boris Feld	24ca9348f7	Fix typo in xgboost.callback.early_stop docstring (#6071 )	2020-09-06 13:37:07 +08:00
Rory Mitchell	2e907abdb8	Updates to GPUTreeShap (#6087 ) * Extract paths on device * Update GPUTreeShap	2020-09-06 13:39:08 +12:00
Bobby Wang	0e2d5669f6	[jvm-packages] cancel job instead of killing SparkContext (#6019 ) * cancel job instead of killing SparkContext This PR changes the default behavior that kills SparkContext. Instead, This PR cancels jobs when coming across task failed. That means the SparkContext is still alive even some exceptions happen. * add a parameter to control if killing SparkContext * cancel the jobs the failed task belongs to * remove the jobId from the map when one job failed. * resolve comments	2020-09-02 14:20:59 -07:00
Tong He	3912f3de06	Updates from 1.2.0 cran submission (#6077 ) * update for 1.2.0 cran submission * recover cmakelists * fix unittest from the shap PR * trigger CI	2020-09-02 20:50:23 +08:00
Philip Hyunsu Cho	9be969cc7a	Add release note for 1.2.0 in NEWS.md (#6063 ) * Update query_contributors.py to account for pagination * Add the release note for 1.2.0 * Add release note for patch releases * Apply suggestions from code review * Fix typo Co-authored-by: Jiaming Yuan <jm.yuan@outlook.com> Co-authored-by: John Zedlewski <904524+JohnZed@users.noreply.github.com>	2020-09-02 00:49:02 -07:00
Anthony D'Amato	ada964f16e	Clean the way deterministic paritioning is computed (#6033 ) We propose to only use the rowHashCode to compute the partitionKey, adding the FeatureValue hashCode does not bring more value and would make the computation slower. Even though a collision would appear at 0.2% with MurmurHash3 this is bearable for partitioning, this won't have any impact on the data balancing.	2020-08-30 14:38:23 -07:00
ShvetsKS	c1ca872d1e	Modin DF support (#6055 ) * Modin DF support * mode change * tests were added, ci env was extended * mode change * Remove redundant installation of modin * Add a pytest skip marker for modin * Install Modin[ray] from PyPI * fix interfering * avoid extra conversion * delete cv test for modin * revert cv function Co-authored-by: ShvetsKS <kirill.shvets@intel.com> Co-authored-by: Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-29 22:33:30 +03:00
FelixYBW	3a990433f9	set maxBins to 256. Align with c code in src/tree/param.h (#6066 )	2020-08-28 15:06:11 +03:00
Rory Mitchell	9bddecee05	Update GPUTreeShap (#6064 ) * Update GPUTreeShap * Update src/CMakeLists.txt Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu> Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-27 12:01:53 -07:00
Jiaming Yuan	2fcc4f2886	Unify evaluation functions. (#6037 )	2020-08-26 14:23:27 +08:00
Jiaming Yuan	80c8547147	Make binary bin search reusable. (#6058 ) * Move binary search row to hist util. * Remove dead code.	2020-08-26 05:05:11 +08:00
Philip Hyunsu Cho	9c14e430af	[CI] Improve JVM test in GitHub Actions (#5930 ) * [CI] Improve JVM test in GitHub Actions * Use env var for Wagon options [skip ci] * Move the retry flag to pom.xml [skip ci] * Export env var RABIT_MOCK to run Spark tests [skip ci] * Correct location of env var * Re-try up to 5 times [skip ci] * Don't run distributed training test on Windows * Fix typo * Update main.yml	2020-08-25 10:14:46 -07:00
Jiaming Yuan	81d8dd79ca	Bump header version. (#6056 )	2020-08-26 00:29:00 +08:00
Jiaming Yuan	20c95be625	Expand categorical node. (#6028 ) Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>	2020-08-25 18:53:57 +08:00
Rory Mitchell	9a4e8b1d81	GPUTreeShap (#6038 )	2020-08-25 12:47:41 +12:00
Philip Hyunsu Cho	b3193052b3	Bump version to 1.3.0 snapshot in master (#6052 )	2020-08-23 17:13:46 -07:00
Philip Hyunsu Cho	4729458a36	[jvm-packages] [doc] Update install doc for JVM packages (#6051 )	2020-08-23 14:14:53 -07:00
Philip Hyunsu Cho	cfced58c1c	[CI] Port CI fixes from the 1.2.0 branch (#6050 ) * Fix a unit test on CLI, to handle RC versions * [CI] Use mgpu machine to run gpu hist unit tests * [CI] Build GPU-enabled JAR artifact and deploy to xgboost-maven-repo	2020-08-22 23:24:46 -07:00
Jiaming Yuan	a144daf034	Limit tree depth for GPU hist. (#6045 )	2020-08-22 19:34:52 +08:00
Jiaming Yuan	b9ebbffc57	Fix plotting test. (#6040 ) Previously the test loads a model generated by `test_basic.py`, now we generate the model explicitly. * Cleanup saved files for basic tests.	2020-08-22 13:18:48 +08:00
Jiaming Yuan	7a46515d3d	Remove win2016 jvm github action test. (#6042 )	2020-08-20 19:39:46 -07:00
Jiaming Yuan	7be2e04bd4	Fix scikit learn cls doc. (#6041 )	2020-08-20 19:23:06 -07:00
Philip Hyunsu Cho	1fd29edf66	[CI] Migrate linters to GitHub Actions (#6035 ) * [CI] Move lint to GitHub Actions * [CI] Move Doxygen to GitHub Actions * [CI] Move Sphinx build test to GitHub Actions * [CI] Reduce workload for Windows R tests * [CI] Move clang-tidy to Build stage	2020-08-19 12:33:51 -07:00
ShvetsKS	24f2e6c97e	Optimize DMatrix build time. (#5877 ) Co-authored-by: SHVETS, KIRILL <kirill.shvets@intel.com>	2020-08-20 01:37:03 +08:00
Jiaming Yuan	29b7fea572	Optimize cpu sketch allreduce for sparse data. (#6009 ) * Bypass RABIT serialization reducer and use custom allgather based merging.	2020-08-19 10:03:45 +08:00
Jiaming Yuan	90355b4f00	Make JSON the default full serialization format. (#6027 )	2020-08-19 09:57:43 +08:00
Anthony D'Amato	f58e41bad8	Fix deterministic partitioning with dataset containing Double.NaN (#5996 ) The functions featureValueOfSparseVector or featureValueOfDenseVector could return a Float.NaN if the input vectore was containing any missing values. This would make fail the partition key computation and most of the vectors would end up in the same partition. We fix this by avoid returning a NaN and simply use the row HashCode in this case. We added a test to ensure that the repartition is indeed now uniform on input dataset containing values by checking that the partitions size variance is below a certain threshold. Signed-off-by: Anthony D'Amato <anthony.damato@hotmail.fr>	2020-08-18 18:55:37 -07:00

1 2 3 4 5 ...

5079 Commits