xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	e4894111ba	Update dmlc-core submodule (#6745 )	2021-03-07 00:30:26 -08:00
Bobby Wang	49c22c23b4	[jvm-packages] fix early stopping doesn't work even without custom_eval setting (#6738 ) * [jvm-packages] fix early stopping doesn't work even without custom_eval setting * remove debug info * resolve comment	2021-03-06 20:19:40 -08:00
Philip Hyunsu Cho	5ae7f9944b	[CI] Clear R package cache (#6746 )	2021-03-06 08:37:16 -08:00
Jiaming Yuan	f20074e826	Check for invalid data. (#6742 )	2021-03-04 14:37:20 +08:00
Jiaming Yuan	a9b4a95225	Fix learning rate scheduler with cv. (#6720 ) * Expose more methods in cvpack and packed booster. * Fix cv context in deprecated callbacks. * Fix document.	2021-02-28 13:57:42 +08:00
kangsheng89	9c8523432a	fix relocatable include in CMakeList (#6734 ) (#6737 )	2021-02-27 19:17:29 +08:00
Roffild	1fa6793a4e	Tests for regression metrics with weights. (#6729 )	2021-02-25 22:08:14 +08:00
Jiaming Yuan	9da2287ab8	[breaking] Save booster feature info in JSON, remove feature name generation. (#6605 ) * Save feature info in booster in JSON model. * [breaking] Remove automatic feature name generation in `DMatrix`. This PR is to enable reliable feature validation in Python package.	2021-02-25 18:54:16 +08:00
capybara	b6167cd2ff	[dask] Use client to persist collections (#6722 ) Co-authored-by: fis <jm.yuan@outlook.com>	2021-02-25 16:40:38 +08:00
Louis Desreumaux	9b530e5697	Improve OpenMP exception handling (#6680 )	2021-02-25 13:56:16 +08:00
Jiaming Yuan	c375173dca	Support pylint 2.7.0 (#6726 )	2021-02-25 12:49:58 +08:00
Honza Sterba	17913713b5	[jvm] Add ability to load booster direct from byte array (#6655 ) * Add ability to load booster direct from byte array * fix compiler error * move InputStream to byte-buffer conversion - move it from Booster to XGBoost facade class	2021-02-23 11:28:27 -08:00
Jiaming Yuan	872e559b91	Use inplace predict for sklearn. (#6718 ) * Use inplace predict for sklearn when possible.	2021-02-22 12:27:04 +08:00
Benjamin Lehmann	25077564ab	Fixes small typo in sklearn documentation (#6717 ) Replaces "dowm" with "down" on parameter n_jobs	2021-02-20 07:36:06 +08:00
Jiaming Yuan	bdedaab8d1	Fix pylint. (#6714 )	2021-02-19 11:53:27 +08:00
ShvetsKS	9f15b9e322	Optimize CPU prediction (#6696 ) Co-authored-by: Shvets Kirill <kirill.shvets@intel.com>	2021-02-16 14:41:22 +08:00
James Lamb	dc97b5f19f	[dask] remove outdated comment (#6699 )	2021-02-15 18:49:11 +08:00
Roffild	4c5d2608e0	[python-package] Fix class Booster: feature_types = None (#6705 )	2021-02-13 17:50:23 +08:00
ShvetsKS	9a0399e898	Removed unnecessary PredictBatch calls (#6700 ) Co-authored-by: Shvets Kirill <kirill.shvets@intel.com>	2021-02-10 20:15:14 +08:00
Ali	9b267a435e	Bail out early if libxgboost exists in python setup (#6694 ) Skip `copy_tree` when existing build is found.	2021-02-10 10:50:10 +08:00
Jiaming Yuan	e8c5c53e2f	Use `Predictor` for `dart`. (#6693 ) * Use normal predictor for dart booster. * Implement `inplace_predict` for dart. * Enable `dart` for dask interface now that it's thread-safe. * categorical data should be working out of box for dart now. The implementation is not very efficient as it has to pull back the data and apply weight for each tree, but still a significant improvement over previous implementation as now we no longer binary search for each sample. * Fix output prediction shape on dataframe.	2021-02-09 23:30:19 +08:00
Jiaming Yuan	dbf7e9d3cb	Remove R cache in github action. (#6695 ) The cache stores outdated packages with wrong linkage. Right now there's no way to clear the cache.	2021-02-09 18:53:20 +08:00
Jiaming Yuan	1335db6113	[dask] Improve documents. (#6687 ) * Add tag for versions. * use autoclass in sphinx build. Made some class methods to be private to avoid exporting documents.	2021-02-09 09:20:58 +08:00
Jiaming Yuan	5d48d40d9a	Fix DMatrix slice with feature types. (#6689 )	2021-02-09 08:13:51 +08:00
Jiaming Yuan	218a5fb6dd	Simplify Span checks. (#6685 ) * Stop printing out message. * Remove R specialization. The printed message is not really useful anyway, without a reproducible example there's no way to fix it. But if there's a reproducible example, we can always obtain these information by a debugger. Removing the `printf` function avoids creating the context in kernel.	2021-02-09 08:12:58 +08:00
Jiaming Yuan	4656b09d5d	[breaking] Add prediction fucntion for DMatrix and use inplace predict for dask. (#6668 ) * Add a new API function for predicting on `DMatrix`. This function aligns with rest of the `XGBoosterPredictFrom` functions on semantic of function arguments. Purge `ntree_limit` from libxgboost, use iteration instead. * [dask] Use `inplace_predict` by default for dask sklearn models. * [dask] Run prediction shape inference on worker instead of client. The breaking change is in the Python sklearn `apply` function, I made it to be consistent with other prediction functions where `best_iteration` is used by default.	2021-02-08 18:26:32 +08:00
Jiaming Yuan	dbb5208a0a	Use __array_interface__ for creating DMatrix from CSR. (#6675 ) * Use __array_interface__ for creating DMatrix from CSR. * Add configuration.	2021-02-05 21:09:47 +08:00
Jiaming Yuan	1e949110da	Use generic dispatching routine for array interface. (#6672 )	2021-02-05 09:23:38 +08:00
Jiaming Yuan	a4101de678	Fix divide by 0 in feature importance when no split is found. (#6676 )	2021-02-05 03:39:30 +08:00
Jiaming Yuan	72892cc80d	[dask] Disable gblinear and dart. (#6665 )	2021-02-04 09:13:09 +08:00
Jiaming Yuan	9d62b14591	Fix document. [skip ci] (#6669 )	2021-02-02 20:43:31 +08:00
Jiaming Yuan	411592a347	Enhance inplace prediction. (#6653 ) * Accept array interface for csr and array. * Accept an optional proxy dmatrix for metainfo. This constructs an explicit `_ProxyDMatrix` type in Python. * Remove unused doc. * Add strict output.	2021-02-02 11:41:46 +08:00
Jiaming Yuan	87ab1ad607	[dask] Accept `Future` of model for prediction. (#6650 ) This PR changes predict and inplace_predict to accept a Future of model, to avoid sending models to workers repeatably. * Document is updated to reflect functionality additions in recent changes.	2021-02-02 08:45:52 +08:00
Jiaming Yuan	a9ec0ea6da	Align device id in predict transform with predictor. (#6662 )	2021-02-02 08:33:29 +08:00
Jiaming Yuan	d8ec7aad5a	[dask] Add a 1 line sample to infer output shape. (#6645 ) * [dask] Use a 1 line sample to infer output shape. This is for inferring shape with direct prediction (without DaskDMatrix). There are a few things that requires known output shape before carrying out actual prediction, including dask meta data, output dataframe columns. * Infer output shape based on local prediction. * Remove set param in predict function as it's not thread safe nor necessary as we now let dask to decide the parallelism. * Simplify prediction on `DaskDMatrix`.	2021-01-30 18:55:50 +08:00
Jiaming Yuan	c3c8e66fc9	Make prediction functions thread safe. (#6648 )	2021-01-28 23:29:43 +08:00
Philip Hyunsu Cho	0f2ed21a9d	[Breaking] Change default evaluation metric for binary:logitraw objective to logloss (#6647 )	2021-01-29 00:12:12 +09:00
Jiaming Yuan	d167892c7e	[dask] Ensure model can be pickled. (#6651 )	2021-01-28 21:47:57 +08:00
Philip Hyunsu Cho	0ad6e18a2a	[CI] Do not mix up stashed executable built for ARM and x86_64 platforms (#6646 )	2021-01-27 23:57:26 +09:00
Philip Hyunsu Cho	55ee2bd77f	[CI] Add ARM64 test to Jenkins pipeline (#6643 ) * Add ARM64 test to Jenkins pipeline * Check for bundled libgomp * Use a separate test suite for ARM64 * Ensure that x86 jobs don't run on ARM workers	2021-01-27 21:51:17 +09:00
Jiaming Yuan	1b70a323a7	Improve string view to reduce string allocation. (#6644 )	2021-01-27 19:08:52 +08:00
Jiaming Yuan	bc08e0c9d1	Remove `experimental_json_serialization` from tests. (#6640 )	2021-01-27 17:44:49 +08:00
Jiaming Yuan	8968ca7c0a	Disable s390x and arm64 tests on travis for now. (#6641 )	2021-01-27 16:21:40 +08:00
Jiaming Yuan	d19a0ddacf	Move sdist test to action. (#6635 ) * Move x86 linux and osx sdist test to action. * Add Windows.	2021-01-26 08:25:59 +08:00
Jiaming Yuan	740d042255	Add base_margin for evaluation dataset. (#6591 ) * Add base margin to evaluation datasets. * Unify the code base for evaluation matrices.	2021-01-26 02:11:02 +08:00
Jiaming Yuan	4bf23c2391	Specify shape in prediction contrib and interaction. (#6614 )	2021-01-26 02:08:22 +08:00
Jiaming Yuan	8942c98054	Define metainfo and other parameters for all DMatrix interfaces. (#6601 ) This PR ensures all DMatrix types have a common interface. * Fix logic in avoiding duplicated DMatrix in sklearn. * Check for consistency between DMatrix types. * Add doc for bounds.	2021-01-25 16:06:06 +08:00
Jiaming Yuan	561809200a	Fix document for tree methods. (#6633 )	2021-01-25 15:52:08 +08:00
Adam Pocock	fec66d033a	[jvm-packages] JVM library loader extensions (#6630 ) * [java] extending the library loader to use both OS and CPU architecture. * Simplifying create_jni.py's architecture detection. * Tidying up the architecture detection in create_jni.py	2021-01-25 15:51:39 +08:00
Jiaming Yuan	a275f40267	[dask] Rework base margin test. (#6627 )	2021-01-22 17:49:13 +08:00

1 2 3 4 5 ...

5344 Commits