xgboost

Author	SHA1	Message	Date
Philip Hyunsu Cho	91cc14ea70	Add another contributor for rabit update	2018-11-04 10:29:21 -08:00
Philip Hyunsu Cho	78ec77fa97	Release 0.81 version (#3864 ) * Release 0.81 version * Update NEWS.md v0.81	2018-11-04 05:49:11 -08:00
Philip Hyunsu Cho	c22e90d5d2	Correct typo	2018-11-04 05:22:53 -08:00
Philip Hyunsu Cho	6da462234e	Move MinGW-w64 + Python section to the end, since it's 'advanced' (#3863 )	2018-11-04 05:12:27 -08:00
Philip Hyunsu Cho	a650131fc3	Update doc: colsample_bylevel now works for tree_method=hist (#3862 ) This feature was introduced by #3635	2018-11-04 02:25:25 -08:00
Philip Hyunsu Cho	91537e7353	Fix #3342 and h2oai/h2o4gpu#625 : Save predictor parameters in model file (#3856 ) * Fix #3342 and h2oai/h2o4gpu#625: Save predictor parameters in model file This allows pickled models to retain predictor attributes, such as 'predictor' (whether to use CPU or GPU) and 'n_gpu' (number of GPUs to use). Related: h2oai/h2o4gpu#625 Closes #3342. TODO. Write a test. * Fix lint * Do not load GPU predictor into CPU-only XGBoost * Add a test for pickling GPU predictors * Make sample data big enough to pass multi GPU test * Update test_gpu_predictor.cu	2018-11-03 21:45:38 -07:00
Philip Hyunsu Cho	e04ab56b57	Fix #3747 : Add coef_ and intercept_ as properties of sklearn wrapper (#3855 ) * Fix #3747: Add coef_ and intercept_ as properties of sklearn wrapper Scikit-learn expects linear learners to expose `coef_` and `intercept_` as properties. Closes #3747. * Fix lint	2018-11-02 01:44:37 -07:00
Philip Hyunsu Cho	ad68865d6b	[Blocking] Fix #3840 : Clean up logic for parsing tree_method parameter (#3849 ) * Clean up logic for converting tree_method to updater sequence * Use C++11 enum class for extra safety Compiler will give warnings if switch statements don't handle all possible values of C++11 enum class. Also allow enum class to be used as DMLC parameter. * Fix compiler error + lint * Address reviewer comment * Better docstring for DECLARE_FIELD_ENUM_CLASS * Fix lint * Add C++ test to see if tree_method is recognized * Fix clang-tidy error * Add test_learner.h to R package * Update comments * Fix lint error	2018-11-01 19:33:35 -07:00
Philip Hyunsu Cho	583c88bce7	[jvm-packages] Require vanilla Apache Spark (#3854 )	2018-11-01 19:15:40 -07:00
Philip Hyunsu Cho	2febc105a4	[jvm-packages] Fix JVM doc build (#3853 ) To get around of the bug https://issues.apache.org/jira/browse/SUREFIRE-1588, set useSystemClassLoader=false.	2018-11-01 15:16:08 -07:00
Jonathan Friedman	45d321da28	Fix typo in docs (#3852 ) Fix typo in docs	2018-11-01 13:03:59 -07:00
Philip Hyunsu Cho	411df9f878	Test wheels on CUDA 10.0 container for compatibility (#3838 )	2018-11-01 08:34:47 -07:00
Rory Mitchell	42200ec03e	Allow XGBRanker sklearn interface to use other xgboost ranking objectives (#3848 )	2018-11-01 13:34:25 +13:00
Chen Qin	87f49995be	update rabit (#3835 )	2018-10-30 09:15:19 -07:00
Zhao Hang	e3c1afac6b	Update parameter.rst (#3843 )	2018-10-31 00:19:45 +13:00
Matthew Tovbin	d81fedb955	[jvm-packages] RabitTracker for Scala: allow specifying host ip from the xgboost-tracker.properties file (#3833 )	2018-10-26 22:01:36 -07:00
Nan Zhu	5fbe230636	[jvm-packages] documenting tracker (#3831 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * fix scalastyle error * fix scalastyle error * documenting tracker * Make it a separate note	2018-10-25 18:53:46 -07:00
Philip Hyunsu Cho	d83c818000	Recommend pickling as the way to save XGBClassifier / XGBRegressor / XGBRanker (#3829 ) The `save_model()` and `load_model()` method only saves the part of the model that's common to all language interfaces and do not preserve Python-specific attributes, such as `feature_names`. More crucially, label encoder is not preserved either; this is needed for the scikit-learn wrapper, since you may have string labels. Fix: Explicitly recommend pickling as the way to save scikit-learn model objects.	2018-10-25 11:12:41 -07:00
Andy Adinets	2a59ff2f9b	Multi-GPU support in GPUPredictor. (#3738 ) * Multi-GPU support in GPUPredictor. - GPUPredictor is multi-GPU - removed DeviceMatrix, as it has been made obsolete by using HostDeviceVector in DMatrix * Replaced pointers with spans in GPUPredictor. * Added a multi-GPU predictor test. * Fix multi-gpu test. * Fix n_rows < n_gpus. * Reinitialize shards when GPUSet is changed. * Tests range of data. * Remove commented code. * Remove commented code.	2018-10-23 22:59:11 -07:00
Bruno Tremblay	32de54fdee	Update R-package/R/xgb.ggplot.R (#3820 ) Changed width parameter of var important ggplot from 0.05 to 0.5 to make it more visible when displaying more variables.	2018-10-23 20:52:33 -07:00
Philip Hyunsu Cho	02130af47d	Enable auto-locking of issues closed long ago (#3821 ) * Enable auto-locking of issues closed long ago Issues that were closed more than 90 days ago will be locked automatically so that no additional comments would be allowed. We will use a bot to do this: https://probot.github.io/apps/lock/ Background: As a maintainer, I often see people leaving comments to old issue posts that were closed long ago. Those comments are hard to discover and assist with, since they get buried under list of other active issues. With the change, users who want to follow up with an old issue would be asked to file a new issue. * Exempt `feature-request` from auto locking * Disable comment to avoid triggering notification	2018-10-23 19:21:58 -07:00
Nan Zhu	4ae225a08d	[Blocking][jvm-packages] fix the early stopping feature (#3808 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * fix scalastyle error * fix scalastyle error * temp * add method for classifier and regressor * update tutorial * address the comments * update	2018-10-23 14:53:13 -07:00
Philip Hyunsu Cho	e26b5d63b2	[jvm-packages] Upgrade Scala to 2.11.12 to address CVE-2017-15288 (#3816 ) A privilege escalation vulnerability (CVE-2017-15288) has been identified in the Scala compilation daemon. See https://nvd.nist.gov/vuln/detail/CVE-2017-15288 Fix: Upgrade Scala to 2.11.12.	2018-10-22 10:15:30 -07:00
Philip Hyunsu Cho	abf2f661be	Fix #3708 : Use dmlc::TemporaryDirectory to handle temporaries in cross-platform way (#3783 ) * Fix #3708: Use dmlc::TemporaryDirectory to handle temporaries in cross-platform way Also install git inside NVIDIA GPU container * Update dmlc-core	2018-10-18 10:16:04 -07:00
Philip Hyunsu Cho	55ee9a92a1	Fix Python environment for distributed unit tests (#3806 )	2018-10-18 00:12:02 -07:00
Philip Hyunsu Cho	b38c636d05	Fix #3523 : Fix CustomGlobalRandomEngine for R (#3781 ) Symptom Apple Clang's implementation of `std::shuffle` expects doesn't work correctly when it is run with the random bit generator for R package: ```cpp CustomGlobalRandomEngine::result_type CustomGlobalRandomEngine::operator()() { return static_cast<result_type>( std::floor(unif_rand() * CustomGlobalRandomEngine::max())); } ``` Minimial reproduction of failure (compile using Apple Clang 10.0): ```cpp std::vector<int> feature_set(100); std::iota(feature_set.begin(), feature_set.end(), 0); // initialize with 0, 1, 2, 3, ..., 99 std::shuffle(feature_set.begin(), feature_set.end(), common::GlobalRandom()); // This returns 0, 1, 2, ..., 99, so content didn't get shuffled at all!!! ``` Note that this bug is platform-dependent; it does not appear when GCC or upstream LLVM Clang is used. Diagnosis Apple Clang's `std::shuffle` expects 32-bit integer inputs, whereas `CustomGlobalRandomEngine::operator()` produces 64-bit integers. Fix Have `CustomGlobalRandomEngine::operator()` produce 32-bit integers. Closes #3523.	2018-10-15 09:39:13 -07:00
Philip Hyunsu Cho	4302fc4027	Update committer list (#3788 ) * Update committer list * Update CONTRIBUTORS.md * Minor format fix	2018-10-14 23:41:03 -07:00
Rory Mitchell	f00fd87b36	Address #2754 , accuracy issues with gpu_hist (#3793 ) * Address windows compilation error * Do not allow divide by zero in weight calculation * Update tests	2018-10-15 17:50:31 +13:00
trivialfis	516457fadc	Add basic unittests for gpu-hist method. (#3785 ) * Split building histogram into separated class. * Extract `InitCompressedRow` definition. * Basic tests for gpu-hist. * Document the code more verbosely. * Removed `HistCutUnit`. * Removed some duplicated copies in `GPUHistMaker`. * Implement LCG and use it in tests.	2018-10-15 15:47:00 +13:00
trivialfis	184efff9f9	Remove NoConstraint. (#3792 )	2018-10-15 15:43:06 +13:00
Rory Mitchell	5d6baed998	Allow sklearn grid search over parameters specified as kwargs (#3791 )	2018-10-14 12:44:53 +13:00
Juzer Shakir	1db28b8718	Typo fixed (#3784 ) The word 'make' was been repeated twice, fixed to single.	2018-10-10 10:23:27 -07:00
KOLANICH	5480e05173	Added some instructions on using MinGW-built XGBoost with python. (#3774 ) * Added some instructions on using MinGW-built XGBoost with python. * Changes according to the discussion and some additions * Fixed wording and removed redundancy. * Even more fixes * Fixed links. Removed redundancy. * Some fixes according to the discussion * fixes * Some fixes * fixes	2018-10-09 09:07:00 -07:00
weitian	9504f411c1	[jvm-packages] For training data with group, empty RDD partition threw exception (#3749 ) (#3750 )	2018-10-09 09:03:22 -07:00
Philip Hyunsu Cho	ca33bf6476	Document gblinear parameters: feature_selector and top_k (#3780 )	2018-10-08 22:41:54 -07:00
Philip Hyunsu Cho	133b8d94df	Fix Jenkins syntax (#3777 )	2018-10-08 14:56:42 -07:00
Philip Hyunsu Cho	11eaf3eed1	Retry Jenkins CI tests up to 3 times to improve reliability (redux) (#3776 )	2018-10-08 11:39:00 -07:00
Philip Hyunsu Cho	6d42e56c85	Retry Jenkins CI tests up to 3 times to improve reliability (redux) (#3775 )	2018-10-08 11:24:01 -07:00
Philip Hyunsu Cho	7a7269e983	Retry Jenkins CI tests up to 3 times to improve reliability (#3769 )	2018-10-08 09:55:39 -07:00
Philip Hyunsu Cho	ea99b53d8e	Document behavior of get_fscore() for zero-importance features (#3763 )	2018-10-08 01:52:25 -07:00
Philip Hyunsu Cho	10cd7c8447	Fix #3714 : preserve feature names when slicing DMatrix (#3766 ) * Fix #3714: preserve feature names when slicing DMatrix * Add test	2018-10-08 01:04:33 -07:00
Philip Hyunsu Cho	813d2436d3	Produce xgboost.so for XGBoost-R on Mac OSX, so that `make install` works (#3767 ) * Produce xgboost.so for XGBoost-R on Mac OSX, so that `make install` works * Modernize R build instructions * Fix crossref	2018-10-07 14:09:54 -07:00
Philip Hyunsu Cho	c23783a0d1	Add notes to doc (#3765 )	2018-10-07 14:09:09 -07:00
Philip Hyunsu Cho	91903ac5d4	Fix broken doc build due to Matplotlib 3.0 release (#3764 )	2018-10-07 13:34:37 -07:00
Philip Hyunsu Cho	ae7e58b96e	Test wheel compatibility on CPU containers, for all pull requests (#3762 ) * Test wheel compatibility on CPU containers, for all pull requests * Run wheel test only when multi-GPU flag is not set	2018-10-06 20:18:58 -07:00
Saumya Bhatnagar	e0fd60f4e5	[doc] Fix link in rank demo README.md . (#3759 )	2018-10-06 12:12:54 -07:00
trivialfis	4b892c2b30	Remove obsoleted QuantileHistMaker. (#3761 ) Fix #3755.	2018-10-06 11:22:15 -07:00
Nan Zhu	785094db53	[jvm-packages] fix issue when spark job execution thread cannot return before we execute first() (#3758 ) * add back train method but mark as deprecated * add back train method but mark as deprecated * add back train method but mark as deprecated * add back train method but mark as deprecated * fix scalastyle error * fix scalastyle error * fix scalastyle error * fix scalastyle error * sparjJobThread * update * fix issue when spark job execution thread cannot return before we execute first()	2018-10-05 22:20:50 -07:00
zengxy	9e73087324	[jvm-packages] support specified feature names when getModelDump and getFeatureScore (#3733 ) * [jvm-packages] support specified feature names for jvm when get ModelDump and get FeatureScore (#3725) * typo and style fix	2018-10-04 09:05:42 -07:00
Rory Mitchell	34522d56f0	Allow plug-ins to be built by cmake (#3752 ) * Remove references to AVX code. * Allow plugins to be built by cmake	2018-10-04 22:03:52 +13:00

1 2 3 4 5 ...

3506 Commits