Nan Zhu
016ab89484
[jvm-packages] Parameter tuning tool for XGBoost ( #1664 )
2016-10-23 16:58:18 -04:00
Nan Zhu
813a53882a
[jvm-packages] deprecate Flaky test ( #1662 )
...
* deprecate flaky test
2016-10-13 07:21:24 -04:00
Nan Zhu
1673bcbe7e
[jvm-packages] separate classification and regression model and integrate with ML package ( #1608 )
2016-09-30 11:49:03 -04:00
Nan Zhu
4ad648e856
[jvm-packages] predictLeaf with Dataframe ( #1576 )
...
* add back train method but mark as deprecated
* predictLeaf with Dataset
* fix
* fix
2016-09-15 06:15:47 -04:00
Nan Zhu
bb388cbb31
default eval func ( #1574 )
2016-09-14 13:26:16 -04:00
Nan Zhu
fb02797e2a
[jvm-packages] Integration with Spark Dataframe/Dataset ( #1559 )
...
* bump up to scala 2.11
* framework of data frame integration
* test consistency between RDD and DataFrame
* order preservation
* test order preservation
* example code and fix makefile
* improve type checking
* improve APIs
* user docs
* work around travis CI's limitation on log length
* adjust test structure
* integrate with Spark -1 .x
* spark 2.x integration
* remove spark 1.x implementation but provide instructions on how to downgrade
2016-09-11 15:02:58 -04:00
Nan Zhu
3f198b9fef
[jvm-packages] allow training with missing values in xgboost-spark ( #1525 )
...
* allow training with missing values in xgboost-spark
* fix compilation error
* fix bug
2016-08-29 21:45:49 -04:00
Nan Zhu
74db1e8867
[jvm-packages] remove APIs with DMatrix from xgboost-spark ( #1519 )
...
* test consistency of prediction functions between DMatrix and RDD
* remove APIs with DMatrix from xgboost-spark
* fix compilation error in xgboost4j-example
* fix test cases
2016-08-28 21:25:49 -04:00
Nan Zhu
6d65aae091
[jvm-packages] test consistency of prediction functions with DMatrix and RDD ( #1518 )
...
* test consistency of prediction functions between DMatrix and RDD
* fix the failed test cases
2016-08-28 20:27:03 -04:00
Nan Zhu
d7f79255ec
improve test of save/load model ( #1515 )
2016-08-27 17:16:22 -04:00
Nan Zhu
bd5b07873e
[jvm-packages] create dmatrix with specified missing value ( #1272 )
...
* create dmatrix with specified missing value
* update dmlc-core
* support for predict method in spark package
repartitioning
work around
* add more elements to work around training set empty partition issue
2016-06-21 17:35:17 -04:00
Nan Zhu
c85b9012c6
[jvm-packages] xgboost4j-spark external memory ( #1219 )
...
* implement external memory support for XGBoost4J
* remove extra space
* enable external memory for prediction
* update doc
2016-05-22 14:01:28 -04:00
CodingCat
d8535313eb
allow empty partitions
2016-03-23 12:30:06 -04:00
CodingCat
55ab1c6a22
adjust numWorkers for test
2016-03-18 10:34:36 -04:00
CodingCat
f2ef958ebb
support kryo serialization
2016-03-13 11:55:14 -04:00
CodingCat
16b9e92328
force the user to set number of workers
2016-03-12 13:33:57 -05:00
CodingCat
43d7a85bc9
change the API name since we support not only HDFS and local file system
2016-03-11 10:05:32 -05:00
CodingCat
d47df5c1d8
allow the user to specify the worker number and avoid unnecessary shuffle
2016-03-10 06:58:30 -05:00
CodingCat
e0a3f1c000
nthread no larger than spark.task.cpus
2016-03-10 05:51:07 -05:00
CodingCat
909c6af330
add test resources manually
2016-03-08 18:43:30 -05:00
CodingCat
fa03aaeb63
revise current API
2016-03-08 17:18:55 -05:00
CodingCat
6499422e90
fix the merge
2016-03-06 15:22:05 -05:00
CodingCat
130ca7b00c
test case for XGBoostSpark
2016-03-05 19:41:26 -05:00
CodingCat
f0647ec76d
test resources
2016-03-05 18:18:07 -05:00
CodingCat
5c1af13f84
distributed in RDD
2016-03-05 17:50:40 -05:00