From 82470ef96b77aef0494087bbeaec6e4a1b5e83d8 Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:28:23 +0200 Subject: [PATCH 1/9] Update README.md --- README.md | 52 +++++++--------------------------------------------- 1 file changed, 7 insertions(+), 45 deletions(-) diff --git a/README.md b/README.md index ee620d6cd..f11dd74ec 100644 --- a/README.md +++ b/README.md @@ -1,46 +1,8 @@ -xgboost: eXtreme Gradient Boosting -====== -An optimized general purpose gradient boosting (tree) library. +This is a Fork of XGBoost from https://github.com/tqchen/xgboost +In the main repo you already find 2 windows project for the porting of the executable and the python library. +Here you have +1) a c# dll wrapper, meaning the passage unmanaged to managed code +2) the c# Higgs Kaggle demo, instead of the python one -Contributors: https://github.com/tqchen/xgboost/graphs/contributors - -Turorial and Documentation: https://github.com/tqchen/xgboost/wiki - -Questions and Issues: [https://github.com/tqchen/xgboost/issues](https://github.com/tqchen/xgboost/issues?q=is%3Aissue+label%3Aquestion) - -Notes on the Code: [Code Guide](src) - -Features -====== -* Sparse feature format: - - Sparse feature format allows easy handling of missing values, and improve computation efficiency. -* Push the limit on single machine: - - Efficient implementation that optimizes memory and computation. -* Speed: XGBoost is very fast - - IN [demo/higgs/speedtest.py](demo/kaggle-higgs/speedtest.py), kaggle higgs data it is faster(on our machine 20 times faster using 4 threads) than sklearn.ensemble.GradientBoostingClassifier -* Layout of gradient boosting algorithm to support user defined objective -* Python interface, works with numpy and scipy.sparse matrix - -Build -====== -* Simply type make -* If your compiler does not come with OpenMP support, it will fire an warning telling you that the code will compile into single thread mode, and you will get single thread xgboost -* You may get a error: -lgomp is not found - - You can type ```make no_omp=1```, this will get you single thread xgboost - - Alternatively, you can upgrade your compiler to compile multi-thread version -* Possible way to build using Visual Studio (not tested): - - In principle, you can put src/xgboost.cpp and src/io/io.cpp into the project, and build xgboost. - - For python module, you need python/xgboost_wrapper.cpp and src/io/io.cpp to build a dll. - -Version -====== -* This version is named xgboost-unity, the code has been refactored from 0.2x to be cleaner and more flexibility -* This version of xgboost is not compatible with 0.2x, due to huge amount of changes in code structure - - This means the model and buffer file of previous version can not be loaded in xgboost-unity -* For legacy 0.2x code, refer to [Here](https://github.com/tqchen/xgboost/releases/tag/v0.22) -* Change log in [CHANGES.md](CHANGES.md) - -XGBoost in Graphlab Create -====== -* XGBoost is adopted as part of boosted tree toolkit in Graphlab Create (GLC). Graphlab Create is a powerful python toolkit that allows you to data manipulation, graph processing, hyper-parameter search, and visualization of big data in one framework. Try the Graphlab Create in http://graphlab.com/products/create/quick-start-guide.html -* Nice blogpost by Jay Gu using GLC boosted tree to solve kaggle bike sharing challenge: http://blog.graphlab.com/using-gradient-boosted-trees-to-predict-bike-sharing-demand +next steps: +I will upload a c# cv implementation for the demo very soon From a68f6680a04086911341080bec8808e35017118c Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:29:03 +0200 Subject: [PATCH 2/9] Update README.md --- README.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index f11dd74ec..ecee80f4a 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,6 @@ This is a Fork of XGBoost from https://github.com/tqchen/xgboost -In the main repo you already find 2 windows project for the porting of the executable and the python library. + +In the main repo you already find 2 windows projects for the porting of the executable and the python library. Here you have 1) a c# dll wrapper, meaning the passage unmanaged to managed code 2) the c# Higgs Kaggle demo, instead of the python one From 0f28ee4a8eba6b3c20df5bddbf6193a13c8fd0c6 Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:30:48 +0200 Subject: [PATCH 3/9] Update README.md --- README.md | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index ecee80f4a..614aacdd8 100644 --- a/README.md +++ b/README.md @@ -1,9 +1,11 @@ This is a Fork of XGBoost from https://github.com/tqchen/xgboost In the main repo you already find 2 windows projects for the porting of the executable and the python library. -Here you have -1) a c# dll wrapper, meaning the passage unmanaged to managed code -2) the c# Higgs Kaggle demo, instead of the python one + +Here (https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper) you have: + + 1) a c# dll wrapper, meaning the passage unmanaged to managed code + 2) the c# Higgs Kaggle demo, instead of the python one next steps: -I will upload a c# cv implementation for the demo very soon + I will upload a c# cv implementation for the demo very soon From a7b512a1c896c99464833662b7c94260599fddbc Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:31:16 +0200 Subject: [PATCH 4/9] Update README.md --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index 614aacdd8..6b6a3f65e 100644 --- a/README.md +++ b/README.md @@ -5,7 +5,10 @@ In the main repo you already find 2 windows projects for the porting of the exec Here (https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper) you have: 1) a c# dll wrapper, meaning the passage unmanaged to managed code + 2) the c# Higgs Kaggle demo, instead of the python one + next steps: + I will upload a c# cv implementation for the demo very soon From e26c072e8339e48f9c4a8f38f9c219e49f15a2d1 Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:39:20 +0200 Subject: [PATCH 5/9] Update README.md --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 6b6a3f65e..98c1b053e 100644 --- a/README.md +++ b/README.md @@ -2,11 +2,11 @@ This is a Fork of XGBoost from https://github.com/tqchen/xgboost In the main repo you already find 2 windows projects for the porting of the executable and the python library. -Here (https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper) you have: +Here you have: - 1) a c# dll wrapper, meaning the passage unmanaged to managed code + 1) a c# dll wrapper, meaning the passage unmanaged to managed code, in https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper - 2) the c# Higgs Kaggle demo, instead of the python one + 2) the c# Higgs Kaggle demo, instead of the python one in https://github.com/giuliohome/xgboost/tree/master/windows/kaggle_higgs_demo next steps: From e4ad70e21cd8a9a46494150fac3a9e6a48b81adb Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:41:34 +0200 Subject: [PATCH 6/9] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 98c1b053e..4733a3142 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,7 @@ Here you have: 1) a c# dll wrapper, meaning the passage unmanaged to managed code, in https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper - 2) the c# Higgs Kaggle demo, instead of the python one in https://github.com/giuliohome/xgboost/tree/master/windows/kaggle_higgs_demo + 2) the c# Higgs Kaggle demo, instead of the python one (actually you will get a higher score with the c# version, due to some changes I've made) in https://github.com/giuliohome/xgboost/tree/master/windows/kaggle_higgs_demo next steps: From 41eef462f06dfcb83acb391a46abf13e588442f0 Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 15:49:34 +0200 Subject: [PATCH 7/9] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 4733a3142..5810720ea 100644 --- a/README.md +++ b/README.md @@ -4,7 +4,7 @@ In the main repo you already find 2 windows projects for the porting of the exec Here you have: - 1) a c# dll wrapper, meaning the passage unmanaged to managed code, in https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper + 1) a c# dll wrapper, meaning the passage from unmanaged to managed code, in https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper 2) the c# Higgs Kaggle demo, instead of the python one (actually you will get a higher score with the c# version, due to some changes I've made) in https://github.com/giuliohome/xgboost/tree/master/windows/kaggle_higgs_demo From 71e5b4c413e3452dac88ed8d61941de21fd2a9fc Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 16:13:20 +0200 Subject: [PATCH 8/9] Update README.md --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index 5810720ea..580c100c1 100644 --- a/README.md +++ b/README.md @@ -7,6 +7,8 @@ Here you have: 1) a c# dll wrapper, meaning the passage from unmanaged to managed code, in https://github.com/giuliohome/xgboost/tree/master/windows/xgboost_sharp_wrapper 2) the c# Higgs Kaggle demo, instead of the python one (actually you will get a higher score with the c# version, due to some changes I've made) in https://github.com/giuliohome/xgboost/tree/master/windows/kaggle_higgs_demo + Start the demo from the root folder like this: + bin\x64\Debug\kaggle_higgs_demo.exe training_path.csv test_path.csv sharp_pred.csv next steps: From 04fc25615c2b90342a1da210a6db6fac4a3b4c1e Mon Sep 17 00:00:00 2001 From: giuliohome Date: Sun, 31 Aug 2014 16:28:49 +0200 Subject: [PATCH 9/9] Update README.md --- README.md | 5 +---- 1 file changed, 1 insertion(+), 4 deletions(-) diff --git a/README.md b/README.md index 580c100c1..c00dd5a6b 100644 --- a/README.md +++ b/README.md @@ -10,7 +10,4 @@ Here you have: Start the demo from the root folder like this: bin\x64\Debug\kaggle_higgs_demo.exe training_path.csv test_path.csv sharp_pred.csv - -next steps: - - I will upload a c# cv implementation for the demo very soon + 3) 5 fold cv implementation in c# for the demo: you see inline cv ams while training (of course on a completely separate set)