* Fix #3545: XGDMatrixCreateFromCSCEx silently discards empty trailing rows Description: The bug is triggered when 1. The data matrix has empty rows at the bottom. More precisely, the rows `n-k+1`, `n-k+2`, ..., `n` of the matrix have missing values in all dimensions (`n` number of instances, `k` number of trailing rows) 2. The data matrix is given as Compressed Sparse Column (CSC) format. Diagnosis: When the CSC matrix is converted to Compressed Sparse Row (CSR) format (this is common format used for DMatrix), the trailing empty rows are silently ignored. More specifically, the row pointer (`offset`) of the newly created CSR matrix does not take account of these rows. Fix: Modify the row pointer. * Add regression test
XGBoost R Package for Scalable GBM
Resources
- XGBoost R Package Online Documentation
- Check this out for detailed documents, examples and tutorials.
Installation
We are on CRAN now. For stable/pre-compiled(for Windows and OS X) version, please install from CRAN:
install.packages('xgboost')
For more detailed installation instructions, please see here.
Examples
- Please visit walk through example.
- See also the example scripts for Kaggle Higgs Challenge, including speedtest script on this dataset and the one related to Otto challenge, including a RMarkdown documentation.
Development
- See the R Package section of the contributors guide.