This commit is contained in:
tqchen 2015-04-19 14:05:33 -07:00
parent 0220a22ca4
commit 6f14405b09
3 changed files with 3 additions and 3 deletions

View File

@ -6,7 +6,7 @@ List of Documentations
* [External Memory Version](external_memory.md)
* [Text input format](input_format.md)
* [Notes on the Code](../src)
* [Parameters](parameter.md)
* List of all parameters and their usage: [Parameters](parameter.md)
* Learning about the model: [Introduction to Boosted Trees](http://homes.cs.washington.edu/~tqchen/pdf/BoostedTree.pdf)
How to get started

View File

@ -14,7 +14,7 @@ train.txt
Each line represent a single instance, and in the first line '1' is the instance label,'101' and '102' are feature indices, '1.2' and '0.03' are feature values. In the binary classification case, '1' is used to indicate positive samples, and '0' is used to indicate negative samples. We also support probability values in [0,1] as label, to indicate the probability of the instance being positive.
## Group Input Format
As XGBoost supports accomplishing [ranking task](https://github.com/tqchen/xgboost/wiki/Ranking), we support the group input format. In ranking task, instances are categorized into different groups in real world scenarios, for example, in the learning to rank web pages scenario, the web page instances are grouped by their queries. Except the instance file mentioned in the group input format, XGBoost need an file indicating the group information. For example, if the instance file is the "train.txt" shown above,
As XGBoost supports accomplishing [ranking task](../demo/rank), we support the group input format. In ranking task, instances are categorized into different groups in real world scenarios, for example, in the learning to rank web pages scenario, the web page instances are grouped by their queries. Except the instance file mentioned in the group input format, XGBoost need an file indicating the group information. For example, if the instance file is the "train.txt" shown above,
and the group file is as below:
train.txt.group

View File

@ -62,7 +62,7 @@ dtrain = xgb.DMatrix( data, label=label, missing = -999.0, weight=w)
=
#### Setting Parameters
XGBoost use list of pair to save [parameters](https://github.com/tqchen/xgboost/wiki/Parameters). Eg
XGBoost use list of pair to save [parameters](parameter.md). Eg
* Booster parameters
```python
param = {'bst:max_depth':2, 'bst:eta':1, 'silent':1, 'objective':'binary:logistic' }