refactor vignette

2015-02-12 00:06:13 +01:00
parent adf8b6553d
commit 97cb8bf637
4 changed files with 41 additions and 583 deletions
--- a/R-package/vignettes/discoverYourData.Rmd
+++ b/R-package/vignettes/discoverYourData.Rmd
@@ -18,7 +18,7 @@ The purpose of this Vignette is to show you how to use **Xgboost** to discover a

 You may know **Xgboost** as a state of the art tool to build some kind of Machine learning models. It has been [used](https://github.com/tqchen/xgboost) to win several [Kaggle](http://www.kaggle.com) competition. 

-During these competition, the purpose is to make prediction. This Vignette is not about showing you how to predict anything. The purpose of this document is to explain *how to use **Xgboost** to understand the link between the features of your data and an outcome*.
+During these competition, the purpose is to make prediction. This Vignette is not about showing you how to predict anything (see [Xgboost presentation](www.somewhere.org)). The purpose of this document is to explain how to use **Xgboost** to understand the *link* between the *features* of your data and an *outcome*.

 For the purpose of this tutorial we will first load the required packages.

@@ -30,12 +30,12 @@ require(Matrix)
 require(data.table)
 if (!require(vcd)) install.packages('vcd') 
 ```
-> **VCD** is used for one of its embedded dataset only (and not for its own functions).
+> **VCD** package is used for one of its embedded dataset only (and not for its own functions).

 Preparation of the dataset
 ==========================

-According to its documentation, **Xgboost** works only on `numeric` variables.
+**Xgboost** works only on `numeric` variables.

 Sometimes the dataset we have to work on have *categorical* data. 

@@ -44,11 +44,11 @@ A *categorical* variable is one which have a fixed number of different values. B
 > In **R**, *categorical* variable is called `factor`.
 > Type `?factor` in console for more information.

-In this demo we will see how to transform a dense dataframe with *categorical* variables to a sparse matrix before analyzing it in **Xgboost**.
+In this demo we will see how to transform a dense dataframe (dense = few zero in the matrix) with *categorical* variables to a very sparse matrix (sparse = lots of zero in the matrix) of `numeric` features before analyzing these data in **Xgboost**.

 The method we are going to see is usually called [one hot encoding](http://en.wikipedia.org/wiki/One-hot).

-The first step is to load Arthritis dataset in memory and create a copy of the dataset with `data.table` package (`data.table` is 100% compliant with **R** dataframe but its syntax is a lot more consistent and its performance are really good).
+The first step is to load Arthritis dataset in memory and wrap the dataset with `data.table` package (`data.table` is 100% compliant with **R** dataframe but its syntax is a lot more consistent and its performance are really good).

 ```{r, results='hide'}
 data(Arthritis)
@@ -71,16 +71,17 @@ str(df)
 > `ordinal` variable is a categorical variable with values wich can be ordered
 > Here: `None` > `Some` > `Marked`.

-Let's add some new categorical features to see if it helps. 
+Let's add some new *categorical* features to see if it helps.

-Of course these feature are highly correlated to the Age feature. Usually it's not a good thing in ML, but tree algorithms (including boosted trees) are able to select the best features, even in case of highly correlated features.
+Of course these feature are highly correlated to the Age feature. Usually it's not a good thing in Machine Learning. Fortunately, tree algorithms (including boosted trees) are very robust in this specific case.

 ```{r}
 df[,AgeDiscret:= as.factor(round(Age/10,0))][1:10]
 ```

-> For the first feature we create groups of age by rounding the real age. 
-> Note that we transform it to `factor` so the algorithm treat them as independant values.
+> For the first feature we create groups of age by rounding the real age.
+> Note that we transform it to `factor` so the algorithm treat these age groups as independant values.
+> Therefore, 20 is not closer to 30 than 60. To make it short, the distance between ages is lost in this transformation.

 Following is an even stronger simplification of the real age with an arbitrary split at 30 years old. I choose this value **based on nothing**. We will see later if simplifying the information based on arbitrary values is a good strategy (I am sure you already have an idea of how well it will work!).

@@ -103,11 +104,10 @@ print(levels(df[,Treatment]))

 Next step, we will transform the categorical data to dummy variables.
 This is the [one hot encoding](http://en.wikipedia.org/wiki/One-hot) part.
-The purpose is to transform each value of each *categorical* feature in a binary feature.

-For example, the column Treatment will be replaced by two columns, Placebo, and Treated. Each of them will be *binary*. For example an observation which had the value Placebo in column Treatment before the transformation will have, after the transformation, the value 1 in the new column Placebo and the value 0 in the new column  Treated.
+The purpose is to transform each value of each *categorical* feature in a binary feature `{0, 1}`.

-> Formulae `Improved~.-1` used below means transform all *categorical* features but column Improved to binary values.
+For example, the column Treatment will be replaced by two columns, Placebo, and Treated. Each of them will be *binary*. Therefore, an observation which has the value Placebo in column Treatment before the transformation will have after the transformation the value `1` in the new column Placebo and the value `0` in the new column  Treated.

 Column Improved is excluded because it will be our output column, the one we want to predict.

@@ -116,10 +116,12 @@ sparse_matrix <- sparse.model.matrix(Improved~.-1, data = df)
 print(sparse_matrix[1:10,])
 ```

-Create the output vector (not as a sparse `Matrix`):
+> Formulae `Improved~.-1` used above means transform all *categorical* features but column Improved to binary values.

-1. Set, for all rows, field in Y column to 0; 
-2. set Y to 1 when Improved == Marked; 
+Create the output `numeric` vector (not as a sparse `Matrix`):
+
+1. Set, for all rows, field in Y column to `0`; 
+2. set Y to `1` when Improved == Marked; 
 3. Return Y column.

 ```{r}
@@ -129,7 +131,7 @@ output_vector = df[,Y:=0][Improved == "Marked",Y:=1][,Y]
 Build the model
 ===============

-The code below is very usual. For more information, you can look at the documentation of `xgboost()` function.
+The code below is very usual. For more information, you can look at the documentation of `xgboost` function (or to the vignette [Xgboost presentation](www.somewhere.org)).

 ```{r}
 bst <- xgboost(data = sparse_matrix, label = output_vector, max.depth = 4,
@@ -137,7 +139,7 @@ bst <- xgboost(data = sparse_matrix, label = output_vector, max.depth = 4,

 ```

-You can see plenty of `train-error: 0.XXXXX` lines followed by a number. It decreases. Each line shows how well your model explains your data. Lower is better. 
+You can see plenty of `train-error: 0.XXXXX` lines followed by a number. It decreases. Each line shows how well your model explains your data. Lower is better.

 A model which fits too well may [overfit](http://en.wikipedia.org/wiki/Overfitting) (meaning it copy paste too much the past, and is not that good to predict the future). 

@@ -151,7 +153,7 @@ Feature importance
 Measure feature importance
 --------------------------

-In the code below, `sparse_matrix@Dimnames[[2]]` represents the column names of the sparse matrix. These names are the values of the feature (because one binary column == one value of one *categorical* feature)
+In the code below, `sparse_matrix@Dimnames[[2]]` represents the column names of the sparse matrix. These names are the original values of the feature (remember, one binary column == one value of one *categorical* feature).

 ```{r}
 importance <- xgb.importance(sparse_matrix@Dimnames[[2]], model = bst)
@@ -161,9 +163,9 @@ print(importance)
 > The column `Gain` provide the information we are looking for.
 > As you can see, features are classified by `Gain`.

-`Gain` is the improvement in accuracy brought by a feature to the branches it is on. The idea is that before adding a new split on a feature X to the branch there was some wrongly classified elements, after adding the split on this feature, there are two new branches, and each of these branch is more accurate (one branch saying if your observation is on this branch then it should be classified as 1, and the other branch saying the exact opposite, both new branch being more accurate than the one before the insertion of the feature).
+`Gain` is the improvement in accuracy brought by a feature to the branches it is on. The idea is that before adding a new split on a feature X to the branch there was some wrongly classified elements, after adding the split on this feature, there are two new branches, and each of these branch is more accurate (one branch saying if your observation is on this branch then it should be classified as 1, and the other branch saying the exact opposite, both new branches being more accurate than the one before the split).

-`Cover` measure the relative quantity of observations concerned by a feature.
+`Cover` measures the relative quantity of observations concerned by a feature.

 `Frequence` is a simpler way to measure the `Gain`. It just counts the number of times a feature is used in all generated trees. You should not use it (unless you know why you want to use it).

@@ -172,13 +174,13 @@ Plotting the feature importance

 All these things are nice, but it would be even better to plot the result. Fortunately, such function already exists.

-```{r}
+```{r, fig.width=8, fig.height=5, fig.align='center'}
 xgb.plot.importance(importance_matrix = importance)
 ```

-Feature have been automatically divided in 2 clusters: the interesting features... and the others.
+Feature have automatically been divided in 2 clusters: the interesting features... and the others.

-> Depending of the case you may have more than two clusters. 
+> Depending of the dataset and the learning parameters you may have more than two clusters. 
 > Default value is to limit them to 10, but you can increase this limit. Look at the function documentation for more information.

 According to the plot above, the most important feature in this dataset to predict if the treatment will work is :
@@ -188,7 +190,7 @@ According to the plot above, the most important feature in this dataset to predi
 * the sex is third but already included in the not interesting feature ; 
 * then we see our generated features (AgeDiscret). We can see that their contribution is very low.

-Does these results make sense?
+Do these results make sense?
 ------------------------------

 Let's check some **Chi2** between each of these features and the outcome.
@@ -214,7 +216,11 @@ c2 <- chisq.test(df$AgeCat, df$Y)
 print(c2)
 ```

-The perfectly random split I did between young and old at 30 years old have a low correlation of **`r round(c2$statistic, 2)`**. It's a result we may expect as may be in my mind > 30 years is being old (I am 32 and starting feeling old, this may explain that), but  for the illness we are studying, the age to be vulnerable is not the same. Don't let your *gut* lower the quality of your model. In *data science* expression, there is the word *science* :-)
+The perfectly random split I did between young and old at 30 years old have a low correlation of **`r round(c2$statistic, 2)`**. It's a result we may expect as may be in my mind > 30 years is being old (I am 32 and starting feeling old, this may explain that), but  for the illness we are studying, the age to be vulnerable is not the same. 
+
+Morality: don't let your *gut* lower the quality of your model. 
+
+In *data science* expression, there is the word *science* :-)

 Conclusion
 ==========
@@ -238,12 +244,12 @@ Special Note: What about Random forest?

 As you may know, [Random Forest](http://en.wikipedia.org/wiki/Random_forest) algorithm is cousin with boosting and both are part of the [ensemble leanrning](http://en.wikipedia.org/wiki/Ensemble_learning) family.

-Both trains several decision trees for one dataset. The *main* difference is that in Random Forest, trees are independant and in boosting tree N+1 focus its learning on what has no been well modeled by tree N (and so on...).
+Both trains several decision trees for one dataset. The *main* difference is that in Random Forest, trees are independant and in boosting tree N+1 focus its learning on the loss (= what has no been well modeled by tree N).

-This difference have an impact on a corner case in feature importance analysis: the *correlated features*.
+This difference have an impact on feature importance analysis: the *correlated features*.

 Imagine two features perfectly correlated, feature `A` and feature `B`. For one specific tree, if the algorithm needs one of them, it will choose randomly (true in both boosting and random forest).

-However, in Random Forest this choice will be done plenty of times, because trees are independant. So the **importance** of a specific feature is diluted among features `A` and `B`. So you won't easily know they are important to predict what you want to predict.
+However, in Random Forest this random choice will be done for each tree, because each tree is independant from the others. Therefore, approximatively, depending of your parameters, 50% of the trees will choose feature `A` and the other 50% will choose feature `B`. So the **importance** of the information contained in `A` and `B` (which is the same, because they are perfectly correlated) is diluted in `A` and `B`. So you won't easily know this information is important to predict what you want to predict! It is even worse when you have 10 correlated features...

-In boosting, when as aspect of your dataset have been learned by the algorithm, there is no more need to refocus on it. Therefore, all the importace will be on `A` or `B`. You will know that one of them is important, it is up to you to search for correlated features.
+In boosting, when a specific link between feature and outcome have been learned by the algorithm, it will try to not refocus on it (in theory it is what happens, reality is never that simple). Therefore, all the importance will be on `A` or on `B`. You will know that one feature have an important role in the link between your dataset and the outcome. It is still up to you to search for the correlated features to the one detected as important if you need all of them.
--- a/R-package/vignettes/framed.sty
+++ b/R-package/vignettes/framed.sty
@@ -1,548 +0,0 @@
-% framed.sty   v 0.96    2011/10/22
-% Copyright (C) 1992-2011 by Donald Arseneau  (asnd@triumf.ca)
-% These macros may be freely transmitted, reproduced, or modified
-% for any purpose provided that this notice is left intact.
-% 
-%====================== Begin Instructions =======================
-%
-%                       framed.sty
-%                       ~~~~~~~~~~
-% Create framed, shaded, or differently highlighted regions that can 
-% break across pages.  The environments defined are 
-%   framed     - ordinary frame box (\fbox) with edge at margin
-%   oframed    - framed with open top/bottom at page breaks
-%   shaded     - shaded background (\colorbox) bleeding into margin
-%   shaded*    - shaded background (\colorbox) with edge at margin
-%   snugshade  - shaded with tight fit around text (esp. in lists)
-%   snugshade* - like snugshade with shading edge at margin
-%   leftbar    - thick vertical line in left margin
-% 
-% to be used like
-% \begin{framed}
-%  copious text
-% \end{framed}
-%
-% But the more general purpose of this package is to facilitate the
-% definition of new environments that take multi-line material,
-% wrap it with some non-breakable formatting (some kind of box or
-% decoration) and allow page breaks in the material.  Such environments
-% are defined to declare (or use) \FrameCommand for applying the boxy 
-% decoration, and \MakeFramed{settings} ... \endMakeFramed wrapped
-% around the main text argument (environment body).
-%
-% The "framed" environment uses "\fbox", by default, as its "\FrameCommand"
-% with the additional settings "\fboxrule=\FrameRule" and "\fboxsep=\FrameSep".
-% You can change these lengths (using "\setlength") and you can change 
-% the definition of "\FrameCommand" to use much fancier boxes.  
-%
-% In fact, the "shaded" environment just redefines \FrameCommand to be
-% "\colorbox{shadecolor}" (and you have to define the color `"shadecolor"':
-% "\definecolor{shadecolor}...").
-%
-% Although the intention is for other packages to define the varieties
-% of decoration, a command "\OpenFbox" is defined for frames with open 
-% tops or bottoms, and used for the "oframed" environment.  This facility
-% is based on a more complex and capable command "\CustomFBox" which can
-% be used for a wider range of frame styles.  One such style of a title-bar
-% frame with continuation marks is provided as an example.  It is used by
-% the "titled-frame" environment.  To make use of "titled-frame" in your
-% document, or the "\TitleBarFrame" command in your own environment
-% definitions, you must define the colors TFFrameColor (for the frame)
-% and a contrasting TFTitleColor (for the title text).
-%
-% A page break is allowed, and even encouraged, before the framed
-% environment.  If you want to attach some text (a box title) to the
-% frame, then the text should be inserted by \FrameCommand so it cannot
-% be separated from the body.
-%
-% The contents of the framed regions are restricted: 
-% Floats, footnotes, marginpars and head-line entries will be lost.
-% (Some of these may be handled in a later version.)
-% This package will not work with the page breaking of multicol.sty,
-% or other systems that perform column-balancing.
-%
-% The MakeFramed environment does the work.  Its `settings' argument
-% should contain any adjustments to the text width (via a setting of
-% "\hsize"). Here, the parameter "\width" gives the measured extra width
-% added by the frame, so a common setting is "\advance\hsize-\width"
-% which reduces the width of the text just enough that the outer edge
-% of the frame aligns with the margins.  The `settings' should also
-% include a `restore' command -- "\@parboxrestore" or "\FrameRestore"
-% or something similar; for instance, the snugshade environment uses
-% settings to eliminate list indents and vertical space, but uses
-% "\hspace" in "\FrameCommand" to reproduce the list margin ouside the
-% shading.
-%
-% There are actually four variants of "\FrameCommand" to allow different
-% formatting for each part of an environment broken over pages.  Unbroken
-% text is adorned by "\FrameCommand", whereas split text first uses 
-% "\FirstFrameCommand", possibly followed by "\MidFrameCommand", and
-% finishing with "\LastFrameCommand".  The default definitions for 
-% these three just invokes "\FrameCommand", so that all portions are
-% framed the same way.  See the oframe environment for use of distinct
-% First/Mid/Last frames.
-%
-% Expert commands:
-% \MakeFramed, \endMakeFramed: the "MakeFramed" environment
-% \FrameCommand: command to draw the frame around its argument
-% \FirstFrameCommand: the frame for the first part of a split environment
-% \LastFrameCommand: for the last portion
-% \MidFrameCommand: for any intermediate segments
-% \FrameRestore: restore some text settings, but fewer than \@parboxrestore
-% \FrameRule: length register; \fboxrule for default "framed".
-% \FrameSep: length register; \fboxsep for default "framed".
-% \FrameHeightAdjust: macro; height of frame above baseline at top of page
-% \OuterFrameSep: vertical space before and after the framed env. Defaults to "\topsep"
-%
-% This is still a `pre-production' version because I can think of many
-% features/improvements that should be made.  Also, a detailed manual needs
-% to be written.  Nevertheless, starting with version 0.5 it should be bug-free.
-%
-% ToDo: 
-% Test more varieties of list 
-% Improve and correct documentation
-% Propagation of \marks
-% Handle footnotes (how??) floats (?) and marginpars.
-% Stretchability modification.
-% Make inner contents height/depth influence placement.
-%======================== End Instructions ========================
-
-\ProvidesPackage{framed}[2011/10/22 v 0.96: 
-   framed or shaded text with page breaks]
-
-\newenvironment{framed}% using default \FrameCommand
-  {\MakeFramed {\advance\hsize-\width \FrameRestore}}%
-  {\endMakeFramed}
-
-\newenvironment{shaded}{%
-  \def\FrameCommand{\fboxsep=\FrameSep \colorbox{shadecolor}}%
-  \MakeFramed {\FrameRestore}}%
- {\endMakeFramed}
-
-\newenvironment{shaded*}{%
-  \def\FrameCommand{\fboxsep=\FrameSep \colorbox{shadecolor}}%
-  \MakeFramed {\advance\hsize-\width \FrameRestore}}%
- {\endMakeFramed}
-
-\newenvironment{leftbar}{%
-  \def\FrameCommand{\vrule width 3pt \hspace{10pt}}%
-  \MakeFramed {\advance\hsize-\width \FrameRestore}}%
- {\endMakeFramed}
-
-% snugshde:  Shaded environment that 
-%  -- uses the default \fboxsep instead of \FrameSep
-%  -- leaves the text indent unchanged (shading bleeds out)
-%  -- eliminates possible internal \topsep glue (\@setminipage)
-%  -- shrinks inside the margins for lists
-%  An \item label will tend to hang outside the shading, thanks to
-%  the small \fboxsep.
-
-\newenvironment{snugshade}{%
-  \def\FrameCommand##1{\hskip\@totalleftmargin \hskip-\fboxsep
-  \colorbox{shadecolor}{##1}\hskip-\fboxsep
-      % There is no \@totalrightmargin, so:
-      \hskip-\linewidth \hskip-\@totalleftmargin \hskip\columnwidth}%
-  \MakeFramed {\advance\hsize-\width 
-    \@totalleftmargin\z@ \linewidth\hsize
-    \@setminipage}%
- }{\par\unskip\@minipagefalse\endMakeFramed}
-
-\newenvironment{snugshade*}{%
-  \def\FrameCommand##1{\hskip\@totalleftmargin 
-  \colorbox{shadecolor}{##1}%
-      % There is no \@totalrightmargin, so:
-      \hskip-\linewidth \hskip-\@totalleftmargin \hskip\columnwidth}%
-  \MakeFramed {\advance\hsize-\width 
-    \@totalleftmargin\z@ \linewidth\hsize
-    \advance\labelsep\fboxsep
-    \@setminipage}%
- }{\par\unskip\@minipagefalse\endMakeFramed}
-
-\newenvironment{oframed}{% open (top or bottom) framed
-  \def\FrameCommand{\OpenFBox\FrameRule\FrameRule}%
-  \def\FirstFrameCommand{\OpenFBox\FrameRule\z@}%
-  \def\MidFrameCommand{\OpenFBox\z@\z@}%
-  \def\LastFrameCommand{\OpenFBox\z@\FrameRule}%
-  \MakeFramed {\advance\hsize-\width \FrameRestore}%
-  }{\endMakeFramed}
-
-% A simplified entry to \CustomFBox with two customized parameters:
-% the thicknesses of the top and bottom rules.  Perhaps we want to
-% use less \fboxsep on the open edges?
-
-\def\OpenFBox#1#2{\fboxsep\FrameSep
-   \CustomFBox{}{}{#1}{#2}\FrameRule\FrameRule}
-
-% \CustomFBox is like an amalgamation of \fbox and \@frameb@x,
-% so it can be used by an alternate to \fbox or \fcolorbox, but
-% it has more parameters for various customizations.
-% Parameter #1 is inserted (in vmode) right after the top rule 
-% (useful for a title or assignments), and #2 is similar, but
-% inserted right above the bottom rule.
-% The thicknesses of the top, bottom, left, and right rules are
-% given as parameters #3,#4,#5,#6 respectively.  They should be
-% \fboxrule or \z@ (or some other thickness).
-% The text argument is #7.
-% An instance of this can be used for the frame of \fcolorbox by
-% locally defining \fbox before \fcolorbox; e.g.,
-% \def\fbox{\CustomFBox{}{}\z@\z@\fboxrule\fboxrule}\fcolorbox
-%
-% Do we need to use different \fboxsep on different sides too?
-%
-\long\def\CustomFBox#1#2#3#4#5#6#7{%
-  \leavevmode\begingroup
-  \setbox\@tempboxa\hbox{%
-    \color@begingroup
-      \kern\fboxsep{#7}\kern\fboxsep
-    \color@endgroup}%
-  \hbox{%
-    % Here we calculate and shift for the depth.  Done in
-    % a group because one of the arguments might be \@tempdima
-    % (we could use \dimexpr instead without grouping).
-    \begingroup
-      \@tempdima#4\relax
-      \advance\@tempdima\fboxsep
-      \advance\@tempdima\dp\@tempboxa
-    \expandafter\endgroup\expandafter
-    \lower\the\@tempdima\hbox{%
-      \vbox{%
-        \hrule\@height#3\relax
-        #1%
-        \hbox{%
-          \vrule\@width#5\relax
-          \vbox{%
-            \vskip\fboxsep % maybe these should be parameters too
-            \copy\@tempboxa
-            \vskip\fboxsep}%
-          \vrule\@width#6\relax}%
-        #2%
-        \hrule\@height#4\relax}%
-    }%
-  }%
-  \endgroup
-}
-
-
-% A particular type of titled frame with continuation marks.  
-% Parameter #1 is the title, repeated on each page.
-\newenvironment{titled-frame}[1]{%
-  \def\FrameCommand{\fboxsep8pt\fboxrule2pt
-     \TitleBarFrame{\textbf{#1}}}%
-  \def\FirstFrameCommand{\fboxsep8pt\fboxrule2pt
-     \TitleBarFrame[$\blacktriangleright$]{\textbf{#1}}}%
-  \def\MidFrameCommand{\fboxsep8pt\fboxrule2pt
-     \TitleBarFrame[$\blacktriangleright$]{\textbf{#1\ (cont)}}}%
-  \def\LastFrameCommand{\fboxsep8pt\fboxrule2pt
-     \TitleBarFrame{\textbf{#1\ (cont)}}}%
-  \MakeFramed{\advance\hsize-20pt \FrameRestore}}%
-%  note: 8 + 2 + 8 + 2 = 20.  Don't use \width because the frame title
-%  could interfere with the width measurement.
- {\endMakeFramed}
-
-% \TitleBarFrame[marker]{title}{contents}
-% Frame with a label at top, optional continuation marker at bottom right.
-% Frame color is TFFrameColor and title color is a contrasting TFTitleColor; 
-% both need to be defined before use.  The frame itself use \fboxrule and
-% \fboxsep.  If the title is omitted entirely, the title bar is omitted
-% (use a blank space to force a blank title bar).
-% 
-\newcommand\TitleBarFrame[3][]{\begingroup
-  \ifx\delimiter#1\delimiter
-    \let\TF@conlab\@empty
-  \else
-    \def\TF@conlab{% continuation label
-     \nointerlineskip
-     \smash{\rlap{\kern\wd\@tempboxa\kern\fboxrule\kern\fboxsep #1}}}%
-  \fi
-  \let\TF@savecolor\current@color
-  \textcolor{TFFrameColor}{%
-  \CustomFBox
-    {\TF@Title{#2}}{\TF@conlab}%
-    \fboxrule\fboxrule\fboxrule\fboxrule
-    {\let\current@color\TF@savecolor\set@color #3}%
-  }\endgroup
-}
-
-% The title bar for \TitleBarFrame
-\newcommand\TF@Title[1]{%
-  \ifx\delimiter#1\delimiter\else
-  \kern-0.04pt\relax
-  \begingroup
-  \setbox\@tempboxa\vbox{%
-   \kern0.8ex
-   \hbox{\kern\fboxsep\textcolor{TFTitleColor}{#1}\vphantom{Tj)}}%
-   \kern0.8ex}%
-  \hrule\@height\ht\@tempboxa
-  \kern-\ht\@tempboxa
-  \box\@tempboxa
-  \endgroup
-  \nointerlineskip 
-  \kern-0.04pt\relax
-  \fi
-}
-
-\chardef\FrameRestore=\catcode`\| % for debug
-\catcode`\|=\catcode`\% % (debug: insert space after backslash)
-
-\newlength\OuterFrameSep \OuterFrameSep=\maxdimen \relax
-
-\def\MakeFramed#1{\par
- % apply default \OuterFrameSep = \topsep
- \ifdim\OuterFrameSep=\maxdimen \OuterFrameSep\topsep \fi
- % measure added width and height; call result \width and \height
- \fb@sizeofframe\FrameCommand
- \let\width\fb@frw \let\height\fb@frh
- % insert pre-penalties and skips
- \begingroup
- \skip@\lastskip
- \if@nobreak\else 
-    \penalty9999 % updates \page parameters
-    \ifdim\pagefilstretch=\z@ \ifdim\pagefillstretch=\z@
-       % not infinitely stretchable, so encourage a page break here
-       \edef\@tempa{\the\skip@}%
-       \ifx\@tempa\zero@glue \penalty-30
-       \else \vskip-\skip@ \penalty-30 \vskip\skip@
-    \fi\fi\fi
-    \penalty\z@
-    % Give a stretchy breakpoint that will always be taken in preference
-    % to the \penalty 9999 used to update page parameters.  The cube root
-    % of 10000/100 indicates a multiplier of 0.21545, but the maximum 
-    % calculated badness is really 8192, not 10000, so the multiplier
-    % is 0.2301. 
-    \advance\skip@ \z@ plus-.5\baselineskip
-    \advance\skip@ \z@ plus-.231\height
-    \advance\skip@ \z@ plus-.231\skip@
-    \advance\skip@ \z@ plus-.231\OuterFrameSep
-    \vskip-\skip@ \penalty 1800 \vskip\skip@
- \fi
- \addvspace{\OuterFrameSep}%
- \endgroup
- % clear out pending page break
- \penalty\@M \vskip 2\baselineskip \vskip\height
- \penalty9999 \vskip -2\baselineskip \vskip-\height
- \penalty9999 % updates \pagetotal
-|\message{After clearout, \pagetotal=\the\pagetotal, \pagegoal=\the\pagegoal. }%
- \fb@adjheight 
- \setbox\@tempboxa\vbox\bgroup
-   #1% Modifications to \hsize (can use \width and \height)
-   \textwidth\hsize \columnwidth\hsize
-}
-
-\def\endMakeFramed{\par
-     \kern\z@
-     \hrule\@width\hsize\@height\z@ % possibly bad
-     \penalty-100 % (\hrule moves depth into height)
- \egroup
-%%%   {\showoutput\showbox\@tempboxa}%
- \begingroup 
-   \fb@put@frame\FrameCommand\FirstFrameCommand
- \endgroup
- \@minipagefalse % In case it was set and not cleared
-}
-
-% \fb@put@frame takes the contents of \@tempboxa and puts all, or a piece,
-% of it on the page with a frame (\FrameCommand, \FirstFrameCommand,
-% \MidFrameCommand, or \LastFrameCommand).  It recurses until all of 
-% \@tempboxa has been used up. (\@tempboxa must have zero depth.)
-% #1 = attempted framing command, if no split
-% #2 = framing command if split
-% First iteration: Try to fit with \FrameCommand. If it does not fit,
-% split for \FirstFrameCommand.
-% Later iteration: Try to fit with \LastFrameCommand. If it does not
-% fit, split for \MidFrameCommand.
-\def\fb@put@frame#1#2{\relax
- \ifdim\pagegoal=\maxdimen \pagegoal\vsize \fi
-|   \message{=============== Entering putframe ====================^^J
-|     \pagegoal=\the\pagegoal,  \pagetotal=\the\pagetotal. }%
- \ifinner
-   \fb@putboxa#1%
-   \fb@afterframe
- \else
-  \dimen@\pagegoal \advance\dimen@-\pagetotal % natural space left on page
-  \ifdim\dimen@<2\baselineskip % Too little room on page
-|   \message{Page has only \the\dimen@\space room left; eject. }%
-    \eject \fb@adjheight \fb@put@frame#1#2%
-  \else % there's appreciable room left on the page
-     \fb@sizeofframe#1%
-|    \message{\string\pagetotal=\the\pagetotal,
-|        \string\pagegoal=\the\pagegoal,
-|        \string\pagestretch=\the\pagestretch,
-|        \string\pageshrink=\the\pageshrink,
-|        \string\fb@frh=\the\fb@frh. \space}
-|    \message{^^JBox of size \the\ht\@tempboxa\space}%
-     \begingroup % temporarily set \dimen@ to be...
-     \advance\dimen@.8\pageshrink  % maximum space available on page
-     \advance\dimen@-\fb@frh\relax % max space available for frame's contents
-%%% LOOKS SUBTRACTED AND ADDED, SO DOUBLE ACCOUNTING!
-     \expandafter\endgroup
-     % expand \ifdim, then restore \dimen@ to real room left on page
-     \ifdim\dimen@>\ht\@tempboxa % whole box does fit
-|       \message{fits in \the\dimen@. }%
-        % ToDo: Change this to use vsplit anyway to capture the marks
-        % MERGE THIS WITH THE else CLAUSE!!!
-        \fb@putboxa#1%
-        \fb@afterframe
-     \else % box must be split
-|       \message{must be split to fit in \the\dimen@. }%
-        % update frame measurement to use \FirstFrameCommand or \MidFrameCommand
-        \fb@sizeofframe#2%
-        \setbox\@tempboxa\vbox{% simulate frame and flexiblity of the page:
-           \vskip \fb@frh \@plus\pagestretch \@minus.8\pageshrink
-           \kern137sp\kern-137sp\penalty-30
-           \unvbox\@tempboxa}%
-        \edef\fb@resto@set{\boxmaxdepth\the\boxmaxdepth 
-                           \splittopskip\the\splittopskip}%
-        \boxmaxdepth\z@ \splittopskip\z@
-|       \message{^^JPadded box of size \the\ht\@tempboxa\space split to \the\dimen@}%
-        % Split box here
-        \setbox\tw@\vsplit\@tempboxa to\dimen@
-|       \toks99\expandafter{\splitfirstmark}%
-|       \toks98\expandafter{\splitbotmark}%
-|       \message{Marks are: \the\toks99, \the\toks98. }%
-        \setbox\tw@\vbox{\unvbox\tw@}% natural-sized
-|       \message{Natural height of split box is \the\ht\tw@, leaving 
-|          \the\ht\@tempboxa\space remainder. }%
-        % If the split-to size > (\vsize-\topskip), then set box to full size.
-        \begingroup
-        \advance\dimen@\topskip
-        \expandafter\endgroup
-        \ifdim\dimen@>\pagegoal
-|         \message{Frame is big -- Use up the full column. }%
-          \dimen@ii\pagegoal
-          \advance\dimen@ii -\topskip
-          \advance\dimen@ii \FrameHeightAdjust\relax
-        \else  % suspect this is implemented incorrectly:
-          % If the split-to size > feasible room_on_page, rebox it smaller.
-          \advance\dimen@.8\pageshrink
-          \ifdim\ht\tw@>\dimen@
-|           \message{Box too tall; rebox it to \the\dimen@. }%
-            \dimen@ii\dimen@
-          \else % use natural size
-            \dimen@ii\ht\tw@
-          \fi
-        \fi
-        % Re-box contents to desired size \dimen@ii
-        \advance\dimen@ii -\fb@frh
-        \setbox\tw@\vbox to\dimen@ii \bgroup
-        % remove simulated frame and page flexibility:
-        \vskip -\fb@frh \@plus-\pagestretch \@minus-.8\pageshrink
-        \unvbox\tw@ \unpenalty\unpenalty
-        \ifdim\lastkern=-137sp % whole box went to next page
-|          \message{box split at beginning! }%
-           % need work here???
-           \egroup \fb@resto@set \eject % (\vskip for frame size was discarded) 
-           \fb@adjheight
-           \fb@put@frame#1#2% INSERTED ???
-        \else % Got material split off at the head
-           \egroup \fb@resto@set
-           \ifvoid\@tempboxa % it all fit after all
-|             \message{box split at end! }%
-              \setbox\@tempboxa\box\tw@
-              \fb@putboxa#1%
-              \fb@afterframe
-           \else % it really did split
-|             \message{box split as expected. Its reboxed height is \the\ht\tw@. }%
-              \ifdim\wd\tw@>\z@
-                \wd\tw@\wd\@tempboxa
-                \centerline{#2{\box\tw@}}%  ??? \centerline bad idea
-              \else
-|               \message{Zero width means likely blank. Don't frame it (guess)}%
-                \box\tw@
-              \fi
-              \hrule \@height\z@ \@width\hsize
-              \eject
-              \fb@adjheight
-              \fb@put@frame\LastFrameCommand\MidFrameCommand
-  \fi\fi\fi\fi\fi
-}
-
-\def\fb@putboxa#1{%
-  \ifvoid\@tempboxa
-    \PackageWarning{framed}{Boxa is void -- discard it. }%
-  \else
-|   \message{Frame and place boxa. }%
-|   %{\showoutput\showbox\@tempboxa}%
-    \centerline{#1{\box\@tempboxa}}%
-  \fi
-}
-
-\def\fb@afterframe{%
-    \nointerlineskip \null %{\showoutput \showlists}
-    \penalty-30 \vskip\OuterFrameSep \relax
-}
-
-% measure width and height added by frame (#1 = frame command)
-% call results \fb@frw and \fb@frh
-% todo: a mechanism to handle wide frame titles 
-\newdimen\fb@frw
-\newdimen\fb@frh
-\def\fb@sizeofframe#1{\begingroup
- \setbox\z@\vbox{\vskip-5in \hbox{\hskip-5in 
-   #1{\hbox{\vrule \@height 4.7in \@depth.3in \@width 5in}}}%
-   \vskip\z@skip}%
-|  \message{Measuring frame addition for \string#1 in \@currenvir\space 
-|    gives ht \the\ht\z@\space and wd \the\wd\z@. }%
-|  %{\showoutput\showbox\z@}%
- \global\fb@frw\wd\z@ \global\fb@frh\ht\z@
- \endgroup
-}
-
-\def\fb@adjheight{%
-  \vbox to\FrameHeightAdjust{}% get proper baseline skip from above.
-  \penalty\@M \nointerlineskip
-  \vskip-\FrameHeightAdjust
-  \penalty\@M} % useful for tops of pages
-
-\edef\zero@glue{\the\z@skip}
-
-\catcode`\|=\FrameRestore
-
-% Provide configuration commands:
-\providecommand\FrameCommand{%
- \setlength\fboxrule{\FrameRule}\setlength\fboxsep{\FrameSep}%
- \fbox}
-\@ifundefined{FrameRule}{\newdimen\FrameRule \FrameRule=\fboxrule}{}
-\@ifundefined{FrameSep} {\newdimen\FrameSep  \FrameSep =3\fboxsep}{}
-\providecommand\FirstFrameCommand{\FrameCommand}
-\providecommand\MidFrameCommand{\FrameCommand}
-\providecommand\LastFrameCommand{\FrameCommand}
-
-% Height of frame above first baseline when frame starts a page:
-\providecommand\FrameHeightAdjust{6pt}
-
-% \FrameRestore has parts of \@parboxrestore, performing a similar but 
-% less complete restoration of the default layout.  See how it is used in
-% the "settings" argument of \MakeFrame.  Though not a parameter, \hsize 
-% should be set to the desired total line width available inside the
-% frame before invoking \FrameRestore.  
-\def\FrameRestore{%
-   \let\if@nobreak\iffalse
-   \let\if@noskipsec\iffalse
-   \let\-\@dischyph
-   \let\'\@acci\let\`\@accii\let\=\@acciii
-   %  \message{FrameRestore:
-   %    \@totalleftmargin=\the \@totalleftmargin,
-   %    \rightmargin=\the\rightmargin, 
-   %    \@listdepth=\the\@listdepth.  }%
-   % Test if we are in a list (or list-like paragraph)
-   \ifnum \ifdim\@totalleftmargin>\z@ 1\fi  
-          \ifdim\rightmargin>\z@ 1\fi
-          \ifnum\@listdepth>\z@ 1\fi 0>\z@
-     %     \message{In a list: \linewidth=\the\linewidth, \@totalleftmargin=\the\@totalleftmargin,
-     %       \parshape=\the\parshape, \columnwidth=\the\columnwidth, \hsize=\the\hsize, 
-     %       \labelwidth=\the\labelwidth. }%
-     \@setminipage % snug fit around the item.  I would like this to be non-global.
-     % Now try to propageate changes of width from \hsize to list parameters.
-     % This is deficient, but a more advanced way to indicate modification to text 
-     % dimensions is not (yet) provided; in particular, no separate left/right
-     % adjustment.
-     \advance\linewidth-\columnwidth \advance\linewidth\hsize
-     \parshape\@ne \@totalleftmargin \linewidth
-   \else % Not in list
-     \linewidth=\hsize
-     %\message{No list, set \string\linewidth=\the\hsize. }%
-   \fi
-   \sloppy
-}
-
-%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
--- a/R-package/vignettes/vignette.css
+++ b/R-package/vignettes/vignette.css
@@ -116,7 +116,7 @@ li ul, li ul {
 }

 pre {
-    padding: 0px 24px;
+    padding: 0px 10px;
    max-width: 800px;
    white-space: pre-wrap;
 }
@@ -145,18 +145,18 @@ aside {
 blockquote {
    font-size:14px;
    border-left:.5em solid #606AAA;
-    background: #f5f5f5;
-    color:#bfbfbf;
-    padding: 5px;
+    background: #F8F8F8;
+    padding: 5px ;
    margin-left:25px;
    max-width: 500px;
 }

 blockquote  cite {
    font-size:14px;
-    line-height:20px;
+    line-height:10px;
    color:#bfbfbf;
 }
+
 blockquote cite:before {
    content: '\2014 \00A0';
 }
--- a/R-package/vignettes/xgboostPresentation.Rmd
+++ b/R-package/vignettes/xgboostPresentation.Rmd
@@ -243,7 +243,7 @@ Feature importance

 Finally, you can check which features are the most important.

-```{r featureImportance, message=T, warning=F}
+```{r featureImportance, message=T, warning=F, fig.width=8, fig.height=5, fig.align='center'}
 importance_matrix <- xgb.importance(feature_names = train$data@Dimnames[[2]], model = bst)
 print(importance_matrix)
 xgb.plot.importance(importance_matrix)