[doc] improve xgboost4j-spark-gpu doc [skip ci] (#7793)

Co-authored-by: Sameer Raheja <sameerz@users.noreply.github.com>
This commit is contained in:
Bobby Wang
2022-04-12 12:02:16 +08:00
committed by GitHub
parent 118192f116
commit 4b00c64d96
2 changed files with 39 additions and 34 deletions

View File

@@ -127,6 +127,11 @@ Now, we have a DataFrame containing only two columns, "features" which contains
"sepal length", "sepal width", "petal length" and "petal width" and "classIndex" which has Double-typed
labels. A DataFrame like this (containing vector-represented features and numeric labels) can be fed to XGBoost4J-Spark's training engine directly.
.. note::
There is no need to assemble feature columns from version 1.6.0+. Instead, users can specify an array of
feture column names by ``setFeaturesCol(value: Array[String])`` and XGBoost4j-Spark will do it.
Dealing with missing values
~~~~~~~~~~~~~~~~~~~~~~~~~~~