GBDT binary classification prediction V2

更新时间:
复制 MD 格式

The GBDT Binary Classification Prediction V2 component applies a trained gradient boosting decision tree (GBDT) model to new data and returns binary classification results. Use it downstream of the GBDT binary classification V2 training component, then connect the output to Binary classification evaluation to measure model performance.

Supported computing resources

This component runs on MaxCompute and Flink.

How it works

A GBDT model is an ensemble of decision trees, each acting as a weak learner. Trees are built sequentially: each tree optimizes the objective function on the basis of the previous decision tree, using a CART structure and a configurable step size. The final model combines all trees to produce predictions.

The figure below shows the basic recursive structure of gradient boosting.

image

In most cases, image is a CART decision tree, image are the parameters of the decision tree, and image is the step size.

Configure the component in the PAI console

Input ports

Both input ports are required before the pipeline can run.

Input portRecommended upstream componentRequired
InputGBDT binary classification V2Yes
Predicted Data TableRead TableYes

Parameters

Fields Information tab

ParameterRequiredDescriptionDefault value
Prediction result column nameYesName of the prediction result column.prediction_result
predictionDetailColNoName of the prediction details column.prediction_detail
Reserved ColumnsNoColumns from the input table to carry through to the output. All columns are carried through by default.N/A

Tuning tab

ParameterRequiredDescriptionDefault value
Number of InstancesNoNumber of compute instances for the job. Increase this value when the job runs slowly on large datasets.Auto-calculated based on input data
Memory Per InstanceNoMemory per instance, in MB. Valid values: 100–65536. Increase this value when the job fails with out-of-memory errors.Auto-calculated based on input data

Output ports

PortRecommended downstream component
OutputBinary classification evaluation

Related topics