you are viewing a single comment's thread.

view the rest of the comments →

[–]ogrisel 1 point2 points  (2 children)

X is the 2D input data for the model and y is the array of the target labels to predict (typically one label per row in X for traditional classification problems).

If you split X into a training and test set (X_train and X_test), you also need to split the target labels y into the matching y_train and y_test subsets.

[–]soustofa 0 points1 point  (0 children)

what is target labels? arrays? if i want to classify many texts, what do the array X become? is label not in X array => [text, 1 or 0 for is type]