sklearn
pipeline
data
X_train, X_test, Y_train, Y_test, indices_train, indices_test = train_test_split(X, Y, indices, test_size=0.33)マルチラベルをバイナリエンコード
import pandas as pd
df = pd.DataFrame(
{"col1": [["aaa","bbb","ccc"], ["aaa"],["ccc","ddd"]]}
)
df

StratifiedKFold
Last updated