Automate Machine Learning Workflows with Pipelines

FelixLarry · Sep-06-2022, 09:37 PM

# Create a pipeline that standardizes (prepares) the data then evaluates a model
import pandas as pd
import numpy as np
from sklearn.model_selection import KFold
from sklearn.model_selection import cross_val_score
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import Pipeline
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
filename = 'pima-indians-diabetes.data.csv'
names = ['preg', 'plas', 'pres', 'skin', 'test', 'mass', 'pedi', 'age', 'class']
dataframe = pd.read_csv(filename, names=names)
array = dataframe.values
X = array[:,0:8]
y = array[:,8]
# create pipeline
estimators = []
estimators.append(('standardize', StandardScaler()))
estimators.append(('lda', LinearDiscriminantAnalysis()))
model = Pipeline(estimators)
# evaluate the model
seed = 7
kfold = KFold(n_splits=10, shuffle=True, random_state=seed)
results = cross_val_score(model, X, y, cv=kfold)
print(results.mean())

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Choosing the Best Machine Learning Model	FelixLarry	1	2,680	Dec-23-2022, 07:36 AM Last Post: praveencqr
	Compare Machine Learning Regression Algorithms Consistently	FelixLarry	0	1,894	Sep-06-2022, 09:25 PM Last Post: FelixLarry
	Evaluating the Performance of Machine Learning Algorithms	FelixLarry	0	2,078	Sep-02-2022, 09:20 PM Last Post: FelixLarry
	Module for creating kernels and convoluting images (Machine Learning)	dibsonthis	0	2,225	Dec-14-2017, 11:58 AM Last Post: dibsonthis

Automate Machine Learning Workflows with Pipelines

User Panel Messages

Announcements