Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Viewing all articles
Browse latest Browse all 4448

Need Help Improving Accuracy of Model

$
0
0

@kakashi wrote:

This is what I followed:

  1. load dataset and separate features and target variables
  2. Separate numeric, categorical and ordinal variables
  3. Imputed numeric variables with median (using groupby) and categorical/ordinal with most frequent values.
  4. Encoded categorical values using one hot encoder(pd.get_dummies) and ordinal with label
    encoder.
  5. Used GridSearchCV to tune hyper parameters of LogisticRegression, RandomForest, SVM, KNN, XGBoost. Highest accuracy was 0.784 with XGBoost. LogisticRegression, SVM and RandomForest gave 0.77.
    I’m new to data science and have completed datacamp courses and read analytics vidya blogposts. I spent considerable amount of time on this problem but the accuracy is not increasing. I tried standardizing features and scaling for appropriate algorithms and using subset of features(most significant of them). Any help will be appreciated.

Posts: 2

Participants: 2

Read full topic


Viewing all articles
Browse latest Browse all 4448

Trending Articles