Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Viewing all articles
Browse latest Browse all 4448

Why 0 or 1 for labeling categorical data?

$
0
0

@shashankjakka wrote:

I was trying KMeans for clustering problem.I have a Gender feature and it also has nan’s.
So I forgot to take care of the nan values and used label encoder and got values like 1029 for male 1028 for female and with this data I got a silhouette_score of about 0.95!
Later when I realised this and corrected it to 1 and 0 the silhouette_score dropped to 0.70!

Both the times I used the RobustScaler to scale the data.

What could have happend here?

Posts: 1

Participants: 1

Read full topic


Viewing all articles
Browse latest Browse all 4448

Trending Articles