Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Viewing all articles
Browse latest Browse all 4448

Noob needs help with data preprocessing, feature selection and feature engineering

$
0
0

@kakashi wrote:

I have completed all the courses on datacamp. I can use numpy pandas scikit learn matplotlib and bokeh i.e I know the syntax and what it does. But I’m still clueless as to when to use what ex. when to scale or normalize or when to impute mean/median for missing values (how will it affect the algorithm) or which algorithm should I prefer as there are multiple choices. In short how to develop intuition. whenever I download data set I’m clueless as to how should I prepare my data for training. Also how much should I know about algorithms, I only have a high level understanding of most of the algorithms, do I need to understand them at mathematical level? I’m in my final year of engineering so I can spare time for learning. By the next june I want to be more than comfortable with data analytics with python and have basic knowledge of hadoop.
I’m in desperate need of guidance. Any help will be appreciated. Thank You.

Posts: 2

Participants: 2

Read full topic


Viewing all articles
Browse latest Browse all 4448

Trending Articles