Necessary mathematical skills for Machine Learning and Data analysi
@NSS wrote: I would like to know what are the necessary mathemical skills that a person must have to enter into machine learning and data science. Please answer keeping in mind that I would want to...
View ArticleTransformation for continuous predictors
@NSS wrote: Hi all,I aim to apply regression methods for now. So i have a highly skewed predictor( be it left skewed or right skewed). So regression method will be best suited for which case ? 1-...
View ArticleDetermining significance of predictors
@NSS wrote: Hello all,I would like to know the importance and relative importance of predictors(say 100 in number) without having to build a model like random forest. Is it possible ? If yes, how ?...
View ArticleProblem in aggregate function
@vijaypalmanit wrote: can some one explain me why these two works aggregate(subset(iris,select=-Species),subset(iris,select=Species),mean) aggregate(iris[-5],iris[5],mean) but not this...
View ArticleOutlier Treatment
@shashwat.2014 wrote: While applying outlier treatment methods in different situations, I realised that I had no measure to compare the different methods? Can anyone suggest a means to compare the...
View ArticleThe seer's accuracy
@dsav wrote: Friends, I am new here and want to look at the data (train.csv and test.csv) for the seer's accuracy competition. Can anyone having it please send it to me? dsav2233@gmail.com Thanks in...
View ArticleMini Datahack 28 May solution
@paramjeet_sharma wrote: Hey did anyone got the solution for the timeseries datahack which which happened on 28th May? Posts: 1 Participants: 1 Read full topic
View ArticleIoT Analytics - Time series
@innamurisrikanth wrote: Hi All, We will receive data of Car parking, lighting and environment for every minute from sensors. We want to fit the best curve for the real-time data. What are the best...
View ArticleUnable to impute missing values through MICE for categorical variables
@shashwat.2014 wrote: I was using MICE package to impute missing values in categorical variables....
View ArticleMIssing at random and missing not at random
@syed.danish wrote: Hi,While handling missing values, I came across two types of missing values : missing at random and missing not at random. Can somebody please explain these two types of missing...
View ArticleCreating dummy variables in python
@syed.danish wrote: Hi, Currently I am working on megastar contest on AV. Below is the structure of data set : Screenshot from 2016-06-02 10-06-49.png1366x768 107 KB I want to create new variables for...
View ArticleBetter way to replace expressions in Regex
@syed.danish wrote: Hi,I am using following technique to replace expressions in regex : def parse (x): x=re.sub('\?',',',str(x)) x=re.sub('&',',',str(x)) x=re.sub('\+',',',str(x))...
View ArticleMeaning and usage of Residual Deviance in Analysis of Deviance Table after...
@shashwat.2014 wrote: Hi,I was executing logistic regression in the Loan Prediction problem. After executing logistic regression, I came across an information that the a larger difference between null...
View ArticleAre there plans to introduce IPython for use in discussion forums
@anantguptadbl wrote: IPython will be very useful for sharing code snippets quickly. Not sure whether AnalyticsVidhya is thinking on the same lines Posts: 1 Participants: 1 Read full topic
View ArticleMulti-class Multi-label text classification
@Niranjanp wrote: Hello, I have been working on text classification problem which has three outcome variables and they are multi-class variables. The dataset description as follows. Dataset is about...
View ArticlePython, R vs 19 Data Science Tools for people who aren’t so good at Programming
@ancientkittens wrote: Hi, After reading through the article "19 Data Science Tools for people who aren’t so good at Programming", it got me thinking about what data scientist prefer to use - learning...
View ArticleAny reference for end to end process for forecasting/timeseries modelling?
@shivanihmcl wrote: Any reference/book/cases for end to end process for forecasting/timeseries modelling? Posts: 2 Participants: 2 Read full topic
View ArticleVideos from PyCon 2016 now available
@kunal wrote: Thought, I'll share the link to videos as they get released. We would plan to release a summary on the blog later in any case. But, if you can't wait for the summary and want to see the...
View ArticleMismatch in levels of categorical variable in train and test data
@syed.danish wrote: Hi,I have a data set divided into two parts train and test , I want to know how one should handle extra levels present in test or /and train data. Three cases are possible : Case 1...
View ArticleUnable to remove errors while installing spark R
@shashwat.2014 wrote: Hi everyone, I was installing spark R on windows 10, and during the process, I am not able to remove one error even after constant struggle and searches. I followed the spark R...
View Article