Multicollinearity in Random Forest
@asha_vish wrote: Is Random Forest affected by multicollinearity and do we need to remove the multicollinear variables.I fed all variables into RF and retained the ones which had a higher variable...
View ArticleWhat are limitations of tree diagram?
@sid100158 wrote: I am currently studying about tree diagram which is a method for the calculation of probilites of the combined event. Pasted image746x214 11.3 KB Here is the tree diagram for the...
View ArticleGetting NA values while converting a variable into Date and Time format
@harry wrote: I am currently solving the one-time series question of Analytics Vidhya hackathon in which there are two variable one is Datetime and other is count.I am currently trying to convert the...
View ArticleSeasonal Parameter in ARIMA and ADF test
@shan4224 wrote: Hi.. While performing ARIMA in R , on data AirPassengers , we can write :fit <- arima(log(AirPassengers), c(0, 1, 1),seasonal = list(order = c(0, 1, 1), period = 12))) We can get...
View ArticleModelling a Stationary series with no dependence among values
@shan4224 wrote: Hello.. While forecasting Time Series, there can be two possibilities: a)_ A strictly stationary series with no dependence among the values. Here we can model the residuals as white...
View ArticleLearning Analytics from Private Institute
@harmeet1991.singh wrote: Hi Team, I am currently working in pre sales department and have 2+ years experience and now, want to switch to Analytics profile. For that I am considering learning VBA and...
View ArticleSlack URL does not work in my PC!
@rahul29 wrote: Hi, I`m unable access the slack page, not sure about others. Here is the error message im getting while trying to open it--- Your connection is not private Attackers might be trying to...
View ArticleData set for mini data hack
@nidhi_Singh wrote: Can anyone please share the data set for the mini data hack.@ k.nidhi101@gmail.com Posts: 1 Participants: 1 Read full topic
View ArticleNeed dataset of Mini DataHack
@Raghuvaran_Raghu wrote: I couldn't participate in Mini DataHack but i would like to solve it. I couldn't download the dataset as the contest is not active. Can anyone please help me to download the...
View ArticleSkills needed for an internship in Analytics field
@Sandeep_Singh wrote: Hi Everyone.I am a Economics student doing my Masters in Economics. What skills/tools i must know which could offer me an intership in some of the leading analytics firms in...
View ArticleLabel Encoding vs One Hot Encoding in Machine Learning Model
@supra_minion wrote: Hello I am working on a data set comprising of multiple variables including 10 categorical (2 level) variables and 5 categorical (3 level) variables. I read about dealing them for...
View ArticleXgb.importance() unable to plot sparse.model.matrix()!
@sunnysingha wrote: Hi,After successfully building XGBoost model on sparse.model.matrix(~.-1, data=train_set)I get below error on the R console while attempting to plot feature importance matrix :...
View ArticleARIMA ACF residuals error in R
@nehak wrote: I am trying to run ARIMA in R. Done auto arima and tried other fits as well.The issue is i am trying to run ACF of residuals.that is coming out significant spikes.I did log...
View ArticleData cleaning before using arima
@nehak wrote: i want to understand whether data cleaning as in removing the outliers is done before applying arima?univariate and bivariate analysis before applying ARIMA? Posts: 1 Participants: 1...
View ArticleWhat does stemmer do in python nltk library
@pagal_guy wrote: Hello, from nltk.stem.porter import * stemmer = PorterStemmer() While participating in a Kaggle competition I came across the above library for doing the things as shown below in one...
View ArticleWhat are some good resources for text analytics in R
@pagal_guy wrote: Hello, I am looking to learn text analytics with R like webscraping and then combining the results in a dataframe for further analysis in R.But I am unable to find any good or...
View ArticleSharing the approach for MiniHack-Time Series Problem!
@rahul29 wrote: Hi All the top performers, I request you all to share the approach/strategy used during the competition or even post that if you have got some insights from it . It would be a great...
View ArticleOneHotEncoding in Python
@shan4224 wrote: Hi.. I am new to Python. Is OneHotEncoding similar to creation of dummy variables ?Tried the following codes in python OneHotEncoding: from sklearn.preprocessing import...
View ArticleSplitting between train/test for customer churn survival models
@girish wrote: I am a bit confused on how data can be split between train/test and "live" data for predicting churn using survival models (the package I am playing with is RandomForestSRC). Goal of...
View ArticleHigh survival probabilities for un-censored test observations when using...
@girish wrote: I am using survival analysis to predict probability of customer churn. I am using the RandomForestSRC package. My test data has some rows that are censored and some that aren't. When I...
View Article