Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live
↧

Multicollinearity in Random Forest

@asha_vish wrote: Is Random Forest affected by multicollinearity and do we need to remove the multicollinear variables.I fed all variables into RF and retained the ones which had a higher variable...

View Article


Image may be NSFW.
Clik here to view.

What are limitations of tree diagram?

@sid100158 wrote: I am currently studying about tree diagram which is a method for the calculation of probilites of the combined event. Pasted image746x214 11.3 KB Here is the tree diagram for the...

View Article


Getting NA values while converting a variable into Date and Time format

@harry wrote: I am currently solving the one-time series question of Analytics Vidhya hackathon in which there are two variable one is Datetime and other is count.I am currently trying to convert the...

View Article

Seasonal Parameter in ARIMA and ADF test

@shan4224 wrote: Hi.. While performing ARIMA in R , on data AirPassengers , we can write :fit <- arima(log(AirPassengers), c(0, 1, 1),seasonal = list(order = c(0, 1, 1), period = 12))) We can get...

View Article

Modelling a Stationary series with no dependence among values

@shan4224 wrote: Hello.. While forecasting Time Series, there can be two possibilities: a)_ A strictly stationary series with no dependence among the values. Here we can model the residuals as white...

View Article


Learning Analytics from Private Institute

@harmeet1991.singh wrote: Hi Team, I am currently working in pre sales department and have 2+ years experience and now, want to switch to Analytics profile. For that I am considering learning VBA and...

View Article

Slack URL does not work in my PC!

@rahul29 wrote: Hi, I`m unable access the slack page, not sure about others. Here is the error message im getting while trying to open it--- Your connection is not private Attackers might be trying to...

View Article

Data set for mini data hack

@nidhi_Singh wrote: Can anyone please share the data set for the mini data hack.@ k.nidhi101@gmail.com Posts: 1 Participants: 1 Read full topic

View Article


Need dataset of Mini DataHack

@Raghuvaran_Raghu wrote: I couldn't participate in Mini DataHack but i would like to solve it. I couldn't download the dataset as the contest is not active. Can anyone please help me to download the...

View Article


Skills needed for an internship in Analytics field

@Sandeep_Singh wrote: Hi Everyone.I am a Economics student doing my Masters in Economics. What skills/tools i must know which could offer me an intership in some of the leading analytics firms in...

View Article

Label Encoding vs One Hot Encoding in Machine Learning Model

@supra_minion wrote: Hello I am working on a data set comprising of multiple variables including 10 categorical (2 level) variables and 5 categorical (3 level) variables. I read about dealing them for...

View Article

Xgb.importance() unable to plot sparse.model.matrix()!

@sunnysingha wrote: Hi,After successfully building XGBoost model on sparse.model.matrix(~.-1, data=train_set)I get below error on the R console while attempting to plot feature importance matrix :...

View Article

ARIMA ACF residuals error in R

@nehak wrote: I am trying to run ARIMA in R. Done auto arima and tried other fits as well.The issue is i am trying to run ACF of residuals.that is coming out significant spikes.I did log...

View Article


Data cleaning before using arima

@nehak wrote: i want to understand whether data cleaning as in removing the outliers is done before applying arima?univariate and bivariate analysis before applying ARIMA? Posts: 1 Participants: 1...

View Article

What does stemmer do in python nltk library

@pagal_guy wrote: Hello, from nltk.stem.porter import * stemmer = PorterStemmer() While participating in a Kaggle competition I came across the above library for doing the things as shown below in one...

View Article


Image may be NSFW.
Clik here to view.

What are some good resources for text analytics in R

@pagal_guy wrote: Hello, I am looking to learn text analytics with R like webscraping and then combining the results in a dataframe for further analysis in R.But I am unable to find any good or...

View Article

Sharing the approach for MiniHack-Time Series Problem!

@rahul29 wrote: Hi All the top performers, I request you all to share the approach/strategy used during the competition or even post that if you have got some insights from it . It would be a great...

View Article


OneHotEncoding in Python

@shan4224 wrote: Hi.. I am new to Python. Is OneHotEncoding similar to creation of dummy variables ?Tried the following codes in python OneHotEncoding: from sklearn.preprocessing import...

View Article

Splitting between train/test for customer churn survival models

@girish wrote: I am a bit confused on how data can be split between train/test and "live" data for predicting churn using survival models (the package I am playing with is RandomForestSRC). Goal of...

View Article

High survival probabilities for un-censored test observations when using...

@girish wrote: I am using survival analysis to predict probability of customer churn. I am using the RandomForestSRC package. My test data has some rows that are censored and some that aren't. When I...

View Article
Browsing all 4448 articles
Browse latest View live