Techniques to collapse the categorical variables in R
@karthe1 wrote: Hi all, I have close to 200 categories in a variable. What are all the techniques available in R or in statistics in general to collapse the values?I generally use collapsing by the...
View ArticleSurvival analysis model
@nehak wrote: Hi i am looking for a statistical model to predict when a customer can do next transaction.For example i have a group of customers and their lifetime transactions(a particular...
View ArticleRFM with Random Forest
@nehak wrote: i am building random forest based on rfm as classifier(1,0) -dependent variable.should i keep r ,f and m later as independent variables while running random forest along with other...
View ArticleWhat is the difference between softprob and softmax in Xgboost
@hackers wrote: Hello, While trying to implement Xgboost in R,I came across the code: param <- list("objective" = "multi:softprob", # multiclass classification "num_class" = 12, # number of classes...
View ArticleDashboards using Python
@mtare wrote: Hi team,Do we have any libraries in Python that can be used to build interactive analytical dashboards/data apps(equivalent to shiny in R) ?I already know a few like Bokeh / Plotly , but...
View ArticleHow to create confusion matrix for xgboost in R
@hackers wrote: Hello, I am using the below code for xgboost: bst.cv <- xgb.cv(param=param, data=data.matrix(X[,-1]), label=y, nfold=10, nrounds=nround.cv, prediction=TRUE, verbose=FALSE) I came...
View ArticleMore than one dependent variable in predicitive modelling
@karthe1 wrote: Hi all, Can we have more than one dependent / target variable in predictive modelling tasks? If yes, what is the approach to be followed to predict the values for every target...
View ArticleRelation between hypothesis testing and null hypothesis
@hinduja1234 wrote: I am currently studying a different type of hypothesis which helps in finding the importance of the variable.I have studied the hypothesis testing and null hypothesis. Hypothesis...
View ArticleHow F-static help in finding the importance of the variable?
@harry wrote: I am currently studying about different types of measure which help in selecting the importance of the variable while studying it I find two measure first is R^2 and other is F-static.I...
View ArticleHow to show predictions for models fitted using caret in R
@pagal_guy wrote: Hello, I am using caret for modeling so that I can use cross validation.Below is my code for rpart: Control <- trainControl(method='cv', number=5,repeats = 10,returnResamp='none')...
View ArticleHow to view the confusion matrix having data counts using caret in R
@pagal_guy wrote: Hello, I am using caret for training models but when I try to see the confusion matrix: What to do so that I can see the counts as is normally shown in confusion matrix.r Posts: 1...
View ArticlePGPM or PGDM Full time from Great Lakes
@erajan119 wrote: Hi Kunal, I have work experience of 2.4 years as a Data Analyst and given cmat this year hence i want to apply for data science program in great lakes should i go for PGDM 2 year or...
View ArticleHow to impute missing values for a variable like Gender?
@karthe1 wrote: Hi all, How should one think about missing values for the variables like Gender, marital status etc (especially the dichotomous variables )? I understand that Mode imputation might...
View ArticlePropensity scoring in R
@nehak wrote: I have 1 lakh members who are purchasing a particular product.i want to test on other members who can be the best fit in order to advertise my product to them so that for that particular...
View ArticleHow to resolve error:non numeric argument to binary operator while applying...
@pagal_guy wrote: Hello, I am using the below code to run lda from the caret package in R: ldaModel <- train(df_train[,-16], df_train[,16], method='lda', trControl=Control, metric = "Accuracy",...
View ArticleHow to decide the kernal type in SVM
@sonam_gupta wrote: I am working on a data classification problem. There are few doubts I am facing. How to decide that whether our data is Linearly or Non-Linearly separable with examples? How to...
View ArticleHow to merge two CSV file by a specific column using R
@sid100158 wrote: I am currently solving one classification problem in which I have three different CSV files in which there is more than one column which is common between two files when I have used...
View ArticleLast Man Standing - Less than 12 Hrs to GO!
@amit_sharma wrote: Analytics Vidhya is back with its Signature Hackathon and it is going to be our biggest ever! The best data scientists across the globe have already registered! Now its your turn....
View ArticleErrors in Logistic Regression
@sonam_gupta wrote: In Linear Regression, we get to know the errors bases on the residuals.How to find Error in Logistic Regression? Posts: 2 Participants: 2 Read full topic
View ArticleConfusion regarding choice of book
@ayushgg92 wrote: Can someone suggest me a good book on Machine learning/data mining using R assuming I don't have much background knowledge in statistics Posts: 5 Participants: 4 Read full topic
View Article