Quantcast
Browsing all 4448 articles
Browse latest View live

Techniques to collapse the categorical variables in R

@karthe1 wrote: Hi all, I have close to 200 categories in a variable. What are all the techniques available in R or in statistics in general to collapse the values?I generally use collapsing by the...

View Article


Survival analysis model

@nehak wrote: Hi i am looking for a statistical model to predict when a customer can do next transaction.For example i have a group of customers and their lifetime transactions(a particular...

View Article


RFM with Random Forest

@nehak wrote: i am building random forest based on rfm as classifier(1,0) -dependent variable.should i keep r ,f and m later as independent variables while running random forest along with other...

View Article

What is the difference between softprob and softmax in Xgboost

@hackers wrote: Hello, While trying to implement Xgboost in R,I came across the code: param <- list("objective" = "multi:softprob", # multiclass classification "num_class" = 12, # number of classes...

View Article

Dashboards using Python

@mtare wrote: Hi team,Do we have any libraries in Python that can be used to build interactive analytical dashboards/data apps(equivalent to shiny in R) ?I already know a few like Bokeh / Plotly , but...

View Article


Image may be NSFW.
Clik here to view.

How to create confusion matrix for xgboost in R

@hackers wrote: Hello, I am using the below code for xgboost: bst.cv <- xgb.cv(param=param, data=data.matrix(X[,-1]), label=y, nfold=10, nrounds=nround.cv, prediction=TRUE, verbose=FALSE) I came...

View Article

More than one dependent variable in predicitive modelling

@karthe1 wrote: Hi all, Can we have more than one dependent / target variable in predictive modelling tasks? If yes, what is the approach to be followed to predict the values for every target...

View Article

Relation between hypothesis testing and null hypothesis

@hinduja1234 wrote: I am currently studying a different type of hypothesis which helps in finding the importance of the variable.I have studied the hypothesis testing and null hypothesis. Hypothesis...

View Article


How F-static help in finding the importance of the variable?

@harry wrote: I am currently studying about different types of measure which help in selecting the importance of the variable while studying it I find two measure first is R^2 and other is F-static.I...

View Article


How to show predictions for models fitted using caret in R

@pagal_guy wrote: Hello, I am using caret for modeling so that I can use cross validation.Below is my code for rpart: Control <- trainControl(method='cv', number=5,repeats = 10,returnResamp='none')...

View Article

Image may be NSFW.
Clik here to view.

How to view the confusion matrix having data counts using caret in R

@pagal_guy wrote: Hello, I am using caret for training models but when I try to see the confusion matrix: What to do so that I can see the counts as is normally shown in confusion matrix.r Posts: 1...

View Article

PGPM or PGDM Full time from Great Lakes

@erajan119 wrote: Hi Kunal, I have work experience of 2.4 years as a Data Analyst and given cmat this year hence i want to apply for data science program in great lakes should i go for PGDM 2 year or...

View Article

How to impute missing values for a variable like Gender?

@karthe1 wrote: Hi all, How should one think about missing values for the variables like Gender, marital status etc (especially the dichotomous variables )? I understand that Mode imputation might...

View Article


Propensity scoring in R

@nehak wrote: I have 1 lakh members who are purchasing a particular product.i want to test on other members who can be the best fit in order to advertise my product to them so that for that particular...

View Article

Image may be NSFW.
Clik here to view.

How to resolve error:non numeric argument to binary operator while applying...

@pagal_guy wrote: Hello, I am using the below code to run lda from the caret package in R: ldaModel <- train(df_train[,-16], df_train[,16], method='lda', trControl=Control, metric = "Accuracy",...

View Article


How to decide the kernal type in SVM

@sonam_gupta wrote: I am working on a data classification problem. There are few doubts I am facing. How to decide that whether our data is Linearly or Non-Linearly separable with examples? How to...

View Article

How to merge two CSV file by a specific column using R

@sid100158 wrote: I am currently solving one classification problem in which I have three different CSV files in which there is more than one column which is common between two files when I have used...

View Article


Last Man Standing - Less than 12 Hrs to GO!

@amit_sharma wrote: Analytics Vidhya is back with its Signature Hackathon and it is going to be our biggest ever! The best data scientists across the globe have already registered! Now its your turn....

View Article

Errors in Logistic Regression

@sonam_gupta wrote: In Linear Regression, we get to know the errors bases on the residuals.How to find Error in Logistic Regression? Posts: 2 Participants: 2 Read full topic

View Article

Confusion regarding choice of book

@ayushgg92 wrote: Can someone suggest me a good book on Machine learning/data mining using R assuming I don't have much background knowledge in statistics Posts: 5 Participants: 4 Read full topic

View Article
Browsing all 4448 articles
Browse latest View live