Brand assortment
@dada_kishore wrote: Hi, I want to see the bias of different brands at a store level. For instance there are 100 stores and there are three brands in each store. Sales ($) and Sales (#), Profit, Sell...
View ArticleWhy 0 or 1 for labeling categorical data?
@shashankjakka wrote: I was trying KMeans for clustering problem.I have a Gender feature and it also has nan’s. So I forgot to take care of the nan values and used label encoder and got values like...
View ArticlePlease suggest some good institute in Pune for DATA Analytics course
@y_shinde wrote: Hi I am Yogesh from Pune. I have total 15 yrs of experience in SQL Database and MSBI Skills. Please guide me for choosing the best institute in Pune for Data Analytics course. I am...
View ArticleItem tagging using R or Python
@hemantsain55 wrote: i want to tag categories to its menuname. i have a csv containing menu item name and in other csv i have a column containing some strings, i want to pick that strings from...
View ArticleChurn Prediction 'Bank of Gardenia' Hackathon
@dhar_arka wrote: What is interpreted by Responder (target) variable as 1 or 0? Does 1 mean customer has churned out or opened a savings account?? The question is not clear. What are we supposed to...
View ArticleBlack Friday Hackathon
@harshit08173 wrote: I have recently done a few regression courses online and wanted to practice by solving some real world problem .As many of the AV’s article suggested black friday and bigmart...
View ArticleKernel SVM for a classifier model
@PSATHYAM wrote: Hi, I summited a kernel SVM model to classify gender based on voice parameters in below link, kindly skim through it and let me know any scope of improvement or glitch in approach. It...
View ArticleRandom Forest -decide variable Importance
@wprashik18 wrote: Hi, I am using random forest for regression and getting different result for variable importance every time I run the same model. I am not able to decide which variable should I...
View ArticleMultilogloss metric in NaiveBayes
@Rudra11 wrote: How to compute multilogloss in Naive Bayes I had built a model for predicting multi class problem. I want to evaluate through multilogloss metric but I am getting output as zero but...
View ArticleWhy Gradient Descent for Optimization?
@TarunSingh wrote: Hello folks, I have a question regarding the optimization technique used for updating the weights. People generally use gradient descent for the optimization whether its SGD or...
View ArticleEmail classification
@hemantsain55 wrote: hello, what is the best approach to classifying emails into different categories, i have a train set containing all the category label i want to build a model on the top of it so...
View ArticleHow to tune warm_start parameter of RandomForest?
@mohdsanadzakirizvi wrote: The RF documentation in sklearn has an interesting parameter warm_start that is used to “reuse the solution of the previous call to fit and add more estimators to the...
View ArticleMulti class classification
@shashankjakka wrote: Hi I have a Multi class classification problem with 3 classes.Class 1 is being differentiated from other classes but I have a problem separating Class 2 from Class 0.(Most of the...
View ArticleJoin slack chat
@ASHISH_17 wrote: Hi, Can someone send me the link to join slack chat for Churn Prediction Hackathon? Thanks Posts: 1 Participants: 1 Read full topic
View ArticleBig change in accuracy xgboost classification
@vijaykmathew wrote: #why there is big change in accuracy changing learning_rate =0.1 to 0.7 xgb2 = xgb.XGBClassifier( learning_rate =0.1, n_estimators=1000, max_depth=5, min_child_weight=12,...
View ArticleLocality Sensitive Hashing for Cosine Similarity
@burhanusman wrote: Has anyone come across good resources/blogs/books that explain how to use LSH for finding cosine similarity between lot many items? Posts: 1 Participants: 1 Read full topic
View ArticleCategorical variable with large level
@sree1986 wrote: Hi I am working on a logistic regression based binary classification problem where I need predict customer churn. Some categorical variables in the data-set have a large no of levels...
View ArticleFirst and second transaction dates using R from transaction table
@chaituchaitanya28 wrote: Hello Team, I have a transaction table which has multiple transactions with customer IDs, Now for each customer i want to find out first order date and second order date...
View ArticleClustering using Kohonen Self Organising Maps
@surya.ssh wrote: Do we have check correlation between variables before passing as a input to SOM Model or Normalising the variable would do the trick ?? How to select the variables for clustering...
View ArticleVariable Selection in R (Similar to Proc Varclus in SAS)
@Reshuditit wrote: Hi All. I want to replicate the variable selection process in R similar to that of Proc Varclus. I tried a lot to search but not getting a proper way. People ave suggested Varclus...
View Article