How many clusters are enough in k-means clustering?
@p22 wrote: Is there a set formula for this, or it depends? Posts: 5 Participants: 1 Read full topic
View ArticleWhich tool do you generally use to build Deep Learning models? (specific to...
@jalFaizy wrote: Hi @everyone, So I have a fair share of experience in working with tools/libraries to build deep learning models. A few of the tools that existed before are extinct as of now (such as...
View ArticleHow to solve this Data Analysis problem ...? need guidance & Suggestions
@deva123 wrote: I try to solve this using sqldf package from R but after 1st question i couldn’t figure it out how to solve next questions I dont how to extract and separate data from columns ( events...
View ArticleWhat CSV file should we submit as solution to contests?
@prashant09 wrote: Hello. I am new here. I am participating in my first contest here (“Identify the digits”). I see that in the submission section, only CSV files are allowed to be submitted as...
View ArticleDescriptive statistics of time series data
@mukundgnana wrote: My task is to summarises the descriptive statistics of time series data ( mean, SD , standard error ). It is fairly starightforward for a stationary series. But How do we find out...
View ArticleCode not working and scipy version not compatible with scikit.ml 0.0.5
@aqsdmcet wrote: Dear developers , I am a data science student. I am working on web page categorization with http://scikit.ml/ . Question: I am not able to execute MLkNN code on the link...
View ArticleMovieLens algo query
@ehsanarif wrote: Can someone help me in which algo to use for Movielens dataset? I have transformed the feature with OHE(dummy var method) and where I have to predict the movie ratings by the user...
View ArticleTime series - can I use any other factor like any numeric factor except date...
@jatin_raina wrote: I have a dataset regrading fees charged for hotel rooms for a period of days mostly 7 - majority of the values have hotel booked for 7 days only and few values have 3 or 2 . I kind...
View ArticleShift to data analytics
@shrutya wrote: Hi, I am into retail banking business in a reputed Public Bank. I want to do a course in data science and machine learning. I have a great deal of intrest in the same and am very good...
View ArticleShift to Data Science from Software Testing Background
@rharsh29 wrote: Hi, I have 8+experience in Software Testing and currently I am on career break from 1.5years. Is it possible for me to take some data science course and become data scientist? Thanks!...
View ArticleWhich algorithms are good for Multi class classification problems?
@swapna26 wrote: When we have more than 20 classes which algorithms are best and what is the procedure??? Please help thanks in advance… –swapna Posts: 6 Participants: 4 Read full topic
View ArticleHow to calculate inter-model correlation for an ensemble in R?
@Saikat_Ghosh wrote: Analytics Vidhya – 15 Feb 17 How to build Ensemble Models in machine learning? (with code in R) This guide explains ensemble modeling to combine various machine learning models /...
View ArticleWhich is best technique for giving rating or scoring for resume analysis?
@premsheth wrote: Hi friends, I am trying to develop model for resume analysis and give score to each resume with respect to job description. can anyone tell me which technique is good? I did some...
View ArticleHow to create Target Encoding or Mean Encoding in R?
@Satish_Chilloji wrote: In recent Analytics Vidhya Hackathon, I have come to know this Concept Called Target Encoding for handling categorical variables having too many levels (Ex ZIP code, Phone...
View ArticleHow can max pooling help in making convolutional features as "translational...
@albela_angur wrote: Hi all, I have read that a CNN model has translation invariance. On exploring more about it, I got to know that the pooling operation in a CNN model does this. Could someone...
View ArticleNoobie to analytics
@muditrai1 wrote: I want to start my career in analytics, very much confused of what combination of tools and languages to learn in this initial stage. I see lots of options, but things are vague in...
View ArticleBest open source data analytics tool
@anoopellickal wrote: Hi I am searching for best data analytics tools available now for free, I am new to this and need help and advise from the experts. I have gone through R and Power BI, Posts: 1...
View ArticleXGBoost only predicts NAN after removing all NANs from the training data in...
@kateb4 wrote: Hi, I asked a question on StackOverflow, but they did not answer my question, so I decided to try it here. Hello! I’m trying to get my code to work, it used to give no errors, until I...
View ArticleHow to convert data in "Practice Problem: Time Series" to ts format in R?
@Pranov_Mishra wrote: Hi all, I am working on the dataset (in the practice problem in AV) which has hourly values over a period of 2 years. I had difficulty in converting the data to timeseries...
View ArticleOverfitting with R xgboost
@sandoz wrote: Below are the 3 R instructions I use with XGBoost: params <- list(booster = "gbtree", objective = "binary:logistic",eta=0.3, gamma=5, max_depth = 3, min_child_weight=1, subsample=1,...
View Article