Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

How many clusters are enough in k-means clustering?

@p22 wrote: Is there a set formula for this, or it depends? Posts: 5 Participants: 1 Read full topic

View Article


Which tool do you generally use to build Deep Learning models? (specific to...

@jalFaizy wrote: Hi @everyone, So I have a fair share of experience in working with tools/libraries to build deep learning models. A few of the tools that existed before are extinct as of now (such as...

View Article


Image may be NSFW.
Clik here to view.

How to solve this Data Analysis problem ...? need guidance & Suggestions

@deva123 wrote: I try to solve this using sqldf package from R but after 1st question i couldn’t figure it out how to solve next questions I dont how to extract and separate data from columns ( events...

View Article

What CSV file should we submit as solution to contests?

@prashant09 wrote: Hello. I am new here. I am participating in my first contest here (“Identify the digits”). I see that in the submission section, only CSV files are allowed to be submitted as...

View Article

Descriptive statistics of time series data

@mukundgnana wrote: My task is to summarises the descriptive statistics of time series data ( mean, SD , standard error ). It is fairly starightforward for a stationary series. But How do we find out...

View Article


Code not working and scipy version not compatible with scikit.ml 0.0.5

@aqsdmcet wrote: Dear developers , I am a data science student. I am working on web page categorization with http://scikit.ml/ . Question: I am not able to execute MLkNN code on the link...

View Article

MovieLens algo query

@ehsanarif wrote: Can someone help me in which algo to use for Movielens dataset? I have transformed the feature with OHE(dummy var method) and where I have to predict the movie ratings by the user...

View Article

Time series - can I use any other factor like any numeric factor except date...

@jatin_raina wrote: I have a dataset regrading fees charged for hotel rooms for a period of days mostly 7 - majority of the values have hotel booked for 7 days only and few values have 3 or 2 . I kind...

View Article


Shift to data analytics

@shrutya wrote: Hi, I am into retail banking business in a reputed Public Bank. I want to do a course in data science and machine learning. I have a great deal of intrest in the same and am very good...

View Article


Shift to Data Science from Software Testing Background

@rharsh29 wrote: Hi, I have 8+experience in Software Testing and currently I am on career break from 1.5years. Is it possible for me to take some data science course and become data scientist? Thanks!...

View Article

Which algorithms are good for Multi class classification problems?

@swapna26 wrote: When we have more than 20 classes which algorithms are best and what is the procedure??? Please help thanks in advance… –swapna Posts: 6 Participants: 4 Read full topic

View Article

Image may be NSFW.
Clik here to view.

How to calculate inter-model correlation for an ensemble in R?

@Saikat_Ghosh wrote: Analytics Vidhya – 15 Feb 17 How to build Ensemble Models in machine learning? (with code in R) This guide explains ensemble modeling to combine various machine learning models /...

View Article

Which is best technique for giving rating or scoring for resume analysis?

@premsheth wrote: Hi friends, I am trying to develop model for resume analysis and give score to each resume with respect to job description. can anyone tell me which technique is good? I did some...

View Article


How to create Target Encoding or Mean Encoding in R?

@Satish_Chilloji wrote: In recent Analytics Vidhya Hackathon, I have come to know this Concept Called Target Encoding for handling categorical variables having too many levels (Ex ZIP code, Phone...

View Article

How can max pooling help in making convolutional features as "translational...

@albela_angur wrote: Hi all, I have read that a CNN model has translation invariance. On exploring more about it, I got to know that the pooling operation in a CNN model does this. Could someone...

View Article


Noobie to analytics

@muditrai1 wrote: I want to start my career in analytics, very much confused of what combination of tools and languages to learn in this initial stage. I see lots of options, but things are vague in...

View Article

Best open source data analytics tool

@anoopellickal wrote: Hi I am searching for best data analytics tools available now for free, I am new to this and need help and advise from the experts. I have gone through R and Power BI, Posts: 1...

View Article


XGBoost only predicts NAN after removing all NANs from the training data in...

@kateb4 wrote: Hi, I asked a question on StackOverflow, but they did not answer my question, so I decided to try it here. Hello! I’m trying to get my code to work, it used to give no errors, until I...

View Article

How to convert data in "Practice Problem: Time Series" to ts format in R?

@Pranov_Mishra wrote: Hi all, I am working on the dataset (in the practice problem in AV) which has hourly values over a period of 2 years. I had difficulty in converting the data to timeseries...

View Article

Overfitting with R xgboost

@sandoz wrote: Below are the 3 R instructions I use with XGBoost: params <- list(booster = "gbtree", objective = "binary:logistic",eta=0.3, gamma=5, max_depth = 3, min_child_weight=1, subsample=1,...

View Article
Browsing all 4448 articles
Browse latest View live