When is normalization of data required
@rahul29 wrote: Hi All, Is it always advisable to have normalized data passed to models? can anybody explain it properly that when to do normalization or scaling of features in different scenarios...
View ArticleAnalytics course for Sales Professional
@rizvin7 wrote: I am a Sales professional with over 9 years experience, currently working for an online portal. I am keen on pursuing a course and eventually a career in Analytics and Business...
View ArticleNeed your guidance regarding career in Data science
@SachinN09 wrote: Hi Kunal/Analyticsvidhya Team, Analytics Vidhya is a great website with lots of crisp information in this ocean of Big data career information. First of all thank you for such a...
View ArticleChoosing a sub domain in Data Science
@bhavyaghai wrote: Hi Everyone, I am a final year master's student in IT and looking forward to pursue MS in Data Science starting fall 2016. I possess working knowledge of R, Machine learning &...
View ArticleWhat do you mean by in-memory data processing when it comes to Qlikview
@deepak.anandani88 wrote: Hello, I have read many discussion, everywhere its mention that Qlikview is in memory BI tool, what does that actually mean, Is tableau not an in-memory tool..? Posts: 1...
View ArticleDownload Analytics and Big Data Salary Report 2016
@Sukanya wrote: The job of a Data Scientist is predicted to gain a notable status as many companies are realizing its importance in their day to day decisions. It is said to be the best job for 2016...
View ArticleHow to resolve error while using truncatedSVD in python
@pagal_guy wrote: hello, I am trying to solve the https://www.kaggle.com/c/home-depot-product-search-relevance problem and in one place I am using: rfr = RandomForestRegressor(n_estimators = 500,...
View ArticleEvaluation Score - Date your Data Contest
@sonny wrote: Team, Is the Evaluation AUC Score for "Date your data" contest on the entire TEST dataset or a partial set of the TEST Dataset?If yes, what is the split? Posts: 1 Participants: 1 Read...
View ArticleMachine Unlearning
@Bolaka wrote: Everyone has tried their best at making models learn classes or numbers. But how to make a machine learning model "unlearn" some instances? Let me put forth my ideas on the same and I...
View ArticleA look into the Hackathon
@ravi1989 wrote: Hi all, I was looking out the problem statement of the hackathon that is been conducted from feb 26-feb 28th. I find that the training dataset contains more entries than the testing...
View ArticlePost-Graduate Diploma in Statistical Methods and Analytics
@bhaskarjit wrote: Can I become a data scientist if I enroll for the course "Post-Graduate Diploma in Statistical Methods and Analytics" offered by Indian Statistical Institute? link to course:...
View ArticleHow to merge the data files in r or sas?
@shyam wrote: hi guys i need a help in this date with your date hackathon regarding how would we merge information of student and internship in train/test data set using primary key in R or SAS...
View ArticleSolution submission walk through
@arfat wrote: Good Morning,This is my first hackathon in analytics. I registered for "Date your data", can you please walk through on how to submit the solution, for suppose i use "R" then after...
View ArticleDate your data: How to upload the code file
@urveshdev wrote: I'm not able to find the place where I can upload my code file. As mentioned in the competition, it is mandatory while setting final submission but I don't see any option to do that....
View ArticleTwo different results on confusion matrix using random Forest on loan...
@Surya1987 wrote:...
View ArticleSteps after Imputation
@Surya1987 wrote: I have created 5 imputed datasets on loan prediction data using Amelia Package.How to proceed further using them in the model. Should I combine the results generated by all to meet...
View ArticleHow to convert a CSCmatrix or dgCMatrix into dataframe?
@Rohit_Nair wrote: Can we convert CSCmatrix or dgCMatrix into dataframe ? i read many forum pages .but couldnt understand, Can anyone help me with this ? class(objTrain_hashed)[1] "CSCMatrix"...
View ArticleDate your data-How to improve model accuracy
@sowmiyanm wrote: Can anyone give any pointers on how to proceed further for improving accuracy for this hackathon (Mine is only 0.54)Steps followed by me so far:1. Joined Internship data to train...
View ArticleDownload - Free Tutorial to learn Data Science in R from Scratch
@Manish wrote: Hello, On users demand, I have uploaded the pdf version of Tutorial on Data Science in R. I hope this tutorial enhances and improve your R and Machine Learning skills. To get maximum...
View ArticleWhat is the standard way for doing validation for a Machine Learning model?
@jalFaizy wrote: Can you really believe in your model after training? Not until you have validated it. So how would one approach this problem? Also suggest some resources for this validation pipeline...
View Article