How to find the model works better from the RMSE and CV score?
@santanu13 wrote: How to find the model works better from the RMSE and CV score? I have included all the model in bigmart sales problem but can’t understand the technique. Posts: 2 Participants: 2...
View ArticleFast AI vs Keras
@prithu007 wrote: Hi All, Wanted to know your opinion about Fast AI as a Deep Learning Framework by itself? It is built on top of PyTorch as against Keras which is built on top of TensorFlow. I just...
View ArticleHow to learn big data analytics with R?
@vamsi115 wrote: I am working as a Data Scientist from 2 years and we work on only structured data. I have decent experience with extracting data from SQL Server and Oracle, writing queries. I am...
View ArticleDifficulty implementing EDA
@abhik95 wrote: Recently, I was suggested to have practical knowledge on EDA through the case study BigMart Sales Prediction using R. But I cannot decide which variables should I choose for...
View ArticleTransit Time for Logistics
@luvsuneja wrote: Hi Friends, I am interning for a logistics corporation. Please, could you share some literature on predicting transit times for deliveries? Thanks in advance Posts: 1 Participants: 1...
View ArticleUsing Mean To Forecast Employee Salary in Time Series
@krishna42 wrote: Hello All, I was reading an article regarding Time Series Analysis and came across suggestion that it is not wise to use MEAN of salaries to forecast future salaries if TREND is...
View ArticlePanel Data Regression Analysis - Whether to Use Panel Structure?
@jiselllla wrote: Hi guys, a very conceptual question while planning a research paper. GOAL: I am investigating the effects of enrollment in insurance programs on health inequality. DEFINITIONS: For...
View ArticleLinear Regression
@vishal62 wrote: I was going through few linear regression models. Can someone explain the relevance of " random_state" during the splitting of the data to trained set and test test. X_train, X_test,...
View ArticleTime series analysis with categorical values
@aashaybane wrote: Hi all , I have a brewery sales data set , in which i have to forecast the sale of beer by the flavor , there are almost 20 to 25 different flavor and 5 to 6 different region , how...
View ArticleHow do I create a Survival Object with the Surv() in R?
@chirantan05 wrote: My dataset consists of more than 90% of Censored observations and 10 % of events in the observed duration. The problem I encountered while applying Surv(Time,Event) was that the...
View ArticleSuggestions for Quicker kind of company on Analytics
@s4sathish wrote: Would it be suggestable to implement analytical services for Quikr kind of company? If so 1. What kind of statistical methods can we use. 2 How to deal with the data…? Posts: 1...
View ArticleWhat exactly does XGBoost gblinear+reg:linear do?
@gaurav655 wrote: I would like to know what exactly does XGBoost gblinear+reg:linear do? And what parameters should I use it with gblinear+reg:linear, like is it advisable to use the lasso with it?...
View ArticleRandomForestClassifier .fit fails with Memory Error on ec2 but runs without...
@ankit481 wrote: stackoverflow.com RandomForestClassifier .fit fails with Memory Error on ec2 but runs without error locally python, pandas, amazon-ec2, scikit-learn asked by Jackie Weiser on 06:27PM...
View ArticleMissing Value Treatment Techniques
@aruna9 wrote: How to Decide Which technique needs to use for Missing Value Treatment in R. For example where to use Mean and Median imputation, where to use Knn Imputation and so on. Posts: 1...
View ArticleMissing value Imputation in R
@akamboj wrote: I’m working on the classification problem having close to 6L observations and 60 variables. Variables also having missing values, I have removed the variables with a large number of...
View ArticleI analyzed more than 40,000 YouTube trending videos. Here are the results....
@ammar1 wrote: Hello Link to analysis results I used Python with packages like Pandas and Matplotlib to analyze a dataset that contains data about more than 40,000 trending videos collected over ~7...
View ArticleWhat are some of the Data Science modelling techniques used in Risk Analytics?
@ashwin5392 wrote: I would be very interested to learn some of the modelling techniques associated with Risk Analytics in the finance domain. Posts: 1 Participants: 1 Read full topic
View ArticleCorrelation in R
@abhik95 wrote: How to determine correlation between multiple categorical variables and a numerical variable in R. I am looking for a solution without using any regression techniques. Posts: 1...
View ArticleApplications of ML algorithms in businness
@AarushiS wrote: Hi everyone, I have recently started exploring Machine learning algorithms and participating in competitions. I was wondering if anyone could tell me where in business are these...
View ArticleCurse of Dimentionality
@AarushiS wrote: I am unable to understand the concept of ‘Curse of Dimensionality’. How can a large number of variables negatively affect the model performance? If we have more variables to...
View Article