Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

How to find the model works better from the RMSE and CV score?

@santanu13 wrote: How to find the model works better from the RMSE and CV score? I have included all the model in bigmart sales problem but can’t understand the technique. Posts: 2 Participants: 2...

View Article


Fast AI vs Keras

@prithu007 wrote: Hi All, Wanted to know your opinion about Fast AI as a Deep Learning Framework by itself? It is built on top of PyTorch as against Keras which is built on top of TensorFlow. I just...

View Article


How to learn big data analytics with R?

@vamsi115 wrote: I am working as a Data Scientist from 2 years and we work on only structured data. I have decent experience with extracting data from SQL Server and Oracle, writing queries. I am...

View Article

Difficulty implementing EDA

@abhik95 wrote: Recently, I was suggested to have practical knowledge on EDA through the case study BigMart Sales Prediction using R. But I cannot decide which variables should I choose for...

View Article

Transit Time for Logistics

@luvsuneja wrote: Hi Friends, I am interning for a logistics corporation. Please, could you share some literature on predicting transit times for deliveries? Thanks in advance Posts: 1 Participants: 1...

View Article


Using Mean To Forecast Employee Salary in Time Series

@krishna42 wrote: Hello All, I was reading an article regarding Time Series Analysis and came across suggestion that it is not wise to use MEAN of salaries to forecast future salaries if TREND is...

View Article

Panel Data Regression Analysis - Whether to Use Panel Structure?

@jiselllla wrote: Hi guys, a very conceptual question while planning a research paper. GOAL: I am investigating the effects of enrollment in insurance programs on health inequality. DEFINITIONS: For...

View Article

Linear Regression

@vishal62 wrote: I was going through few linear regression models. Can someone explain the relevance of " random_state" during the splitting of the data to trained set and test test. X_train, X_test,...

View Article


Time series analysis with categorical values

@aashaybane wrote: Hi all , I have a brewery sales data set , in which i have to forecast the sale of beer by the flavor , there are almost 20 to 25 different flavor and 5 to 6 different region , how...

View Article


How do I create a Survival Object with the Surv() in R?

@chirantan05 wrote: My dataset consists of more than 90% of Censored observations and 10 % of events in the observed duration. The problem I encountered while applying Surv(Time,Event) was that the...

View Article

Suggestions for Quicker kind of company on Analytics

@s4sathish wrote: Would it be suggestable to implement analytical services for Quikr kind of company? If so 1. What kind of statistical methods can we use. 2 How to deal with the data…? Posts: 1...

View Article

What exactly does XGBoost gblinear+reg:linear do?

@gaurav655 wrote: I would like to know what exactly does XGBoost gblinear+reg:linear do? And what parameters should I use it with gblinear+reg:linear, like is it advisable to use the lasso with it?...

View Article

Image may be NSFW.
Clik here to view.

RandomForestClassifier .fit fails with Memory Error on ec2 but runs without...

@ankit481 wrote: stackoverflow.com RandomForestClassifier .fit fails with Memory Error on ec2 but runs without error locally python, pandas, amazon-ec2, scikit-learn asked by Jackie Weiser on 06:27PM...

View Article


Missing Value Treatment Techniques

@aruna9 wrote: How to Decide Which technique needs to use for Missing Value Treatment in R. For example where to use Mean and Median imputation, where to use Knn Imputation and so on. Posts: 1...

View Article

Missing value Imputation in R

@akamboj wrote: I’m working on the classification problem having close to 6L observations and 60 variables. Variables also having missing values, I have removed the variables with a large number of...

View Article


Image may be NSFW.
Clik here to view.

I analyzed more than 40,000 YouTube trending videos. Here are the results....

@ammar1 wrote: Hello Link to analysis results I used Python with packages like Pandas and Matplotlib to analyze a dataset that contains data about more than 40,000 trending videos collected over ~7...

View Article

What are some of the Data Science modelling techniques used in Risk Analytics?

@ashwin5392 wrote: I would be very interested to learn some of the modelling techniques associated with Risk Analytics in the finance domain. Posts: 1 Participants: 1 Read full topic

View Article


Correlation in R

@abhik95 wrote: How to determine correlation between multiple categorical variables and a numerical variable in R. I am looking for a solution without using any regression techniques. Posts: 1...

View Article

Applications of ML algorithms in businness

@AarushiS wrote: Hi everyone, I have recently started exploring Machine learning algorithms and participating in competitions. I was wondering if anyone could tell me where in business are these...

View Article

Curse of Dimentionality

@AarushiS wrote: I am unable to understand the concept of ‘Curse of Dimensionality’. How can a large number of variables negatively affect the model performance? If we have more variables to...

View Article
Browsing all 4448 articles
Browse latest View live