Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Black Friday Hackathon - Not able to run even lm algorithm due to memory...

@j.akhil.j wrote: Hi, I am working on the Black Friday problem, and after making some very basic modifications, i tried to run first pass on the data. Even though i am using lm as my algorithm, and...

View Article


Exploratory Data Analysis

@krishnamurthypranesh wrote: Hi, I’m pretty new to data science(esp. predictive modeling ). I find new information but I have no idea as to how I can use that info to improve the accuracy of my model....

View Article


Boxplots with Groupby using multiple columns

@pandafoot wrote: In the tutorial by Kunal [https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-learn-data-science-python-scratch-2/] There is a section on “Filling in Missing Values” where...

View Article

What is the hypothesis definition for this case study?

@rakshithp wrote: Hello everyone, I am almost close to submitting my code and I got an accuracy f 77%, however I am Keen to understand what is the hypothesis for this case study. And it would be...

View Article

Anybody working related to Law - Use cases in Legal Industry

@ssvbalan wrote: Hi All, Is there any one working in Law firms as Data Analyst/Business Analyst? Would like to know any use cases/General analytics work other than Data visualization? How machine...

View Article


Error in Final Submission of Code and CSV file

@rakshithp wrote: Hello, I am not able to upload my completed R code and the final submission document. I get an error message stating "Please check if your submission file has the correct headers"....

View Article

Running multiple algorithms using caretList, caretEnsemble

@j.akhil.j wrote: caretList is a great function from the Caret package to run base algorithms from the same package, and caretEnsemble is the function that helps to form a GLM based ensemble out of...

View Article

What is the difference between ROSE and SYN function in handling unbalanced...

@Suhas_naik wrote: How can we differentiate between ROSE and SYN function in handling unbalanced data in R, which one is best process to follow. Posts: 1 Participants: 1 Read full topic

View Article


Apply HMM on iris dataset?

@Devi93 wrote: can some one show me how to apply hidden markov model on iris dataset. I was just going through http://www.m-hikari.com/ams/ams-2014/ams-49-52-2014/benyacoubAMS49-52-2014.pdf when...

View Article


How can i arrange columns as per condition?

@rahulsemwal10 wrote: My data set is S1 S2 S3 S4 1 4 7 20 2 6 8 21 3 9 10 30 and i convert data as per condition that sort as per maximum difference value between each column value data after the...

View Article

Difference in train and test values

@ASHISH_17 wrote: Hello, Below is the size of train and test dataset - X_train.shape, y_train.shape, X_test.shape ((548, 7), (548,), (548, 6)) After running a Linear Regression, I am getting an error...

View Article

PGDBA course iitK +IIM +ISI

@Pvash23 wrote: I am working as ML engineer having 2 years work-ex package of 5L.should I go for this course.I am confused as now most of my work is around building pipelines and ML models(DL models...

View Article

How to merge 3 dataframes by id in R

@KumarP wrote: I have 3 dataframes(contacts, bookings,tickets). merged.data <- merge(contacts, bookings,tickets, by=c(“contact_id”, “booking_id”,“ticket_id”)) While merging data frame ,getting...

View Article


Image may be NSFW.
Clik here to view.

Risk Constraint in PortfolioAnalytics- Portfolio Optimazation(Stocks)

@sirishan wrote: Can some one help me on the below ? 1)How to add risk constraint in PortfolioAnalytics? I have tried adding the return as constraint,risk as objective function and its working fine....

View Article

Image may be NSFW.
Clik here to view.

Categorical variables and predictive modelling

@krishnamurthypranesh wrote: HI, Scenario: I have a categorical variable x1 with four levels(a, b, c and d). I run a few tests and find that one of those levels(say d) does not contribute much towards...

View Article


Data cleaning Decision Trees

@ysn_vatsal wrote: Why decision trees are not influenced by outliers or missing data? Posts: 1 Participants: 1 Read full topic

View Article

Type error in Linear Regression

@ASHISH_17 wrote: Hello, I am getting a TypeError: float() argument must be a string or a number, not ‘Timestamp’. Can’t we use date-time entries? Though I have grouped them on the basis of hourly,...

View Article


Image may be NSFW.
Clik here to view.

How to get the autoregressive coefficient for the ARMA model given a...

@Devi93 wrote: MY dataset is as below: Can anyone please explain how to get AR and MA coefficients for this dataset?? Ur help is appreciated.Thnx in advance. Posts: 1 Participants: 1 Read full topic

View Article

Estimate the number of aircrafts in air across the globe at this moment in time

@kanshou87 wrote: Am trying whether test my approach to this question is correct. There are 4,416 cities in the world, of which 125 are major cities. Assume 100 top airline companies operate in 125...

View Article

Increase sales and reduce cost

@Chaitanya333 wrote: Hi All. I have some doubts , I have a data were 9 variables are categoricsl , 4 are continous out of which 2 are cost and sales.I have to maximize sales and reduce cost. How do i...

View Article
Browsing all 4448 articles
Browse latest View live