Black Friday Hackathon - Not able to run even lm algorithm due to memory...
@j.akhil.j wrote: Hi, I am working on the Black Friday problem, and after making some very basic modifications, i tried to run first pass on the data. Even though i am using lm as my algorithm, and...
View ArticleExploratory Data Analysis
@krishnamurthypranesh wrote: Hi, I’m pretty new to data science(esp. predictive modeling ). I find new information but I have no idea as to how I can use that info to improve the accuracy of my model....
View ArticleBoxplots with Groupby using multiple columns
@pandafoot wrote: In the tutorial by Kunal [https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-learn-data-science-python-scratch-2/] There is a section on “Filling in Missing Values” where...
View ArticleWhat is the hypothesis definition for this case study?
@rakshithp wrote: Hello everyone, I am almost close to submitting my code and I got an accuracy f 77%, however I am Keen to understand what is the hypothesis for this case study. And it would be...
View ArticleAnybody working related to Law - Use cases in Legal Industry
@ssvbalan wrote: Hi All, Is there any one working in Law firms as Data Analyst/Business Analyst? Would like to know any use cases/General analytics work other than Data visualization? How machine...
View ArticleError in Final Submission of Code and CSV file
@rakshithp wrote: Hello, I am not able to upload my completed R code and the final submission document. I get an error message stating "Please check if your submission file has the correct headers"....
View ArticleRunning multiple algorithms using caretList, caretEnsemble
@j.akhil.j wrote: caretList is a great function from the Caret package to run base algorithms from the same package, and caretEnsemble is the function that helps to form a GLM based ensemble out of...
View ArticleWhat is the difference between ROSE and SYN function in handling unbalanced...
@Suhas_naik wrote: How can we differentiate between ROSE and SYN function in handling unbalanced data in R, which one is best process to follow. Posts: 1 Participants: 1 Read full topic
View ArticleApply HMM on iris dataset?
@Devi93 wrote: can some one show me how to apply hidden markov model on iris dataset. I was just going through http://www.m-hikari.com/ams/ams-2014/ams-49-52-2014/benyacoubAMS49-52-2014.pdf when...
View ArticleHow can i arrange columns as per condition?
@rahulsemwal10 wrote: My data set is S1 S2 S3 S4 1 4 7 20 2 6 8 21 3 9 10 30 and i convert data as per condition that sort as per maximum difference value between each column value data after the...
View ArticleDifference in train and test values
@ASHISH_17 wrote: Hello, Below is the size of train and test dataset - X_train.shape, y_train.shape, X_test.shape ((548, 7), (548,), (548, 6)) After running a Linear Regression, I am getting an error...
View ArticlePGDBA course iitK +IIM +ISI
@Pvash23 wrote: I am working as ML engineer having 2 years work-ex package of 5L.should I go for this course.I am confused as now most of my work is around building pipelines and ML models(DL models...
View ArticleHow to merge 3 dataframes by id in R
@KumarP wrote: I have 3 dataframes(contacts, bookings,tickets). merged.data <- merge(contacts, bookings,tickets, by=c(“contact_id”, “booking_id”,“ticket_id”)) While merging data frame ,getting...
View ArticleRisk Constraint in PortfolioAnalytics- Portfolio Optimazation(Stocks)
@sirishan wrote: Can some one help me on the below ? 1)How to add risk constraint in PortfolioAnalytics? I have tried adding the return as constraint,risk as objective function and its working fine....
View ArticleCategorical variables and predictive modelling
@krishnamurthypranesh wrote: HI, Scenario: I have a categorical variable x1 with four levels(a, b, c and d). I run a few tests and find that one of those levels(say d) does not contribute much towards...
View ArticleData cleaning Decision Trees
@ysn_vatsal wrote: Why decision trees are not influenced by outliers or missing data? Posts: 1 Participants: 1 Read full topic
View ArticleType error in Linear Regression
@ASHISH_17 wrote: Hello, I am getting a TypeError: float() argument must be a string or a number, not ‘Timestamp’. Can’t we use date-time entries? Though I have grouped them on the basis of hourly,...
View ArticleHow to get the autoregressive coefficient for the ARMA model given a...
@Devi93 wrote: MY dataset is as below: Can anyone please explain how to get AR and MA coefficients for this dataset?? Ur help is appreciated.Thnx in advance. Posts: 1 Participants: 1 Read full topic
View ArticleEstimate the number of aircrafts in air across the globe at this moment in time
@kanshou87 wrote: Am trying whether test my approach to this question is correct. There are 4,416 cities in the world, of which 125 are major cities. Assume 100 top airline companies operate in 125...
View ArticleIncrease sales and reduce cost
@Chaitanya333 wrote: Hi All. I have some doubts , I have a data were 9 variables are categoricsl , 4 are continous out of which 2 are cost and sales.I have to maximize sales and reduce cost. How do i...
View Article