Quantcast
Browsing all 4448 articles
Browse latest View live

Which course to take for Data Science?

@anurag_s wrote: I have just completed my 12 board exams and I want to do my graduation from a course which create solid foundation for future in Data Science . Which Course should i pursue ? Posts: 1...

View Article


How to generate buy sell trading signals using ggplot in r

@devrajan wrote: I have made a ggplot with my data set having close price and two sets of limits(upper limit 1=ucl1,lower limit 1=lcl1,upper limit 2= ucl2, lower limit 2=lcl2) using following commands...

View Article


Need an algorithm or direction in order to classify time series data

@prajgujarathi wrote: Hi All, I have time series data of equipment readings and I am trying to predict the type of equipment depending upon the historical/live reading. I looked into Dynamic time wrap...

View Article

Optimize `n_estimators` using `xgb.cv`

@rahul485 wrote: In this code fragment: cvresult = xgb.cv(xgb_param, xgtrain, num_boost_round=1000, nfold=cv_folds, metrics='mlogloss', early_stopping_rounds=50)...

View Article

How to decide which tool or technology to be selected for a machine learning...

@jai-prakash.yadav wrote: Hi Experts, Need suggestions how we can decide which tool or technology to be selected for a machine learning problem. Whether we plan to use sklearn or Azure ML or MATlab or...

View Article


Multinomial Classification on text

@manishceeri wrote: I am looking for a multinomial naive Bayes text classification package in R that accepts a term document matrix (from tm) as input for training and classifies new text based on...

View Article

How difficult to get a job as bigdata administrator or developer

@naveen46 wrote: I have 15 years experience from 2001 to 2008 I worked as a maintenance engineer in steel and construction domains.From 2008 to till now iam working as industrial applications...

View Article

Research/Project Ideas during time abroad - (6-9 months)

@mutu wrote: Hi I’m a young IT audit professional who is interested making a switch into data science and analytics as a career. This upcoming academic year however, I received a Fulbright scholarship...

View Article


Image may be NSFW.
Clik here to view.

Which regression algorithm to use when there is weak correlation between...

@rohit.haritash wrote: I have a regression problem in hand. Dataset have 20 predictors and 1 target. Target is continuous and predictors are both categorical and continous. I performed a correlation...

View Article


Research: "The effects of big data analytics and distinct dynamic...

@rishal wrote: Hi experts. I require your assistance. Organisations today currently exist in an era of complexity and discontinuous change. The rules of competition and survival are constantly...

View Article

How to install Boruta package for Python on windows?

@prasad_patil wrote: Hi, Does boruta package available on python?How one can install it on windows?Please share the steps Thanks. Prasad Posts: 3 Participants: 2 Read full topic

View Article

Multivariate Time Series Analysis

@lakshveer wrote: Hi, I’ve a multivariate time series data set which has 20 predictor variable. The target variable is a continuous variable. The data set is about “Employees Absenteeism” . The...

View Article

TypeError when finding mode for each Outlet_Type

@warrior wrote: When I run this code from the tutorial: #Import mode function: from scipy.stats import mode #Determing the mode for each outlet_size_mode = data.pivot_table(values='Outlet_Size',...

View Article


Spliting data for random forest

@rohit.haritash wrote: Hi My dataset have some variables with factor level more than 30. When running my model for prediction I am getting the following error in R. modelRF1 <-...

View Article

A simple question on zero values from the 4th column of train data

@vishwajitb wrote: Hi, In training dataset we have 4th column named value which has 0 values. Should we consider this as missing data and use imputation methods to fill values here? Kindly reply ASAP....

View Article


Handling missing categorical data

@jayanthd wrote: I have 2lac records in sample with 70 variables. There are 40 categorical variables in which many data are missing and have blank values. There are few categorical variables which has...

View Article

Image may be NSFW.
Clik here to view.

How to input multiple indices on pd.crosstab?

@gcowner wrote: On this tutorial: Analytics Vidhya – 14 Jan 16 A Complete Tutorial to Learn Data Science with Python from Scratch This is a complete tutorial to learn data science in python using a...

View Article


Image may be NSFW.
Clik here to view.

R/Python Script needed to assign Cluster ID to each PO records based on...

@engxiongster wrote: Dear All, I am working on a test scenario/procedure for split purchase orders such that I would like to create a flag for cluster of POs in the data tables based on multiple...

View Article

Airline Spend Analytics

@vishnu_jon wrote: Hi Experts, friends I have a business problem where I need to find out through some predictive models or techniques which Airline should I choose and what will be my max and min...

View Article

Image may be NSFW.
Clik here to view.

Stationarity : Dickey-Fuller Test

@superbromy wrote: Hi, dickey-fuller-test.png501x594 57.1 KB I can’t understand why the p-value is so small. For what I’ve understood, this time serie is clearly not stationary, no ? Posts: 2...

View Article
Browsing all 4448 articles
Browse latest View live