Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Necessary mathematical skills for Machine Learning and Data analysi

@NSS wrote: I would like to know what are the necessary mathemical skills that a person must have to enter into machine learning and data science. Please answer keeping in mind that I would want to...

View Article


Transformation for continuous predictors

@NSS wrote: Hi all,I aim to apply regression methods for now. So i have a highly skewed predictor( be it left skewed or right skewed). So regression method will be best suited for which case ? 1-...

View Article


Determining significance of predictors

@NSS wrote: Hello all,I would like to know the importance and relative importance of predictors(say 100 in number) without having to build a model like random forest. Is it possible ? If yes, how ?...

View Article

Image may be NSFW.
Clik here to view.

Problem in aggregate function

@vijaypalmanit wrote: can some one explain me why these two works aggregate(subset(iris,select=-Species),subset(iris,select=Species),mean) aggregate(iris[-5],iris[5],mean) but not this...

View Article

Outlier Treatment

@shashwat.2014 wrote: While applying outlier treatment methods in different situations, I realised that I had no measure to compare the different methods? Can anyone suggest a means to compare the...

View Article


The seer's accuracy

@dsav wrote: Friends, I am new here and want to look at the data (train.csv and test.csv) for the seer's accuracy competition. Can anyone having it please send it to me? dsav2233@gmail.com Thanks in...

View Article

Mini Datahack 28 May solution

@paramjeet_sharma wrote: Hey did anyone got the solution for the timeseries datahack which which happened on 28th May? Posts: 1 Participants: 1 Read full topic

View Article

IoT Analytics - Time series

@innamurisrikanth wrote: Hi All, We will receive data of Car parking, lighting and environment for every minute from sensors. We want to fit the best curve for the real-time data. What are the best...

View Article


Unable to impute missing values through MICE for categorical variables

@shashwat.2014 wrote: I was using MICE package to impute missing values in categorical variables....

View Article


MIssing at random and missing not at random

@syed.danish wrote: Hi,While handling missing values, I came across two types of missing values : missing at random and missing not at random. Can somebody please explain these two types of missing...

View Article

Image may be NSFW.
Clik here to view.

Creating dummy variables in python

@syed.danish wrote: Hi, Currently I am working on megastar contest on AV. Below is the structure of data set : Screenshot from 2016-06-02 10-06-49.png1366x768 107 KB I want to create new variables for...

View Article

Better way to replace expressions in Regex

@syed.danish wrote: Hi,I am using following technique to replace expressions in regex : def parse (x): x=re.sub('\?',',',str(x)) x=re.sub('&',',',str(x)) x=re.sub('\+',',',str(x))...

View Article

Image may be NSFW.
Clik here to view.

Meaning and usage of Residual Deviance in Analysis of Deviance Table after...

@shashwat.2014 wrote: Hi,I was executing logistic regression in the Loan Prediction problem. After executing logistic regression, I came across an information that the a larger difference between null...

View Article


Are there plans to introduce IPython for use in discussion forums

@anantguptadbl wrote: IPython will be very useful for sharing code snippets quickly. Not sure whether AnalyticsVidhya is thinking on the same lines Posts: 1 Participants: 1 Read full topic

View Article

Multi-class Multi-label text classification

@Niranjanp wrote: Hello, I have been working on text classification problem which has three outcome variables and they are multi-class variables. The dataset description as follows. Dataset is about...

View Article


Python, R vs 19 Data Science Tools for people who aren’t so good at Programming

@ancientkittens wrote: Hi, After reading through the article "19 Data Science Tools for people who aren’t so good at Programming", it got me thinking about what data scientist prefer to use - learning...

View Article

Any reference for end to end process for forecasting/timeseries modelling?

@shivanihmcl wrote: Any reference/book/cases for end to end process for forecasting/timeseries modelling? Posts: 2 Participants: 2 Read full topic

View Article


Image may be NSFW.
Clik here to view.

Videos from PyCon 2016 now available

@kunal wrote: Thought, I'll share the link to videos as they get released. We would plan to release a summary on the blog later in any case. But, if you can't wait for the summary and want to see the...

View Article

Mismatch in levels of categorical variable in train and test data

@syed.danish wrote: Hi,I have a data set divided into two parts train and test , I want to know how one should handle extra levels present in test or /and train data. Three cases are possible : Case 1...

View Article

Unable to remove errors while installing spark R

@shashwat.2014 wrote: Hi everyone, I was installing spark R on windows 10, and during the process, I am not able to remove one error even after constant struggle and searches. I followed the spark R...

View Article
Browsing all 4448 articles
Browse latest View live