How to use set.seed()
@crisis1.08 wrote: How do we decide how much seeds to specify while using it in various algorithms Posts: 2 Participants: 2 Read full topic
View ArticleHow to work with strsplit?(R)
@B.Rabbit wrote: There is a feature in my data set like the one mentioned below:Incomeover $150,000$25,001 - $50,000$75,000 - $100,000under $25,000.....I want to create a new feature called normalised...
View ArticleNLTK Download Error
@sudheer.rao wrote: I am using a Python 2.7x (Spyder) and Win 8.1 system. I am able to download all content from nltk but one file panlex_lite (1.7gb) is not getting downloaded to the system. I have...
View ArticleReplacing a particular integer by another integer in pandas dataframe
@ravi_6767 wrote: Hi,I have a data-set consisting of a column consisting of years (from 1996 to 2010), after exploration I have found that there are many observations having values equal to 0, which...
View ArticleHow to deal with more than 1000 levels in a factor variable in R?
@shashwat.2014 wrote: Hi everyone, I am working on a dataset to predict the sales for the next 6 weeks of a retail store. You can get the datasets from here . There are two datasets 'train' and...
View ArticleHow to deal with large data sets while applying linear regression in R?
@shashwat.2014 wrote: Hi everyone, I am working on a dataset with 25 variables and around 1000000 rows. When I applied linear regression : lmao<-lm(Sales~.,data=combined)It gave me the error :...
View ArticleExtracting the best fitted DecisionTreeClassifier after Grid Search
@syed.danish wrote: I have implemented grid search to find the best decision tree that could be fitted to my training data using the following code : parameters={'min_samples_split' :...
View ArticleRecommended laptops for beginners in the field
@janakrajchadha wrote: I'm a beginner in the field of analytics and have just started working on my own projects. I had been learning the various tools and techniques for the last 5 months. Which...
View ArticleWarning message when qplot function is used in R
@B.Rabbit wrote: Hello, I'm doing data exploration in the pseudo facebook dataset from Udacity(Exploratory data analysis course). When I run this following piece of code:qplot(x = friend_count, data =...
View ArticleDeep Learning in R
@vbnsl wrote: Hello everyone,I have been using R for the past 1 year and am doing Facial Keypoints Detection and Classification of Music according to mood projects.I have applied the basic algorithms...
View ArticleTheano on Windows
@abhinavunnam wrote: Has anyone been able to use Theano on Windows 7 and also been able to process data using GPU, can some one direct to a post/blog with help on this ? Posts: 2 Participants: 2 Read...
View ArticleUnderstanding reliability diagram for classification
@syed.danish wrote: Hi everybody,I was going through a research paper(attached below if needed) on predicting good probabilities and it suggests two methods to do the same.1. Platt scaling2. Isotonic...
View ArticleWhat are the broad inferences we could make by plots when exploring any dataset?
@B.Rabbit wrote: Hello, I am doing the Exploratory data analysis course in Udacity(R). I am applying the methods learnt in the course on various data sets(like bike sharing, Crime classification...
View ArticleUnable to read data using read.csv.sql in R
@Corporate_Cowboy wrote: I am still in learning phase and discovered the sqldfI was using read.csv.sql command of the sqldf package to read my file. Though the command is running, it is not returning...
View ArticleHow well do we need to be skilled for getting an internship in data science?
@gokulahd wrote: i am a beginner in learning Data science in order to get intership what are the minimum skills required so that i get apply for internships .Like how much of skill i required in...
View ArticleMultiple Instances of R Studio getting opened
@prasadvaidya198 wrote: Hi, I have R 3.2 and R studio ( I don't know the version).When I open R studio There are multiple instances of R studio getting opened.Can any one hep me out with this?Machine...
View ArticleData source sizing greater then 2TB
@WashR wrote: Guys, Good After noon. I am new here. I am trying to do a performance comparison study between HiveQL and Spark SQL on around 2TB of data. But I am facing difficulties in getting the...
View ArticleCoord_cartesian( ) command is not working (R)
@B.Rabbit wrote: Hello, I'm doing the Exploratory Data Analysis course in Udacity(R). In lesson 4 there's a chapter on " Overlaying Summaries With Raw Data". It is required to plot a scatterplot, a...
View ArticleWhat is the logic behind jitters command(geom_jitter( ) ) in R?
@B.Rabbit wrote: Hello,I'm doing the Exploratory Data Analysis course in Udacity(R). In lesson 4 there's a chapter on "Alpha and jitters", where the tutor explains that the command geom_jitter adds...
View ArticleHow to start career that asks for statistical techniques?
@surajbhu wrote: Hi, I want to make career in analytics, specially into Statistical Modelling and Machine learning.I have experience of 4 years+ into data analysis (to be more specific its deep diving...
View Article