Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live
↧

How to use set.seed()

@crisis1.08 wrote: How do we decide how much seeds to specify while using it in various algorithms Posts: 2 Participants: 2 Read full topic

View Article


How to work with strsplit?(R)

@B.Rabbit wrote: There is a feature in my data set like the one mentioned below:Incomeover $150,000$25,001 - $50,000$75,000 - $100,000under $25,000.....I want to create a new feature called normalised...

View Article


NLTK Download Error

@sudheer.rao wrote: I am using a Python 2.7x (Spyder) and Win 8.1 system. I am able to download all content from nltk but one file panlex_lite (1.7gb) is not getting downloaded to the system. I have...

View Article

Replacing a particular integer by another integer in pandas dataframe

@ravi_6767 wrote: Hi,I have a data-set consisting of a column consisting of years (from 1996 to 2010), after exploration I have found that there are many observations having values equal to 0, which...

View Article

Image may be NSFW.
Clik here to view.

How to deal with more than 1000 levels in a factor variable in R?

@shashwat.2014 wrote: Hi everyone, I am working on a dataset to predict the sales for the next 6 weeks of a retail store. You can get the datasets from here . There are two datasets 'train' and...

View Article


How to deal with large data sets while applying linear regression in R?

@shashwat.2014 wrote: Hi everyone, I am working on a dataset with 25 variables and around 1000000 rows. When I applied linear regression : lmao<-lm(Sales~.,data=combined)It gave me the error :...

View Article

Extracting the best fitted DecisionTreeClassifier after Grid Search

@syed.danish wrote: I have implemented grid search to find the best decision tree that could be fitted to my training data using the following code : parameters={'min_samples_split' :...

View Article

Recommended laptops for beginners in the field

@janakrajchadha wrote: I'm a beginner in the field of analytics and have just started working on my own projects. I had been learning the various tools and techniques for the last 5 months. Which...

View Article


Warning message when qplot function is used in R

@B.Rabbit wrote: Hello, I'm doing data exploration in the pseudo facebook dataset from Udacity(Exploratory data analysis course). When I run this following piece of code:qplot(x = friend_count, data =...

View Article


Deep Learning in R

@vbnsl wrote: Hello everyone,I have been using R for the past 1 year and am doing Facial Keypoints Detection and Classification of Music according to mood projects.I have applied the basic algorithms...

View Article

Theano on Windows

@abhinavunnam wrote: Has anyone been able to use Theano on Windows 7 and also been able to process data using GPU, can some one direct to a post/blog with help on this ? Posts: 2 Participants: 2 Read...

View Article

Understanding reliability diagram for classification

@syed.danish wrote: Hi everybody,I was going through a research paper(attached below if needed) on predicting good probabilities and it suggests two methods to do the same.1. Platt scaling2. Isotonic...

View Article

What are the broad inferences we could make by plots when exploring any dataset?

@B.Rabbit wrote: Hello, I am doing the Exploratory data analysis course in Udacity(R). I am applying the methods learnt in the course on various data sets(like bike sharing, Crime classification...

View Article


Unable to read data using read.csv.sql in R

@Corporate_Cowboy wrote: I am still in learning phase and discovered the sqldfI was using read.csv.sql command of the sqldf package to read my file. Though the command is running, it is not returning...

View Article

How well do we need to be skilled for getting an internship in data science?

@gokulahd wrote: i am a beginner in learning Data science in order to get intership what are the minimum skills required so that i get apply for internships .Like how much of skill i required in...

View Article


Multiple Instances of R Studio getting opened

@prasadvaidya198 wrote: Hi, I have R 3.2 and R studio ( I don't know the version).When I open R studio There are multiple instances of R studio getting opened.Can any one hep me out with this?Machine...

View Article

Data source sizing greater then 2TB

@WashR wrote: Guys, Good After noon. I am new here. I am trying to do a performance comparison study between HiveQL and Spark SQL on around 2TB of data. But I am facing difficulties in getting the...

View Article


Image may be NSFW.
Clik here to view.

Coord_cartesian( ) command is not working (R)

@B.Rabbit wrote: Hello, I'm doing the Exploratory Data Analysis course in Udacity(R). In lesson 4 there's a chapter on " Overlaying Summaries With Raw Data". It is required to plot a scatterplot, a...

View Article

What is the logic behind jitters command(geom_jitter( ) ) in R?

@B.Rabbit wrote: Hello,I'm doing the Exploratory Data Analysis course in Udacity(R). In lesson 4 there's a chapter on "Alpha and jitters", where the tutor explains that the command geom_jitter adds...

View Article

How to start career that asks for statistical techniques?

@surajbhu wrote: Hi, I want to make career in analytics, specially into Statistical Modelling and Machine learning.I have experience of 4 years+ into data analysis (to be more specific its deep diving...

View Article
Browsing all 4448 articles
Browse latest View live