Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Spatial clustering with profiling variables/weights

@kapooraparna wrote: Hello, I have a list of demand points in an area, with latitude, longitude, revenue potential and other variables. Tried k-means clustering to group these points using Lat, Long...

View Article


Research projects or Work experience for MS in AI

@vikas wrote: I am a third year ug(4 year course) student at IIT Kharagpur. Iam very interested in data science and want to do MS in ml.I can stay one more year(total 5 years) in the same course to...

View Article


Image may be NSFW.
Clik here to view.

GGPLOT2 - Don't know how to automatically pick scale for object of type...

@schella wrote: I am not sure what this error refers to:Don't know how to automatically pick scale for object of type function. I dont seem got reason in stackoverflow for the error. Can some one...

View Article

Problem in solving Practice Problem : Loan Prediction - 2

@Ankitb_5 wrote: I am very new to Data Analysis . I have tried to solve Practice Problem : Loan Prediction - 2 but could not solve. I also don't know how to proceed. Problem : I Have test and train...

View Article

Proper use of get_dummies function in pandas

@jalFaizy wrote: I don't fully understand the use of get_dummies function. How is it different from One hot encoding? Also, if I use get_dummies on train data, it is always not reproducible on test...

View Article


Not able to execute the following code in python version '3.5.1'

@nagarjunrao24 wrote: Hi Guys, we are new to python and this is our first projectwe have a problem with respect to the following code "outlet_size_mode = data.pivot_table(values=’Outlet_Size’,...

View Article

Tableau vs Qlikview vs MSBI&power-bi – which tool should I learn?

@vijay495 wrote: Tableau vs Qlikview vs MSBI&power-bi – which tool should I learn? i am just confused i need some guidance from you friends which one best among them email me at...

View Article

Tableau vs Qlikview vs MSBI&power-bi – which tool should I learning and jobs...

@vijay495 wrote: Tableau vs Qlikview vs MSBI&power-bi, SAP BO – which tool should I learn? i am just confused i need some guidance from you friends so let me among this which one is best as per...

View Article


Image may be NSFW.
Clik here to view.

How to plot the grand median in the given plot?

@harry wrote: I am currently trying to plot a plot between age and the friend the have for this I have created a bucket of 4 to know how many friend people will have depended upon the year joined to...

View Article


Gradient Boosting

@aman1391 wrote: I tired this problem and i got an accuracy of .7916667 Can I apply gradient Boosting to this dataset , plus what is relevant sources to study regarding the gradient boosting or any...

View Article

How to create a co-occurence matrix in python

@pagal_guy wrote: hello, I have been looking for a function to create a co-occurrence matrix in python but have been unable to find any good one.Can someone please help me regarding this? Posts: 2...

View Article

How to create a TermDocument matrix in python

@pagal_guy wrote: hello,I am trying to replicate the below code for a single column of a dataframe in python: # Create initial documents list: doc = [ ] doc.append( 'It is a far, far better thing I...

View Article

Probability Content for Beginers

@tillutony wrote: Hi All, What content should to be trained on Probability for fresher's who wants to make their career in data science. Please suggest. Thanks,Tony Posts: 1 Participants: 1 Read full...

View Article


Prospects of learning bigdata and analytics for experienced person

@Pratap83 wrote: Hi All, Need an honest advice. I have 8.5 years of total experience in IT industry. My primary skill set is Oracle PL/SQL. Apart from that I have worked on Oracle Forms & reports,...

View Article

Combine Classification Agorithms

@erigits wrote: is it possible to combine logistic regression, SVM, decision tree for classification Posts: 2 Participants: 2 Read full topic

View Article


Is there any way to calculate cv logloss for h2o.ensemble model? (just like...

@pritish.pandey wrote: Hello everyone, I ran an h2o.ensemble model like this. fit <- h2o.ensemble(x = x, y = y, training_frame = train1, family = "binomial", learner = learner, metalearner =...

View Article

Feature Selection Chi-Square

@Debanjan_Banerjee wrote: I am trying to do the feature selection for the Loan_Prediction 2. I am using R for Chi-Square test and for any categorical feature that has a value less than 0.05 , I am...

View Article


How to connect to SOAP API using R

@gupta0710 wrote: I have to copy session name and session ID into the SOAP endpoint URL as a query parameter so that all subsequent calls to the web service use the authenticated session parameters....

View Article

How does a machine learning algorithm really learn?

@khurshidrpvv wrote: I am new to data science.I am not able to understand how our model learns with the data set.Means to say, when we perform data munging and related stuff,in the end we eliminate...

View Article

How to create a term vector in python containing all the terms from a...

@pagal_guy wrote: Hello , I am trying to create a term vector in python so that ultimately I can create a corpus for a wordcloud.Below is my code for it: import pandas as pd import gensim from gensim...

View Article
Browsing all 4448 articles
Browse latest View live