Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Applying a measure of error to raw data

@fletcher wrote: Hi guys, I’m new to this so please be kind. I’d like to know if there is a way in which I can apply a measure of error to raw data. What I mean by this, is that I’d like to apply this...

View Article


Missing data and CART

@omkar18 wrote: Hi Team, I have few queries regarding ML algorithms and data, It would be great if you can provide some feedback on that. Which is best package to impute the missing data (currently...

View Article


Data transformation while predicting for new data(one hot encoding issue)

@syammohan2103 wrote: I have one hot encoded the data and trained a model. After passing the new(test) data through my transformation pipeline(including the one hot encoder) new columns are created as...

View Article

Practice Problem Intel Scene Classification Challenge

@sanjukta_mitra wrote: how to convert JPG image to pixel data in excel , so that it can be used as an input file to anaytics process Posts: 1 Participants: 1 Read full topic

View Article

Help required on Web Scraping with Python

@kommuri_suresh wrote: Hello Team, I am following Web Scraping with Python I executed your suggested code but I am not getting result. please help me. I couldn’t understand what is exactly mean of if...

View Article


Multinomial Logistic Regression from Scratch

@swati0205 wrote: Hi All, there was an interesting article on building Logistic Regression classifier from scratch https://www.analyticsvidhya.com/blog/2015/10/basics-logistic-regression/. However i...

View Article

Understaing the significance test

@neha30 wrote: I saw one question regarding Z test (Que.3) in here: A test is administered annually. The test has a mean score of 150 and a standard deviation of 20. If Ravi’s z-score is 1.50, what...

View Article

t-SNE with a classifier

@r_sangole wrote: In articles like these: Link 1 Link 2 the authors mention using the dimension reduction vectors as features for a classifier (be it RF or XGB). However, the original t-SNE...

View Article


Start a business based on data science

@ridzkish wrote: Hi guys, I am planning to launch a consultancy based on data science. Is it a good idea to go on this path? I have been working with for more than five years now and need to up the...

View Article


Training data only contains single positive label

@chankey wrote: The dataset which I have only contains the positive label. How do I make a model which can predict whether the new data is positive or negative? There’s no test data either. Just a CSV...

View Article

Barriers to data analytics in automobile industry

@rinu.nitt wrote: Hello Friends I am a research scholar.As a part of research, i am conducting a survey to identify the" Barriers to data analytics in automobile industry". Kindly share the important...

View Article

Web Scrapping using python

@purnima82 wrote: Hi I gone through some small tutorials about scrapping using python. But I have a huge doubt about legality of this technique. How can i know from which website I can scrap data? I...

View Article

Why do we need to make our data stationary?

@gurjas wrote: I have seen, before using ARIMA model, we need to convert our data into stationary model. However, the same is not true for different model like - Naive, Moving, Simple Exponential,...

View Article


Data merging in python(Data Minig)

@rock_bt wrote: Hey All, Can anyone help me to know why data value(No. of rows) is less after merging, and how i can get the all data of GEN_3 with merge one column from P_List. Note: In both the file...

View Article

Chi-Sq Test for Numeric variables

@tarunisgr8 wrote: Can we use Chi-Sq test to test whether two numeric variables? I was watching a video on Youtube by Khan Academy --> Here he has compared 2 variables - observed and expected. But...

View Article


Error converting string to float using fit_transform

@samuel20 wrote: I am geting the error mensage: array = np.array(array, dtype=dtype, order=order, copy=copy) ValueError: could not convert string to float: ‘Pass w/ Conditions’ Even using...

View Article

KMeans clustering with both numerical and categorical data in PySpark

@arin1405 wrote: I need to do KMeans clustering using both numerical and categorical data. There is KModes algorithm for clustering using only categorical data and KPrototypes algorithm for clustering...

View Article


Time Series Forecasting and reducing it to stationary series

@bhumika53 wrote: Hello, I want to ask when we make a stationary time-series from a non-stationary one, we have to remove the trend and the seasonality. That means we are left we the residuals. So, is...

View Article

Shape error while calculating R Square

@mohitlearns wrote: Hello, While running the code below…infact on running the last line of the code i am getting the following error ValueError: shapes (1,2131) and (2,) not aligned: 2131 (dim 1) != 2...

View Article

Data Science Interview Questions

@manishceeri wrote: Hi All, Following are some interview questions that I encountered. I would really appreciate for anyone to answer with proper : Theory Proof as well relevant R/Python Coding Based...

View Article
Browsing all 4448 articles
Browse latest View live