Applying a measure of error to raw data
@fletcher wrote: Hi guys, I’m new to this so please be kind. I’d like to know if there is a way in which I can apply a measure of error to raw data. What I mean by this, is that I’d like to apply this...
View ArticleMissing data and CART
@omkar18 wrote: Hi Team, I have few queries regarding ML algorithms and data, It would be great if you can provide some feedback on that. Which is best package to impute the missing data (currently...
View ArticleData transformation while predicting for new data(one hot encoding issue)
@syammohan2103 wrote: I have one hot encoded the data and trained a model. After passing the new(test) data through my transformation pipeline(including the one hot encoder) new columns are created as...
View ArticlePractice Problem Intel Scene Classification Challenge
@sanjukta_mitra wrote: how to convert JPG image to pixel data in excel , so that it can be used as an input file to anaytics process Posts: 1 Participants: 1 Read full topic
View ArticleHelp required on Web Scraping with Python
@kommuri_suresh wrote: Hello Team, I am following Web Scraping with Python I executed your suggested code but I am not getting result. please help me. I couldn’t understand what is exactly mean of if...
View ArticleMultinomial Logistic Regression from Scratch
@swati0205 wrote: Hi All, there was an interesting article on building Logistic Regression classifier from scratch https://www.analyticsvidhya.com/blog/2015/10/basics-logistic-regression/. However i...
View ArticleUnderstaing the significance test
@neha30 wrote: I saw one question regarding Z test (Que.3) in here: A test is administered annually. The test has a mean score of 150 and a standard deviation of 20. If Ravi’s z-score is 1.50, what...
View Articlet-SNE with a classifier
@r_sangole wrote: In articles like these: Link 1 Link 2 the authors mention using the dimension reduction vectors as features for a classifier (be it RF or XGB). However, the original t-SNE...
View ArticleStart a business based on data science
@ridzkish wrote: Hi guys, I am planning to launch a consultancy based on data science. Is it a good idea to go on this path? I have been working with for more than five years now and need to up the...
View ArticleTraining data only contains single positive label
@chankey wrote: The dataset which I have only contains the positive label. How do I make a model which can predict whether the new data is positive or negative? There’s no test data either. Just a CSV...
View ArticleBarriers to data analytics in automobile industry
@rinu.nitt wrote: Hello Friends I am a research scholar.As a part of research, i am conducting a survey to identify the" Barriers to data analytics in automobile industry". Kindly share the important...
View ArticleWeb Scrapping using python
@purnima82 wrote: Hi I gone through some small tutorials about scrapping using python. But I have a huge doubt about legality of this technique. How can i know from which website I can scrap data? I...
View ArticleWhy do we need to make our data stationary?
@gurjas wrote: I have seen, before using ARIMA model, we need to convert our data into stationary model. However, the same is not true for different model like - Naive, Moving, Simple Exponential,...
View ArticleData merging in python(Data Minig)
@rock_bt wrote: Hey All, Can anyone help me to know why data value(No. of rows) is less after merging, and how i can get the all data of GEN_3 with merge one column from P_List. Note: In both the file...
View ArticleChi-Sq Test for Numeric variables
@tarunisgr8 wrote: Can we use Chi-Sq test to test whether two numeric variables? I was watching a video on Youtube by Khan Academy --> Here he has compared 2 variables - observed and expected. But...
View ArticleError converting string to float using fit_transform
@samuel20 wrote: I am geting the error mensage: array = np.array(array, dtype=dtype, order=order, copy=copy) ValueError: could not convert string to float: ‘Pass w/ Conditions’ Even using...
View ArticleKMeans clustering with both numerical and categorical data in PySpark
@arin1405 wrote: I need to do KMeans clustering using both numerical and categorical data. There is KModes algorithm for clustering using only categorical data and KPrototypes algorithm for clustering...
View ArticleTime Series Forecasting and reducing it to stationary series
@bhumika53 wrote: Hello, I want to ask when we make a stationary time-series from a non-stationary one, we have to remove the trend and the seasonality. That means we are left we the residuals. So, is...
View ArticleShape error while calculating R Square
@mohitlearns wrote: Hello, While running the code below…infact on running the last line of the code i am getting the following error ValueError: shapes (1,2131) and (2,) not aligned: 2131 (dim 1) != 2...
View ArticleData Science Interview Questions
@manishceeri wrote: Hi All, Following are some interview questions that I encountered. I would really appreciate for anyone to answer with proper : Theory Proof as well relevant R/Python Coding Based...
View Article