Data insight with categorical variables
@cachu wrote: Hi, My first post and hope to learn a lot. I am working with a dataset consist of lot of categorical variables and target is numerical. I am in a fix will linear regression give...
View ArticleEquality constraints
@shruti.bothe wrote: Code: df.loc[(df[‘SMS’] <19.44) | (df[‘CALL’] <16.21) | (df[‘INTERNET’] <194.14) , ‘REGION’]=1 df.loc[(df[‘SMS’] >=19.44) | (df[‘CALL’] >=16.21) | (df[‘INTERNET’]...
View ArticleDiscussions for article "LeaRning Path on R - Step by Step Guide to Learn...
@jalFaizy wrote: Hi All, The article “LeaRning Path on R - Step by Step Guide to Learn Data Science on R” is quiet old now and you might not get a prompt response from the author. We would request you...
View ArticleHow to improve score of an Binary Classification model (Attrition) with...
@psnh wrote: Hi, I am trying to create a binary classification model (Attrition) for an imbalance data using Random Forest - 0- 84K, 1- 16K. I have tried using class_weights = ‘balanced’,...
View ArticleWhy high model accuracy compared to very low validation accuracy?
@amy.dj wrote: I’m building a sentiment analysis program in python using Keras Sequential model for deep learning my data is 20,000 tweets: positive tweets: 9152 tweets negative tweets: 10849 tweets I...
View ArticleDiscussions for article "A Complete Tutorial to Learn Data Science with...
@jalFaizy wrote: Hi All, The article “A Complete Tutorial to Learn Data Science with Python from Scratch” is quiet old now and you might not get a prompt response from the author. We would request you...
View ArticleHow to find the a single categorical variable importance in a set of all...
@akshay.kotha wrote: Hi Basically, I have the same question which was posted here: datascience.stackexchange.com Feature importance with high-cardinality categorical features for regression (numerical...
View ArticleWhere can i read top solutions (code and explanation) to all the past...
@omkarsawant30 wrote: For eg. Practice problems similar to Recommendation engine, Twitter sentiment analysis , etc Posts: 2 Participants: 2 Read full topic
View ArticleSetting cut-off on idf values while calculating cosine similarity between two...
@AD5257 wrote: I have two nodes both of which contains a list of 5 documents each. In each document there are a number of words. I want to find the distance of a keyword (which also has a list of 5...
View ArticleDiscussions for article "7 Types of Regression Techniques you should know!"
@jalFaizy wrote: Hi All, The article “7 Types of Regression Techniques you should know!” is quiet old now and you might not get a prompt response from the author. We would request you to post your...
View ArticleDeep Learning with TensorFlow - Challenges : Need Advise
@mohitvarma wrote: Hello, I am using a core i3 system with 12 GB RAM. I believe to learn Deep learning with TensorFlow I would need a system with a GPU which I dont have. Is there a way around this...
View ArticleSurvival random forest with time dependent covariates
@gileper wrote: I am trying to use one of the R package for survival random forest with time dependent covariates but I still cannot find any package that handles with obj of this kind:...
View ArticleDistribution Fitting for large scale data
@shruti.bothe wrote: What is the best way to check the best fitting distribution of the data? Are there any distribution fitting tools ? I have a set of 12 files and I need to check the distribution...
View ArticleR, machine learning
@saniyakhan wrote: I am a beginner. I am dealing with a dataset where I have categorical variables with more than 1000 and 2000 levels. I am not sure how to deal with this. Posts: 3 Participants: 2...
View ArticlePolynomial regression of second order with a single variable
@lokesh_jindal wrote: Hi, I have seen people trying to run polynomial regression of second order (quadratic regression) with a single variable by using x and x^2 as the independent variables in excel...
View ArticleError while using fileinput in shiny
@Meraki09 wrote: I am trying to develop an application with Shiny, where the user uploads a test file(csv format), and the data needs to be pre - processed and predictions are made accordingly. For...
View ArticleScala IDE for data science applications (like RStudio / Spyder / Rodeo)?
@priya27 wrote: Hi, With the ascent of Spark, Scala has picked up a huge force as programming dialect of decision for information science applications. To expand the effectiveness when dealing with...
View ArticleHelp: Create ids of an igraph graph using “internal” ids
@evt77ch wrote: I’m using igraph package in R. I’ve created a random smallworld graph using sample_smallworld function. Now I’d like to set ids of the vertices of this new graph equal to their...
View ArticleFor loop in Python
@itijain wrote: Hello, I am learning coding and the query I am asking might be a silly thing for most of you but please help me out. I would be really thankful. I have a dataframe of 3 Lakh rows and...
View ArticleData Analytics Use cases in Networking domain
@Sandeep2904 wrote: Hi All, Can anybody please suggest as to what are the applications or use cases in Data Analytics(machine learning), that can be applied in Networking domain. Regards, Sandeep....
View Article