Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Data insight with categorical variables

@cachu wrote: Hi, My first post and hope to learn a lot. I am working with a dataset consist of lot of categorical variables and target is numerical. I am in a fix will linear regression give...

View Article


Equality constraints

@shruti.bothe wrote: Code: df.loc[(df[‘SMS’] <19.44) | (df[‘CALL’] <16.21) | (df[‘INTERNET’] <194.14) , ‘REGION’]=1 df.loc[(df[‘SMS’] >=19.44) | (df[‘CALL’] >=16.21) | (df[‘INTERNET’]...

View Article


Image may be NSFW.
Clik here to view.

Discussions for article "LeaRning Path on R - Step by Step Guide to Learn...

@jalFaizy wrote: Hi All, The article “LeaRning Path on R - Step by Step Guide to Learn Data Science on R” is quiet old now and you might not get a prompt response from the author. We would request you...

View Article

How to improve score of an Binary Classification model (Attrition) with...

@psnh wrote: Hi, I am trying to create a binary classification model (Attrition) for an imbalance data using Random Forest - 0- 84K, 1- 16K. I have tried using class_weights = ‘balanced’,...

View Article

Why high model accuracy compared to very low validation accuracy?

@amy.dj wrote: I’m building a sentiment analysis program in python using Keras Sequential model for deep learning my data is 20,000 tweets: positive tweets: 9152 tweets negative tweets: 10849 tweets I...

View Article


Image may be NSFW.
Clik here to view.

Discussions for article "A Complete Tutorial to Learn Data Science with...

@jalFaizy wrote: Hi All, The article “A Complete Tutorial to Learn Data Science with Python from Scratch” is quiet old now and you might not get a prompt response from the author. We would request you...

View Article

Image may be NSFW.
Clik here to view.

How to find the a single categorical variable importance in a set of all...

@akshay.kotha wrote: Hi Basically, I have the same question which was posted here: datascience.stackexchange.com Feature importance with high-cardinality categorical features for regression (numerical...

View Article

Where can i read top solutions (code and explanation) to all the past...

@omkarsawant30 wrote: For eg. Practice problems similar to Recommendation engine, Twitter sentiment analysis , etc Posts: 2 Participants: 2 Read full topic

View Article


Setting cut-off on idf values while calculating cosine similarity between two...

@AD5257 wrote: I have two nodes both of which contains a list of 5 documents each. In each document there are a number of words. I want to find the distance of a keyword (which also has a list of 5...

View Article


Image may be NSFW.
Clik here to view.

Discussions for article "7 Types of Regression Techniques you should know!"

@jalFaizy wrote: Hi All, The article “7 Types of Regression Techniques you should know!” is quiet old now and you might not get a prompt response from the author. We would request you to post your...

View Article

Deep Learning with TensorFlow - Challenges : Need Advise

@mohitvarma wrote: Hello, I am using a core i3 system with 12 GB RAM. I believe to learn Deep learning with TensorFlow I would need a system with a GPU which I dont have. Is there a way around this...

View Article

Survival random forest with time dependent covariates

@gileper wrote: I am trying to use one of the R package for survival random forest with time dependent covariates but I still cannot find any package that handles with obj of this kind:...

View Article

Distribution Fitting for large scale data

@shruti.bothe wrote: What is the best way to check the best fitting distribution of the data? Are there any distribution fitting tools ? I have a set of 12 files and I need to check the distribution...

View Article


R, machine learning

@saniyakhan wrote: I am a beginner. I am dealing with a dataset where I have categorical variables with more than 1000 and 2000 levels. I am not sure how to deal with this. Posts: 3 Participants: 2...

View Article

Polynomial regression of second order with a single variable

@lokesh_jindal wrote: Hi, I have seen people trying to run polynomial regression of second order (quadratic regression) with a single variable by using x and x^2 as the independent variables in excel...

View Article


Error while using fileinput in shiny

@Meraki09 wrote: I am trying to develop an application with Shiny, where the user uploads a test file(csv format), and the data needs to be pre - processed and predictions are made accordingly. For...

View Article

Scala IDE for data science applications (like RStudio / Spyder / Rodeo)?

@priya27 wrote: Hi, With the ascent of Spark, Scala has picked up a huge force as programming dialect of decision for information science applications. To expand the effectiveness when dealing with...

View Article


Help: Create ids of an igraph graph using “internal” ids

@evt77ch wrote: I’m using igraph package in R. I’ve created a random smallworld graph using sample_smallworld function. Now I’d like to set ids of the vertices of this new graph equal to their...

View Article

For loop in Python

@itijain wrote: Hello, I am learning coding and the query I am asking might be a silly thing for most of you but please help me out. I would be really thankful. I have a dataframe of 3 Lakh rows and...

View Article

Data Analytics Use cases in Networking domain

@Sandeep2904 wrote: Hi All, Can anybody please suggest as to what are the applications or use cases in Data Analytics(machine learning), that can be applied in Networking domain. Regards, Sandeep....

View Article
Browsing all 4448 articles
Browse latest View live