Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live
↧

Random Forest Binary classification - Predict_Proba values

@psnh wrote: Hi, I am doing a binary classification using Random Forest. My Training dataset has 5 attributes and Feature importance for the data set is as below: A - 28%, B, 27%, C - 17%, D - 17%, E...

View Article


Resources for Time Series Prediction using Deep Learning

@arman wrote: Any good tutorials for a time-series prediction using deep learning for beginners please. thanks Posts: 2 Participants: 2 Read full topic

View Article


How to return original String after measuring distance

@premsheth wrote: Hi, I am stuck in one problem I am checking word is it in vector or not by measuring distance. I used stringdist package my code is below s1 <- c("Business Head", "Recruitment...

View Article

Please provide any good tutorial on data cleaning and preprocessing using python

@vashishtarora wrote: Please provide any good tutorial or video series(not basic ones) on data cleaning and preprocessing before modelling using python Posts: 2 Participants: 2 Read full topic

View Article

Image may be NSFW.
Clik here to view.

Regarding python pivot table

@santhoshreddy wrote: image.png1072x438 99.9 KB can anyone help with this Posts: 1 Participants: 1 Read full topic

View Article


Why different classifier behaves differently for different data?

@nileshthakkar wrote: Why behaviour of different classifier differ for different data? Based on what parameters we can decide the good classifier for particular dataset? Posts: 1 Participants: 1 Read...

View Article

Image may be NSFW.
Clik here to view.

Test the best-fit distribution and return parameter and 'p' values

@shruti.bothe wrote: My data set fits gamma and weibull distribution. I need to compare which one is the best fit with respect to p values and return the shape and scale values, loc parameter needs to...

View Article

Module 'lmoments' has no attribute 'samlmu'

@shruti.bothe wrote: Has anyone worked with this module? Posts: 1 Participants: 1 Read full topic

View Article


Categorial Data Feature Engineering

@amitabha_joy wrote: I have a data set for House price prediction.I am unable to encode some categorical feature like availability,society etc .How to find correlation and find the importance of the...

View Article


Image may be NSFW.
Clik here to view.

McKinsey Analytics Hackathon - Healthcare Analytics (14th April 2018)

@ankit2106 wrote: Are you ready to crack a real-life business problem on McKinsey Analytics Online Hackathon? April 14th is your opportunity to hack your way into the future—all in a limited span of...

View Article

How to impute or do r bind lists in for loop?

@premsheth wrote: Hi Friends, I am encountered with one problem I guess it is easy for an experienced person but I am unable to solve it. When I run the code outside for loop, I get desired output. s...

View Article

Using scrapy to extract webpages

@ssonia wrote: Im getting an error by using scrapy crawl redditbot as saying invalid syntax, i used the command in shell as well existing the shell still same eroor, can anyone let me know ? Posts: 1...

View Article

Analyse large RDBS data schema

@satish78 wrote: Hello, My problem statement - Have a large RDBMS Schema with over 100 tables, and over million records of various Provider data Want to know best way to analyse these table to find...

View Article


SAS or Power BI or Power BI with R?

@Shaina_M wrote: Hi, I work at a bank and as strange as it may sound, we had been using only Excel till now for analytics. Through a lot of push, our senior management has decided to invest in a new...

View Article

TensorFlow or Caffe: Which is better for running Deep learning convolutional...

@anand1 wrote: TensorFlow or Caffe: Which is better for running Deep learning convolutional neural networks Posts: 1 Participants: 1 Read full topic

View Article


Big-data hadoop question

@ganesh_sharma wrote: I ask the question Ideally what should be the block size in Hadoop cluster? Posts: 1 Participants: 1 Read full topic

View Article

To display variable and change datatype from uplaoded file

@Meraki09 wrote: I am creating a application, where I am uploading a file and displaying the variables from the uploaded file and changing their data type as desired from the user. In most of the...

View Article


Improving Supervised Algos Using Clustering

@rishii.ds wrote: Hello Experts, I have used RandomForestClassifier for solve the classification problem, then to improve the accuracy used K-Means clustering then re-applied the...

View Article

Movie Recommendation Engine - movielens

@ssk17 wrote: Hi All, I am working on a recommendation engine for the first time in python. I am using the movielens dataset. https://grouplens.org/datasets/movielens/ I am stalled here so would love...

View Article

Is it Mathematics require for ML?

@crish786 wrote: I am not belongs from mathematics background, how can I start ML and from which level I should require to learn Maths. Posts: 1 Participants: 1 Read full topic

View Article
Browsing all 4448 articles
Browse latest View live