Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Open 2 excel files and create a third file with values that are averaged

@shruti.bothe wrote: File 1: ID TIME A B C 1 23:00:00 4 5 10 2 2:00:00 1 4 6 3 3:00:00 3 2 7 File 2: ID TIME A B C 1 23:00:00 6 7 10 2 2:00:00 1 6 6 3 3:00:00 3 4 7 Expected output file: ID TIME A B C...

View Article


Validating custom variables to get better estimate of a parameter I am trying...

@akshay.kotha wrote: Hi, I am currently facing a challenge of comparing a metric (y) across different times (x). But there is a parameter ‘z’ which affects the calculation of ‘y’. How to bring ‘z’...

View Article


Chi-square test to find multicollinearity

@deva123 wrote: How to use chi-square test to find multicollinearity between categorical variables (features) .?. i’m using Ames house data where we have 30 to 34 categorical variable and some of...

View Article

Linear Algebra and statistics

@anil2029 wrote: Can anyone suggest some books on algebra and stats for Data Science. Posts: 2 Participants: 2 Read full topic

View Article

Projects for final year related to ML and cyber security

@AnnaList wrote: Hello, We are interested in combining cyber security and machine learning domains for our final year projects.But i don’t know where to start. It would be helpful if we get an idea !...

View Article


KeyError in Python 3

@AnnaList wrote: #Determine average visibility of a product visibility_avg = data.pivot_table(values=‘Item_Visibility’, index=‘Item_Identifier’) #Impute 0 values with mean visibility of that product:...

View Article

How to Manually Calculate Text Blob Naive Bayes Prob_Classify function

@jimbo1985 wrote: Hi everyone, I am very new to this field and I don’t have a well background. I am using Text Blob’s built-in classifier for multi-classes text classification. I think Text Blob is...

View Article

Image may be NSFW.
Clik here to view.

imbalanced dataset for deep learning

@tw349 wrote: I do have one question. More statistics than how to use KNIME. I have some data with lots of rows … maybe 62,000 rows but may only have about 913 rows that have a dependent variable...

View Article


Doubts in applying Linear Regression

@fornanthu wrote: Hi, My data, Exercise Econ Job Position 50 1 A Owner 30 0 <NA> <NA> 25 1 B Employee 44 1 B Owner 32 0 <NA> <NA> 50 0 <NA> <NA> 22 1 A Employee 14...

View Article


Make a Cluster of ID holding null values using dbscan

@siddharth_verma wrote: Greetings to all, I have a data which maps gps points for mapping an area. What I need to do is cluster postgesql query which has a edge_id column which has road_id. There are...

View Article

Recommender system

@humblefool wrote: I want to build a recommender system for event promotion website. Can someone guide me how to build it using python ,flask and my-sql .As the back-end of my website is in java i...

View Article

Unable to submit solution

@vishalmodagekar wrote: “Some error occurred. Please try after sometime.” Whenever I try to upload the solution file I get above error. Please help, it has been 2 days I’m trying but facing the same...

View Article

Best way to calculate Quartile

@heyjag wrote: Hi, There are three methods for calculating Quartile as per wikipedia --> https://en.wikipedia.org/wiki/Quartile Wanted to know which is the best method? From my understanding,...

View Article


How to do segmentation based on some filters(e.g. TRAFFIC SIGNALS) from live...

@desiredeveloper wrote: I am supposed to do traffic symbols recognition from live streaming data. Please tell me how to automate the process of segmentation. I am able to recognize the symbols using...

View Article

Use of Pretrained language model in Lord of the Machines hackathon

@yifanxie wrote: Hello, could you please clarify if pretrained word vector can be used in this competition? for example, fasttext:...

View Article


Unable to submit solution -Lord of machines

@rishi_kumar wrote: this is the error i am getting- unnamed 0 were not expected but found in your solution. can anyone please explain why this may happen? Posts: 2 Participants: 2 Read full topic

View Article

Uploading code file for final solution with pretrained word vectors

@yifanxie wrote: Hello: This is a follow-up question to a previous question to do with pretrained word vectors. When uploading our code file for our final solution, how do we make reference to...

View Article


Image may be NSFW.
Clik here to view.

Is it possible to classify text with Regex?

@vitaly1 wrote: I’m trying to create MODEL like decision tree style that receive series of STRINGs. I’m using WEKA , with J48 classifier and stringToWordVector as a filter. As I know a lot of...

View Article

Is it possible to build text classification model , that depends on previous...

@vitaly1 wrote: Suppose I have training data on xxx.arff file as the following below, @relation model1 @attribute class-att {TAG1,TAG2} @attribute classification string @data TAG1,'bla1 bla1 here some...

View Article

Solution for Lord of the machines

@Shikhar2205 wrote: Can winners share their solution so that we can know their approach to the question. It would be really helpful . Posts: 2 Participants: 2 Read full topic

View Article
Browsing all 4448 articles
Browse latest View live