Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

How to write functions for calculating Euclidean distance and mahalanobis...

@wehired wrote: Write two functions; One should return the distance measures using Euclidean distance and another one should use mahalanobis distance measure. Posts: 1 Participants: 1 Read full topic

View Article


Code Required on github for comprehensive Learning path to become a data...

@mohitlearns wrote: Hello, @analyticsvidhya I am doing the following course from Analytics Vidhya: A comprehensive Learning path to become a data scientist in 2018 However the code snippets provided...

View Article


R parallel Processing code is taking time in Server

@saiviswas wrote: I am using Parallel Processing to build multiple models for each Product when I am running code in my laptop (i5 4th generation) is taking 100 seconds to build models for 1 Product....

View Article

Laptop/hardware suggestions for Deep learning

@yaswanth912 wrote: Can someone suggest me which are the best laptops for learning/practicing deep learning. I have been researching on this and many suggest to go with desktop so that it will be...

View Article

How can I predict tsunami using machine learning? Where can I get the dataset...

@midhilesh wrote: I need to predict tsunami using Machine learning but I don’t know where can I find the dataset for that and the model for building it. Can anyone please help me regarding this issue...

View Article


Getting error while fitting a linear regression model

@mohitlearns wrote: Hello, I am trying to run the following Linear Regression code and I am getting the error as listed in the subject line. I have tried to research on the given error but I have not...

View Article

Can any one explain the below code?

@saisaranv wrote: channel_fin = ddply(channel_fin,~path,summarise, conversion= sum(convert)) can any one give me the idea to summerize a series of characters in my data …row by row , ex: Screen record...

View Article

Pls help me with the code

@saisaranv wrote: i want to perform if any cell in my data frame with characters(lowercase,uppercase),or any word then need to execute the below code . lapply(data27,lookup,key.match=data23) above...

View Article


COX ph coefficients interpretation

@omohiefe wrote: Quick question, say after running a Cox PH model I have results with a exp coefficient of 3.567, how can i interprete this as a percentage. ie a HR of 1.56 indicated group a has a 56%...

View Article


Non Linear Multiple regression

@jayanthd wrote: How to identify a multiple regression is Non Linear. There may be possibility when few independent variable are linear and other are non linear in relation to dependent variable....

View Article

How to integrate LDAP with Rshiny Dashboard application

@jai-prakash.yadav wrote: Hi All, I have a Rshiny Dashboard application and want to integrate LDAP so that user can first authenicate using LDAP and then can see dashboard. Any help/pointer in this...

View Article

Curse of Dimentionality

@AarushiS wrote: I am unable to understand the concept of ‘Curse of Dimensionality’. How can a large number of variables negatively affect the model performance? If we have more variables to...

View Article

Data Merging in Python

@rock_bt wrote: Hey all, I have a DataFrame like: df = pd.DataFrame({'B': ['B2', 'B3', 'B4', 'B5'], 'D': ['D2', 'D3', 'D4', 'D5']} And I want to create a ‘Key’ column 'key':...

View Article


Feature selection for incremental kmeans clustering

@veee wrote: Hi All, I have implemented incremental kmeans clustering [which use a cluster radius aka threshold to create clusters] and trying to identify the fraud transaction in a dataset. My...

View Article

What is weakly supervised learning?Can anyone help me with example?

@midhilesh wrote: I came across a research paper they used weakly supervised learning with CNN for object localization. I am unable to understand what basically a weakly supervised learning?...

View Article


PGDBA from IIM C + IIT K + ISI vs Continue working in analytics domain

@saurabh015 wrote: Hi, I am seeking help to decide, whether I should go for PGDBA from IIM C next year or continue working? Where am I coming from? I have been working in Mu Sigma from more than 3...

View Article

Image may be NSFW.
Clik here to view.

Relevant to bike-share dataset, why not split up the hours into 24 categories...

@nadirn wrote: I was following a 2 year old article on predicting bike share users from data using Machine Learning Analytics Vidhya – 25 Jun 15 Kaggle Bike Sharing Demand Prediction - How To Get In...

View Article


Coding Open ended questions in R - (Text Classification)

@jayanthd wrote: Hi There, I am trying to do sentiment analysis for a survey where I have more than 1000 comments. I have few themes like Industry, People, Communication and I need to identify who is...

View Article

Generating average values on dictionary of datasets

@bgarcial wrote: I have the followings pandas dataframe phreatic_level_l2n1_28w_df.head() Fecha Hora PORVL2N1 # PORVLxNx column change their name in each data frame 0 2012-01-12 01:37:47 0.65 1...

View Article

How to parse keyword which is in Sentence using NLTK?

@premsheth wrote: Hi friends, I am trying to do CV parsing from PDF file. My steps are follows: 1)convert pdf to text and list of sentences 2) extracting segment of experience from list of experience...

View Article
Browsing all 4448 articles
Browse latest View live