Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Browsing all 4448 articles
Browse latest View live

Are there any MS data science or related programs for MS/PhD Health Science...

@venkata55 wrote: Hello all, I am an experienced scientist in Health sciences/epidemiology. I am working towards career transition to data science. Are there any masters programs available? Thank you...

View Article


Can anyone suggest a good platform for storing analytical data for structured...

@sourabhadap wrote: Looking for a platform where I can fetch the data from MySQL and then store the analytical data in order to slice and dice the data which is performed in OLAP architecture , or any...

View Article


Image may be NSFW.
Clik here to view.

Where to post for errors, grammatical mistakes, etc. for articles written at AV?

@shivendra90 wrote: Just my two cents here. This is regards to this article on feature engineering. Well, the images about two automobiles being assembled; the one on the left is a 1980s image and...

View Article

Image may be NSFW.
Clik here to view.

Complete Machine Learning Guide to Parameter Tuning in Gradient Boosting...

@louis wrote: Analytics Vidhya – 21 Feb 16 Guide to Hyperparameter Tuning in Gradient Boosting (GBM) in Python This is a guide on hyperparameter tuning in gradient boosting algorithm using Python to...

View Article

Complete Machine Learning Guide to Parameter Tuning in Gradient Boosting in...

@louis wrote: Please am referring to your topic on parameter tunning using GBM. Below is your code which I edited to include my dataframe code: x_train,x_test,y_train,y_test=train_test_split(...

View Article


XGBoost: hyperparameter tuning makes AUC worse ?!

@clare_che wrote: I am using Xgboost for a Machine Learning task, my dataset is relatively small, with 3153 observations and 46 features. I follow the steps in...

View Article

Doing upsampling of data is a valid move in machine learning hackathons?

@himanshukhatri wrote: Hi folks, my problem is regarding multiclass classification when given an unbalanced data in any ML hackathon, performing upsampling or undersampling will it violate the rules...

View Article

Zero inflated reponse with Random Forest and Gradient Boosting regressors

@lucia89 wrote: Hi! I have a variable reponse with a lot of zeros and positives values in the remaining part and some categorical and quantitive variables as features. Given a Tweedie or Poisson...

View Article


Running Java code on the GPU with Aparapi

@jorba wrote: I recently found out about Aparapi and I wondered if you guys are aware of this. It basically converts any native Java code into OpenCL, so that the Java code can run on the GPU. Because...

View Article


Image may be NSFW.
Clik here to view.

R mlr Methods for defining parameters

@sijs wrote: Hi, I have a question about R mlr Methods for defining parameters from a blog post, point “4. Random Forest”: Analytics Vidhya – 8 Aug 16 Practicing Machine Learning Techniques in R with...

View Article

NLP Tokenization

@train.bi wrote: Hi Team Architecture Experience - 5 years minimum across multiple disciplines (Agile Methodologies, Scrum Framework, Risk management, Technology planning, design, development, System...

View Article

Decision Tree Pruning and other related queries

@ismail18 wrote: Hi All, I am currently doing ML practice problem where I need to predict “item_sales” (continuous variable). Feature variable are a mix of continuous and categorical variables. I am...

View Article

Time series forecasting using ML techniques

@do_it wrote: Hey please add, how to forecast time series data with machine learning models in the syllabus Posts: 2 Participants: 2 Read full topic

View Article


Factorization Machines & their application on huge datasets (with codes in...

@Prachi12 wrote: Hi This is referring to the article - https://www.analyticsvidhya.com/blog/2018/01/factorization-machines/#comment-151820. I have been facing kernel dying error while I run the...

View Article

Implement neural network using tensorflow: UnboundLocalError

@haiyang wrote: I got lost what is wrong with this: Error: UnboundLocalError: local variable ‘batch_y’ referenced before assignment here is the tf.Session with tf.Session() as sess: sess.run(init) for...

View Article


Doubt about the div() in crosstab

@palash82 wrote: I cannot understand how the div( ) is working here. Does it divides the column(because axis=0) by column sum? Gender.div(Gender.sum(1).astype(float), axis=0).plot(kind="bar",...

View Article

KNN Skill test Q.13

@chdeepak96 wrote: Could someone explain the solution [https://www.analyticsvidhya.com/blog/2017/09/30-questions-test-k-nearest-neighbors-algorithm/] 13) Which of the following value of k in k-NN...

View Article


Multivariate & Multi-class dependent text classification?

@rupesh.thakur wrote: Working on Text Classification which has two Class, Text,class1,class2 class1 = [‘a’,‘b’,‘c’] class2 can be [‘a1’,‘a1’,‘a3’] if class1 = ‘a’ class2 can be [‘b1’,‘b2’,‘b3’] if...

View Article

Feature selection with LDA

@Madhanlal wrote: Hi This is relating to the article https://www.analyticsvidhya.com/blog/2016/12/introduction-to-feature-selection-methods-with-an-example-or-how-to-select-the-right-variables/ As per...

View Article

Image may be NSFW.
Clik here to view.

How can I stack one feature-engineering based model and another one...

@shounakrockz47 wrote: I have a StackOverflow question answer dataset. ( this is a classification problem ) So , far I have created two different models. Model 1: LightGBM model optimized. Data fed...

View Article
Browsing all 4448 articles
Browse latest View live