Error in XGBoost Cross and Validation Prediction Output in R

@supra_minion wrote:

Hi

I am working on a data set in R. It required predicting a categorical variable. The output variable has two categories 1 and 0. In XGboost, I've taken num_class parameter as 2.

There are 600 rows in Training Set and 350 rows in test set.

** I am facing multiple issues.**

First Problem
After I run the Xgboost model with cross validation:

xg_model <- xgb.cv(data=data.matrix(dum_train[,-1]), label=x, objective="multi:softprob", nfold = 10, num_class=2, nrounds=200, eta=0.1, subsample=0.5, colsample_bytree=0.5,max_depth=6,min_child_weight=1,eval_metric="merror", prediction=T)

The result shows up like this:
[179] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[180] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[181] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[182] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[183] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[184] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[185] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[186] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[187] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[188] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[189] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[190] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[191] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[192] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[193] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[194] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[195] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[196] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[197] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[198] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000
[199] train-merror:0.000000+0.000000 test-merror:0.000000+0.000000

Question 1: Does this validation result suggest I am over-fitting too much ? If yes, what can I do to avoid over-fitting ?

Second Problem

After running this model, I predicted values on my test set. As mentioned above, my test set has 350 rows, I expect the predicted values from model to be 350. But, the predicted values I get is 700. Double the number of values in test set.

*Question 2: Why is this happening ? What am I doing wrong here ?

Posts: 2

Participants: 2

Read full topic

Error in XGBoost Cross and Validation Prediction Output in R

Trending Articles

Bath man appears in court charged with attempted murder of a man...

MACLEAN, Allan

Black Angus Grilled Artichokes

Practice Sheet of Right form of verbs for HSC Students

Police blotter for Jan. 12

99 God Status for Whatsapp, Facebook

Rajasthan Board 12th Science Result 2018 name wise- RBSE 12th commerce result...

Notorious Naushad of Ippa gang nabbed

Child Kidnapping: Amy McNeil was kidnapped on her way to school by 5 adults;...

Sonible Smartlimit v1.1.5-R2R

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Arrow Flash 2 – Sinhala Dubbed – Episode 23 – 20th March 2016

[GET] AI Traffic Goldmine

[E² Plugin] HDF-Radio

Universal Multi-Patch v1.3 By RADIXX11

IWAN – Thanks and Praise ( Throw Back Thursday )

RONALD P SONDERGAARD Arrested by Miami-Dade County Corrections on Mar 03, 2017

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

HSSC Excise & Taxation Inspector Result 2017 Scorecard/ Category Wise Merit List