Decision Tree Pruning and other related queries

@ismail18 wrote:

Hi All,

I am currently doing ML practice problem where I need to predict “item_sales” (continuous variable). Feature variable are a mix of continuous and categorical variables. I am following these steps :

Taking all the feature variables

Imputing missing data in continuous variable by mean and in categorical data by mode

One hot encoding categorical variables

Fitting a decision tree regressor and getting prediction and r-score

Since decision trees often overfit, pruning it through hyperparameters tuning of max_depth, min_samples_split etc using gridsearchcv

Getting an improved and robust model

Here are my observations and concerns :

Q1. A continuous variable “item_mrp” is getting a very high relative feature importance compared to others. why so ?

Q2. Does one hot encoding make categorical variables less relevant compared to continuous variables ?

Q3. Should I consider dimensionality reduction to improve robustness and remove overfit ? (but my data does not have many features even after one hot encoding)

Q4. What can we do to build decision trees which give high r-score but are also robust (does not overfit and perform well on unseen data) ?

This question is regarding decision trees so please answer accordingly. Help is very much valued.

Posts: 1

Participants: 1

Read full topic

Decision Tree Pruning and other related queries

Trending Articles

High police presence reported in Broadfield including helicopter

Police helicopter searches for Crawley suspect who made off from...

Who Is Sisanda Jonas? | Biography| Profile| History Of South African Media...

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Kundi Mat Khadkao Raja Lyrics Translation | Gabbar is Back

Mp3 Download: Mmatema Moremi - Ke Lerato

[SG News]Lady Who Shot Sexy Photos Of Herself At HDB Flats Is Exposed As Natalie

Problemas al unir un nodo al cluster entre sitios? Problema Resuelto!!

Issues installing KB2533623 on Windows Server 2008 R2 SP1 64-Bit

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

Black Angus Grilled Artichokes

Who died from the T.V. Show pawn stars ?? #pawnstars

How to delete a request from a database table in a debug mode

Practice Sheet of Right form of verbs for HSC Students

Mahanoy Area board sets new graduation date

LINKIN PARK – From Zero [iTunes Plus M4A]

Scunthorpe teen jailed for raping woman as she slept & taking...

1981-Depeche Mode - Speak & Spell multichannel WAV RE UP

[GET] Ayesha Santos – Powerhouse Portfolio ($359.00)

Daru and Sharab Status for Sharabi Friends in Hindi, Punjabi