@aeren wrote:
I would like to cluster mix-type data that contains 50k rows and 10 features/columns. I am using R in my 64 bit PC. When I calculate dissimilarity / distance matrix with “daisy” function, I got “Error: cannot allocate vector of size X GB” error.
gower_dist <- daisy(df, metric = “gower”).
This is the command to generate distance matrix. How to handle this script with chunks to avoid RAM error ?
Posts: 1
Participants: 1