Quantcast
Channel: Data Science, Analytics and Big Data discussions - Latest topics
Viewing all articles
Browse latest Browse all 4448

How to Chunk large dissimilarity / distance matrices in R?

$
0
0

@aeren wrote:

I would like to cluster mix-type data that contains 50k rows and 10 features/columns. I am using R in my 64 bit PC. When I calculate dissimilarity / distance matrix with “daisy” function, I got “Error: cannot allocate vector of size X GB” error.

gower_dist <- daisy(df, metric = “gower”).

This is the command to generate distance matrix. How to handle this script with chunks to avoid RAM error ?

Posts: 1

Participants: 1

Read full topic


Viewing all articles
Browse latest Browse all 4448

Trending Articles