Help : Finding patterns of identical values across fields in large data

I’m looking for either a visualisation technique, or preferably an algorithm or approach ( and preferably implementable in R) for the following pattern searching problem.

I have a dataset of 1000s of customers where the customer has supplied numerical data. I want to find automatically groups of customers who have provided identical numbers in multiple fields.
I’m not looking to find where every field between customers has been filled in the same, only 2 or more of the fields.

So on the dummy data below:

Customer. f1. f2. f3 … f17. f18. f19. f20
A. 2. 5. 7. … 3. 11. 4. 8
B. 2. 6. 7 … 1. 11. 7. 5
…
C. 1. 1. 2 … 1. 11. 7. 9
…
Z. 6. 5. 8 … 3. 9. 6. 8

In the above data: customers A and B share fields f1, f3 and f18
Customers B and C share fields f17, f18 and f19
Customers A and Z share fields f2, f17 and f20

I want to automatically find / highlight these 3 scenarios .

I can obviously find these patterns by grouping by and counting, and then look for groups with a count greater than 1…but I would have to do that for all combinations of two fields, then all combinations of three fields, then four, etc

And a standard k means clustering approach doesn’t really do it.

Is there an algorithm, approach that anyone could recommend?

Regards

Andy

1 post - 1 participant

Read full topic

Help : Finding patterns of identical values across fields in large data

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...