AWS Hadoop JSON Data Processing

@Rashnil wrote:

Hi Friends,
I am new to big data and have recently started working on a cloud based (AWS) hadoop project. I have setup the AWS environment (4 t2.large EC2 instances with 100GB data volume per instance) and have installed cloudera distribution. I have tested couple of examples using word count csv files etc.

Now, My main project is to analyze research article data in JSON files. I have around 4 million JSON files close to 70GB of data with each JSON file containing all the information for one article (i.e. around 4million articles). These files are unrelated to each other and are around 340+ lines per file in a multi-level structure format. They are spread across 400 folders with each folder containing 10,000 JSON files. I want to analyze this data (bring it to a form that can be analyzed) . I am bit stuck here and not sure how to move forward.

May be convert to CSV,but converting this into CSV may take long time. I am not sure whether dumping this in HDFS and running map reduce on top of it is good idea or should i move it to hive? The no of files and size has made me little hesitant towards moving forward.
Please advice on possible approach.

Looking forward to here from you. Thanks in advance.

Posts: 1

Participants: 1

Read full topic

AWS Hadoop JSON Data Processing

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...