Hadoop@SG

chewren123

Member
Joined
Feb 26, 2009
Messages
441
Reaction score
4
Have you wondered how Google or Facebook handle all their data? All these Big Data !

Yes, now you can also create it too. With Hadoop.

Hadoop has generated remarkable buzz and excitement in recent years.

Hadoop, an open-source, distributed programming framework relies on parallel processing to store and analyze tremendous amounts of structured and unstructured data.


Unfortunately, after googling for days, I could not find any Singapore group who comes together to share and contribute.

Whether you are a beginner, or hiding in the closet of experience, do come out to share and contribute. We aim to create our very first Hadoop.

Please help and spread the word. Kum Sia.

For a start, do join : Hadoop @ SG


What is Hadoop ?
 

chewren123

Member
Joined
Feb 26, 2009
Messages
441
Reaction score
4
Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone
 

chewren123

Member
Joined
Feb 26, 2009
Messages
441
Reaction score
4
besides R, what are the languages that we need to know?

R is not an absolute requirement

I may prefer Python. You may not. It's a matter of preference

Most of the scripting languages like php, python, perl, ruby bash is good.

On most linux distros used to setup hadoop clusters, python, bash, ruby, perl... are already installed but nothing will prevent to roll up your own execution environment for your favorite scripting or compiled programming language.

The difference between java and scripting language, it is "Heart Beat of child nodes will not be sent to the parent nodes when we are using scripting languages".

Though Java code is most common, any programming language can be used.
 

AnimeNewbie

Suspended
Joined
Nov 1, 2003
Messages
8,050
Reaction score
1,966
I am a C++ and .NET programmer. And Hadoop does not support native and .NET, rite? I have to use Java which I do not have experience.
 

NSforSG

High Supremacy Member
Joined
Nov 17, 2010
Messages
34,250
Reaction score
2
R is not an absolute requirement

I may prefer Python. You may not. It's a matter of preference

Most of the scripting languages like php, python, perl, ruby bash is good.

On most linux distros used to setup hadoop clusters, python, bash, ruby, perl... are already installed but nothing will prevent to roll up your own execution environment for your favorite scripting or compiled programming language.

The difference between java and scripting language, it is "Heart Beat of child nodes will not be sent to the parent nodes when we are using scripting languages".

Though Java code is most common, any programming language can be used.

i see. look like i need to read up more about Hadoop first!
 
Important Forum Advisory Note
This forum is moderated by volunteer moderators who will react only to members' feedback on posts. Moderators are not employees or representatives of HWZ Forums. Forum members and moderators are responsible for their own posts. Please refer to our Community Guidelines and Standards and Terms and Conditions for more information.
Top