Monthly Archives: November 2011

Huge Data Processing Applying Hadoop Cluster – Part 4

Finely we are getting close to the project target which was finding spam tweets in twitter logs and find out if the spams occur more in the advertisements or not. In the previous posts from this series we learned how … Continue reading

Posted in Cloud Computing, Java, Software Engineering | Leave a comment

Getting Familiar with Alternative Technologies

My professional work as a software developer has been starting at the time DOS was the only available OS. I didn’t even use SQL. Pascal was the only rapid development language. The application doesn’t have any certain architecture. Everything was … Continue reading

Posted in Software Engineering, Software Market Demands | 1 Comment

Huge Data Processing Applying Hadoop Cluster – Part 3

MapReduce In the previous posts we have deployed a Hadoop cluster, then we’ve developed a Java class which can be used as an additional Hadoop command for gathering Tweets from different log files and merge them all into a single … Continue reading

Posted in Cloud Computing, Java, Software Engineering | Leave a comment

Huge Data Processing Applying Hadoop Cluster – Part 2

The previous post has been learning us how to set up and deploy a real Hadoop cluster. As I mentioned the target of this little project is to port a number of huge log files such as Twitter logs into … Continue reading

Posted in Cloud Computing, Java, Linux | 2 Comments