5 Reasons why DynamoDB is better than MongoDB

Posted on April 29, 2014 by Bhavesh Goswami | Comments(2)

If you are considering MongoDB or any other NoSQL databases, its a must that you consider DynamoDB. In the MongoDB vs DynamoDB matchup, DynamoDB has a lot of brilliant features that help ease the pain of running NoSQL clusters. Below I give five reasons to choose DynamoDB over MongoDB. Reason 1: People don’t like being woken up in the middle of the night One sure-shot way to motivate someone to rethink their priorities in life, and reconsider their choice in becoming an IT professional, is to hand them pager-duty for a MongoDB cluster. Maintaining a MongoDB cluster requires keeping the servers up and running, keeping the MongoDB processes up and running, and performance monitoring for the cluster. Check this image for example (time there are in UTC). In the middle of the night, a client’s MongoDB Cluster generated few automated CloudWatch alarms. At 4 AM the conversation between a systems engineer and me is like following: Engineer: Hey, got woken up by the pager, seems like CPU utilization is spiking, but requests are running fine. I looked around but found nothing. Can I just resolve this issue and look at it tomorrow? Me: You woke me up to just ask this?

Continue reading…

1000 jobs for BigData Analytics posted in 1 week !!

Posted on February 20, 2014 by Subhashini Tripathi | Comments(2)

I teach a BigData Analytics course in Bangalore and I routinely check up for jobs that exist in this domain on Naukri.com. (Naukri.com is the no.1 job site in the country). You must be hearing about BigData , Cloud technologies and Analytics being the ‘hottest’ jobs of the century. Quoting Harvard Business Review ,” Data Scientist: The Sexiest Job of the 21st Century. So who is a Data Scientist ? It’s a high-ranking professional with the training and curiosity to make discoveries in the world of big data. The title has been around for only a few years. (It was coined in 2008 by one of us, D.J. Patil, and Jeff Hammerbacher, then the respective leads of data and analytics efforts at LinkedIn and Facebook.) But thousands of data scientists are already working at both start-ups and well-established companies. Their sudden appearance on the business scene reflects the fact that companies are now wrestling with information that comes in varieties and volumes never encountered before. If your organization stores multiple petabytes of data, if the information most critical to your business resides in forms other than rows and columns of numbers, or if answering your biggest question would involve a

Continue reading…

Facebook Open Sources Presto SQL Query Engine

Posted on November 12, 2013 by Himanshu Sachdeva | Comments(0)

In June 2013 at Analytics @ WebScale conference, Facebook announced Presto which they were using internally to process petabytes of data. It has now been made open-source as per a recent post by Facebook Engineering. So what is Presto? Hive, which was initially developed by Facebook used MapReduce chaining to transform a query into multiple MapReduce Jobs. Presto different as it does not use MapReduce & is 10 times faster that Hive for most queries as per Facebook. Presto allows querying data where it lives, including Hive, HBase, relational databases or even proprietary data stores. You can issue SQL like queries on Presto that include left/right outer join, subqueries or even common aggregate functions. A single Presto query can combine data from multiple sources, allowing for analytics across your entire organization. Facebook uses Presto internally to interactively query over a petabyte of data by about 1000 employees running more than 30,000 queries a day. Currently its also being used by leading internet companies including Airbnb and Dropbox.   You can find more about Presto here : Presto Website Facebook Blog about Presto Gigaom Story

Career opportunities in Cloud Computing and Big Data

Posted on August 14, 2013 by CloudThat | Comments(0)

“The global analytics market is expected to reach $25 billion by 2015 and the global cloud market is expected to be ~$675 billion by 2020. Indian IT players need to capitalize on its already well established IT/BPM market presence by increasing their services portfolio beyond the typical IT offerings.” Social, Mobile, Analytics & Cloud – The Game Changers for the Indian IT Industry June 2013 – Dinodia Capital Advisors. Organizations have realized the importance of Big Data and are now looking for ways of gleaning insights from it that will be to their advantage. The large quantity, velocity and the diversity of this data has given rise to the need for Data scientists who are trained and have the ability to analyze data that is available at this magnitude. Cloud Computing on the other hand utilizes the internet to provide software and hardware capacities to businesses through third party vendors. Services involving the Cloud offer to reduce cost, complexity and save time. Players in the IT sector are seriously looking towards offering their services through the cloud as a result of the continuous and growing deployment of businesses on the cloud. This has created opportunities for firms and individuals who

Continue reading…