Hadoop and MapReduce have long been mainstays of the big data movement, but some companies now need new and faster ways to extract business value from massive — and constantly growing — datasets.
Code submitted this week for inclusion in the Hadoop stack will help speed the spread of the distributed big-data platform, according to Hortonworks co-founder Arun Murthy. The submission of the ...
Cloudera is opening its Impala open source real-time query project to public beta and adding its second subscription offering---Cloudera Enterprise RTQ---in a bid to speed up Hadoop data crunching. In ...
Hadoop was named after a toy elephant, sounds like a Dr. Seuss character, and it's the hottest thing in big-data technology. It's a software framework that makes short work of a tall task—managing ...
When Morgan tried to do some portfolio analysis 18 months ago it found that traditional databases and grid computing just wouldn’t scale to the very large volumes of data that its data scientists ...
DataTorrent made its primary product, DataTorrent RTS, generally available today. The product is built on top of Hadoop 2.0 and allows companies to process massive amounts of big data in real time.
In a world of real-time data, why are we still so fixated on Hadoop? Hadoop, architected around batch processing, remains the poster child for big data, though its outsized reputation still outpaces ...
It’s been a big year for Apache Hadoop, the open source project that helps you split your workload among a rack of computers. The buzzword is now well known to your boss but still just a vague and ...
One big disadvantage that comes with a hybrid cloud strategy is forcing your developers to learn and understand the different techniques required by cloud providers and on-premises software vendors ...
In the 1800s, John Godfrey Saxe wrote a poem about six blind men and an elephant based on an old Indian story. In an effort to discover what the elephant is, each man touches a different part of the ...