The past year has been dominated by Big 
Data.  What it might mean and the way you might look at it. The stories 
have often revolved around Hadoop and his herd of chums. Vendors and 
analysts alike have run away and joined this ever-growing and rapidly 
moving circus. And yet, as we saw in our own EMA and 9sight Big Data Survey,
 businesses are on a somewhat different tour. Of course, they are 
walking with the elephants, but many so-called Big Data projects have 
more to do with more traditional data types, i.e. relationally 
structured, but bigger or requiring faster access. And in these 
instances, the need is for Big Analytics, rather than Big Data. The 
value comes from what you do with it, not how big it happens to be.
Which brings us to Big Blue.  I've been reading IBM's PureSystems announcement
 today.  The press release headline trumpets Big Data (as well as 
Cloud), but the focus from a data aspect is on the deep analysis of 
highly structured, relational information with a substantial upgrade of 
the PureData for Analytics System, based on Netezza technology, first 
announced less than four months ago.  The emphasis on analytics, 
relational data and the evolving technology is worth exploring.
Back
 in September 2010, when IBM announced the acquisition of Netezza, there
 was much speculation about how the Netezza products would be positioned
 within IBM's data management and data warehousing portfolios that 
included DB2 (in a number of varieties), TM1 and Informix. Would the 
Netezza technology be merged into DB2?  Would it continue as an 
independent product?  Would it, perhaps, die? I opined
 that Netezza, with its hardware-based acceleration, was a good match 
for IBM who understood the benefits of microcode and dedicated hardware 
components for specific tasks, such as the field programmable gate array
 (FPGA), used to minimize the bottleneck between disk and memory. It 
seems I was right in that; not only has Netezza survived as an 
independent platform, as the basis for the PureData System for 
Analytics, but also being integrated behind DB2 for z/OS in the IBM DB2 
Analytics Accelerator.
Today's announcement of the PureData 
System for Analytics N2001 is, at heart, a performance and efficiency 
upgrade to the original N1001 product, offering a 3x performance 
improvement and 50% greater capacity for the same power consumption. The
 improvements come from a move to smaller, higher capacity and faster 
disk drives and faster FPGAs. With a fully loaded system capable of 
handling a petabyte or more of user data (depending on compression ratio
 achieved), we are clearly talking big data. The technology is purely 
relational. And a customer example from the State University of New 
York, Buffalo quotes a reduction in run time for complex analytics on 
medical records from 27 hours to 12 minutes (the prior platform is not 
named). So, this system, like competing Analytic Appliances from other 
vendors, is fast.  Perhaps we should be using images of cheetahs?
Original article
 
Big Analytics Rather Than Big Data
Subscribe to:
Post Comments (Atom)
 






0 comments:
Post a Comment