In their zeal to collect as much operational data as possible, organizations hoping to gain an advantage through the use of big data will also need to rethink how they process, analyze and present that material.
"When all this information finally gets to the business, it is difficult for the business to understand what to glean out of the data," said Sharmila Shahani-Mulligan, CEO and co-founder of big-data startup ClearStory Data. "We know this has been a problem for several years now."
Shahani-Mulligan was one of a number of speakers at the O'Reilly Strata + Hadoop World conference Thursday in New York to offer tips on making the move from data to big data. She suggested that the executive dashboard is giving way to emerging technique of interactive storytelling, which gives data more readily apparent context and meaning.
Meanwhile, organizations should watch Google closely, advised M.C. Srivas, chief technology officer of Hadoop distributor MapR Technologies. "Google, with its vast and varied infrastructure, can provide us with a glimpse into the future of where computing is going," said Srivas, who worked at Google before co-founding MapR.
One of the basic rules to pick up from Google is that "more data beats complex algorithms," Srivas said. "This is something that Google has demonstrated again and again: The company that can process the most data will have an advantage over everybody else in the future."
A number of MapR customers are following this principle, Srivas said.
Millennial Media, a leader in the mobile advertising market, collects up to about 4TB of mobile user data each day, combining with petabytes of data on hand to build profiles of mobile users.
Cisco collects data from its firewalls worldwide, aggregating about a million events per second, all to better detect security threats. Credit agency TransHuman collects data from multiple sources to provide real-time credit scores.
But once an organization has committed to collecting more data, the question becomes what to do with it.
Visualization is a handy tool, but picking the correct visualization is vitally important, advised Miriah Meyer, an assistant professor in the University of Utah's School of Computing.
The most challenging and important step in visualization is "gaining an understanding of the user's needs and then being able to translate that into a set of visualization requirements," Meyer said.
Meyer worked with one researcher who was comparing the human genome with that of lizards. The researcher tried off-the-shelf data visualization tools, but found that they hid many pertinent details and were not intuitive to work with.
The tool Meyer helped create, called Mizzbee, allowed the researcher to get insights from the data that couldn't be gleamed from generic visualization software.
Sign up for Computerworld eNewsletters.