Big Data News – 24 Aug 2015

Top Stories

While real-time analytics is getting more affordable, it’s still not right for everything. Here are 10 ways to get the most from real time, near real time, and batch use cases.

Companies with strong revenue growth are more likely to have a chief data officer (CDO) than organizations with less impressive financial performance.

There are a lot of open source projects out there, and keeping track of them all is next to impossible. Here are five important ones in the Big Data space that you may not know about.

It appears to make sense to take a single, well-architected approach to dealing with big data. A specifically built mix of hardware and software should be better than a hand-built collection of bits cobbled together, surely?

Cybersecurity and big data analytics are two set of technologies that are frequently mentioned by CEOs and CIOs as top investment priorities. But what about marrying the two? Many organizations are not yet there. For example, a recent survey of government cybersecurity professionals found that 86 percent of respondents believe big data analytics could help improve cybersecurity, but only 28 percent are currently fully leveraging big data for security purposes.

I'd say: Multi-model database management has been around for decades. Marketers who say otherwise are being ridiculous. Thus, "multi-model"-centric marketing is the last refuge of…

Few businesses rely on big data when it comes to their marketing. But those who do rely on it to outsmart, outmatch, and outdo their competitors. Big data can transform any marketing strategy. It pinpoints customers needs, wants, and desires.

Over the last few months as I have transitioned into primarily supporting retail customers, I have been updating my knowledge of current trends in customer analytics, personalisation and data-driven marketing. Not surprisingly, this then led me to the topics of omni-channel and the connected consumer (which will no doubt give me plenty of ideas for future blogs).  An area of strength for Teradata…

Gunnar Wagenknecht introduces the Eclipse Bundle Recipes project, explaining how to turn a library from Maven into an OSGi bundle, and how to deploy recipes and build systems to a local environment.

Maximilian Koegel introduces Edapt, describing its basic features and demonstrating how it can be used for migrating models in real life applications

The term 'big data' has become ubiquitous as it has permeated nearly every industry over the last ten years. While definitions abound, its fundamental implications remain constant: it represents a cross-functional focus on leveraging exponentially growing volumes of data to increase operational performance and ROI. Though it hit the mainstream only recently, big data has […] The post Retailers Find Big Data Limiting for Promotions appeared first on Predictive Analytics Times.


Jose Valim introduces Elixir and some of the most important features: data types, modules, async, collections, parallelism, streams, etc.

Application building no longer means learning to code. Here are seven products and services that can help you develop apps without developing programming skills.

Who will be the next Amazon of health care? That's the question posed by PwC in a report that explored how the health care industry is innovating to meet the demands of the digital consumer, or in this case, patient. The general consensus from patients concluded that they wanted to "shop for their health care" […] The post Big Data is Driving Health Care appeared first on Predictive Analytics Times.

Professional services firm, which employs hundreds of thousands of workers in cities around the globe, has been quietly preparing for the "massive revolution" of getting rid of annual performance reviews and rankings, in its internal operations.

The Agile 2015 conference ran in Washington, DC earlier this month with over 2300 attendees from around the globe. This is a second article in a series looking at the sponsors who supported the conference. They were each asked to answer three questions — Who, What and Why. Who are they, What were they offering at the conference and Why did they choose to sponsor this event? By Craig Smith

David Nolen explores how ClojureScript makes UI programming a tangible process by demonstrating live application building for the web and iOS.

Big Design Upfront was considered so evil in the early days of Agile that it acquired its own acronym. But absent the careful thinking of good designers, our systems turn out to be mediocre collections of stuff that customers asked for or the data showed was desirable. It's time we relearned that great products start with asking the right questions.

Health insurer Humana is getting good results by making analytics the foundation of its clinical operations. Earlier this year a CDW survey revealed that analytics is a top priority for two thirds of decision-makers in the health care industry. Nearly 70 percent of respondents said they were planning for or already implementing analytics. Strong results […] The post Using Analytics to Improve Health Outcomes appeared first on Predictive Analytics Times.

It's a classic scenario. Two people meet at a party. They chat and then exchange information. However, they never speak or meet again. It is as though the contact information was never exchanged. So, what happened? Was there never intent to follow up? Or, did the information get lost, forgotten, or placed in a pile […] The post If I Text You, Will Your Analytics Text Me Back? appeared first on Predictive Analytics Times.

Four Spheres of Lean and Agile Transformation book by Thomas P. Wise and Reuben Daniel, is based on how management should create an organizational environment to implement Agile. They talk about the Agile readiness in the organization and how to begin a Lean or Agile implementation journey.

We take a look at Etsy's blameless postmortems, both in terms of philosophy, process and practical measures/guidance to avoid blame and better prepare for the next outage. Because failures are inevitable in complex socio-technical systems. It's the failure handling and resolution that can be improved by learning from postmortems.

In your business and every other, efficient management of assets matters. It matters to Colmobil, the #1 importer and distributor of cars, trucks and buses in Israel. Vehicles must go from Point A…

There is a bewildering array of big data platforms in the cloud today including multiple flavors of Hadoop and varied technologies from major vendors. The difficulty of selecting the "right cloud platform" is exacerbated by the fact that there are no established guidelines about how to configure the right cloud infrastructure to support a specific use case around a set of technologies and related workloads.

Data scientists who are interested in learning about the latest data enrichment tools and techniques may want to check out the Rich Data Summit, a one-day conference taking place in San Francisco this fall. It's the first event dedicated to the topic of data enrichment, according to event host CrowdFlower. Companies across all industries are jumping into data analytics and machine learning with the hope of harnessing big data for a competitive advantage. But before they can become data-driven, companies must first learn how to enrich their data.

Photographing an object at regular intervals over a long time period, and then stitching the results into a timelapse video, is a great way to "speed up time" and observe slow-motion changes (for…

MEATPACKING DISTRICT A walk through the meatpacking district in New York City is a lively affair. In the early 1900s, this part of town was known for precisely what its name implies:…

Data visualization techniques can give data scientists a vital tool for representing the data that analysts and line-of-business users need to make strategic decisions. Discover how a few simple considerations of a specific data set in a real-world use case enables data scientists to implement cost-effective data visualizations that can be applied in any industry.

Today's automobiles seem to be quickly morphing into connected devices on wheels. As the boundaries of the auto industry blur, and as new competitors get into the mix, see how traditional industry participants are utilizing real-time streaming data analysis and the Internet of Things to thrive amid this phenomenal technological disruption.

The ten-year anniversary of Hurricane Katrina and its impact on New Orleans compel Jen to reflect on how technology and data analytics have made great strides in enhancing governmental preparation and response to natural disasters.

We often get asked this question – Where do I begin? How are problems being solved using big-data analytics? To answer this question we need to take a step back and think in the context of the problem and a complete solution to the problem.

If you've thought about learning the R language but didn't know how to start, there's a new, free course on edX that starts you from the R basics and lets you learn R by trying R as you go. Presented…

University of Montana researchers maximized analytics to improve outcomes through highly effective and prompt brain trauma treatment and accelerated experimental research. Analytics also enabled quick identification of patients likely to develop post-traumatic epilepsy.

I love a good sale. It's hard to resist the temptation of getting a designer shirt for an eighth of the cost and filling your bag full of clothes for the same price you'd normally spend on a single dress. But I've learned the hard way, and heard my mother tell me a thousand times, that just because you buy something at a deep discount doesn't mean you need it or that it will look fabulous on you. The evidence is hanging in my closet with the tags still on, to boot.

Take a look at your historical forecasting performance data. (Every organization should be recording, at the very least, the statistical forecast (generated by the forecasting software) and the final forecast (after adjustments and management approval), to compare to the actual that occurred.

In the fourth article in the Conversation Patterns for Software Professionals series Michael focuses on asking the right questions. By Micha Bartyzel

Yakov Fain explains the state of Java, JavaScript, and web development today,explains reasons for choosing Dart or TypeScript, and why he's interested in web components and Polymer.

Four successive lightning strikes on a local utility grid in Europe caused a data loss at Google's Belgium data center. For Google, a company with a self-described "appetite for accuracy" in its data center operations, admitting an unrecoverable data loss as small as 0.000001% — as it did — likely came with a little bit of pain. The lightning strikes occurred Aug. 13 and the resulting storage system problems weren't fully resolved for five days. Google's post mortem found room for improvement in both hardware upgrades and in the engineering response to the problem.

Go 1.5 has a complete tool chain written in Go, a quicker garbage collector and runs a goroutine on each available CPU.

In a competitive business landscape like retail, access and usage of big data is a source of competitive advantage. In the past, entrepreneurs relied on intuition to make marketing decisions. Now, data rules how commerce is being done.

At the the Teradata Summit that was on in Sydney and Melbourne this week, Nikolaus Walkowsky, Global Head of Business Intelligence at DHL Express outlined some of the recent analytics innovations they have implemented. He started with an anecdote to demonstrate the central place which information holds at DHL. Way back in the 1960s, founders Hillblom, Dalsey and Lynn (DHL) saw that by flying bills of lading ahead of the goods being delivered they could expedite the customs process and deliver goods more quickly. Hence in some ways the information describing the goods was as central to the business as the goods themselves.

Peter Sommerlad covers compile-time computations available in C++14: constexpr functions and constants, literal types, variable templates, variadic templates and what can be expected in the future.

Matt Zimmer discusses architectural patterns including service decomposition, stateless application tiers, and polyglot persistence. He also presents migration strategies including shadow writes, KPI comparisons, feature dials, and fast rollbacks used by Netflix. By Matt Zimmer

At the MesosCon conference this week, Basho Technologies and Cisco are demonstrating the Riak key-value store database running atop Apache Mesos, the next-gen cluster manager developed by UC Berkeley's AMPLab. As Basho CEO Adam Wray tells Datanami, it's all about making data delivery more scalable. Basho develops a multi-modal NoSQL database that includes a key-value store and an object store, along with hooks into other big data tools like Apache Spark, Redis, and Solr€”with graph and columnar analytic capabilities possibly on the way.

More than 14 million people worldwide were diagnosed with cancer in 2012, according to the World Health Organization, and that number is expected to rise about 70 percent over the next 20 years. A sobering prediction magnified by the expense and time required to develop treatment plans.  Intel and the Knight Cancer Institute at Oregon Health & Science University (OHSU) hope to completely transform the way medical institutions treat cancer by leveraging big data analytics and open cloud technologies

More than 14 million people worldwide were diagnosed with cancer in 2012, according to the World Health Organization, and that number is expected to rise about 70 percent over the next 20 years. A sobering prediction magnified by the expense and time required to develop treatment plans.  Intel and the Knight Cancer Institute at Oregon Health & Science University (OHSU) hope to completely transform the way medical institutions treat cancer by leveraging big data analytics and open cloud technologies

Seven Unbeatable Workshops at PAW Boston The training workshops at PAW Business and PAW Healthcare in Boston are unbeatable. With training programs dedicated to topics covering R, predictive modeling, ensemble methods, healthcare and more, there is something for every level of predictive analytics practitioner. Take your understanding of predictive analytics up a notch by signing up for the workshops below and enjoy Data Science Central's discount code DSCPAW150.

NASCAR is seeking to engage with racing enthusiasts via a new analytics platform that seeks to leverage, among other things, unstructured text data collected directly from NASCAR fans. The motor racing organization began working with partner HP Enterprise Services in 2013 to develop the Fan and Media Engagement Center, or FMEC. The platform is designed to track social media and break down unstructured text data using real-time analytics.

The cloud is the answer. Now, what was the question? Depending on your point of view, this comment either makes you pump your fist in enthusiastic agreement — or shake your head in strong defiance. Folks tend to be either hardcore cloud "fans" or vehement cloud "detractors," and often there is not much middle ground between them. Having been around the cloud world for several years now,

Mention big-data tools like Spark and Kafka to most enterprise users, and the other big-data tool that comes to mind along with them is Hadoop. But does it need to? Mesosphere, corporate backers of the Apache Mesos cluster-management project, are ginning up a big-data stack that eschews Hadoop, but embraces Spark (and Kafka, and Cassandra, and the Akka event framework) for real-time processing. [ Download the InfoWorld quick guide: Learn to crunch big data with R. | Sign up for InfoWorld's Big Data Report to stay atop all the latest news and developments in the field. ] Mesosphere Infinity, as the new product is called, is "a turnkey, full-stack offering optimized for big data and IoT," and its main aim is to provide an easily erected stack for businesses for real-time data work. But it also stands as a recent example of how many of the technologies reflexively associated with the Hadoop stack don't require Hadoop to be useful.

Published Date: 2015-08-20 17:04:14 UTC Tags: Analytics, Data Science, Data Warehousing, Predictive Analytics, Social Collaboration Title: Analytics In Space: The Final Frontier Subtitle: What can data from the great unknown tell us?

Companies across nearly all industries are exploring how to use GPU-powered deep learning to extract insights from big data. From self-driving cars and voice-directed phones to disease-detecting mirrors and high-speed securities trading, the potential use cases for the technology are large and expanding by the day. Ever since computer scientist Geoff Hinton decided to try training a neural network on a GPU and essentially invented the field of deep neural networks several years back, researchers have been racing to apply the technique to tough modeling problems in the real world.

While real-time analytics is getting more affordable, it's still not right for everything. Here are 10 ways to get the most from real time, near real time, and batch use cases.

In a marketplace of rapid, unprecedented change and sustained volatility, traditional planning, budgeting and forecasting processes may not keep pace with shifting trends. However, new advanced planning and forecasting analytics software enables financial professionals to support an enterprise's goals regardless of business fluctuations.

Responding to an emergency is difficult. How can analytical data determine what to prioritize when managing an emergency?

Moviegoers aren't alone, analytics needs a superteam, too. Learn how four critical success factors come together to create more than the sum of their parts.

by Joseph Rickert One great beauty of the R ecosystem, and perhaps the primary reason for R's phenomenal growth, is the system for contributing new packages. This, coupled to the rock solid stability…

Inaccurate perceptions of predictive analytics are common in the business world. In reality, predictive analytics is straightforward to understand, can leverage existing skillsets in business and IT organizations, and can deliver value in most industries and lines of business. Getting started with the technology simply requires strong business problem-solving skills, a supportive IT support organization, and analysts who can ensure that the right predictive algorithms are being used to extract useful insights from data.

People love their smartphones, but they're not especially thrilled with their wireless providers. That's the key takeaway from the 2015 American Customer Satisfaction Index (ACSI) survey of telecommunications customers. The wireless industry was among the least favored groups in the study, with only social media and airlines earning lower levels of customer satisfaction. The survey results, released last month, show that customer satisfaction with wireless telephone service dropped 2.8 percent from the prior year. Ouch.

Customer relationship management data will offer a way for utilities to redefine their relationships with customers. Traditionally, the utilities sector has been somewhat shielded from economic struggles, but in 2015, customers have options like solar panels that are getting cheaper by the year and helping some go off the grid.

Web UI Framework Bootstrap has announced version 4 and released an alpha version. This is a major update that accepts more modern web techniques and drops support for old browsers.

"Lean Enterprise" book authors discuss how traditional management practices fail to balance innovation and product exploitation as they require very different sets of capabilities.

An investigation was done to find out which skills are desired by employers for project management positions and what it takes to be successful as a project manager. An interview about skills for managing projects, likability of bosses and leadership skills, why people feel miserable at work and what can be done to prevent it and the need for project managers when organizations migrate to agile.

Chris Richardson describes how to implement business logic using a domain model that is based on event sourcing. He compares and contrasts a hybrid OO/FP design with a purely functional approach. Chris also shows how Domain Driven Design concepts such as bounded contexts and aggregates fit in with event-driven microservices.

Vaclav Petricek discusses how to train models as well as architect and build a scalable system powered by Storm, Hadoop, Spark, Spring Boot and Vowpal Wabbit that meets SLAs measured in tens of milliseconds

This entry was posted in News. Bookmark the permalink.