Big Data News – 30 May 2016

Today's Infographic Link: The World’s Worst Engineering Mistakes

Featured Article
Telling a compelling story with your data helps you get your point across effectively. Here are four tips to keep your data from getting lost in translation.

Top Stories
With so many options on the table these days, navigating the power/cooling divide can be tricky.

Lattice Engines, the provider of predictive marketing and sales applications to the enterprise, launched the Lattice Predictive Insight Platform.

Five years ago, in a taxonomy of analytic business benefits, I wrote: A large fraction of all analytic efforts ultimately serve one or more of three purposes: Marketing Problem and anomaly detection…

Gives sales people instant access to key data from multiple sources to visually see the whole story of each customer's experience

In the past few years we've seen software giants like Alphabet's Google and Amazon.com penetrate more and more industry verticals. Google began as a search bar but now plays in the mobile (Android), media (YouTube), hardware (Motorola), transportation (Waze) and IoT (Nest) spaces as well. Likewise, Amazon.com was originally a site to sell books online, but now sells every product under the sun and operates the world's largest public cloud through Amazon Web Services (AWS).

The monetisation of personal data of the users by the companies in return for free services is at the root of the challenges

Microsoft's CEO Satya Nadella is visiting India, reflecting the growing importance of the country as a market for multinational technology companies. Nadella's visit follows the first visit to India by Apple CEO Tim Cook, who visited the country this month to drum up support for the company's plans to offer refurbished iPhones in the price-sensitive market as well as to get permission to set up its wholly-owned stores in the country.

GridGain Systems, provider of enterprise-grade In-Memory Data Fabric solutions based on Apache® Ignite™, announced that e-Therapeutics plc (LSE ETX), a U.K.-based drug discovery and development group, is using the GridGain In-Memory Data Fabric to run hundreds of thousands of computational analyses in minutes.

Sumo Logic, a leading cloud-native, machine data analytics service, announced the availability of its Sumo Logic App for Amazon Web Services (AWS) Lambda allowing DevOps and developers to monitor the health of all AWS Lambda functions and better manage the associated costs.

Monday newsletter published by Data Science Central. Previous editions can be found here.   Featured Resources and Technical Contributions Making data science accessible — Logistic Regression + Multi-Regression in R  Crime Analysis with Zeppelin, R & Spark  Identify, describe, plot, and remove outliers with R  13 Great Data Science Infographics  Data Science Central 'Challenge of the Week' 

LinkedIn contributes Kafka Monitor to open source and Apple prepares to open Siri to third-party developers. We have all this, plus news about Confluent and Tinkerpop, in our Big Data Roundup for the week ending May 29, 2016.

Informatica, the world's No.1 provider of data management solutions, is removing marketing's traditional blinders with an innovative new Marketing Data Lake product that gives marketers self-service access to all data relevant to marketing.

In this special guest feature, Nishant Vyas, Head of Product and Strategy at MariaDB, suggests that organizations "know" that the cloud can serve all aspects of an organization's infrastructure, and not just their Salesforce subscriptions. But a recent study from InfoSecBuddy shows that 90 percent of enterprises are still wary to fully embrace cloud computing, citing security and other fears. What gives?

The digital banking landscape has never been more dynamic than it is today.  The number of people going into branches to do their banking is falling dramatically. Customers are changing the way…

Infer Inc., a leading predictive sales and marketing platform that helps companies win more customers, announced a new sales and account intelligence application called Infer Glance.

Most of these infographics are tutorials covering various topics in big data, machine learning, visualization, data science, Hadoop, R or Python, typically intended for beginners. Some are cheat sheets and can be nice summaries for professionals with years of experience. Some, popular a while back (you will find one example here) were designed as periodic tables. For Geeks  Data Science Wars: R versus Python  Three periodic tables for data scientists   

Coho Data, a leading innovator in scale-out flash storage for the enterprise, announced DataStream 2.7 which further enhances its support for enterprise workloads.

In a recently accepted manuscript titled "Deep learning applications for predicting pharmacological properties of drugs and drug repurposing using transcriptomic data", scientists from Insilico Medicine, Inc located at the Emerging Technology Centers at Johns Hopkins University in collaboration with Datalytic Solutions and Mind Research Network presented a novel approach applying deep neural networks (DNNs) to predict pharmacologic properties of many drugs.

Big Data analytics and the Internet of Things are still relative newcomers. Given how quickly technology moves, though, it's not surprising that their next incarnations are already being talked about: cognitive analytics and a much-grown Internet of Things (IoT). We all know about the Internet of Things: it's data from a host of Web-connected devices that can be analyzed and studied, just like traditional data analytics. But what is cognitive analytics? Think back to old sci-fi shows. Remember the insanely smart, almost-human robot sidekick?

In a nut shell, scalebase is to mysql what greenplum db is to postgresql, it makes it possible to create an MPP database based on mysql. You can use it to scale out your existing "mysql applications" without changing your code and/or create new MPP databases for handling big data.

Machine learning and data mining techniques have been used in numerous real-world applications. An assumption of traditional machine learning methodologies is the training data and testing data are taken from …

To help improve overall cybersecurity practices, the first line of defense is often an educated user with strong password practices.

Alpine Data announced Chorus 6, a significant enhancement to its award-winning integrated analytics platform that brings machine learning, data, and people together to create operational solutions for business users.

The pursuit of data-driven decision making has put tracking, logging and monitoring at the forefront of the minds of product, sales and marketing teams.

Google enables all Android Play Store apps to work on Chromebooks, and Intel makes a move toward IoT and connected cars.

Clinton's policy violation brings to mind CEO behaviors in which the top person in a company thinks that rules don't apply to him or her.

In 1998, the European Union, United States, and Switzerland entered an agreement called Safe Harbor, which was enacted to assure that the data collected, stored, and used by businesses within and among these nations would be kept to a mutually-agreed upon standard of privacy. In October of 2015, a judge for the European Court of Justice struck down Safe Harbor, largely in response to the United States government's spying habits, made legal by the Patriot Act (enacted following the terrorist attacks on 9/11).

While an EHR is supposed to automate and streamline the clinician's workflow, most systems are not living up to the promise.

Performance reporting is vital in performance management, but quite often the finance department has conflicting objectives with the owners of plans, budgets and forecasts. In this concluding installment of a two-part series, see how automated internal performance reporting can deliver real business value at operational, tactical and strategic levels.

With edge computing, the enterprise can improve service to users and lower its wide area networking costs.

An entire new ecosystem of competing communications service providers (CSPs) has emerged from a rising tide of over-the-top services delivery. Listen to an audio presentation of a white paper that delves into revenue generation opportunities for CSPs created by the application of net promoter scores and customer satisfaction ratings.

At Hadoop Summit San Jose the goal of the Data Science, Analytics and Spark track is sure to be packed. Ram Sriharsha – Product Manager Apache Spark, Databricks generalizes the 16 sessions in the track as providing technical guidance around: Leveraging Hadoop for analytics is a key use case across industries and represents a critical value proposition for Hadoop. This track… The post Apache Spark Insights appeared first on Hortonworks.

Microsoft and Facebook will build MAREA, the fastest trans-Atlantic cable ever. But why is this HUGE news — is it because of the cable itself, or because of how it'll be used?  [Developing story. Updated 9:27 am PT with more comment] Obviously, laying an undersea cable over 4,100 miles is an expensive idea, so they're usually owned by telcos, who parcel out bandwidth to their customers. But in this case, it'll be directly owned by its two customers. And that isn't just a technicality.

SAPPHIRE NOW 2016 was one for the history books: record attendance, crazy social engagement, live streaming everything, and heavens that opened up. A lot. Can we get a little of that in California? And SAP BusinessObjects Analytics was in the thick of things throughout. A Rose by Any Other Name… SAP announced updated and streamlined…




One thing systemically changing businesses of all stripes today is data. While it may seem that many companies have gotten their arms around the complicated data situation, it's as clear as ever that is not the case. Although the marketing department is being tasked with providing a more enriched interaction between businesses and their respective customers, the reality is that they're still orienting themselves to the problem and current solutions.

This vendor-written tech primer has been edited by Network World to eliminate product promotion, but readers should note it will likely favor the submitter's approach. The forecast for unified communications (UC) in the cloud seems bright with clear skies ahead. According to Transparency Market Research, UCaaS (Unified Communications as a Service) is expected to reach nearly $38 billion by 2022 with an impressive compound annual growth rate of over 23%.

A CEO that I worked for once gave me the following feedback when I proudly sent him an Excel file containing the data that he had asked me for less than an hour prior:Never give me data. Only provide me with information.

Analysis: TCS' chief scientist explains to CBR how neural automation systems work and how they will change enterprise IT.

Intel, which recently announced that it would focus more on the Internet of Things and cloud computing, is acquiring Itseez, which specializes in IoT and self-driving vehicle technology.

I'm joined this week by Jon Morra, director of data science at eHarmony to discuss a variety of ways in which machine learning and data science are being applied to help connect people for successful long term relationships. Interesting open source projects mentioned in the interview include Face-parts, a web service for detecting faces and extracting a robust set of fiducial markers (features) from the image, and Aloha, a Scala based machine learning library. You can learn more about these and other interesting projects at the eHarmony github page. In the wrap up, Jon mentioned the LA Machine Learning meetup which he runs. This is a great resource for LA residents separate and complementary to datascience.la groups, so consider signing up for all of the above and I hope to see you there in the future.

Some of the biggest threats to cybersecurity involve controlling, damaging and interrupting systems,

No, it's not a Jaqen H'ghar quote. Recently, Hadley Wickham tweeted the following image: While this image isn't included in Hadley's Advanced R book, he does discuss many of the implications there….

Mendix sees this as an opportunity to give different classes of developers more access to a common pool of data in the cloud.

This entry was posted in News and tagged , , , , , , . Bookmark the permalink.