Big Data News – 03 Jun 2016

Today's Infographic Link: Vaccine-Preventable Outbreaks

Featured Article
The field of Data Science is booming, yet comparatively few women are entering it. Why? What are the obstacles and opportunities facing them if they do? The path to change is challenging, but as a woman who has happily worked in data science for many years, I can say with confidence that it's possible. Women… The post Women in Data Science appeared first on Predictive Analytics Times.

Top Stories
In anticipation of their upcoming conference co-presentation, Understanding our Customers' Customers' Customers' Needs — Text Analytics for B-to-B Businesses at Text Analytics World Chicago, June 21-22, 2016, we asked Michael Dessauer, Data Scientist at The Dow Chemical Company and Justin Kauhl, Computational Linguistics Expert at The Dow Chemical Company, a few questions about their work… The post Wise Practitioner – Text Analytics Interview Series: Michael Dessauer and Justin Kauhl at The Dow Chemical Company appeared first on Predictive Analytics Times.

Windows Holographic is potentially not only a new interface into computing, but a potentially a new interface to the world around us.

Qlik bowed to pressure to sell itself to private equity, announcing that it will be acquired by Thoma Bravo. Qlik was one of three companies ranked as Leaders in Gartner's most recent Magic Quadrant report for Business Intelligence and Analytics Platforms.

In this special guest feature, PK Agarwal, Regional Dean and CEO at Northeastern University-Silicon Valley, discusses the need to train new data scientists, and the value of continuing education for those already employed in the field.

Galactic Exchange, Inc. announced the availability of its Docker container powered clustering technology (ClusterGX™) as an on-premise managed service solution (ClusterGX™ Full-Service) for customers deploying Hadoop/Spark Big Data applications.

In this contributed article, Paul Pilotte, Technical Marketing Manager of MathWorks, highlights how the ability to create analytics that process massive amounts of business and engineering data is enabling designers in many industries to develop intelligent products and services, and how designers can use analytics to describe and predict a system's behavior.

SYS-CON Events announced today that FalconStor Software® Inc., a 15-year innovator of software-defined storage solutions, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. FalconStor Software®, Inc. (NASDAQ: FALC) is a leading software-defined storage company offering a converged, hardware-agnostic, software-defined storage and data services platform. Its flagship solution FreeStor®, utilizes a horizontal architecture that unlocks a new world of storage opportunities, allowing IT managers, MSPs, and CSPs to maximize efficiencies and lower costs, while taking advantage of public cloud, hybrid cloud, flash storage and software-defined storage.

SYS-CON Events announced today that Stratoscale, the software company developing the next generation data center operating system, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Stratoscale is revolutionizing the data center with a zero-to-cloud-in-minutes solution. With Stratoscale's hardware-agnostic, Software Defined Data Center (SDDC) solution to store everything, run anything and scale everywhere, IT is empowered to take control of their data centers. Stratoscale is offering a Hyperconverged cloud supporting OpenStack out of the box.

Webscale Networks' offering promises to enable IT organizations to recover from a disaster in less than hour with no more than 15 minutes of lost data.

A move to the cloud is underway but software remains the biggest part of SAP's business.

The new intelligent resource and workload management software, called IBM Spectrum Computing is designed to make it easier for organisations to extract full value from data to accelerate performance-intensive analytics or machine learning.

According to the World Bank, South Africans are the biggest borrowers in the world, with 86% of the population in debt.* And unfortunately, the National Credit Regulator, goes on to state that of the 20 million credit-active consumers in South Africa, 47% are in arrears on their accounts by three months or more, or had judgements against them, or had negative credit ratings on their credit record.

The Salesforce AppExchange is known to many as the number one application marketplace for businesses, but there is more to it than just apps. The AppExchange also includes the rapidly expanding Salesforce Consulting Partner ecosystem that I covered in my recent post, "The Salesforce ecosystem: A shift on the playing field."

South Africa's First National Bank (FNB) has been considered one of the world's most innovative financial institutions for years now. Voted the most innovative bank globally in 2012, the financial institution owns bragging rights as the first bank in South Africa to launch a mobile banking app in 2011 and second in line to provide fully-fledged web banking portal conveniences to its customers. For  those who can remember the days before feature-rich banking apps,

By Katharina Reichert, Finance Solutions at SAP Here's a challenge for any accounts receivable (A/R) professional:  Tell me how much in receivables you expect in 90 days –then show me the top 10 customers with the largest amounts of outstanding receivables driving that total number, then drill through to a specific customer line item–and do…

Farmers looking for ways to increase their crop output are using more technology and relying a little less on intuition. Farmers associated with Land O'Lakes, a dairy-focused, agricultural cooperative in Minnesota, are using online tools and apps to visualize their fields and to analyze the data in such areas as water management, seed placement and crop diseases. To do that, Land O'Lakes, known for producing the top butter brand in the U.S., has turned to Google's public cloud. The company is involved in a $3.5 million project with Google and expects to see a payback on it in two to three years.

SYS-CON Events announced today that Anexia will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Anexia offers high-quality customized managed hosting solutions for SaaS and IaaS companies.

The C-suite has become a vocal supporter of applications with embedded analytics capabilities such as dashboards, self-service reporting and interactive reports.

SYS-CON Events announced today that Column Technologies will exhibit at SYS-CON's @DevOpsSummit at Cloud Expo, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Established in 1998, Column Technologies is a global technology solutions provider with over 400 employees, headquartered in the United States with offices in Canada, India, and the United Kingdom. Column Technologies provides "Best of Breed" technology solutions that automate the key DevOps principals and help our customers meet today's DevOps and Digital Transformation challenges.

SYS-CON Events announced today that iDevices®, the preeminent brand in the connected home industry, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. iDevices, the preeminent brand in the connected home industry, has a growing line of HomeKit-enabled products available at the largest retailers worldwide. Through the "Designed with iDevices" co-development program and its custom-built IoT Cloud Infrastructure, iDevices works with highly recognized consumer brands to bring the power of connectivity to their customers. By designing and developing its technology in-house, the Company is able to seamlessly connect people to their worlds with responsive products that enhance everyday life.

SYS-CON Events announced today that Avere Systems, a leading provider of enterprise storage for the hybrid cloud, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Avere delivers a more modern architectural approach to storage that doesn't require the overprovisioning of storage capacity to achieve performance, overspending on expensive storage media for inactive data or the overbuilding of data centers to house increasing amounts of storage infrastructure.

SYS-CON Events announced today that Fusion, a leading provider of cloud services, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Fusion, a leading provider of integrated cloud solutions to small, medium and large businesses, is the industry's single source for the cloud. Fusion's advanced, proprietary cloud service platform enables the integration of leading edge solutions in the cloud, including cloud communications, cloud connectivity, and cloud computing. Fusion's innovative, yet proven cloud solutions lower customers' cost of ownership, and deliver new levels of security, flexibility, scalability, and speed of deployment.

While hardware vendors view it as another vehicle for selling more chips and servers that would allow enterprises to move processing power closer to data, the torrent of sensor and other information the Internet of Things (IoT) is expected to generate also is driving adoption of artificial intelligence technology, concludes a new survey of software developers. Evans Data Corp., the market intelligence specialist based on Santa Cruz, Calif., released a survey of developers this week that placed IoT at the top of a list of technologies propelling AI adoption.

Fred and Ginger, peanut butter and jelly, the Internet of Things (IoT) and… analytics?

The Internet of Things is no good without a way to act on the data it generates. A new partnership between two of the biggest IoT players promises to put smart collection and advanced analysis of data right where it's needed. IBM and Cisco Systems have worked out how to run components of IBM's Watson IoT analytics on Cisco edge devices. This will bring more intelligence closer to where the action is, helping enterprises run things like factories and oil rigs more efficiently. In 2014, Cisco unveiled small routers and switches that could be embedded in facilities and vehicles located far from any data center. The devices could take in data from local sensors and analyze it on site with a small, built-in Linux computer.

People are worried about security and the privacy of the IoT and IIoT. They have not ensured that infrastructures, which contain IoT/IIoT, are secure.

Oracle plans to sue whistleblower Svetlana Blackburn for malicious prosecution, the company said Thursday. On Wednesday, Blackburn — a senior finance manager in Oracle's cloud business — said in a lawsuit she was terminated from her job for refusing to go along with cloud-computing accounting principles she considered unlawful. Blackburn alleges that upper management was trying to fit "square data into round holes" in a bid to boost the financial reports for Oracle's cloud services business that would be "paraded" before company leaders and investors.

Many of the most important values of object storage are derived from the fact that it leverages the advantages of distributed, scale-out architecture, which expands capacity by simply adding nodes.

It might be a tough word to get out of your mouth, but "operationalization" remains a central element of the big data opportunity. And according to a new Capgemini and Informatica study titled "The Big Data Payoff: Turning Big Data into Business Value," companies are making slow but steady progress in the operationalization of big data projects. To operationalize something in the business world is to make it tangible, to make it real and repeatable and reliable. In the context of big data, to operationalize a big data science project is to take it from the planning stage to proof of concept stage to test stage and finally all the way into full production, with all the qualities that production-grade IT projects require, like SLA monitoring, high availability, governance, security, etc.

Big Data has grown tremendously in the past few years. Interestingly, the concept of Big Data is dated back to what was characterized as 3Vs about 15 years ago, way before NoSQL, MapReduce, Hadoop, Storm or Spark ever started life. Obviously this kind of 3Vs definition makes less and less sense in the new era. In fact, more tools and product have matured to process large data sets, making bigness no longer a pressing problem.

With today's launch of Amazon EMR release 4.7, you can now create clusters with Apache Phoenix 4.7.0 for low-latency SQL and OLTP workloads. Phoenix uses Apache HBase as its backing store (HBase 1.2.1 is included on Amazon EMR release 4.7.0), using HBase scan operations and coprocessors for fast performance. Additionally, you can map Phoenix tables and views to existing HBase tables, giving you SQL access over data already stored in HBase.

We have built a regulatory framework that is ineffective at recognizing, let alone combatting, cybercrime.

Machine learning is finding its way into a variety of applications. Discover an open source machine learning platform that combines the data processing power of Spark with powerful machine learning algorithms courtesy of the H2O platform to tackle challenges technologists face when applying machine learning to a wide range of applications.




IBM announced additions today to the infrastructure layer of its high performance data analytics software portfolio, including "cognitive features," such as scheduling and resource management, and capabilities aimed at easing adoption of Spark. The new software-defined infrastructure products — in which the data center is managed, provisioned and automated by software regardless of compute, storage or network components — called IBM Spectrum Computing, are intended to reduce the complexity of performance-intensive data analytics and machine learning implementations.

Bill Vorhies is Editorial Director for DataScienceCentral, and President and Chief Data Scientist at Data-Magnum, providing predictive analytics and big data infrastructure projects as a service. Bill has been an active commercial predictive modeler since 2001. Below, you will find a selection of his articles posted in the last two years. To check out his most recent posts in 2016, click here. For 2015 posts, click here. NoSQL and RDBMS are on a Collision Course.

IBM and Cisco today announced the formation of an alliance that promises to bring Watson analytics to the edge of the network.

Oracle plans to sue whistleblower Svetlana Blackburn for malicious prosecution, the company said Thursday. On Wednesday, Blackburn — a senior finance manager in Oracle's cloud business — said in a lawsuit she was terminated from her job for refusing to go along with cloud-computing accounting principles she considered unlawful.

This vendor-written tech primer has been edited by Network World to eliminate product promotion, but readers should note it will likely favor the submitter's approach. As organizations shift more workloads to the cloud they increasingly rely on networks and infrastructure they don't own or directly manage. Yet this infrastructure is just as critical as when applications and services were hosted in the data center.

Manufacturers are embracing the Industrial Internet the same way consumers are leveraging Fitbits — to improve overall health and wellness. Both can provide consistent measurement, visibility, and suggest performance improvements customized to help reach goals. Fitbit users can view real-time data and make adjustments to increase their activity.

Google CEO Sundar Pichai says he believes his company's artificial intelligence is smarter than Amazon's Alexa or Apple's Siri, but we're seeing only 1% of its potential.

With the growing volumes of diverse data being stored in the Data Lake, any breach of this enterprise-wide data can be catastrophic, from privacy violations and regulatory infractions to corporate image and long-term shareholder value. Seshu Adunuthula – Head of Analytics Infrastructure, eBay acting as Track Chair for Governance and Security for Hadoop Summit San Jose has…

Government IT leaders stress value in private-sector partnerships, repeatable contracting provisions as they look to incremental modernization strategies.

Business intelligence software maker Qlik Technologies (NASDAQ: QLIK) today announced that it has agreed to be acquired by the private equity firm Thoma Bravo for $30.50 per share, or about $3 billion. As one of the largest of a new wave of visual analytic firms, Qlik's acquisition could hasten consolidation in a crowded BI field. Founded in Sweden more than 20 years ago, Qlik created a niche for itself by developing an easy-to-use BI tool that users could navigate visually and without the extensive training required by the popular BI products of the day from Cognos,

Information security was hard enough when we had to lock down all of the servers, desktops and laptops. That seems like child's play compared to what we face now. You could pretty easily count your server infrastructure and PCs were simply a function of your company employee count. Mobile and Internet of Things (IoT) change that equation as we outlined earlier this week in our new White Paper, Take Back Control of Your Information Security.

by Joseph Rickert The model table on the caret package website lists more that 200 variations of predictive analytics models that are available withing the caret framework. All of these models may be prepared, tuned, fit and evaluated with a common set of caret functions. All on its own, the table is an impressive testament to the utility and scope of the R language as data science tool.  For the past year or so xgboost, the extreme gradient boosting algorithm, has been getting a lot of attention. The code below compares gbm with xgboost using the segmentationData set that comes with caret.

As part of the drumbeat for Spark Summit West in San Francisco (June 6-8),  learn how analyzing stats from professional sports leagues is an instructive use case for data analytics using Apache Spark with SQL. In the United States, many diehard sports fans morph into amateur statisticians to get an edge over the competition in their fantasy sports leagues.

3 Ways You Can Improve Your Lost Customer AnalysisPreventing Customer Churn with Text AnalyticsLapsed Customers, Customer Churn, Customer Attrition, Customer Defection, Lost Customers, Non-Renewals, whatever you call them this kind of customer research is becoming more relevant everywhere, and we are seeing more and more companies turning to text analytics in order to better answer how to retain more customers longer. Why are they turning to text analytics?

Dell claims this approach uses 97 percent less cooling power than the average air-cooled data center.

Through a project called Magenta, Google's machine learning researchers hope to understand whether computer-generated music can qualify as art.

This entry was posted in News and tagged , , , , , , , , , , , . Bookmark the permalink.