Big Data News – 26 Oct 2016

Today's Infographic Link: How to Run a Productive Office

Featured Article
New model-fitting technique is efficient even for data sets with hundreds of variables. Data analysis — and particularly big-data analysis — is often a matter of fitting data to some sort of mathematical model. The most familiar example of this might be linear regression, which finds a line that approximates a distribution of data points. But fitting data to probability distributions, such as the familiar bell curve, is just as common.

Top Stories
At the OpenStack Summit conference today, an initiative led by IBM to make good on the original promise of OpenStack interoperability delivered.

Be it on-premises, in the cloud or hosted on Hadoop, Data Warehousing and OLAP figure prominently into the headlines at the annual SQL Server-focused PASS Summit event

Last week, DNS provider Dyn was unavailable for a good part of Friday. I blogged late that day about how a computer using OpenDNS was able to access a particular website, while another computer, a block away and using the same ISP, could not. The good computer was using OpenDNS while the problematic one was using DNS servers from our ISP. I asked OpenDNS about this and the explanation is both interesting and simple. Starting at the very beginning, computers on the Internet are identified with numbers, called IP addresses. The most common type of IP address is a 32 bit binary number, typically written as four decimal numbers separated by periods (i.e. 1.2.3.4). You can get to Google based on its IP address with

Fast new solid-state storage disks now are U.S. government certified at highest level of hardware security.

Amazon is trying to simplify the process of moving legacy applications to the cloud with a new service that it quietly launched this week.  The aptly named Server Migration Service is designed to help IT teams set up the incremental replication of virtual machines from their on-premises infrastructure to Amazon's cloud. More companies are adopting the public cloud to take advantage of performance benefits and cost savings. But getting legacy apps into the cloud can be a pain, especially for those applications that require high uptime but take time to migrate. Server Migration Service helps simplify that process and may lead to additional cloud adoption.

IBM is joining educators from around the globe in their quest to unleash a new generation of data scientists on the world. During the IBM Insight at World of Watson 2016 conference, discover how data science initiatives and offerings are empowering up-and-coming data scientists everywhere.

If simplicity can fundamentally accelerate focused action, then you can significantly boost speed, productivity and effectiveness in your enterprise. Take a look at this overview of key announcements unveiled on the first day of IBM Insight at World of Watson 2016.

The IoT is that it is a massively insecure undertaken with literally countless opportunities for crackers to mount disabling attacks.

On Tuesday, IBM unveiled a cloud-based AI engine to help businesses harness machine learning. It aims to give everyone, from CEOs to developers, a simple platform to interpret and collaborate on data.

Last week, about a year after the merger of Dell and EMC into a company now called Dell Technologies  (I graciously permitted them to use a branding element from my company, Endpoint Technologies), the combined entity held a conference called Dell EMC World. Where there had been two conferences — Dell World and EMC World — a single planet remained. The combination drew several thousand more people to Austin, Dell's traditional home, than Dell World alone had supported, and the city's infrastructure groaned under the effort. I can report that there were probably 1,000 people in the overflow room where I sat through several keynotes.

From self-service analytics to the cloud, chief data officers (CDOs) had a wealth of information at their fingertips on the first day of IBM Insight at World of Watson 2016. Catch the high points of some of Monday's most relevant sessions for CDOs in this quick recap.

Data analysis — and particularly big-data analysis — is often a matter of fitting data to some sort of mathematical model. The most familiar example of this might be linear regression, which finds a line that approximates a distribution of data points. But fitting data to probability distributions, such as the familiar bell curve, is just as common. If, however, a data set has just a few corrupted entries — say, outlandishly improbable measurements — standard data-fitting techniques can break down. This problem becomes much more acute with high-dimensional data, or data with many variables, which is ubiquitous in the digital age. Since the early 1960s, it's been known that there are algorithms for weeding corruptions out of high-dimensional data, but none of the algorithms proposed in the past 50 years are practical when the variable count gets above, say, 12. That's about to change.

Led by FirstMark Capital to Empower Data Teams Across the Globe with an End-to-End Platform for both Clickers and Coders   New York, NY, October 25 — Dataiku Inc., an emerging market leader in end-to-end advanced analytics and collaborative data science, today announced a $14 million Series A investment round, led by New York venture capital firm, FirstMark Capital, with participation from all existing investors.   Dataiku offers a unique collaborative tool that enables teams of data scientists and data analysts to quickly prototype and easily deploy scalable data-driven solutions in production, across the enterprise. 

Experience shows that organizations that manage GRC as an integrated program are more successful in delivering value to their organizations

There is one development that is truly poised to upend the enterprise infrastructure industry as we know it: the software-defined data center.

Today's telecommunications providers find themselves facing a Gordian Knot when investing in improving the customer experience cost-effectively while revenue shrinks because of a variety of factors. See what some real-world telecommunications providers are doing to attempt to untie the Gordian Knot that represents the challenge of dwindling revenue while providing superior customer experience.

This table outlines the top needs of each stakeholder group that can help guide your conversations on priorities and needs for the GRC program.

OpenStack users running their workloads on IBM's SoftLayer public cloud infrastructure took it calmly when the company's object storage development lead, Brian Cline, announced that SoftLayer is going away. Cline opened his presentation with the news at the OpenStack Summit in Barcelona on Tuesday. But it's not as bad as it sounds. The same services will still be available from the same servers, managed through the same SoftLayer control portal: Only the brand is going away. IBM is going to replace the SoftLayer name with Bluemix, its broader cloud platform, making SoftLayer services just another page in the Bluemix catalog of infrastructure, platform and application services.

IBM Planning Analytics, now on premises and in the cloud, helps you move beyond keeping score to driving the business with greater speed, agility and foresight.




At the IBM World of Watson conference in Las Vegas, IBM announced the latest version of its DB2 database software, in an effort to empower better analytics.

I am super excited to be in my first month at Hortonworks, heading up the product and solutions marketing team. In addition to joining a super star team, I am joining one of the leading innovators in the modern data landscape. My love affair with all things data started in the early 1990's when I… The post Joining Hortonworks: Big Data Done Right appeared first on Hortonworks.

OpenStack users running their workloads on IBM's SoftLayer public cloud infrastructure took it calmly when the company's object storage development lead, Brian Cline, announced that SoftLayer is going away. Cline opened his presentation with the news at the OpenStack Summit in Barcelona on Tuesday. But it's not as bad as it sounds. The same services will still be available from the same servers, managed through the same SoftLayer control portal: Only the brand is going away. IBM is going to replace the SoftLayer name with Bluemix, its broader cloud platform, making SoftLayer services just another page in the Bluemix catalog of infrastructure, platform and application services.

Friday's mass DDoS attack against a DNS provider spotlights a long-standing weakness in how traffic moves across the internet.

by Graham Williams, Director of Data Science, Microsoft Programming is an art and a way we express ourselves. As we write our programs we should keep in mind that someone else is very likely to be reading it. We can facilitate the accessibility of our programs through a clear presentation of the messages we are sharing. As data scientists we also practice this art of programming. Indeed even more so we aim to share the narrative of our discoveries through our living and breathing of data through programming over the data. Writing programs so that others understand why and how we analysed our data is crucial. Data science is so much more than simply building black box analyses and models and we should be seeking to expose and share the process and particularly the knowledge that is discovered from the data. Style is important in making the code we share readily accessible.

On Tuesday, IBM announced it will be combining its enterprise iOS mobile apps with IBM Watson for better decision-making and productivity.

We're asking InformationWeek readers about their top IT spending priorities for the year ahead. Take our flash poll and tell us where your IT dollars are headed.

SQL Server Analysis Services, one of the key features of Microsoft's relational database enterprise offering, is going to the cloud. The company announced Tuesday that it's launching the public beta of Azure Analysis Services, which gives users cloud-based access to semantic data modeling tools. The news is part of a host of announcements the company is making at the Professional Association for SQL Server Summit in Seattle this week. On top of the new cloud service, Microsoft also released new tools for migrating to the latest version of SQL Server and an expanded free trial for Azure SQL Data Warehouse. On the hardware side, the company revealed new reference architecture for using SQL Server 2016 with active data sets of up to 145TB.

The two companies collaborated to standardize everything from application programming interfaces (APIs) to single sign-on capabilities.

A family of new tools helps solutions providers leverage positioning technologies to create customized, high-value solutions.

Industry analyst Claudia Imhoff has identified three critical skill sets that will help CIOs build a formidable data science infrastructure.

If the internet of things is to provide real value to users, design and big data must be redefined and merged into a single philosophy.

The Internet of Things (IoT) is defined as the increased interconnectivity between all devices capable of using wireless communication technology. It's estimated that by the year 2020, there will be between 50 and 75 billion devices connected to the internet worldwide. Where there's connectivity, there's information. And smart people know how to turn information into profit.

Solr 5 includes a completely re-written faceted search and analytics module with a structured JSON API to control the faceting and analytics commands. Here's how it works. Since I joined Cloudera a few years ago to help bring search-powered analytics to Cloudera's platform, I've been working actively upstream alongside the rest of the Solr community to develop new functionality that will drive more interesting applications on Cloudera Search (which is based on an integration of Solr with the Apache Hadoop ecosystem).

The Intel Atom processor E3900 series employs 14-nanometer silicon to push processing horsepower out to the very edge of distributed IoT environments.

The economic impact of being behind your peer group in digital is huge.

For six years, Watchfinder, a U.K.-based global buyer and seller of pre-owned luxury watches, split the role of DevOps between application development and management of a virtual infrastructure environment. But the company's ambitious growth plans, which included expansion to the U.S. earlier this year and an expected doubling of monthly watch sales, required IT director Jonathan Gill to think differently.

FalconStor is all about automating the movement of data in storage and making it available when and where it's needed in any production environment.

This article discusses five reasons the lack of information flow stunts company growth and technology that will enable access to actionable information in real time. Keep on reading: Flow of actionable information is slowing down your business

Look for IBM to launch the on-premises version of its enterprise performance management solution, IBM Planning Analytics, at for IBM Insight at World of Watson 2016. Check out the new features and business benefits of this new release of a leading-edge financial and operational performance management software from IBM.

Collibra enables users to research and find data in the same way they shop for products on consumer sites, such as Amazon and iTunes, the company said.




A new client library makes it easier for Xamarin developers to add cloud storage capabilities to their mobile apps.

This industry customer panel was recorded live on Wall Street at the IBM Forum for Financial Services in New York City on 20 September 2016. It includes comments on the business benefits of IBM Customer Insight Solutions from Pershing, Southern Farm Bureau, USAA and US Bank. IBM Customer Insight solutions are based on industry-specific models and can effectively predict, personalize and address the changing needs of your clients by using a new era of customer segmentation and cognitive computing. These solutions empower companies by dynamically segmenting clients by their behaviors, anticipating life and financial events, foreseeing client attrition, identifying product opportunities and delivering tailored news and alerts.

A demonstration of the IBM Customer Insight solutions for banking, wealth management and insurance was recorded live on Wall Street at the IBM Forum for Financial Services in New York City on 20 September 2016. IBM Customer Insight solutions are based on industry-specific models and can effectively predict, personalize and address the changing needs of your clients by using a new era of customer segmentation and cognitive computing. These solutions empower companies by dynamically segmenting clients by their behaviors, anticipating life and financial events, foreseeing client attrition, identifying product opportunities and delivering tailored news and alerts. The demo is presented by Boxley Llewellyn, vice president, Banking Analytics Solutions, at IBM.

If you didn't have a chance to catch my presentation at the Machine Learning and Data Science Summit, I'll be reprising an updated version of the talk in a live webinar on Tuesday, November 1. I'll also be taking questions from the audience after the webinar. You can register here, and the details of the webinar are below. Changing Lives with Data Science and R at Microsoft Whether it's called data science, machine learning, or predictive analytics, the combination of new data sources and statistical modeling has produced some truly revolutionary applications. Many of these applications incorporate open source technologies (including R) and research from academic institutions. In this talk, I'll share a few ways that Microsoft is improving the lives of people around the world by applying Statistics, research and open-source software in applications and devices, and describe how Microsoft has integrated R into its data platforms. Register here to attend — I hope to see you there!

Open DCI makes use of standard VXLAN network virtualization software to create an SDN capable of spanning multiple data centers.

This entry was posted in News and tagged , , , , , , , , , . Bookmark the permalink.