Big Data News – 23 Jun 2016

Today's Infographic Link: Kowloon Walled City: A place of anarchy

Featured Article
The internet of things is gaining traction in a number of industries, but IoT investments in three areas will be particularly large, according to IDC.

Top Stories
Alphabet is boosting its Google Fiber team with Webpass, a San Francisco-based ISP. The idea is to bring ludicrously-fast Internet access to more people, more quickly. In IT Blogwatch, bloggers ponder symmetric gigabit for $50/month. Your humble blogwatcher curated these bloggy bits for your entertainment. Not to mention: Alien Selection… What's the craic? John Ribeiro–Google Fiber to add urban coverage and wireless:

Forward looking IT security pros need to better address known risks, monitor closely the value of shadow IT devices and solve the inherent weaknesses introduced by the internet of things, Gartner says. The consulting firm has taken a look at five key areas of security concern that businesses face this year and issued predictions on and recommendations about protecting networks and data from threats that will likely arise in each. The areas are threat and vulnerability management, application and data security, network and mobile security, identity and access management, and Internet of Things security. Gartner's findings were revealed at its recent Security and Risk Management Summit by analyst Earl Perkins.

What is time series forecasting? Time series data typically refers to historical data collected at regular time intervals over a period of time. For example, sales data of a store recorded on a daily basis or expense data recorded on a monthly basis are examples of times series data. Predictive forecasting is using machine learning…

    Applying analytics to sports is one of the fun part of my work.  I had a great opportunity last year to work as part of an IBM team to help ultra cyclist Dave Haase race across America.   Racing across America is quite a challenge: imagine a 3000+ miles, non stop, race across USA, with over 110,000 feet elevation (see pictures below).  Cyclists can race as they wish, rest only when they chose to.    

While mobility offers many advantages to the modern business, it brings new security challenges. How can your business protect your sensitive data in a mobile world? How can you maintain security, when you can't control every device in your organization? In this article, we explore 5 steps you must take to protect your business data in a mobile world.

Kubernetes is a new and revolutionary open-sourced system for managing containers across multiple hosts in a cluster. Ansible is a simple IT automation tool for just about any requirement for reproducible environments. In his session at @DevOpsSummit at 18th Cloud Expo, Patrick Galbraith, a principal engineer at HPE, discussed how to build a fully functional Kubernetes cluster on a number of virtual machines or bare-metal hosts. Also included will be a brief demonstration of running a Galera MySQL cluster as a Kubernetes application.

K.S. Viswanathan, Vice President, Nasscom, said the sector was expected to grow at CAGR (compounded annual growth rate) of 26 percent over next five years.

Stephan Aarstol is out to convince us that companies would prosper, and society would benefit, if we'd switch to a five-hour workday.

Budgeting is perhaps the most dreaded aspect of operating a business. Between the painstaking process of developing the budget, the wrangling over monetary allocation, and the challenges of accurately projecting future revenues and costs, budgeting is a tedious and consuming process that many businesses find exacerbating.Fortunately, your company doesn't have to suffer. There are some easy tactics to streamline the budget cycle, improving both the efficiency and accuracy of the process.Figure Out What's Driving Your Business

In this special industry research report, The Forrester Wave™: Enterprise Data Warehouse, Q4 2015 you'll learn about the 10 providers that matter most and how they stack up.

Trifacta, a leader in data wrangling, strengthened its partnership with Hortonworks, Inc., today, announcing deepened technical integration with the Hortonworks Data Platform (HDP).

For large-scale analytics, a distributed file system is kind of important. Even if you're using Spark you need to pull a lot of data into memory very quickly. Having a file system that supports high burst rates — up to network saturation — is a good thing. However, Hadoop's eponymous file system (Hadoop Distributed File System, aka HDFS) may not be all it's cracked up to be. What is a distributed file system? 

Dissertations Online Free When you found us on the internet, you probably know you will discover numerous of other producing expertise available on the market. You must be watchful! A great number of them fail to appeal your business perhaps up to Experts Essay does. They typically commitment exceptionally low prices and great quality. You comprehend from working experience we frequently get everything we pay money for — crafting companies are no diverse.

C-level briefing: Executives from Philips and Pegasystems talk to CBR on the future of healthcare and how to scale connected environments.

The idea has been around for a while but the technology has improved.

Visual, real-time provenance validates veracity of data It's been nine months since I first learned about Hortonworks DataFlow, which is powered by Apache NiFi, Kafka and Storm. Back then, I was immediately able to see the productivity benefits that the Apache NiFi aspect of HDF would have brought to my previous work in analyzing data… The post Prescient Transforms 48,000+ Data Sources in Real Time with Apache NiFi appeared first on Hortonworks.

StreamSets, the company that delivers performance management for data flows, today announced results from a global survey of more than 300 data management professionals conducted by independent research firm Dimensional Research®.

If you're embarking upon a big data project, then you're likely running into one or more data management challenges. The decisions you make regarding how you enforce data governance and how you control data flows can make or break your project. Here are five data governance mistakes you should avoid: 1. You Have No Data Governance Strategy If you said to yourself, "Huh, what's data governance?" then you're likely (Kues/Shutterstock) making this mistake. Data governance refers to an overarching strategy that defines how organizations ensure the data they use is clean, accurate, usable, and secure.

IBM Analytics Day at the U.S. Open was more than just an excellent venue for clients, it was a showcase of just how IBM's clients are leveraging insight from existing data to improve experiences for customers and end users, while doing so in a cost-effective, secure and scalable cloud or hybrid-cloud structure.

This excerpt focuses on defect management, including basic concepts of a defect, how to manage defects, and an analysis of the root causes of defects.

Big Data automation can mean writing dozens of scripts to process different input sources and aligning them in order to consolidate all this data and produce the required output. Why exactly do you need Big Data for your enterprise projects? Many industry observers have been noting that although a lot of enterprises like to claim that their big data projects are aimed at "deriving insights" that replace human intuition with data-driven alternatives, in reality though, the objective appears to be automation. They point out that the role of data scientists at a lot of organizations has got little to do with replacing human intuition with big data. Instead, it is about augmenting human experience by making it easier, faster and more efficient.

Remember the "Three Laws of Analytics," or simply remember that the answer should follow the analysis, not define it.

While the use of IoT tech enables retailers to better track their products in the supply chain, it introduces a variety of security and data risks.

This article was posted on the website Nesta. Nesta is an innovation charity with a mission to help people and organisations bring great ideas to life  As more public datasets are opened up, finding effective ways to communicate the messages within them becomes increasingly important. There is no shortage of data available to us, but it can be difficult to extract meaning from it in its raw state. That's where data visualisation comes in.

We live in a smart and connected world. From smartphones and watches to automobiles and appliances, everyday objects and devices are now connecting to the Internet to share and analyze data. This connected world is soon to get even smarter and even more connected. Gartner, Inc. forecasts that 6.4 billion connected things will be in use worldwide in 2016, up 30 percent from 20151, and IDC reports that connected devices will reach 50 billion by 2020.2 Gartner further reports that this year, in 2016, 5.5 million new things are connecting every day.[1] With the rise in connectivity, we are experiencing an explosion in data that can be used to generate business intelligence. This Internet of Things (IoT)-generated data can provide valuable insights to drive efficiencies in operations and fuel new data-driven business models.

Peter Dalgaard announced yesterday on behalf of the R core team that R 3.3.1, the latest update to the R language, is now available for download from your local CRAN mirror. As of this writing,…

Researchers at MIT's Computer Science and Artificial Intelligence Laboratory reported this week they have developed a deep learning algorithm that could help machines using predictive vision anticipate human interactions. The approach uses unlabeled YouTube videos as its source material to train deep networks to predict human interactions. In a paper titled, "Anticipating Visual Representations from Unlabeled Video," the researchers said they applied recognition algorithms on the trained network's prediction to forecast future actions.

There are countless "as-a-Service" offerings on the market and typically they live in the cloud. In 2014, startup BlueData blazed a different trail by launching its EPIC Enterprise big-data-as-a-service offering on-premises instead. On Wednesday, BlueData announced that the software can now run on Amazon Web Services (AWS) and other public clouds, making it the first BDaaS platform to work both ways, the company says. "The future of Big Data analytics will be neither 100 percent on-premises nor 100 percent in the cloud," said Kumar Sreekanti, CEO of BlueData. "We're seeing more multicloud and hybrid deployments, with data both on-prem and in the cloud.

A Geneva Convention on cyberwar: That's how a panel of experts proposes to deal with the growing threat to critical infrastructure posed by the possibility of cyberattack. With control systems in dams, hospitals, power grids and industrial systems increasingly exposed online, it's possible that nation states could seek to damage or disable them electronically. But building electronic defenses to prevent such attacks is expensive — and often ineffective, given the myriad ways in which they can fail or be breached. That's why the Global Commission on Internet Governance recommends that in any future cyberwar, governments should pledge to restrict the list of legitimate targets for cyberattacks, to not target critical infrastructure predominantly used by civilians, and to not to use cyberweapons against core Internet infrastructure.

Amazon has gradually rolled out parts of its IoT offerings, but these are just the tip of the iceberg. In addition to optimizing their backend AWS offerings, Amazon is laying the ground work to be a major force in IoT – especially in the connected home and office. In his session at @ThingsExpo, Chris Kocher, founder and managing director of Grey Heron, explained how Amazon is extending its reach to become a major force in IoT by building on its dominant cloud IoT platform, its Dash Button strategy, Replenishment Services, the Echo/Alexa voice recognition control platform, strategic investments of its Alexa Venture Fund, strategic partnerships with 30+ major consumer package goods companies and the 50-60 million current Prime customers.

There are countless "as-a-service" offerings on the market today, and typically they live in the cloud. Back in 2014, startup BlueData blazed a different trail by launching its EPIC Enterprise big-data-as-a-service offering on-premises instead. On Wednesday, BlueData announced that the software can now run on Amazon Web Services (AWS) and other public clouds, making it the first BDaaS platform to work both ways, the company says. "The future of big data analytics will be neither 100 percent on-premises nor 100 percent in the cloud," said Kumar Sreekanti, CEO of BlueData. "We're seeing more multicloud and hybrid deployments, with data both on-prem and in the cloud. BlueData provides the only solution that can meet the realities of these mixed environments in the enterprise."




One of the main goals of SDN (software-defined networking) is to make networks more agile to meet the changing demands of applications. A new Silicon Valley startup, Apstra, says it has an easier way to do the same thing. Rather than control the guts of individual network devices through software that makes them more programmable, Apstra says it can deal with those devices as they are and shape the network from a higher level. The result is a new approach that might let IT departments bypass some of the complex technologies and politics of SDN and still make their networks more responsive to users' needs. It's due to go on sale by August.

Of the three pillars of enterprise infrastructure — compute, storage and networking — storage remains the most complex.

A Geneva Convention on cyberwar: That's how a panel of experts proposes to deal with the growing threat to critical infrastructure posed by the possibility of cyberattack. With control systems in dams, hospitals, power grids and industrial systems increasingly exposed online, it's possible that nation states could seek to damage or disable them electronically. But building electronic defenses to prevent such attacks is expensive — and often ineffectual, given the myriad ways in which they can fail or be breached. That's why the Global Commission on Internet Governance recommends that in any future cyberwar, governments should pledge to restrict the list of legitimate targets for cyberattacks, to not target critical infrastructure predominantly used by civilians, and to not to use cyberweapons against core Internet infrastructure.

Water, water everywhere, Nor any drop to drink These lines from "The Rime of the Ancient Mariner," by Samuel Taylor Coleridge also accurately describe the companies that are trying to transform themselves into a data driven company. These organizations have astronomical volumes of raw data at their disposal but how do they find that proverbial… The post Business Catalog – Why Do You Need One for Hadoop? appeared first on Hortonworks.

Salesforce has introduced Cloud App Mobile, an app dev platform that unifies all its efforts to help non-coders develop business applications. The company has also released its Pardot Engagement Studio for marketing automation to general availability.

HDS clearly sees the rise of converged and hyperconverged systems as an opportunity to extend its reach into the midmarket.

Moataz Anany is a Solutions Architect with AWS Amazon EMR has added Apache Tez version 0.8.3 as a supported application in release 4.7.0. Tez is an extensible framework for building batch and interactive data processing applications on top of Hadoop YARN.

Businesses are transforming before us. With a strong focus on innovation and digital interfaces, winning companies are rethinking how humans and machines interact. The Cognitive Era is opening up many avenues for banks to outthink competition by adopting new strategies and leveraging technology in new, clever, pragmatic ways.

As businesses continue to innovate and explore more line of business applications that can help them speed up production and boost efficiency, we're swimming in data. Dashboards and graphics give us more insight into our business and productivity than ever before. From sales to customer service, we know exactly where we stand with a quick glance.

Note: This story is fiction, but it is based on experience with real clients. Any resemblance to people you know is incidental. You can read the prequel to this episode here. Michael's Tale, Continued TalkThree's new Analytics Director, Michael, has had a sobering month. What he had hoped would be his first major contribution to… The post Taking Action on Technical Success: A Fable of Data Science and Consequences appeared first on Predictive Analytics Times.

This entry was posted in News and tagged , , , , , , , , , , . Bookmark the permalink.