Big Data News – 15 Jun 2016

Today's Infographic Link: Tape Drives: The Do’s and Don’ts

Featured Article
If you think of LinkedIn as a social network or an online recruitment service, then you may well be scratching your head about why Microsoft would spend more on it than it has on any other acquisition. But consider that Microsoft has a graph that covers how you're connected to people by email, documents, messages, meetings and address books, while LinkedIn has a graph that covers jobs, skills, colleagues, and professional connections. That's two separate sets of information that would be much more useful together. Microsoft CEO Satya Nadella has already talked about how the Microsoft Graph and the Office Graph are some of the company's most valuable assets. Think about what you could get by combining those with the graph that represents the professional networks of your employees and partners and adding machine learning that can pick out who and what is actually relevant to you in the sea of all the people and resources you're connected to.

Top Stories
In anticipation of his upcoming Predictive Analytics World for Manufacturing conference presentation, Building a Predictive Analytics Organization, we interviewed Chris Labbe, Managing Technologist at Seagate Technology. View the Q-and-A below for a glimpse of what's in store at the PAW Manufacturing conference. Q: What are the challenges in translating the lessons of predictive analytics from other verticals into manufacturing?

We've been doing it for years, decades for some. How many websites have you created accounts on? Your bank, your credit card companies, social media sites, hotels and travel sites, online shopping sites, and that's just the start. We do it often without even thinking about it, quickly entering our personal information, our data, in a plethora of systems. Sometimes we're not even aware of the information we are providing. It could be very personal information (think of the security questions you provide answers to for protecting your information on these systems), to information about your behavior, your interests (websites visited, internet search topics). While possibly concerning from a privacy perspective, you had some level of control. For the backend systems involved, the volume and variety of the data was a manageable situation. The tradeoff between privacy and convenience is something we, for the most part, found acceptable.

The role of auditors has been changing rapidly over the past decade. Big data is allowing them to make higher quality decisions. However, their job is also becoming more complicated, so future financial auditors will need a strong background in IT.Big Data Presents New Opportunities for Auditors

Habitat effectively takes over the management of an application in a way that abstracts the function from the underlying IT infrastructure.

To become leaders, developers need to stretch themselves, commit to something over time, and execute in the trenches.

In this special technology white paper, Inside the Snowflake Elastic Data Warehouse, you'll find out why today's premises-based data warehouses are based on technology that is two decades old. You'll also discover why data warehouses have to fundamentally change in order to meet today's demands and opportunities.

Today's managers depend a great deal on their data to make informed decisions. The analysis of data gathered from the different activities of a company's departments help its managers make decisions that are designed to improve efficiency and deliver greater savings on cost.

You might not have heard about Kast before; I certainly hadn't before its email arrived in my inbox. Apparently, Kast is a competitor to white-hot messaging platform Slack. Given that pretty much everyone in the technology industry has heard of Slack, and almost no one has heard of Kast, I'd suggest that Slack isn't exactly panicking about this competitor.

Business is ripe for a bot explosion. The foundational technologies are available, industry behemoths are fanning the fire, users are demanding better experiences, and companies are looking for new ways to optimize their financial performance. Are you ready?

2016 is the Year of the Monkey in the Chinese calendar. More specifically, it's the Year of the Fire Monkey. So, for those of you unfamiliar with Chinese and wider Asian culture….who or what is the Fire Monkey?

Analysis: Company eyes $1.7tr industrial internet market by expanding its digital focus on European soil.

Everyone seemingly is complaining about the spread of ransomware, and now somebody is trying to do something about it using machine learning-based behavioral analytics techniques to track suspicious behavior on company networks. As the scale of the ransomware threat grows, including ransom payments by hospitals and universities and growing fears that it will soon spread to other sectors, a Silicon Valley security intelligence firm has rolled out an approach for detecting ransomware via machine learning.

The growth of cloud services is, in essence, the great leveler between small- and medium-sized businesses (SMBs).

Thanks to continued improvements in the accuracy voice recognition technology, we're closer than ever to having a natural conversation with computers, without the interaction devolving into a frustrating, phone-throwing experience. As Mary Meeker explains in her recent Internet Report, we could be near an inflection point for voice-based interactions. When Apple (NASDAQ: AAPL) first launched Siri on iPhones four years ago, the personal assistant was a bit of a novelty. It was fun to ask Siri off-the-wall questions and see what she would say.

Initial salary offerings are generally a starting point for discussion, so how you answer questions about salary is really important.

It is beginning look as if net neutrality will be with us for a while. Certainly, it is embedded in our laws more firmly than it ever has been.

Techaisle Founder/CEO Anurag Agrawal discusses findings from research on why SMBs are using cloud: It addresses control over operations, costs.

The key to successful IT modernization includes proper planning, alignment, understanding vulnerabilities and finding the right partner.

Much has been made about security intelligence, but too often what is provided is data relating to a potential security breach that lacks any context.

Apple unveiled major updates to all four of its operating systems: tvOS, macOS, iOS, and watchOS. Some of them are incredibly exciting, while others seem awfully familiar. Is Apple pushing software forward or playing catchup?

Close proximity interactions between individuals influence how infections spread. Quantifying close contacts in developing world settings, where such data is sparse yet disease burden is high, can provide insi…

Now is the time for businesses to think about putting in place machine learning applications to benefit from this hot advanced analytics technology trend.

The applications for artificial intelligence and machine learning in self-driving vehicles range from infotainment systems to advanced driver assistance systems, according to new report from IHS.

Social media data is inexpensive, fast, and effortless to access. Businesses are increasingly using data from social networking sites such as Twitter, Facebook, Youtube, and Google Search. For example, when compiling data for our Caltrain project, we found that sentiment on Twitter was highly correlated with train delays. As more value is being realized from online communities, graph-based approaches are becoming increasingly important for mining social networks. Below are some links to interesting work being done with social media data.

Barefoot Networks provides compilers and tools needed to program a switch capable of processing packets at 6.5 terabits per second, at no cost.

Nokia hopes it will have a hit on its hands with Impact, an all-encompassing new Internet-of-Things management platform that brings together several existing products. The Intelligent Management Platform for All Connected Things builds on the company's established CPE (customer premises equipment) management tool for network operators, Motive Connected Device Platform, so broadband and home routers will be among the first devices to feel its impact. It already supports 80,000 different device types, Nokia said Tuesday. No doubt its new Smart Home Gateway, also unveiled Tuesday, will soon be among them. It combines a fiber "modem" with a router, dual-band gigabit Wi-Fi — and Zigbee and Z-Wave radios for home automation. It's intended to be distributed by network operators.

C-level briefing: Transforming a hackaday project into the backbone of how journalists work in a data driven organisation.

The financial industry has been one of the most reticent to adopt IaaS public cloud computing services, but researchers at Deutsche Bank predict that big banks' use of cloud will ramp up "materially" in 2017. +MORE AT NETWORK WORLD: How Goldman Sachs and Bank of America are using containers and cloud | What happened at Apple's WWDC + Regulatory barriers, questions about security and legacy IT installations are the leading reasons financial services companies have not used public cloud compute and storage services thusfar, DB researchers say.

by Dmitry Pechyoni, Microsoft Data Scientist – The New York City taxi dataset is one of the largest publicly available datasets. It has about 1.1 billion taxi rides in New York City. Previously this dataset was explored and visualized in a number of blog posts, where the authors used various technologies (e.g., PostgreSQL and Apache Elastic Search). Moreoever, in a recent blog post our colleagues showed how to build machine learning models over one year of this dataset using Microsoft R Server (MRS) running in a 4-node Hadoop cluster. In this blog post we will use a single commodity machine to show an end-to-end process of downloading and cleaning 4 years of data, as well building and evaluating a machine learning model.

In this installment, we provide insight into how the Fair Scheduler works, any why it works the way it does. In Part 3 of this series, you got a quick introduction to Fair Scheduler, one of the scheduler choices in Apache Hadoop YARN (and the one recommended by Cloudera). In Part 4, we will cover most of the queue properties, some examples of their use, as well as their limitations. The post Untangling Apache Hadoop YARN, Part 4: Fair Scheduler Queue Basics appeared first on Cloudera Engineering Blog.

A renewed focus on users should pay off for Apple, says analyst Rob Enderle, but business features seem to be underplayed at this point.

Nokia hopes it will have a hit on its hands with Impact, an all-encompassing new Internet-of-Things management platform that brings together several existing products. The Intelligent Management Platform for All Connected Things builds on the company's established CPE (customer premises equipment) management tool for network operators, Motive Connected Device Platform, so broadband and home routers will be among the first devices to feel its impact. It already supports 80,000 different device types, Nokia said Tuesday.

What to Do When Your Backup Data is DestroyedAccording to the book, Disaster Recovery Planning: Managing Risk and Catastrophe in Information Systems, written by Jon William Toigo, "A company that experiences a computer outage lasting for more than 10 days will never fully recover financially and that 50 percent of companies suffering such a predicament will be out of business within 5 years."

While implementing Business Intelligence can drive revenue and save money, many businesses underestimate the deployment and ownership costs of BI. Besides the cost of software, they don't anticipate the fees that crop up during some BI implementations. In this article, we explore a few common "hidden" fees of BI that you cannot ignore.

Google Cloud Platform Review: Head In The Clouds Over the last decade, cloud computing services have sky rocketed as people push them to their limits and beyond to find more, advantageous uses for them. Providers such as Amazon Web Services (AWS), Microsoft Azure and VMware are constantly refining their own services and providing users with better means to their end. But in recent years, one service in particular has provided cloud computing that boasts both flexibility and performance to a high degree: The Google Cloud Platform (GCP).

In this contributed article, Kaushal Amin, Chief Technology Officer for KMS Technology, takes a close look at what you need to think about to successfully deploy your Big Data solution.




In this special guest feature, Neil Cawse, CEO of Geotab, looks at the big data industry from the lens of vehicular/transportation big data, a sprawling category that includes location, speed, drive train diagnostics, fuel economy and driving behavior.

I made a mistake about machine learning. Repeatedly. I said, and I wrote, that machine learning and predictive analytics were almost the same. To be more specific, my view was simple: analytics can be divided in four categories, exemplified below (see Analytics Landscape for details) I put machine mearning near predictive analytics in this 2D landscape: Of course, I also put optimization as the queen of all analytics technologies as it yields best business value. What else would you expect from someone who spent nearly 3 decades in working in optimization?

Analysis: Microsoft needs to get integration right in order to unlock the potential of LinkedIn's big B2B data.

One of the great things about my job at SAP is the chance to work with my colleagues in GRC and our customers to develop exciting presentations and case studies highlighting innovations and best practices. The goal we set for ourselves was to work with our SAP presenters to make our SAP-led presentations more interesting…

I shiver just at the thought of shopping for necessities! Yet, I do it weekly and I am not alone. Van Rompaey[1] pointed out that 20% of Belgian citizens consider shopping for food a chore. According to the Daily Mail "a national survey" revealed that 62% of 2500 surveyed customers cannot get out of supermarkets quickly enough. The main reasons given in the survey for this aversion to supermarket shopping are as follows: (62%) customers — "Get me out of here!" (38%) customers — The unruly children of other customers.

One of the downsides of living and working in a data-rich environment is the desire to squirrel away every last bit and byte for future use. Thanks to cheap storage systems such as Amazon S3 and Hadoop, it's technically possible to store every piece of data you've collected. But going too far down that path can lead to a perilous condition known as data hoarding. While data hoarding may not be as great a threat as physically hoarding real-world items, there is a similar psychology at play.

Are you up for a challenge? Ever wondered what the most intense meeting of your life would look like? Immerse yourself in the high-stakes play of today's board meeting in SAP and Intel's exciting new online, interactive experience, Win the Board. Your mission? You've got all the tools you need – now you just need…

This entry was posted in News and tagged , , , , , , , , . Bookmark the permalink.