Big Data News – 14 Jan 2016

Top Stories
Apple is reportedly moving to dump its in-house iAd sales team and making iAds solely a self-service platform.

While there are disagreements about what exactly constitutes a "data scientist," there's little doubt that one of the critical components involves statistical aptitude. And as the data science profession evolves, it's taking the field of statistics with it, an expert with the American Statistical Association says. Before the Harvard Business Review dubbed data scientist the sexiest job of the 21st century several years ago, few people outside of Silicon Valley had heard the term. But all of a sudden, the title was showing up in thousands of job descriptions, as companies sought the key personnel who could help them make sense of the big, messy, and fast-moving data they were collecting.

Cambridge Semantics, a provider of analytics and data management services, is acquiring the intellectual property portfolio of graph database specialist SPARQL City, whose executive team also will join the buyer. Financial details of the transaction were not disclosed. Boston-based Cambridge Semantics said Wednesday (Jan.13) the acquisition would allow it to expand its Anzo "smart data" platform by combining it with SPARQL City's in-memory graph query engine. The Anzo platform is based on semantic web technology designed to help customers develop interactive, real-time data analytics capabilities.

A prediction has been released on the cost of securing the Internet of Things (IoT). And, if that guess is correct, we may have a big problem.

A data-driven company tends to outperform its competitors, but the transition to being driven by data is rarely led by the asset protection department. Weis Markets is the exception.

When it comes to coaxing personal information out of Americans, a Pew Research Center report found certain factors, like safety, lead to greater acceptance than cost savings can.

MapR's Hadoop distribution will add a message system to feed a streaming data pipeline. It takes a cue from open-source Kafka technology.

It started with Web-based social media. We became enamored with sharing and connecting via Facebook, Twitter, even LinkedIn. Those platforms and others migrated as apps to our now-indispensable smartphones and tablets. This was the beginning of us becoming "sensor platforms" as the data being culled from the websites and apps became ever more sophisticated and deep. And we've all done so willingly, enthusiastically, in exchange for the reach and convenience that these systems provide us. The evolution is continuing with two key technological trends: the addition of physical-world sensors to smartphones, and specifically designed sensing devices, or wearables.

New years are new beginnings. In our personal lives, that usually means rethinking and recommitting to goals like physical fitness, financial prosperity, and personal betterment in the form of more education or smarter family decisions. In business, the new year means taking a look at things like budgets, expansion goals, and IT security. According to leading security experts, one of the biggest threats to watch out for in 2016 will be hactivism. Are you prepared?

I've heard that most people now have an attention span less than that of a goldfish. We're talking about 8 seconds. But I think it's less than that. I think it's probably more like 5 seconds. But great customer communications — individualized marketing — doesn't even need that long. If it's timely enough, relevant enough to delight customers where they stand right now, they'll gladly engage.

ServiceMax's new IoT application, delivered via the cloud, integrates data collected from embedded systems within a field service application.

Impetus Technologies, a big data thought leader, software, services and solutions company, announced plans to hire 150 big data architects, senior developers and data scientists to support the rapid growth of its business.

Cloudera continually strives to advance the state of the modern data warehouse. A environment that was often times defined by its constraints is being extended and complemented by new technologies that aim to extend analysis to data that was traditionally a poor fit for the data warehouse and doing it at a level of scale that wasn't a reality for most data warehouse environments. For a good portion of time, a data warehouse was an analysis environment that could only be afforded by large enterprises. Now companies large and small can reasonably store and analyze unlimited data with the introduction of solutions like Cloudera Enterprise.

It's the time of year again for predictions on all sorts of topics. Worthy, solid predictions are often based on the past and present trends and then projecting those trends into the coming year. Since I spend a lot of time studying trends of big data and analytics, I'm going to offer my predictions for the upcoming year.

TSO Logic's Cloud Enablement module provides more transparency into application workloads and what it would cost to run them in a public cloud.

So, what can merchants operating online do to prepare for the inevitable flood of shoppers, both good and bad?

Interview: Barclays Head of Information "we can take the best of what a fintech [bank] can offer…and do it at scale"

Chinese Web company Baidu is announcing today that it is releasing key artificial intelligence (AI) software under an open-source Apache license. The WARP-CTC C library and optional Torch bindings are now available on GitHub, by way of Baidu Research's Silicon Valley AI Lab (SVAIL). The connectionist temporal classification (CTC) approach dates back to 2006, when it was documented in a paper from the Swiss AI lab IDSIA.

Editor's note: This was originally posted on the O'Reilly Data blog as "When building an enterprise data strategy, consider "why?" before "how?" At the 2015 Strata + Hadoop World in New York, members from consulting firm Silicon Valley Data Science (SVDS) responded to audience questions and discussed best practices in creating and implementing data strategies in a session called, "Ask Me Anything: Developing a Modern Enterprise Data Strategy."

Have you considered an IoT security policy, separate from your BYOD policy? It may be time to evaluate your need for one.

This document makes recommendations for how users should select checklists from NIST, evaluate and test checklists, and apply them to IT products.

Startup Gennion uses sensors to provide retailers with useful information on customer store traffic. The Spain-based Internet-of-Things company processes about 16,000 events per minute from its sensors and is hoping to scale up to thousands per second soon. Like many young companies, Gennion launched in Amazon Web Services' cloud.

VB WEBINAR: Join this live webinar and hear from top marketers how marketing automation is ratcheting up demand generation, leads, conversions, and revenue. CMOs from Stonyfield Farm and Arizona's famous Casino Del Sol Resort will be on the panel to discuss real world applications and opportunities with VB Insight's Andrew Jones. Register here for free. In case you hadn't noticed, the Internet is a very busy place.

by Joseph Rickert Over the past few months, a number of new CRAN packages have appeared that make it easier for R users to gain access to curated data. Most of these provide interfaces to a RESTful…

The rapidly changing technology landscape makes it more important than ever for companies and their employees to be aware of what 2016 holds.

Machine learning is taking hold in all kinds of applications, from self-driving cars to image recognition to online recommendation engines. But unless you're a Google or a Facebook, it's hard to get your hands on the kind of massive, real world data sets required to test and validate machine learning programs. Yahoo has helped to rectifying that with the release Thursday of what it called the "largest ever" data set made available to machine learning scientists. It's a collection of anonymized user interactions with the news streams on sites like Yahoo News and Yahoo Sports. Yahoo says there are 110 billion events in the file — or 110 billion records of when a user clicked on a news story or took some other action in the feed — and it comprises 13.5TB of data, or 1.5TB compressed. That's more than 10 times the size of the previous largest dataset released, Yahoo says.

Now that big data initiatives are going mainstream in Fortune 1000 companies, CIOs and other C-level executives are targeting the next frontier — how to transform all that information into products and services — according to a new report.

Advanced Micro Devices has had a few tough years as it struggles to stay afloat financially and compete with Intel, the world's biggest chip maker. Yet Lisa Su, chief executive of the Sunnyvale, Calif.-based chip design company, remains an optimist. She told us in an interview that 2016 is going to be a good year for AMD, with products that are in the pipeline finally seeing the light of day. Among those is AMD's new ARM-based 64-bit processors for data centers, which are being launched in volume sales today.

More and more people are entering the data science field everyday, but how have methodologies changed over the past few years? The infographic below courtesy of Platfora provides several key points.

After a few years of effort, Advanced Micro Devices is finally shipping its ARM-based processor for data centers. The Sunnyvale, Calif.-based chip design company hopes to fight the dominance of Intel with the AMD Opteron A1100 processor, formerly code-named Seattle. The chip represents the start of AMD's effort to differentiate itself from Intel, which generates around $14 billion a year from the data-center server chip market, according to Gartner.

In this special guest feature, Moshe Kranc, CTO at the tech consulting firm Ness, discusses how big industry players will increasingly try to corner the market on high quality one-of-a-kind data that can yield actionable insights – unseating content as king.

Yahoo is today announcing the release of a large-scale data set that describes people's usage of news feeds on several of the company's web services, including Yahoo News and Yahoo Finance.

Machine learning is taking hold in all kinds of applications, from self-driving cars to image recognition to online recommendation engines. But unless you're a Google or a Facebook, it's hard to get your hands on the kind of massive, real world data sets required to test and validate machine learning programs. Yahoo has helped to rectifying that with the release Thursday of what it called the "largest ever" data set made available to machine learning scientists. It's a collection of anonymized user interactions with the news steams on sites like Yahoo News and Yahoo Sports.

A couple of months ago I joined Hortonworks. There was an undeniable pull to go into the fire of crazy fast innovation and growth. About four seconds in, I realized there was so much more than just the pace of execution and growth but rather a bigger opportunity to be a part of something game-changing. The opportunity to partake in trailblazing the world of data. The opportunity to offer a unique value proposition of truly 100% open technology.

Hortonworks today unveiled an overhauled business partner program that it says will make it easier for software vendors, consultants, and resellers to engage with the Hadoop distributor in the burgeoning market for big data solutions. With more than 1,600 partners around the world, Hortonworks (NASDAQ: HDP) has more partners than either employees or paying customers. But with demand for big data solutions expected to ramp up in the coming years, the Santa Clara, California company decided it was time to re-organize how its network of partners engages with customers and with itself.

I'm on two overlapping posting kicks, namely "lessons from the past" and "stuff I keep saying so I might as well also write down. My recent piece on Oracle as the new IBM is…

Internet of Things technology is a hot topic. You can't read a tech news site without coming across at least one mention of IoT. But if you're looking to take advantage of sensors, you will likely have to update your data store to handle the workload. Once you're set up data-wise, get ready to monitor everything from weather and the environment to overseas factory floors and even fleets of trucks.

Big data got off to a roaring start in 2016 with the release of Spark 1.6 last week. You can rely on the Spark team to deliver useful features in a point release, but Spark 1.6 goes a step further, offering a mini-milestone on the way to Spark 2.0. The new features and improvements In Spark 1.6 will make both developers and operators very happy. Let's take a look at some of the highlights. [ What you must know about Hadoop and Spark right now | Learn how to unlock the power of the Internet of things analytics with big data tools in InfoWorld's downloadable Deep Dive.

Amazon has promised to create a new AWS region based in Montreal, Canada, that will launch sometime in 2016, answering a competitive challenge there from Microsoft's Azure. It's unclear which AWS services will be supported in Canada at launch, or when they'll be available for use. Amazon hasn't given any timetable except to say that the region will go live sometime this year, along with four other new regions in Ohio, China, India and the United Kingdom. Once those are online, Amazon will have a total of 6 regions available in North America, and 17 regions worldwide. The expansion comes at a time when AWS faces increased competition from Microsoft Azure. Microsoft has announced five forthcoming regions around the world, including two Canadian regions: one in Toronto and one in Quebec City.




Seeking to address the growing shortage of data scientists as demand for those skills explodes, leading U.S. security specialist Booz Allen Hamilton released a data science platform aimed at "democratizing data" via a simplified analytics system. The military and intelligence contractor (NYSE: BAH) based in McLean, Va., unveiled its "Sailfish" data science platform this week designed to "lower the barrier to entry for data science." The platform consists of an "Exchange," or data library, an analytics tool called "Explore" and a support platform.

2016 is underway and the concept that still dominates the tech world is Big Data. The Economist Intelligence Unit surveyed executives and found that an estimated 60% of executives report generating revenue from their Big Data initiatives. It's clear that the concept of Big Data is here to stay, but we anticipate a few changes in… The post Why You Should Be Using Data-as-a-Service in 2016 appeared first on BrightPlanet.

IoT and big data are fast-growing areas of technology, and this month's Tech Pro Research survey dives into these pertinent topics.

As many companies consider security to be their biggest concern for 2016, it's important to be prepared and not get caught unawares.

Not enough attention is paid by devs to mobile app security. They may talk the talk about building secure apps, but don't appear to walk the walk.

For some, the downward trend of the term "big data" on Web word counters and disappearance off hype curves is an indication that the phenomenon has past, that big data is now neither a problem to be solved nor an opportunity to be taken advantage of. But some smart folks see the situation differently, including those at Deloitte Consulting, who say big data has become like the air we breathe. Deloitte uses the phrase "everywhere analytics" to describe how the rapid evolution of data and analytics is impacting our world.

Microsoft has acquired technology assets from Event Zero in an effort to improve management tools within Skype for Business.

ScribbleLive has figured out which people, brands, products, and categories commanded the most attention at the 2016 International CES, the big tech trade show in Las Vegas last week. The company analyzed data from more than 100 million articles from news, blog, and social media sources to identify what grabbed the most attention. The CES Influence Report said Apple, Samsung, and Blackberry gathered a lot of attention at the show. Gamers in particular were glued to news about the Oculus Rift virtual reality headset.

Google means business in virtual reality: That's the overwhelming message out of a VP shuffle that put Clay Bavor in charge only of VR. His other apps got given to Diane Greene. (Yes, Clay is now VP of VR. Sometimes this stuff simply writes itself.) But why? the crazy goldrush? Could it be, as one hilarious quote puts it, FOMO (fear of missing Oculus)? Well, certainly this Facebook-owned business could be a strategic threat to Google's huge monopoly^H dominance. In IT Blogwatch, bloggers pay the lawnmower man. [You're fired -Ed.] Not to mention: Autism Is a Term for Behavior, Not Disease…

This entry was posted in News and tagged , , , , , , , , , . Bookmark the permalink.