Big Data News – 22 Feb 2016

Today's Infographic Link: What Happens on YouTube in 24 Hours?

Top Stories
DCIM is a product category that sounds too good to be true, but cost and complexity prevent most organizations from seriously considering it.

Ah, the cloud — the very word makes the notion of remote-stored data sound so fluffy, light, and inviting. But the truth is that the cloud is only a foundation — fluffy, perhaps, but still a soil-like layer. That means if you've moved some or all of your files into a service like Dropbox, Google Drive, or OneDrive, it's up to you to plant feature-enhancing crops into your virtual space.

Last week at Spark Summit East, Databricks dropped a few hints about where in-memory data processing tool Spark is headed. The company is the primary commercial entity behind Spark and plays a leading roll in its evolution. Databricks' hosted Spark platform, Databricks Cloud, is available by subscription. To make it easier to get onboard with Spark in its cloud, Databricks announced a free tier, the Community Edition. It's available for now only as a beta invite, but general availability is planned for the middle of this year.

Cloud service providers are evolving their platforms to handle personal health information that's subject to HIPAA regulations, as seen in this case study about the state of Georgia's Google cloud use.

Ericsson and Amazon Web Services are partnering to help service providers use the AWS cloud to quickly roll out and expand services in areas like the Internet of Things. The partnership, announced Monday at Mobile World Congress, is part of Ericsson's efforts to help carriers evolve from traditional data-center infrastructure to Web-scale technologies. Like enterprises, fixed and mobile operators want to become more agile so they can meet growing and changing demands from users. AWS and Ericsson are starting out by working with Australian carrier Telstra at the Gurrowa Innovation Lab in Melbourne.

News: Company aims to use analytics to improve customers' experiences.

Salesforce has acquired PredictionIO, a startup with general-purpose open source machine learning software, Simon Chan, the startup's cofounder and chief executive, announced today. Terms of the deal weren't disclosed. "Over the past few years, we've been on a mission to scale and simplify development of machine learning technology," Chan wrote in a blog post. "What started as a passion project quickly grew into much more — now with a community of more than 8,000 developers powering more than 400 apps.

I play a fair bit of Destiny, a space-themed video game. Actually, a lot. (It's an amount my husband calls "too much".) I enjoy the game not just for its great story and space-age shooting gameplay, but also for the social interaction. It's a massively-multiplayer game: you're always online, and you'll continually run into other players in the game world. Some parts of the game even require teams of 3 or 6 people to complete. As a result, I often play with real-world friends, and my Xbox friends list is now full of people I've met online playing the game.

News this week included smartphones sales numbers, IIoT growth, a new Google Glass patent, and new chips for the IoT.

We've released a minor update to Microsoft R Open 3.2.3 to address issues that some people were experiencing. The update available now on MRAN fixes the following issues: The Windows R GUI (RGUI.exe) could crash when typing beyond the bounds of the visible window. The "R" command wasn't available from the Terminal in OS X El Capitan. TCL/TK wasn't working on some Linux platforms. Using MKL on Windows 7 resulted in a "0xc000007b error".

Because we live in an on-demand world, customers are demanding personalized experiences across all touchpoints — regardless of where they reside.

Today's blog post comes from our partners at Neustar and is by Jonjie Sena When it comes to mobile Caller ID, smartphone owners are feeling outnumbered. In the U.S. market, where Caller ID has the largest uptake, over ninety percent of landline phones feature Caller ID service, versus two percent for mobile phones. The remaining ninety-eight percent of mobile phones rely on data stored in their user's contact lists to identify incoming calls. In international markets, the uptake of Caller ID service is lower still, and in some cases not even available to mobile consumers.

List: Data analytics, Omni-channel shopping, mobile and cloud technology are just some of the technologies being used to disrupt the fashion industry.

Data is increasingly available, accessible and decipherable. You probably use it every day to help make personal decisions: tracking calories on your iPhone, browsing recommended movies based on what you've already viewed, seeing advertisements on your laptop and smartphone based on your browsing behavior and more. These insights are based on data collected from you — and they are used to predict actions you might take in the future.

The data science talent shortage has some companies thinking outside the box. Even if your company employs a formidable data science team, you would likely still benefit from third-party ideas or solutions. Data science competitions and other forms of crowdsourcing offer viable means of advancing the art of the possible relatively quickly and cost-effectively. We share some of the possibilities.

It's a mad, mad, mad digital world, with digital connections, purchases and reservations to be made around the clock and around the world. Deeper insights gleaned from a broad array of digital sources can help travel providers increase demand while managing customer expectations.

Cisco this week unveiled a Cisco Firepower Next Generation Firewall that better secures applications before attacks are ever launched.

The way we consume, process and act upon content varies wildly between our personal and public lives. In the workplace, reports are content–and many of them go unread or unacted upon. Tomorrow's cognitive businesses will have "thinking" computers that help them analyze–and take action–on the data.

In anticipation of his upcoming Predictive Analytics World for Workforce conference presentation, The State of Predictive Analytics for Workforce in Enterprise Applications, we interviewed Holger Mueller, Principal Analyst & Vice President at Constellation Research, Inc. View the Q-and-A below to see how Holger has incorporated predictive analytics into the workforce of Constellation Research, Inc. Also, glimpse what's in…

Accelerance has created a global network of software development teams to work with small and medium-sized businesses.

Facebook Messenger advertising is coming: Brace yourselves. This leaked document says brands can begin IM'ing ads to us soon–it mentions a way to prevent spam, but companies already seem to be tip-toeing their ways around it. The rumor seems to be somewhat confirmed. And it matches other statements that Facebook made recently. Mind you, it does contradict Mark Zuckerberg's quote about (errm) not advertising in messaging. So let's remove Messenger from our phones? In IT Blogwatch, bloggers never actually installed it in the first place. Your humble blogwatcher curated these bloggy bits for your entertainment. Not to mention: Walking in London…

News: Acquisition of assets from Roambi could revamp user interface.

A top business priority for banks today is to have a complete 360 degree view of customers through the collection and use of personal data. To effectively manage data, it helps to know how consumers expect their information to be used. Armed with this knowledge, banks can identify opportunities for new value-add services that drive revenue and competitive advantage.

News: IBM Watson Health will now hold the world's largest collection of health-related data.

VB WEBINAR: New tech trends have transformed the business of lending. Learn how leading lenders are using big data and predictive models to slash risk and boost revenue in our free webinar.  Register here for free. Lending in the personal loan space has traditionally been a very forward-looking leap of faith, where the lender does their best to create some sort of mathematical estimation of how likely it is that a borrower will pay them back, and then sets the price and hopes for the best. And yet hoping for the best is not exactly a sound financial decision.

IBM and the Internet of Things are changing how insurance companies do business. Insurers have the opportunity to improve claims management, crisis response, policyholder retention, risk management and more. Outthink limits and discover what your insurance company can do with the Internet of Things.

IBM and the Internet of Things are changing how insurance companies do business. Insurers have the opportunity to improve claims management, crisis response, policyholder retention, risk management and more. Outthink limits and discover what your insurance company can do with the Internet of Things.

You may be surprised to discover that the power of analytics can be successfully applied to games of chance and skill. See how two prominent businesses involved in casual online games and retail lotteries use analytics to score points with their customers.

There's nothing like raging success to bring on a pack of competitors, and that's exactly what's happening in the enterprise-collaboration arena. Ever since the public launch of its team-communication software two years ago, Slack has taken enterprises by storm, propelled by its freemium business model and plenty of investor enthusiasm. Today, it claims 2.3 million daily active users, more than 675,000 paid seats and over $64 million in annual recurring revenue. Other companies smell opportunity. Two new contenders entered the scene this week, both hoping to claim a piece of the pie for themselves. On Wednesday, it was SpotCues with what it calls the industry's first location-based contextual social network.

The goal of security staffs has gotten harder to fulfill: Protect the devices that hold critical data and offer ways into an organization.

Facebook today announced that it has released the data it used to train its artificial intelligence software to understand children's stories and predict the word that was missing from a given sentence in a story. The data set (.tgz) comes out to more than 1.6GB, and it's affiliated with a recently published academic paper called "The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations." Facebook chief executive Mark Zuckerberg provides a good overview of the research today in a Facebook post: Language is one of the most complex things for computers to understand.

Pinning down a comprehensive definition for patient engagement is no easy feat. Its definition is predicated on a wide range of characteristics and interactions between patients and institutions. Attendees of the Health Information and Management Systems Society (HIMSS16) conference can consider a number of questions vital to understanding the complexities of patient engagement.

No one can predict the future for certain, but generating a data forecast could be the next best thing. Register for IBM InterConnect 2016 to discover how IBM and the Internet of Things are changing the way that insurers do business.

AMC uses IBM Analytics solutions to extract insights into audience preferences and viewing patterns, helping to make smarter scheduling and data-driven marketing decisions.

Learn more about five essential self-service data prep capabilities every analyst must be able to access to make more meaningful and timely decisions.

There's nothing like raging success to bring on a pack of competitors, and that's exactly what's now happening in the enterprise-collaboration arena. Ever since the public launch of its team-communication software two years ago, Slack has taken enterprises by storm, propelled by its freemium business model and plenty of investor enthusiasm. Today, it claims 2.3 million daily active users, more than 675,000 paid seats and over US$64 million in annual recurring revenue.

Google on Thursday launched the open beta for its Cloud Vision service, giving developers a new way to make intelligent apps that use images.   Using Google Cloud Vision, developers can manipulate images in several ways, such as running optical character recognition to pull text out of images, or using the technology that powers Google's SafeSearch feature to detect inappropriate images. Google launched the service in private beta last year, and it is now available for public consumption.  In addition to making the service publicly available, Google also revealed the pricing. Developers will be able to run up to 1,000 images through Google services for free, and then pay a flat fee for each group of 1,000 images they upload after that.

Security experts weigh in on the FBI's battle with Apple.

Engineers from across the Apache Hadoop community are collaborating to establish Arrow as a de-facto standard for columnar in-memory processing and interchange. Here's how it works. Apache Arrow is an in-memory data structure specification for use by engineers building data systems. It has several key benefits: A columnar memory-layout permitting O(1) random access. The layout is highly cache-efficient in analytics workloads and permits SIMD optimizations with modern processors. The post Introducing Apache Arrow: A Fast, Interoperable In-Memory Columnar Data Structure Standard appeared first on Cloudera Engineering Blog.

The move to a cloud-based ERP system forced Skullcandy to rethink its global network, which ultimately led to the decision to migrate to an offering from Aryaka.  Network World Editor in Chief John Dix recently discussed the migration with Systems Manager Yohan Beghein. Skullcandy Systems Manager Yohan Beghein What WAN problem were you having that encouraged you to go looking for an alternative?

A few significant changes in the SMB landscape could change the way they operate and how quickly they grow in the near future.

IBM's Watson Health, the company's 10-month-old healthcare data crunching unit, plans to acquire Truven Health Analytics and its 215 million patient profiles, with the ultimate goal of helping health-care providers make better business and patient-care decisions. The $2.6 billion deal, announced Thursday, gives IBM Watson Health access to a huge new data set in the unit's efforts to mine data to improve health-care quality while controlling costs. Including data from Truven, Watson Health will have access to about 300 million patient data sets, IBM said.




IBM's Watson Health, the company's 10-month-old healthcare data crunching unit, plans to acquire Truven Health Analytics and its 215 million patient profiles, with the ultimate goal of helping health-care providers make better business and patient-care decisions. The $2.6 billion deal, announced Thursday, gives IBM Watson Health access to a huge new data set in the unit's efforts to mine data to improve health-care quality while controlling costs. Including data from Truven, Watson Health will have access to about 300 million patient data sets, IBM said. Using IBM's Watson Health Cloud, health-care organizations will be able to combine several data sets.

Google today announced that its Cloud Vision application programming interface (API), which can give applications new image recognition capabilities, is now available in beta for anyone to use. For the past two months it was only available in a limited preview. The API can carry out multiple functions, including optical character recognition (OCR), detection of faces in photos, recognition of the emotion on faces, detection of logos, detection of inappropriate content (think nudity), and, of course, object detection. It's "powered by the same technologies under Google Photos," as Google Cloud Platform product manager Ram Ramanathan wrote in a blog post today.

by Joseph Rickert Earlier this month the Bay Area useR Group (BARUG) held it annual lightning talk meeting. This is by far our most popular meeting format: eight, 15 minute talks (12 minutes speaking and 3 minutes Q & A while the next speaker is setting up) packed into a two hour time slot. The intensity seems to really energize the speakers and engaged the audience.  Bradley Shanrock-Solberg kicked off the event with delightful example of an R Monte Carlo simulation based on his wildpoker package that you can find on CRAN. I have never seen a more prepared lightning talk presenter: high energy, a royal flush presentation and a four color printed hand out just in case you have trouble keeping up with him for the 12 minutes.

This entry was posted in News and tagged , , , , , . Bookmark the permalink.