Big Data News – 13 Jul 2016

Today's Infographic Link: Planet Earth Control Deck

Featured Article
If you've heard about Data Science but don't really understand what it's all about, you might want to check out the 5-part video series Data Science for Beginners presented by my colleague Brandon Rohrer, senior data scientist at Microsoft. Each video is short (5-10 minutes) and explains an aspect of data science without any assumed knowledge or technical jargon.

Top Stories
Facebook and Microsoft have inked a deal to migrate the social media giant's 13,000 employees to Office 365. Facebook will continue to use its own collaborative tools instead of Skype for Business and Yammer.

Organizations can now efficiently and securely set up, deploy and manage a Hadoop-based data lake with the new version of Podium, the enterprise-ready management platform from Podium Data.

In anticipation of his upcoming conference presentation, The Revolution in Retail Customer Intelligence, at Predictive Analytics World New York, October 23-27, 2016, we asked Dean Abbott, Co-Founder and Chief Data Scientist of SmarterHQ, a few questions about his work in predictive analytics. Q: In your work with predictive analytics, what behavior do your models predict?… The post Wise Practitioner – Predictive Analytics Interview Series: Dean Abbott of SmarterHQ appeared first on Predictive Analytics Times.

The SeekingAlpha mobile application allows attackers to eavesdrop and steal sensitive data.

Redmond GAs its data warehouse and embedded BI cloud services, as its data fever grows ever stronger.

Parse is Facebook's cloud mobile solution. Recently, the social networking behemoth announced that it would be discontinuing Parse from January 2017. This means app developers using Parse to develop applications will have to migrate to another platform. However, this is not as easy as it sounds. A lot of considerations will have to be determined and assessed before developers can narrow down their choice of a good platform-as-a-solution in place of Parse. In addition, the migration itself can be a worrying activity if not done with expert assistance.

You probably don't need to know that yet another report has been published about the benefits of big data. Tech giants like Google, Facebook and eBay are in on it, using a mix of bespoke, freeware and licensed technologies to monetize internal data assets by combining it with freely available big data. Even Dilbert has something to say about it!

The proliferation of cloud computing today enables businesses of all sizes to expand their workforce, and reduce the number of people using the office at the same time. Fortune 1000 companies claim their staff are away from their desk as much as 60 percent of time, with between 80 and 90 percent of American workers saying the option to work remotely would be advantageous.

While at Hadoop Summit 2016, I had the opportunity to catch up with Jack Norris, Senior Vice President of Data and Applications at Mapr, to discuss all the progress the company has made in the past year.

Hadoop – A key enabler for IoT Apache Hadoop has rapidly evolved from just supporting simple, batch-processing jobs with limited flexibility to a full-blown ecosystem of projects that supports a wide range of use cases and analytic applications including – ingestion, storage, processing, serving and analytics for massive volumes of data-at-rest as well as data-in-motion. Given the data characteristics of IoT, Hadoop is quickly emerging as the data platform of choice for enabling a number of compelling IoT use cases today.

The Data Lab had a great experience working with e-Placement Scotland to organise the placements for The Data Lab MSc programme. As part of their course, students have the opportunity to undertake a paid placement in industry. Recognising e-Placement Scotland's expertise in engaging with employers and helping them to benefit from working with students and course leaders, The Data Lab turned to e-Placement Scotland to bring employers into the project.

If you've been hungering to make use of the advanced number-crunching technology in Microsoft's R Server product, but feared its pricetag, Microsoft itself has a partial answer: Microsoft R Client. Free, but not open source, R Client is built with much of the same code as R Server. It even includes many of the same features, such as the "ScaleR" technology that allows R programs to benefit from multicore architectures, although they're only available here in a limited form.

This article was originally published in CIO Review. Here is the link. No one can dispute that data has significant value for organizations. We see it everyday in how some companies are using data to successfully deliver better customer experiences. This can take many shapes, from better products and services that companies create based on collected and analyzed customer behavior, to personalizing customer experiences. Examples abound: we all have read how Disney is creating magical experiences by leveraging data collected through the magic bands.

Continuous testing helps bridge the gap between developing quickly and maintaining high quality products. But to implement continuous testing, CTOs must take a strategic approach to building a testing infrastructure and toolset that empowers their team to move fast. Download our guide to laying the groundwork for a scalable continuous testing strategy.

Guest blog post by Jeff Pettiross For almost as long as we have been writing, we've been putting meaning into maps, charts, and graphs. Some 1,300 years ago, Chinese astronomers recorded the position of the stars and the shapes of the constellations. The Dunhuang star maps are the oldest preserved atlas of the sky: More than 500 years ago, the residents of the Marshall Islands learned to navigate the surrounding waters by canoe in the daytime–without the aid of stars. These master rowers learned to recognize the feel of the currents reflecting off the nearby islands.

Find out what CSV is.

Find out what R is.

Despite widespread corporate adoption of big data and analytics tools along with exponential advances in analytics technology, a new survey by respected consulting firm finds that only one-third of CEOs polled have a high level of trust in the accuracy of their data or the analytics tools used to crunch it. In a recently released CEO study, KPMG International said 29 percent of the 400 U.S. chief executives it surveyed said they either have "limited distrust" or "actively distrust" their organization's use of big data and analytics.

I've recently taken on a position as an executive in a technical role at a big data startup headquartered in the heart of Silicon Valley. I see the growth of next-gen big data platforms as the most exciting and disruptive story of the 21st century. The entire computational landscape is going through a profound change; legacy technologies are in decline and a new vanguard of distributed data platforms are emerging as the next-gen infrastructure for enterprise and research applications. It's a great time to be a technologist. But it's hard not to notice that in the midst of all of this new growth and innovation, women have played a decidedly small role.

Hadoop Summit in San Jose wrapped up a few weeks ago. This was the ninth year and, wow, have we come a long way. It's been a decade for Apache Hadoop and five years for Hortonworks. Hadoop Summit is the leading conference for Hadoop and data management, and this year saw well over 4,000 attendees… The post Hadoop Summit 2016: The Growth Accelerates appeared first on Hortonworks.

AlphaGo's uncanny success at the game of Go was taken by many as a death knell for the dominance of the human intellect, but Google researcher David Silver doesn't see it that way. Instead, he sees a world of potential benefits. As one of the lead architects behind Google DeepMind's AlphaGo system, which defeated South Korean Go champion Lee Se-dol 4 games to 1 in March, Silver believes the technology's next role should be to help advance human health.

AlphaGo's uncanny success at the game of Go was taken by many as a death knell for the dominance of the human intellect, but Google researcher David Silver doesn't see it that way. Instead, he sees a world of potential benefits. As one of the lead architects behind Google DeepMind's AlphaGo system, which defeated South Korean Go champion Lee Se-dol 4 games to 1 in March, Silver believes the technology's next role should be to help advance human health.

Many announcements have been made recently in regard to the IoT and security, suggesting the industry is working hard to put people's fears to rest.

We are in the midst of a fundamental shift in the way in which organizations protect themselves from the modern adversary. Traditional rules based cybersecurity applications of the past are not able to protect organizations in the new mobile, social, and hyper-connected world they now operate within. However, the convergence of big data technology, analytic advancements, and a variety of other factors have sparked a cybersecurity renaissance that will forever change the way in which organizations protect themselves. Tomorrow join Rocky DeStefano, Cloudera's Cybersecurity subject matter expert, as he explores how modern organizations are protecting themselves from more frequent, sophisticated attacks.

In this podcast from ISC 2016 in Frankfurt, Steve Pawlowski from Micron discusses the latest memory technology trends for high performance computing. Download the MP3 * Subscribe on iTunes * Subscribe to RSS

The way individuals interact with an app is about to fundamentally change. Invoking apps via a traditional user interface will become the exception.

IBM and Cisco's recent studies show over 90% of the data in the world were created in the last two years and 80%+ of the data on the internet is visual. There is a trend to use infographics to communicate that huge amount of information with target audiences. Infographics are also a powerful way for (more…)

Azure Stack, Microsoft's system for building private clouds that run the same software it uses in its public cloud, has been delayed until the middle of 2017, the company announced Tuesday. Microsoft has changed its strategy and will provide Azure Stack through integrated systems built by hardware partners, rather than allowing companies to deploy the software on any compatible hardware they choose. Microsoft has been pushing Azure Stack as a differentiator of its cloud platform compared to many of its competitors in the public cloud arena — especially Amazon.

Insights from CIOs can reveal a lot about the industries in which they operate, and hearing from IBM's CIO is no exception. Check out this recent podcast featuring Jeff Smith, CIO at IBM, who offers a glimpse at his idea of focusing on culture, a story of transformation, the CIO's role and expectations and more.

CEO briefing: Some of the key trends developing in the IT industry could be vital to adopt in order to stay ahead of the competition.

Azure Stack, Microsoft's system for building private clouds that run the same software it uses in its public cloud, has been delayed until the middle of 2017, the company announced Tuesday. Microsoft has changed its strategy and will provide Azure Stack through integrated systems built by hardware partners, rather than allowing companies to deploy the software on any compatible hardware they choose. Microsoft has been pushing Azure Stack as a differentiator of its cloud platform compared to many of its competitors in the public cloud arena — especially Amazon. The change came about because the company wants to give businesses prevalidated hardware to simplify deployments of Azure Stack, according to Mike Schutz, the general manager of product marketing for Microsoft's Server and Tools division. At launch, Microsoft will be working with Hewlett-Packard Enterprise, Dell, and Lenovo.

The cloud will require careful coordination of many moving parts on the physical, virtual and application layers.

by Sheri Gilley, Microsoft Senior Software Engineer SQL Server 2016 has reached general availability recently and one of the top new capabilities it features is SQL Server R Services — advanced analytics in-database with the R language. R has also been integrated into Power BI, allowing you to create fully integrated visualizations with the power of the R language.

by Sheri Gilley, Microsoft Senior Software Engineer SQL Server 2016 has reached general availability recently and one of the top new capabilities it features is SQL Server R Services — advanced… …

Chef to make it simpler to acquire range of open source technologies designed to automate everything from machine configuration to the entire app.

Chef to make it simpler to acquire range of open source technologies designed to automate everything from machine configuration to the entire app.

Business intelligence platform provider Sisense is taking user self-service for analytics to the next level through integration with Amazon Echo and IoT devices.

By the end of some students' first two weeks in a college course, an analytical model can determine with 75% accuracy rate how well they'll end up doing. But should it? For the 2,200 students at community colleges and historically black universities, the Open Academic Analytics Initiative in 2014 — a program developed by Marist College and… The post Colleges Can Now Figure Out Which Students Will Be Successful — Even Before Classes Start appeared first on Predictive Analytics Times.

Cisco extended the reach of its integrated management framework into the realm of IT security.

Sisense is a business analytics vendor. It actually has a pretty cool product that offers analytics baked into a silicon chip — essentially the idea around Sisense is to make business intelligence (BI) quicker and easier, and to disrupt the traditional model of data warehousing, heavyweight extract, transform and load (ETL) tools, and the need for highly paid data scientists. Sisense combines the hardcore analytical tools with the visualization tools that organizations need to actually display data insights. So Sisense is a cool company.

The most recent top 25 list of IT service providers from outsourcing analyst firm HfS Research leads with a couple of the usual suspects, with IBM and Accenture in the No. 1 and 2 spots, with 7.8 percent and 5.1 percent market shares, respectively. [ Related: 10 outsourcing trends to watch in 2016 ] But not far behind are India's Tata Consultancy Services (TCS), at No. 5, offshore-centric Cognizant in 8th, and as-a-service Amazon Web Services (AWS) already in the No. 12 spot. HfS is calling it a "full-scale assault" on the traditional providers.

Online Travel Agents (OTAs) are known as an effective distribution channel for hotel rooms, but with more major hotel brands pushing direct bookings in an effort to reduce commissions paid out to agents– this leaves OTAs searching for the next goldmine product to fuel their continued growth. With this in mind — for years — hotels have enjoyed a non-competitive landscape with their loyalty programs by essentially ring-fencing their guests into the scheme by not offering elite benefits and points if the room was booked through a third party distribution channel such as an online travel agent.

There is a clear sense in the marketplace today that for the internet of things (IoT) to realize its true potential as the next-big-thing, analytics is going to be critical. After all what is the purpose of connecting all these devices and gathering the data if we are not going to do anything about it? Unfortunately,… The post Opportunities and Challenges: Predictive Analytics for IoT appeared first on Predictive Analytics Times.

When you think of creativity and community, the last thing that comes to mind is a Key Performance Indicator – just writing that out made me feel completely dead inside. They've been around since the stone age, yet they remain the same and we continue to bang on about how important and necessary they are every chance we get.

Salesforce.com is stepping up its efforts to woo security-conscious businesses by adding "bring your own key" encryption to its Salesforce Shield cloud services. Introduced a year ago, Shield offers encryption, auditing and event-monitoring functions to help companies build cloud apps that meet compliance or governance requirements. Encryption is based on keys generated by Salesforce using a combination of an organization-specific "tenant secret" and a Salesforce-maintained master one. Originally, secrets and keys in Shield were generated and managed through Salesforce's built-in key-management infrastructure, accessed through a point-and-click interface.

In this contributed article, Dr. Jans Aasman, Ph.D, CEO of Franz Inc., takes a look at a combination of advancements in various technologies–cognitive computing, graph databases, visualizations, and data discovery which deliver analytics results in a fraction of the time that IT departments typically required for analysis on even proprietary, relational data.

Crunchy Data, a leading provider of trusted open source PostgreSQL and PostgreSQL technology, support and training, announced that it is open sourcing the Crunchy PostgreSQL Container Suite.




Salesforce.com is stepping up its efforts to woo security-conscious businesses by adding "bring your own key" encryption to its Salesforce Shield cloud services. Introduced a year ago, Shield offers encryption, auditing and event-monitoring functions to help companies build cloud apps that meet compliance or governance requirements. Encryption is based on keys generated by Salesforce using a combination of an organization-specific "tenant secret" and a Salesforce-maintained master one. Originally, secrets and keys in Shield were generated and managed through Salesforce's built­-in key-management infrastructure, accessed through a point-and-click interface.

There are certain people who absolutely love New York, and would never live anywhere else, and other people who would never consider living there.

Hortonworks is looking a lot more like its rivals, Cloudera and MapR, in offering content that is vendor-specific. That's a good thing, especially if you're a customer looking to implement a data lake — and seeking assurance that your vendor has a sustainable business model.

As organizations look to stay competitive by expanding their use of real-time analytics, implementation becomes a challenge. Finding options to effectively serve your company over the long term is often more difficult than it appears. We've identified 12 common obstacles you'll want to avoid as your company pursues real-time analytics.

This year on July 28th we will once again host the Wrangle Conference – the definitive single track conference by and for data scientists. Wrangle explores the principles, practice, and application of data science across many industries. This is an opportunity for you to hear directly from practitioners on how they worked to solve complex problems, their harrowing victories, and of course there's lots of hay! .

By becoming citizen data scientists, business users are more invested in the data discovery process as they're given the tools for performing deep analysis.

The next time you chat with someone on Cisco Systems' Spark messaging service, that someone may not be a co-worker — or even a person. Welcome to the world of bots. On Monday at the Cisco Live conference, Cisco said it's working with the messaging company Gupshup so more developers can bring their bots into Spark. It also introduced a partnership with Api.ai, a natural-language software company, in part to help developers build interfaces where users can just talk to bots. Bots aren't brand-new to Spark, but Gupshup prides itself on its chat-bot development platform, which is designed to make it easy to build bots and make them available through popular communications channels. The list already includes Facebook, Skype, Twitter and Slack.

Whether your IoT service is connecting cars, homes, appliances, wearable, cameras or other devices, one question hangs in the balance — how do you actually make money from this service? The ability to turn your IoT service into profit requires the ability to create a monetization strategy that is flexible, scalable and working for you in real-time. It must be a transparent, smoothly implemented strategy that all stakeholders — from customers to the board — will be able to understand and comprehend.

Tesla's recent bid to acquire Solar City and in effect become a new kind of energy provider has generated a huge amount of copy in recent days. Some cite it as another key milestone on the way to a new renewable energy world. Some cite it as the beginning of the end of the traditional utility model. (Or perhaps another nail in the coffin, as the end of the traditional utility model has been nigh for some time now…) And yet others have used the speculation and hype about this bid to remind us of the technical and financial realities of moving to an entirely new energy landscape.

A major change is taking place in the way companies staff and organize for business intelligence success. Here are the new types of skills people need to be enablers rather than impediments of success. Keep on reading: Dresner's Point: How do you decide what to look at in business intelligence data?

This entry was posted in News and tagged , , , , , , , , , , . Bookmark the permalink.