Big Data News – 23 Aug 2016

Today's Infographic Link: How to beat the dreaded jet lag

Featured Article
It has been more than five years since James Dixon of Pentaho coined the term "data lake." His original post suggests, "If you think of a data mart as a store of bottled water — cleansed and packaged and structured for easy consumption — the data lake is a large body of water in a more natural state." The analogy is a simple one, but in my experience talking with many end users there is still mystery surrounding the concept.

Top Stories
The provider of ETL tools is seeing a spike in demand for mainframe data integration expertise thanks mainly to the rise of Big Data applications.

Microsoft has agreed to acquire Genee, an AI-powered scheduling service, to drive intelligent experiences in Office 365.

Datrium announces it can now make 100TB of Flash storage available per server using Datrium DVX software running on top of a virtual machine.

Using new techniques of information modeling, indexing, and processing, new cloud-based systems can support cloud-based workloads previously not possible for high-throughput insurance, banking, and case-based applications. In his session at 18th Cloud Expo, John Newton, CTO, Founder and Chairman of Alfresco, described how to scale cloud-based content management repositories to store, manage, and retrieve billions of documents and related information with fast and linear scalability.

The wearable device can be activated to sound an alarm and send out a location-based text alert in the event of an attempted assault.

In this contributed article, Dan Turchin, BigPanda's VP of Growth Strategy, provides some insights into Big Data's growth around the transportation industry.

In this special guest feature, Prat Moghe, CEO of Cazena, highlights the important considerations for migrating data warehousing to the cloud.

News: Adam Selipsky will also be the president of Tableau while the current CEO and co-founder Christian Chabot will head the board of directors.

Nowadays there is a great variety of different useful tools for market researchers that are free and open for usage. Today you can use such instruments to enhance your business and significantly improve effectiveness of your work. Leaving you a chance to waste your precious time for searching the best instrument among the abundance of useful online tools, we have already made a list of Top 10 Market Research tools for your business.

In today's digital economy, companies are faced with a fast data challenge as well as a Big Data one. As a result they are under pressure to adapt their analytics processes and data flows at pace to move beyond traditional data warehouse silos. Big Data projects are either too big or too complex to handle the traditional way. That's why most projects by companies at the start of their Big Data initiative have no process at all.

"My role is working with customers, helping them go through this digital transformation. I spend a lot of time talking to banks, big industries, manufacturers working through how they are integrating and transforming their IT platforms and moving them forward," explained William Morrish, General Manager Product Sales at Interoute, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.

Data scientists use statistical analysis tools to find non-obvious patterns in deep data. But they know the universe is full of spurious correlations. Big data simply intensifies the problem. Because, as the range of sources and the diversity of predictors continues to grow, the number of relationships that can potentially be modeled begins to approach infinity. As David G. Young pointed out, "predictive variables sometimes aren't ….We've all seen variable interactions that change the significance, curvature, and even the sign of an important predictor."

News: Genee acts as a digital personal assistant to schedule and reschedule meetings.

While most big enterprises use some form of predictive analytics, many do not use anticipatory analytics, which identifies changes before they happen.

Akana has announced the availability of version 8 of its API Management solution. The Akana Platform provides an end-to-end API Management solution for designing, implementing, securing, managing, monitoring, and publishing APIs. It is available as a SaaS platform, on-premises, and as a hybrid deployment. Version 8 introduces a lot of new functionality, all aimed at offering customers the richest API Management capabilities in a way that is easier than ever for API and app developers to use.

Microsoft on Monday announced it bought a startup to boost its artificial intelligence capabilities, and rival Apple confirmed it has boosted its health focus with an acquisition of its own.

The Internet of Things, commonly known as IoT, is spreading at a much faster rate than what I initially thought it would about a year ago. I have been curious to learn about what IoT is all about and encountered a large community of contributors and followers in the two dominant open source forums, Arduino and RaspberryPi. Both are working hard to transform the lives of people, organisations and cities around world with the power of digital.




WANdisco (LSE: WAND), a leader in Active Transactional Data Replication™ enabling global enterprises to meet today's data challenges of secure storage, scalability and availability both on-premise and in the cloud, announced the release of WANdisco Fusion™ 2.9.

Data visualization specialist Tableau Software is bringing in seasoned talent to help it compete in an increasingly cloud-based world. The company has hired longtime Amazon Web Services executive Adam Selipsky as its CEO, replacing cofounder Christian Chabot. Chabot, who has been CEO for 14 years, will continue to serve as chairman of Tableau's board of directors.

Data visualization specialist Tableau Software is bringing in seasoned talent to help it compete in an increasingly cloud-based world. The company has hired longtime Amazon Web Services executive Adam Selipsky as its CEO, replacing cofounder Christian Chabot. Chabot, who has been CEO for 14 years, will continue to serve as chairman of Tableau's board of directors.

Online retailer eBay is attempting to extend its machine language capabilities beyond automatic language translation to e-commerce uses designed to make product searches more relevant. As automation improves, the company said one goal eliminating the search box. Meanwhile, development cycles have been reduced as more machine learning libraries are released to the open source community. That has stimulated innovation. "

Every summer, technologists turn to Gartner's Hype Cycle for Emerging Technologies, which has become a barometer of sorts for gauging the state of various hardware and software innovations that are expected to impact business and society over the next decade. This year, Gartner analysts have their eye on all manner of artificial intelligence technologies, including "smart machines" that can learn by themselves.

Kubernetes, Docker and containers are changing the world, and how companies are deploying their software and running their infrastructure. With the shift in how applications are built and deployed, new challenges must be solved. In his session at Dev Ops Summit at19th Cloud Expo, Sebastian Scheele, co-founder of Loodse, will discuss the implications of containerized applications/infrastructures and their impact on the enterprise. In a real world example based on Kubernetes, he will show how to migrate an existing application to Docker and Kubernetes, and what the benefits are.

If you've been shopping around for cloud services, especially for big data storage and analytics, you've likely stumbled across the term 'metal cloud' or 'bare metal cloud'. The bare metal cloud is a public cloud service offering that allows an organization to rent dedicated servers and hardware resources from a remote cloud service provider.

Sharon Machlis is a journalist with Computerworld, and to show other journalists how great R is for data visualization she shows them these five data visualizations, each of which can be created in 5…

Sharon Machlis is a journalist with Computerworld, and to show other journalists how great R is for data visualization she shows them these five data visualizations, each of which can be created in 5 lines of R code or less.

The vast majority of analytics efforts are expended on problems that are tactical in nature. That's not necessarily wrong. Tactics get a bad rap, sometimes, but the truth is that the vast majority of decisions we make in almost any context are tactical. The problem isn't that too much analytics is weighted toward tactical issues, it's really that strategic decisions don't use analytics at all.

And they said resilience–continuous data access in the face of outages, failures and downtime–across distributed data sources is impossible. Yet the recent IBM BigInsights release offers this capability in its IBM Big Replicate technology. Get an inside look at resilience in an interview with Jim Campigli, cofounder of WANdisco and its chief product officer, who explains the active-active replication technology that makes resilience a reality.

You have a problem: Businesses all over the world are facing a serious issue. Employees are increasingly overworked, disengaged, and bogged down by inefficient processes. A Gallup poll of more than 80,000 workers found that just 31.5% described themselves as being engaged at work. The majority, 51%, say they're disengaged, and the final 17.5% reported actually being "actively disengaged." Legacy enterprise software is arguably one of the least engaging aspects of any job.

It's a tough time for CIOs. They're facing multiple challenges based on current technology trends, changing markets and shifting organizational culture. To see how CIOs around the world are dealing with these and other factors, join an upcoming CrowdChat that analyzes the results of a new CIO study.

Cloud Expo 2016 New York at the Javits Center New York was characterized by increased attendance and a new focus on operations. These were both encouraging signs for all involved in Cloud Computing and all that it touches. As Conference Chair, I work with the Cloud Expo team to structure three keynotes, numerous general sessions, and more than 150 breakout sessions along 10 tracks.

What would you do if nearly a third of your employees were making mistakes that could cause serious harm to the company?

The enterprise has quite a bit of legacy data infrastructure to support, and it can't scrap that investment just because something new has come along.

Growing up in the Silicon Valley, the daughter of a mechanical engineer, I always wanted to work in technology. However, I never quite managed to get myself exposed to a tech field until I'd already developed a complex about my abilities in the STEM fields. I pivoted towards reading and writing – oriented professions and…

Cloudera Enterprise 5.8 includes the latest release of Hue (3.10), the web UI that makes Apache Hadoop easier to use. As part of Cloudera's continuing investments in user experience and productivity, Cloudera Enterprise 5.8 includes a new release of Hue that makes several common tasks much easier.

Aug� describes non-places as quantifiable and measureable physical space, but does not give an appropriate measure to do so. This contribution thinks about using crowd-harvested photo data, a specific kind of Big Data, to measure non-places in the context of tourism by giving a theory based discussion on Aug�’s non-places, photography as key element of "doing" tourism, and the selection processes of photography and uploading photos. Solely using theoretical thoughts and propositional logic, this contribution indicates that it could be possible to do so.

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Pig Doing Yoga: How to Build Superflexible Pig Scripts by:gkeys We know that parameter passing is valuable for pig script reuse. One lesser known understanding is… The post Top Articles on Apache Hadoop — From HCC appeared first on Hortonworks.

To leverage Continuous Delivery, enterprises must consider impacts that span functional silos, as well as applications that touch older, slower moving components. Managing the many dependencies can cause slowdowns. See how to achieve continuous delivery in the enterprise.

While interest in IoT technologies continues to rise among businesses, many organizations are still unsure how it will directly benefit the bottom line. However, the IT department and the CIO are helping to lead the way for those companies that are adopting the technology.

By: Eric Siegel, Founder, Predictive Analytics World In anticipation of his upcoming conference keynote presentation, Fraud Screening for 2/3rds of All Card Transactions: A Consortium and Its Data at Predictive Analytics World Financial in New York City, October 23-27, 2016, we asked Scott Zoldi, Chief Analytics Officer at FICO, a few questions about his work in predictive analytics.

The cannabis industry is growing up, and it would be tough to imagine more convincing proof than Microsoft's recent announcement that it's getting involved. Though the software giant will stay very much in the background — its role will focus primarily on providing Azure cloud services for a compliance-focused software push — the move is still widely viewed as a telling sign.




Lyft, the San Francisco ride-sharing company and Uber competitor, is thriving on AWS IT services and looking to ring up its first $1 billion. Here's what IT pros can learn from the example.

Originally published in The European Business Review The NSA can leverage bulk data collection with predictive analytics to target law enforcement activities. But this little-known capability both intensifies and redefines the debate over how much data governments should be collecting. About this article. This article is excerpted from the Revised and Updated edition of Eric…

The cannabis industry is growing up, and it would be tough to imagine more convincing proof than Microsoft's recent announcement that it's getting involved. Though the software giant will stay very much in the background — its role will focus primarily on providing Azure cloud services for a compliance-focused software push — the move is still widely viewed as a telling sign. "

The cannabis industry is growing up, and it would be tough to imagine more convincing proof than Microsoft's recent announcement that it's getting involved. Though the software giant will stay very much in the background — its role will focus primarily on providing Azure cloud services for a compliance-focused software push — the move is still widely viewed as a telling sign. "

It's been a busy time for tech's ongoing infatuation with containers. Amazon just announced EC2 Container Registry to simply container management. The new Azure container service taps into Microsoft's partnership with Docker and Mesosphere. You know when there's a standard for containers on the table there's money on the table, too.

PLUMgrid Networks delievers a CloudSecure module that makes it simpler to secure individual virtual network segments in an OpenStack environment.

Think of a network that’s over a decade old, and the first words that come to mind probably aren’t “flexible” or “scalable.”

Our director of engineering told me that she had a customer ask if we could do real-time data processing with Syncsort DMX-h. Knowing that real-time means different things to different people, the engineer asked what exactly the customer meant by real-time. He said, "We want to be able to move our data out of the database and into Hadoop in real-time every two hours."

We asked participants to tell us why they would like to take part in the course, and how they would use the data science Boot Camp training in their current roles. After going through all the participants' videos, and carefully scoring each criteria, we have a tie!

This entry was posted in News and tagged , , , , , , , . Bookmark the permalink.