Big Data News – 27 Jun 2016

Today's Infographic Link: Global Hospitality Index

Featured Article
In the wide world of racing, the goal is to field a car capable of zooming around the track at optimal speeds, avoiding a fiery crash into the wall or another car for 400-500 miles, and cross the finish line. Sure, every driver wants to win the race, but keeping the car in one piece and on the track is actually more important, because race teams aren't just going for the race trophy — they're going for the championship. Staying in the race (even when you don't win) earns the points necessary to put the car and driver in the best possible position to take it all.

Top Stories
Are you attending the 2016 Esri User Conference? Consider yourself a mapping expert? Great news–we're launching a fun contest to put your skills to the test! Esri International User Conference attendees will get the chance to use their geo-location expertise to find awesome prizes. We're giving away one prize each day between Tuesday, June 28th…

Eight years ago today, on June 27, 2008, we filed the incorporation paperwork for Cloudera. We've come a long way since Jeff Hammerbacher, Amr Awadallah, Christophe Bisciglia and I huddled around a borrowed conference table at the Admob offices in San Mateo. The company has grown, of course; we are world-wide, with a big and fast-growing enterprise software business, great customers, outstanding partners and a world-class team of employees. The market has kept pace; when we started, only a small group of hard-core technical people knew what Apache Hadoop was.

Tips designed to give IT pros a way to ensure that their networks are operating smoothly and securely while they are out of the office.

Waterline Data, The Smart Data Catalog Company, today announces the integration of the company's Smart Data Catalog with Apache Atlas within Hortonworks Data Platform (HDP).

It may take a while for both programmers and the IT organizations they work for to get used to this new world order of application development.

The useR! 2016 conference, the annual gathering of R users from around the world, is already underway at Stanford University. Today is a day of interactive tutorials, and the presentation program…

While the market seems to be in flux, it seems certain that the physical attributes of hybrids are very well tailored to work environments.

Red Hat on Monday rolled out a major new release to its JBoss Enterprise Application Platform that's designed to offer better support for containers and cloud-native applications. It's been 10 years since Red Hat acquired JBoss, but much has changed in the technology world since then. Now, JBoss EAP 7 is optimized for cloud environments, Red Hat says. The platform combines Java EE 7 APIs (application programming interfaces) with key DevOps tools, including Red Hat's JBoss Developer Studio integrated development environment (IDE). Also included are Jenkins, Arquillian, Maven, and support for several Web and JavaScript frameworks.

In last week's post, I discussed the three phases that IoT initiatives go through as they enter the market. Today, I'll take a closer look at the first two phases, highlight four industries that have IoT initiatives, and give specific uses cases for those industries. A little deep dive on the first two phases of…

In his general session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed cloud as a 'better data center' and how it adds new capacity (faster) and improves application availability (redundancy). The cloud is a 'Dynamic Tool for Dynamic Apps' and resource allocation is an integral part of your application architecture, so use only the resources you need and allocate /de-allocate resources on the fly.

It is one thing to build single industrial IoT applications, but what will it take to build the Smart Cities and truly society changing applications of the future? The technology won't be the problem, it will be the number of parties that need to work together and be aligned in their motivation to succeed. In his Day 2 Keynote at @ThingsExpo, Henrik Kenani Dahlgren, Portfolio Marketing Manager at Ericsson, discussed how to plan to cooperate, partner, and form lasting all-star teams to change the world and how it starts with business models and monetization strategies.

Red Hat on Monday rolled out a major new release to its JBoss Enterprise Application Platform that's designed to offer better support for containers and cloud-native applications. It's been 10 years since Red Hat acquired JBoss, but much has changed in the technology world since then. Now, JBoss EAP 7 is optimized for cloud environments, Red Hat says. The platform combines Java EE 7 APIs (application programming interfaces) with key DevOps tools including Red Hat's JBoss Developer Studio integrated development environment (IDE). Also included are Jenkins, Arquillian, Maven, and support for several Web and JavaScript frameworks.

Digital Initiatives create new ways of conducting business, which drive the need for increasingly advanced security and regulatory compliance challenges with exponentially more damaging consequences. In the BMC and Forbes Insights Survey in 2016, 97% of executives said they expect a rise in data breach attempts in the next 12 months. Sixty percent said operations and security teams have only a general understanding of each other's requirements, resulting in a "SecOps gap" leaving organizations unable to mobilize to protect themselves. The result: many enterprises face unnecessary risks to data loss and production downtime.

Storage is a good example of the catch-22 that enterprises find themselves in: The easier you make things for users, the tougher it gets for managers.

The enterprise's challenge will be integrating application servers written in Java with a new generation of so-called "cloud native" applications.

The other day I heard about an interesting experiment to deliver highly targeted advertising to Japanese consumers. Now, as we all know, Japan is a pretty consumer-driven society with a high uptake and appreciation for the latest and greatest in gadgetry — heck, this is the country that invented multi-function toilet seats which perform a plethora of functions in one. So given the sophisticated nature of the Japanese consumer, it is fair to expect that their advertising will be similarly sophisticated.

The latest version of a data integration tool released this week by management software specialist Talend offers "self-service" data preparation while helping shift more big data workloads to the cloud. Talend, Redwood City, Calif., said Monday (June 27) pitched its new data fabric platform as a way of achieving enterprise-scale support for self-service data preparation while leveraging corporate data lakes and other cloud resources via a cloud integration tool.

Full NFV adoption may take years to accomplish, but will enable network admins to work without purchasing dedicated hardware to build a service chain.

Here's a step-by-step guide on how to connect Power BI with SharePoint team sites without uploading files, allowing you to export and analyze data much more seamlessly.

In today's fast-paced and data-driven world, analysts need to be able to explore and visualize data very quickly. However, exsiting technologies do not make it easy to expose hidden data transformations, nor do they track the transformations when data moves. Your analytic insights take a backseat while the tedious refactoring process takes place. That's why we've created Anaconda Mosaic.

Pepperdata, the world's experts in the performance of distributed systems at scale, today announced the availability of Hadoop Health Check, a complimentary, expert assessment that evaluates and diagnoses Hadoop clusters of 100 nodes or more, and provides full visibility into current cluster conditions.

Data governance has emerged to the forefront of financial services ever since this vertical became inundated with increasing and evolving regulations, penalties, and regulatory entities at the end of the last decade. Stringent compliance requirements not only mandate that organizations provide accountability for data, but traceability, provenance, and auditability as well.

Cognitive systems will only eliminate a small portion of jobs in the near future, according to Forrester. But the impact will still be significant.

As global business increases competitive pressures, marketing departments face new challenges. They must anticipate and respond to frequently changing customer preferences and produce effective programs and campaigns to attract them. In the online world where customers can jump instantly from one company to another, Marketing must develop new ways to catch and hold their attention. Doing this well requires systematic, flexible planning that begins with the CMO and engages the entire department to utilize the full portfolio of resources and act as one to serve their mission.

When it comes to data recoverythe current best practice employed by any organization is based on two key factors; time and selectivity. In short, IT departments respond to a disaster recovery situation as swiftly as they can to restore the most business-critical operational data they can and, at present, this does not include big data.

Black hats are increasingly looking to sites like LinkedIn to harvest information on employees and their roles within a company.

"A lot of times people will come to us and have a very diverse set of requirements or very customized need and we'll help them to implement it in a fashion that you can't just buy off of the shelf," explained Nick Rose, CTO of Enzu, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.

The now mainstream platform changes stemming from the first Internet boom brought many changes but didn't really change the basic relationship between servers and the applications running on them. In fact, that was sort of the point. In his session at 18th Cloud Expo, Gordon Haff, senior cloud strategy marketing and evangelism manager at Red Hat, will discuss how today's workloads require a new model and a new platform for development and execution. The platform must handle a wide range of recent developments, including containers and Docker, distributed resource management, and DevOps tool chains and processes. The resulting infrastructure and management framework must be optimized for distributed and scalable applications, take advantage of innovation stemming from a wide variety of open source projects, span hybrid environments, and be adaptable to equally fundamental changes happening in hardware and elsewhere in the stack.

Accelerated Threat Triage and Expanded Deployment Options Two months ago, the Metron Engineering and PM team  released  Technical Preview 1 of Apache Metron based on the 0.1 release. We shared our vision for an open community based cybersecurity solution that provides real-time, cross-referenced and contextualized big data to combat cyber threats. Apache Metron Reference Architecture As the above… The post Apache Metron Tech Preview 2 Available Now! appeared first on Hortonworks.

In this slidecast, Marc Hamilton from Nvidia describes the latest updates to the company's Deep Learning Platform. "Great hardware needs great software. To help data scientists and developers make the most of the vast opportunities in deep learning, we're announcing today at the International Supercomputing show, ISC16, a trio of new capabilities for our deep learning software platform. The three — NVIDIA DIGITS 4, CUDA Deep Neural Network Library (cuDNN) 5.1 and the new GPU Inference Engine (GIE) — are powerful tools that make it even easier to create solutions on our platform."

In this special guest feature, Adam Mollenkopf, Real-Time GIS Capability Lead at Esri, discusses how geospatial technology helps companies make the most of real-time data.

As Apache Hadoop celebrates its 10th birthday this year, it has become the central component of the next generation data architecture. Many of the world's largest organizations have several production workloads running on Hadoop for new revenue generating applications, to stay competitive and relevant in their industry and to become more agile and efficient. As enterprise adoption grew, so did the requirements for security and compliance. Last year, Syncsort joined Cloudera to provide a unified foundation for open metadata and end-to-end visibility for governance. We helped our joint customers to secure and govern their data and meet regulatory compliance requirements with solutions leveraging Syncsort's big data integration product, Syncsort DMX-h, tightly integrated with Cloudera Enterprise Data Hub (EDH), Cloudera Manager, Apache Sentry, and Cloudera Navigator.

When cars left the starting line at Sonoma Raceway in California on Sunday for the start of the Toyota/Save Mart 350, they were taking part in the launch of a product designed to help NASCAR officials monitor and manage the 110-lap race. New race management software that NASCAR launched on Sunday is designed to give officials a single screen to watch where cars are on the racetrack, manage penalties and share information with racing teams about what's going on. It arose from a partnership between NASCAR and Microsoft that started in 2014. It began with a mobile inspection app that let race officials see whether cars were in compliance with all the rules about how they have to be constructed.

Guest blog post by Manish Bhoge I am Back ! Yes, I am back (on the track) on my learning track. Sometime, it is really necessary to take a break and introspect why do we learn, before learning. Ah ! it was 9 months safe refuge to learn how Big Data & Analytics can contribute to Data Product. Data strategy has always been expected to be revenue generation. As Big data and Hadoop entering into the enterprise data strategy it is also expected from big data infrastructure to be revenue addition. This is really a tough expectation from new entrant (Hadoop) when the established candidate (DataWarehouse & BI) itself struggle mostly for its existence. So, it is very pertinent for solution architects to raise a question WHERE and WHY to bring the Big data (Obviously Hadoop) in the Data Strategy.

Guest blog post by Irina Papuc Machine Learning (ML) is coming into its own, with a growing recognition that ML can play a key role in a wide range of critical applications, such as data mining, natural language processing, image recognition, and expert systems. ML provides potential solutions in all these domains and more, and is set to be a pillar of our future civilization. The supply of able ML designers has yet to catch up to this demand. A major reason for this is that ML is just plain tricky. This tutorial introduces the basics of Machine Learning theory, laying down the common themes and concepts, making it easy to follow the logic and get comfortable with the topic. What is Machine Learning? So what exactly is "machine learning" anyway?

In recent years, we've seen a quiet but steady movement to open up more datasets for use by researchers, local governments, product development teams, and just about anyone else. Today, this "open data" movement makes an enormous amount of data freely available. So what exactly is open data? The industry group Open Knowledge International offers this definition in its Open Data Handbook: "Open data is data that can be freely used, re-used and redistributed by anyone–subject only, at most, to the requirement to attribute and share alike." This is important in the development of new products or services, data sharing drives innovation and collaboration to enable deeper insights that would not have been possible without open data sets.

The human auditory system gives us the extraordinary ability to converse above the chatter of a lively cocktail party. Selective listening in such conditions is an extremely challenging task for computers, and has been the holy grail of speech processing for more than 50 years. Previously, no practical method existed in the case of single channel mixtures of speech, especially when the speakers are unknown, but now Mitsubishi Electric Research Labs (MERL) are addressing the problem of acoustic source separation with a deep learning framework they call "deep clustering". 

From Python to R and SAS to Spark, bring all of your analytics packages together with one trusted platform for analytics governance.

There is little doubt that Big Data solutions will have an increasing role in the Enterprise IT mainstream over time. Big Data at Cloud Expo – to be held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA – has announced its Call for Papers is open. Cloud computing is being adopted in one form or another by 94% of enterprises today. Tens of billions of new devices are being connected to The Internet of Things. And Big Data is driving this bus. An exponential increase is expected in the amount of information being processed, managed, analyzed, and acted upon by enterprise IT. This amazing is not part of some distant future – it is happening today. One report shows a 650% increase in enterprise data by 2020. Other estimates are even higher. A recent Gartner report predicts that the volume of enterprise data overall will increase by a phenomenal 650% over the next five years. Help plant your flag in the fast-expanding business opportunity that is Big Data: submit your speaking proposal today!

BlueData embraces the enterprise with a big data hybrid service for public cloud. Couchbase updates server with features that bridge SQL and NoSQL. Samsung bets big on IoT. Dell sells Statistica. It's our Big Data Roundup for the week ending June 26, 2016.

Brexit — the vote by the UK to exit the European Union — will have far-reaching impact on the enterprise tech business, not only in Europe and the UK, but also in the US and Asia for companies doing business in the region.

Digital Transformation is happening everywhere you look. It is impacting businesses of any size, in any industry, any market and every geography. Many organizations recognize the importance of…

With a focus on driving business outcomes and customer experience by way of leveraging analytical services and products, the National Association of Software and Services Companies (Nasscom) inaugurated the fourth edition of its Big Data & Analytics Summit 2016 in Hyderabad.

Kyvos Insights, a big data analytics company, announced general availability of the next major version of Kyvos — a massively scalable, self-service analytics solution designed to make big data lakes ready for BI analysts.

A new study shows that most people prefer that self-driving cars be programmed to save the most people in the event of an accident, even if it kills the driver. Unless they are the drivers.

Social gaming company Playtika uses big-data analytics to deliver captivating user experiences and engagement. We'll learn how feedback from massive user action streams can be analyzed in bulk rapidly to improve the features and attractions of online games and so help Playtika react well in an agile market. To learn more about leveraging big data in the social casino industry, we're pleased to welcome Jack Gudenkauf, Vice President of Big Data at Playtika in Santa Monica, California. The discussion is moderated by me, Dana Gardner, Principal Analyst at Interarbor Solutions.

In this special guest feature, Errol Apostolopoulos, Senior Vice President of Product at Crimson Hexagon, makes the case for when data is democratized or distributed across the enterprise, greater transparency ensues, leading to better business decisions across the board.

Arguably the biggest driver of productivity and growth in the next decade, the Industrial Internet of Things will accelerate the reinvention of sectors that account for almost two-thirds of world output.

Author: Vinay Shukla, Hortonworks, Moon So Lee, Apache Zeppelin PMC & NFLabs, Prabhjyot Singh, Apache Zeppelin PMC & Hortonworks Recently the Apache Software Foundation (ASF) announced Apache Zeppelin as a top level project. This was a great milestone for both the Zeppelin and data science community. Since its' incubation in ASF in December 2014, the… The post Apache Zeppelin: The Road Ahead appeared first on Hortonworks.

Britain votes to leave the EU, but what does this mean for the technology industry? Let's take a look at the situation in the United Kingdom of Great Britain and Northern Ireland. But let's keep this a politics-free zone, eh? In IT Blogwatch, British bloggers panic (or not). Your humble blogwatcher curated these bloggy bits for your entertainment. Not to mention: don't panic…

When cars leave the starting line at Sonoma Raceway in California on Sunday for the start of the Toyota/Save Mart 350, they'll be taking part in the launch of a product designed to help NASCAR officials monitor and manage the 110-lap race. New race management software that NASCAR is launching on Sunday is designed to give officials a single screen to watch where cars are on the racetrack, manage penalties and share information with racing teams about what's going on. It arose from a partnership between NASCAR and Microsoft that started in 2014. It began with a mobile inspection app that let race officials see whether cars were in compliance with all the rules about how they have to be constructed.

A real-time notifications system was a champ behind-the-scenes at The Championships, Wimbledon 2015 by enabling its digital and content team to break the news of a key tournament statistics milestone that scooped media organizations worldwide. See what value an extension to that system is adding to the 2016 event to engage fans through predictability and real-time insight.




What happens when you take the scripts from dozens of sci-fi movies and TV series, and feed them (along with a couple of seed prompts) into a long short-term memory recurrent neural network? You get this bizarre screenplay, with dialogue for three characters (named H, H, and C — one of the H's had to be renamed H2) and stage directions like C Well, I have to go to the skull. I don't know. He picks up a light screen and fights the security force of the particles of a transmission on his face. Director Oscar Sharp assembled a cast and crew (including Silicon Valley's Thomas Middleditch) and produced the screenplay as a short film. The results are …

What happens when you take the scripts from dozens of sci-fi movies and TV series, and feed them (along with a couple of seed prompts) into a long short-term memory recurrent neural network? You get…

Consider the roads, highways, railways, ports, and airports around you. With immense variety in scale, function, and complexity, our planet's transportation infrastructure ranks high among mankind's most impressive achievements. It owes much of its existence to geospatial technologies. Geospatial information is a key tool for processes by which transportation projects are conceived, planned and constructed. Once an infrastructure project is in operation, geospatial data enables owners and maintenance teams to keep the infrastructure in good condition and adapt to changes in demand and public needs. There are uncounted examples that illustrate how modern geospatial solutions have increased efficiency and reduced costs in building and operating transportation infrastructure. However, the benefits are not limited to the infrastructure itself.

Just a cursory look around the world of cyber security is enough to deduce: we aren't doing a good enough job. The cyber criminal organization Carbanak whipped up a concoction of malware that caused ATM machines around the world (25 countries total, mostly in Russia) to spew cash at random customers. The total loss is estimated to be about $1 billion. Another group of cyber criminals, called Equation, has developed what security experts are calling the nastiest piece of malware ever discovered.

As it edges away from oblivion, BlackBerry has simultaneously edged away from being a device company.

BlueData, provider of a leading Big-Data-as-a-Service software platform, announced the new summer release for the enterprise edition of its BlueData EPIC software.

Artificial intelligence is still surrounded by an aura of mystery, and it would be tough to find a better illustration than the story in the news last week about a British grandmother who includes "please" and "thank you" in all her Google searches. "Please translate these roman numerals mcmxcviii thank you," read the search request from May Ashworth that ultimately went viral when her grandson tweeted it on Twitter. "I thought, well somebody's put it in, so you're thanking them," Ashworth reportedly explained. "I don't know how it works, to be honest. It's all a mystery to me."

This entry was posted in News and tagged , , , , , , , . Bookmark the permalink.