Big Data News – 28 Jun 2016

Today's Infographic Link: Modern Science Map

Featured Article
At Hadoop Summit in San Jose Tuesday Hadoop distribution vendor MapR Technologies took the wraps off a new initiative intended to maximize the productivity of users and administrators of big data deployments. "This is putting some focus on the user and operator side," says Dale Kim, senior director of Product Marketing at MapR. "It provides deep visibility and full control." MapR calls it the Spyglass Initiative — a comprehensive, open and extensible approach to centralized monitoring of big data deployments.

Top Stories
European Union rules for electronic signatures change on Friday to make a clear distinction between the identity of the person signing, and that of the authority guaranteeing the integrity of the data, but the technology needs to be still simpler, vendors say. The new rules are intended to simplify the process of electronically signing contracts between businesses, or between businesses and persons, and across international borders where different and often incompatible electronic signature rules apply today.

Document databases are popular because they provide a more natural construct for building apps in a database that humans more naturally understand.

Early this year, we announced our partnership with Pivotal and Syncsort,  incorporating key technologies from the ecosystem to optimize the value from Hortonworks Connected Data Platforms. Today, I am very excited to announce an addition with our partnership to provide global access to and resell AtScale. Customers are constantly asking us to find simpler, faster… The post Partnering With AtScale to Bridge the Gap Between Business Users and Hadoop appeared first on Hortonworks.

SANTA CLARA and SAN JOSE, Calif., (Hadoop Summit) — June 28, 2016 — Hortonworks, Inc.® (NASDAQ: HDP), a leading innovator of open and connected data platforms, today announced it will resell technology from AtScale, the first company to provide business users with a fast and secure self-service BI platform for Hadoop.  

Hadoop Summit San Jose is here once again and with it comes a reminder of the power of the Open Source Community and the tremendous innovation which continues to occur within the Apache Hadoop ecosystem.  At Hortonworks, we get the opportunity to engage with this vibrant, creative, and talented group of engineers all year round,… The post Latest innovation within Hortonworks Data Platform (HDP) 2.5 unveiled appeared first on Hortonworks.

European Union rules for electronic signatures change on Friday to make a clear distinction between the identity of the person signing, and that of the authority guaranteeing the integrity of the data, but the technology needs to be still simpler, vendors say. The new rules are intended to simplify the process of electronically signing contracts between businesses, or between businesses and persons, and across international borders where different and often incompatible electronic signature rules apply today. But while the new rules will simplify the legal environment, today's technical environment makes it too difficult to create and securely manage digital identities, according to the Cloud Signature Consortium.

NoSQL database MongoDB is adding a new cloud-based elastic SaaS version to its delivery options. The company is also announcing a native connector to Apache Spark during its MongoDB World event.

Enterprises are inundated with data from social, mobile, IoT and other technologies. The pace of the data flow is only accelerating. Over 300 hours of videos are uploaded on YouTube every minute, and this has grown from 100 hours every minute in 2013. Facebook users watch the equivalent of 750 years of video every day…. The post The Big Picture: Today's Data Analytics Stack appeared first on Predictive Analytics Times.

Over the past century, IBM has reinvented itself again and again. This evolution is in stark contrast to one-trick ponies in the information technology business. Companies like LinkedIn do one thing, maybe quite well, until they reach the end of their run, in this case a purchase by Microsoft. IBM, on the other hand, has been forced to remake its business a number of times — moving from typewriters to mainframes, from mainframes to PCs, from PCs to infrastructure, and now, building on that last round, to cloud computing, analytics and infrastructure for the cognitive era. Each time, the change has been painful but effective. Thousands of people lose their jobs in the older businesses, and thousands more are hired to staff the new ones. The company is such a veteran of this type of move that it has even developed a template for self-regeneration. Few other companies have that.

At Hadoop Summit in San Jose Tuesday Hortonworks announced a new release of its Hortonworks Data Platform (HDP) Hadoop distribution, an expansion of its Partnerworks program and a new partnership with AtScale, provider of a self-service Business Intelligence (BI) platform for Hadoop. "There has been a marked change around the business conversations that are being held around Hadoop and big data," says Matt Morgan, vice president of product and alliance marketing at Hortonworks. "The conversations have completely shifted. It's about transformational use cases."

In the digital era where consumers are just as likely to purchase online as in a brick-and-mortar location, delivering a seamless channel experience has become the new competitive imperative. Retailers are ramping up their investments in omnichannel and multichannel strategies to deliver exceptional experiences, wherever and whenever today's consumers choose to interact with brands. What is omni-channel retail? Omnichannel is an approach to marketing and retail that utilizes multiple communication channels to reach customers.

Online backup company Backblaze made waves last year when it announced a beta test last year of B2, a new public cloud storage service that would be cheaper than competing offerings from Amazon and Microsoft. B2 has exited beta with some new features that may make it more appealing to business users. It now has a service level agreement guaranteeing 99.9 percent uptime for all data stored within it, matching the baseline offerings from Amazon Web Services' S3 and Microsoft Azure's Blob Storage service.

Users of Outlook can now pull data directly from the Salesforce CRM into Outlook via a Lightning for Outlook plugin.

The number of developers worldwide who are turning their attention to the Internet of Things has increased 34% in the past 12 months, according to Evans Data.

Dataguise, a technology leader in secure business execution, today announced the general availability of Dataguise DgSecure® 6.0, the most recent release of the company's data-centric security platform.

First came the data lake, then the data hub. This week at its annual Hadoop Summit conference, Hortonworks (NASDAQ: HDP) will be sharing its vision of a connected data plane that presents a single interface for analyzing and managing data wherever it is-at rest in Hadoop, in motion on Kafka, in the data center, or in the cloud.

The majority of sales generated in the 4.5 trillion dollar US retail market is in-store and the volume of transaction data collected at various points in the trading process is immense. This data is a treasure trove of customer insight as well as product performance. While many retailers mine this data to gain specific insights into understanding their shoppers better, the possibilities that such data analysis opens up is largely untapped.

Despite an increasing focus on data storage and access standards across organizations, security has remained a cat and mouse game where hackers continue to win on a fairly routine basis.

Microsoft's Outlook.com is used by some 400 million users around the world, so it's only natural that Salesforce wants its own software to play nicely with it. On Tuesday, the CRM giant announced a big step in that direction. The latest in a series of integrations resulting from the two-year-old partnership between Salesforce and Microsoft, Lightning for Outlook is an add-in that promises to let salespeople tailor their inboxes with smooth access to customer relationship management (CRM) data whenever they need it.

As Hadoop settles into the enterprise, deployments are getting bigger and more complex. To help its customers stay on top of things, MapR Technologies today unveiled Spyglass, a new operations console that's customizable. The company also rolled out the first deployment pack that separates the core components of the distro from fast-moving projects like Spark and Drill. MapR started the Spyglass initiative in response to input from MapR customers who were watching the complexity level increase in their MapR deployments.

Microsoft's Outlook.com is used by some 400 million users around the world, so it's only natural that Salesforce wants its own software to play nicely with it. On Tuesday, the CRM giant announced a big step in that direction. The latest in a series of integrations resulting from the two-year-old partnership between Salesforce and Microsoft, Lightning for Outlook is an add-in that promises to let salespeople tailor their inboxes with smooth access to customer relationship management (CRM) data whenever they need it.

ODPi, a nonprofit organization accelerating the open ecosystem of big data solutions, announced that the Apache Hadoop® distributions products from Altiscale, ArenaData, Hortonworks, IBM, and Infosys are now ODPi Runtime Compliant.

Online backup company Backblaze made waves last year when it announced a beta test last year of B2, a new public cloud storage service that would be cheaper than competing offerings from Amazon and Microsoft. B2 has now exited beta with some new features that may make it more appealing to business users. It now has a service level agreement guaranteeing 99.9 percent uptime for all data stored within it, matching the baseline offerings from Amazon Web Services' S3 and Microsoft Azure's Blob Storage service. In addition, users can now purchase expanded support from Backblaze, which will give them guaranteed rapid responses to support questions and — at the highest level — access to a phone number for around-the-clock support.

In his keynote at 18th Cloud Expo, Andrew Keys, Co-Founder of ConsenSys Enterprise, provided an overview of the evolution of the Internet and the Database and the future of their combination — the Blockchain. Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereum.

Northeastern's ALIGN program draws upon undergraduates with degrees in other disciplines, from English and political science to business and biotech.

As the banking industry pursues improved customer engagement, unlocking the value of data becomes critical in designing a successful loyalty programme. The balance of power in banking has changed. What customers expect, how they want to be serviced, what information they are prepared to share, and how loyal they are prepared to be, have all changed radically. According to leading industry analysts, Forrester Research, we are in the age of the customer, in which the only sustainable competitive advantage is knowledge of and engagement with customers.

Somewhere between blind faith and skepticism is the world of prescriptive analytics. Here, machine-generated action items and potential outcomes meet human decision-making. Finding the right balance between algorithms and common sense can be tricky, so consider these tips.

First, a confession. When David and I started writing this series, we envisioned it would be completed after the third part. It appears we were a bit short-sighted. An astute reader challenged us to help address the concepts of preserving queue message data integrity. With that, we welcome you to part 4. In parts 1, 2 and 3 of this series we wrote at length about the possible compliance and security issues associated with the queueing of cardholder data (CHD) within a PCI compliant cardholder data environment (CDE).

First, a confession. When David and I started writing this series, we envisioned it would be completed after the third part. It appears we were a bit short-sighted. An astute reader challenged us to help address the concepts of preserving queue message data integrity. With that, we welcome you to part 4. In parts 1, 2 and 3 of this series we wrote at length about the possible compliance and security issues associated with the queueing of cardholder data (CHD) within a PCI compliant cardholder data environment (CDE).

Bigstep, the only provider of a full-stack bare-metal cloud built for big data, announced its launch in the U.S. From its new Chicago-based data center and U.S. headquarters, Bigstep provides companies with the security and performance of a private bare-metal infrastructure, the self-service and on-demand scalability of the public cloud, and the versatility and convenience of an integrated stack of big data applications. "

Here's this week's news in Data Science and Big Data. Don't forget to subscribe if you find this useful! Interesting Data Science Articles and News Columbus Wins U.S. DOT's Smart City Challenge — Columbus, Ohio will receive money to implement automated driving, connected vehicles, and sensor technology in their transportation systems. What's Really Warming the World? — See how different factors contribute to global warming in this data visualization based on finings from NASA's Goddard Institute for Space Studies.




Recently, I developed a handful of demos using open source technologies for detecting and alerting fraudulent events, incidence of poor customer experience and arrival of target subjects in geo-fenced locations for marketing purposes. The use cases required detection of individual events from streaming data sources and processing complex set of rules for identifying events of interest to create alerts for enabling data-driven insights and actions. I selected the Apache Hadoop technologies, namely, Kafka, Storm, HDFS and HBase as they were found to be the best fit for these use cases and the tools had been deployed in large scale operation by reputed multinational organisations.

For companies looking to reduce the cost and complexity of virtual desktop infrastructure (VDI), the attraction of Desktop-as-a-Service (DaaS) is that you can greatly reduce up-front investment. "It's pay as you go and you only pay for what you need," says Mark Lockwood, research director at Gartner.

NASCAR is bringing modern technology to race day, using a new system to tie together data on each race so officials can make decisions fast. We got to see it in action at the Sonoma Raceway in California.

Teradata (NYSE: TDC), the big data analytics and marketing applications company, today announced the certification of multiple business intelligence (BI) and visualization solutions on the Teradata Distribution of Presto.

In this infographic, discover how you can use weather analytics to help your policyholders stay ahead of the storm by protecting property when adverse weather events are still on the horizon.

Watch this two-minute demo to learn how you can better understand your clients and better manage their portfolio. Clients want to receive personalized advice and tailored portfolio recommendations. How can wealth management firms leverage data to not only enhance the client experience, but to also drive profitability? The IBM Client Insight for Wealth Management Powered by Watson solution can effectively predict, personalize and address the changing needs of the current and next generation of clients, helping financial advisors deliver customized advice and service more clients.

Summary:  In this Lesson 2 we continue to provide a complete foundation and broad understanding of the technical issues surrounding an IoT or streaming system so that the reader can make intelligent decisions and ask informed questions when planning their IoT system.  In Lesson 1 In This Article In Lesson 3 Is it IoT or Streaming Stream Processing

This entry was posted in News and tagged , , , , , , , , , , , . Bookmark the permalink.