Big Data News – 31 May 2016

Featured Article
It's not your average company that can trace its origins back to a nineteenth-century Russian tsar, but then, Sberbank is no average financial institution. Established through a decree by Emperor Nikolai I in 1841, Sberbank is Russia's oldest bank and has played a long and storied role in the nation's history. Today, with more than 16,000 branches in all 83 constituent entities of the Russian Federation — traversing 11 time zones — it serves roughly 70 percent of the Russian population. Therein lie the roots of the bank's very modern challenge.

Top Stories
The issue is thorny because it involves spectrum being agilely shared between commercial and non-commercial users.

Predictive analytics shows how a consumer will interact with content, but it has limited insight into why. Uplift modeling predicts what will influence a consumer to take the action you want. This free webinar from the Predictive Analytics World conference series gives an introduction into this rapidly growing area of data modeling.

Microsoft has repositioned Microsoft Ventures, its internal startup accelerator, to focus on investments in early-stage businesses.

I recently bought a smoker box for my BBQ from the local hardware store. I go in there all the time, often with my two children. The store owner Jeff recognized me, and asked "are you starting to… …

Guidance Software CEO discusses the relationship of private versus public sectors, particularly when it comes to the jurisdiction of security events.

If you're embarking upon a big data analytics project, you're likely considering some sort of self-service data preparation tool to help you cleanse, transform, and standardize your data. And if you aren't, you probably should be. From a product standpoint, self-service data preparation tools represent one of the most remarkable success stories of the big data revolution. Sure, platforms like Hadoop and Spark have removed technological tethers and turbo-charged our capability to store and process massive amounts of data, in an affordable and reliable manner.

The European Union has told social platforms such as Facebook to do something about hate speech. And, yes, this is indeed something — something they're already doing. And does it surprise you to learn that this "code of conduct" is being justified in the name of combating terrorism? In IT Blogwatch, bloggers are ever so glad they won't be subjected to hate speech any longer. Your humble blogwatcher curated these bloggy bits for your entertainment. To read this article in full or to leave a comment, please click here

Information Security has always been a tough job. All defenses have cracks and attackers know more about you than you do about them. Everyone recognizes the challenge is increasing each year. There are three trends driving this situation and causing information security professionals to feel like they have less control than ever before: A rapidly expanding attack surface to defend The continued rise in both attack volume and sophistication.

No longer is simply keeping up to date with the latest tools and systems an appropriate driver for new technology deployments.

The world's top authorities on Apache Hadoop convene at Hadoop Summit San Jose and one of the top questions that will be answered will be around the future and direction of Hadoop. Sanjay Radia – Founder and Architect, Hortonworks lead the track which selected 13 sessions around this topic. I asked Sanjay what he hoped would… The post The Future of Apache Hadoop appeared first on Hortonworks.

CDH 5.7 includes a lot of changes (more than 1,500) to Hue, the Web UI that makes Apache Hadoop easier to use. In this new release, the emphasis on performance and security carries over from 5.5. The overall improvement in the SQL user experience is also considerable. In this post, we'll cover some highlights. New Hive Metastore Interface This app is now on a single page, The post New in CDH 5.7: Improved Performance, Security, and SQL Experience in Hue appeared first on Cloudera Engineering Blog.

SYS-CON Events announced today that IBM Cloud Data Services has been named "Bronze Sponsor" of SYS-CON's 18th Cloud Expo, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. IBM Cloud Data Services offers a portfolio of integrated, best-of-breed cloud data services for developers focused on mobile computing and analytics use cases.

by John Mount Ph. D. Data Scientist at Win-Vector LLC In her series on principal components analysis for regression in R, Win-Vector LLC's Dr. Nina Zumel broke the demonstration down into the… …

Workato CEO Vijay Tella: Rival vendors deliberately downplaying capabilities of cloud-based integration services to protect more expensive platforms.

Data is the Next Currency Make the most of the data you own Data plays a vital role in today's digital economy. Data sources such as customer insights and product performance can help realize additional revenue opportunities and drive performance. TCS Analytics & Insights practice enables organizations to capitalize on opportunities through insightful and precise data analysis.

In today's digital age, there can be a contradiction between the need for more speed and real-time transacting on one hand, and the necessity to mitigate the risks of conducting business on-line. These risks include potential fraud, exposure to unsafe business partners (be they customers, vendors, service providers), failures to comply to industry or financial (more…)

SYS-CON Events announced today that Super Micro Computer, Inc., a global leader in Embedded and IoT solutions, will exhibit at SYS-CON's 18th International Cloud Expo®, which will take place on June 7-9, 2016, at the Javits Center in New York City, NY. Supermicro (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology, is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, HPC and Embedded Systems worldwide.

18th Cloud Expo, taking place June 7-9, 2016, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterprises are using some form of XaaS — software, platform, and infrastructure as a service.

@DevOpsSummit taking place June 7-9, 2016 at Javits Center, New York City, and Nov 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with the 18th International @CloudExpo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world.

By: Eric Siegel, Founder, Predictive Analytics World In anticipation of his upcoming conference presentation, Deep Learning in Cloud Based Applications at Predictive Analytics World Chicago, June 20-23, 2016, we asked Tanay Chowdhury, Associate Data Scientist at Zurich North America, a few questions about his work in predictive analytics.

The future of data centers looks very exciting. Data Centers are turning out to be increasingly important in today's connected world as we can store and process a lot of consumer and commercial data. Data centers seem to be either becoming huge or very small. In general, marketing cloud storage and cloud computing services are likely to increase the profit margins and favor the large players. There is an increasing need for superior Data center competence as we are more dependent on cloud-based applications and huge Data center services.

One important rule in data governance is to take your time to make sure you do everything right. It might call for patience and consume a lot of time but a phase-based approach will do the magic for your business!

Dr. Angel Diaz of IBM co-chaired and co-authored the first XML standard in 1998. Here, he shares what he's learned about open source in the past two decades, and tells us why it's crucial for any company looking to attract and retain top dev talent in today's competitive marketplace.

As the need to pull together the growing volumes of data that's generated and collected by organizations has increased, several types of data integration software have emerged to help IT teams simplify and manage the process. But with so many products to choose from, what's the best approach to selecting the right data integration tool for your enterprise?

As the latest wave of high-profile breaches shows, all the sensitive information law firms handle makes them attractive cyberattack targets. Here's what can happen and what you should do about it.

People often get stuck when they are asked to improve the performance of existing predictive models. What usually they do is try different algorithms and check their results. But often they end up not improving the model. Here are some of the steps you can take to boost your existing models. Add more data:… The post Boosting Performance of Machine Learning Models appeared first on Predictive Analytics Times.

In this special guest feature, Viral Chawda, Managing Director of Data Science & Innovation at KPMG, discusses how data and analytics is fueling the Internet of Things.

Above the Trend Line: machine learning industry rumor central, is a recurring feature of insideBIGDATA. In this column, we present a variety of short time-critical news items such as people movements, funding news, financial results, industry alignments, rumors and general scuttlebutt floating around the big data, data science and machine learning industries including behind-the-scenes anecdotes and curious buzz.

BlueData is making the case for creating a software-defined data center environment that is truly heterogeneous.

As virtualization advancements such as container technologies emerge, PaaS is hitting its stride. While you may have heard of a couple of the PaaS providers on our list, we're betting there are a few more here that aren't on your radar yet. Here's why these seven firms are worth your attention.

I love EMC World (though I can't say the same about Las Vega$). I get an opportunity to talk to customers who are at the dirty and grimy frontline of trying to derive value from all of this Big Data hoopla. They teach me tons! One theme that came up several times in our conversations was the following: "I can't get the Business to engage in an envisioning type of engagement. We have lost their trust. So we are forced to start our Big Data journey from the technology."

Dinah Washington sang "what a difference a day makes" and having lived in London for a year this month, I'm feeling that multiplied by 365! And what a year it has been…. I joined Hortonworks back in 2012 when the company was barely 8 months old and moved to be part of the International team… The post Hortonworks' Customers in Europe Power the Future of Data appeared first on Hortonworks.

Data Security, it's been a hot topic for many years with so many stories of breaches occurring from the small end of town right up to the big end of town. And yet we read of so many security products and approaches out there that if you were lost for several years and came back, you'd think as IT professionals we'd have this security game all done and dusted. Right??…wrong. In 2015 the financial services and retail industry reported a total of 1,165 security incidents of which 441 were breaches.

To ensure records and data are properly managed, companies need to know the differences between data governance, information governance and records management strategies. Keep on reading: Distinguishing the Difference: Data Governance, Information Governance and Records Management

What is the single greatest factor that affects business performance today? Do you think it's new technologies? Maybe it's the economy. Perhaps, it's the labor force, or even government regulations? No, no, and no. The single greatest factor that affects business performance is: the weather.




Modern data science can unlock new innovation in healthcare, bioinformatics, genetic research, and other related fields. New personalized medicine programs, for instance, can identify previously unrecognized disease risk factors by applying analytics to vast amounts of genomic and clinical data. Hospitals can pore through EMR and operational data to pinpoint sources of infection. Public health agencies can use longitudinal population data to more accurately inform policy.

Thanks to mobile technology, wearable devices, social media and the general pervasiveness of the internet, an abundance of new customer information is now available to marketers. This data, if leveraged optimally, can create opportunities for companies to better align their products and services to the fluctuating needs of a demanding market space.

News today from a justifiably proud Promise technology that Apple (yes, THAT Apple) is going to be selling Promise's Apollo "personal cloud appliance" at its stores globally. Any time Apple does anything it's a big deal so it is worth having a bit of background about Promise. The company is a 25-year veteran of the storage industry. Traditionally active in the enterprise arena, it produces its own enterprise storage hardware and vertical offerings tailored to the video, rich media and other industries. It seems the enterprise wasn't quite enough for Promise and hence they've decided to enter the consumer market.

Nvidia has staked a big chunk of its future on supplying powerful graphics chips used for artificial intelligence, so it wasn't a great day for the company when Google announced two weeks ago that it had built its own AI chip for use in its data centers. Google's Tensor Processing Unit, or TPU, was built specifically for deep learning, a branch of A.I. through which software trains itself to get better at deciphering the world around it, so it can recognize objects or understand spoken language, for example.

Few companies take full advantage of all the data that's made available to them, and most business users spend more time gathering data than analyzing it.

Forbes once reported that even the slightest increase of investments in Big Data related projects (~ 10%) improves average net income by $65 million for all typical Fortune 1000 companies. Knowledge is power. Fortunately, the amount of power in this world is immense. All that's left for you to do is to find efficient ways of using that power. A typical business acquires lots of information and is obligated process it correctly. Data volumes skyrocket, hence more analytical power is required. You simply can't afford to lose your grip of adequate analysis, because you'll miss out on profits.

This entry was posted in News and tagged , , , , , . Bookmark the permalink.