Big Data News – 08 Nov 2016

Today's Infographic Link: History of The Porsche 911

Featured Article
There's nothing quite like an empty stomach to get the old creative juices flowing, as participants at last week's Babson College Rocket Pitch event explained during their 3-minute pitches before peers, investors and assorted people like myself.

Top Stories
by Ali Zaidi, Data Scientist at Microsoft Apache Spark and a Tale of APIs Spark is an exceptionally popular processing engine for distributed data. Dealing with data in distributed storage and programming with concurrent systems often requires learning complicated new paradigms and techniques. Statisticans and data scientists familiar wtih R are unlikely to have much experience with such systems.

SAP HANA 2 embeds a broader range of analytics, algorithms and application programming interfaces (APIs) into the core in-memory computing platform.

Data science is having its 15 minutes of fame.

The next release of SAP's in-memory HANA database plugs a few gaps for enterprise high availability features. As the in-memory database market commoditizes, SAP differentiates with analytic capabilities for extended data types.

RPA tools are software robots that use business rules logic to execute specifically defined and repeatable functions in the same way that a person would.

SAP wants businesses struggling to keep up with the pace of innovation in its HANA in-memory database to relax as it readies a new version, to be known as HANA 2. Since introducing HANA in 2010, SAP has been releasing updates twice a year, providing customers with new capabilities but also pushing them to keep their software current to benefit from continuing support. The new version gives businesses two reasons to relax, according to Marie Goodell, vice president of product marketing at SAP. HANA 2 is designed to simplify things for the IT department, reducing the effort it takes to keep the lights on so that businesses can spend more time working on new, next-generation applications that take advantage of new types of data, she said. Even if they choose to keep upgrading, that should involve less work going forward.

Facebook users will be able to record smartphone videos that ape the style of famous artworks with a new feature unveiled Tuesday. Using a technique called style transfer, the feature takes live video and turns it into something that resembles the work of Van Gogh, Picasso and other artists. That effect is probably familiar to people who have used the app Prisma, which uses similar techniques to change the look of photos. Prisma's app can't perform live filtering, and some filters require a connection to the internet. Facebook's system can work offline and render live.

SAP wants businesses struggling to keep up with the pace of innovation in its HANA in-memory database to relax as it readies a new version, to be known as HANA 2. Since introducing HANA in 2010, SAP has been releasing updates twice a year, providing customers with new capabilities but also pushing them to keep their software current to benefit from continuing support. The new version gives businesses two reasons to relax, according to Marie Goodell, vice president of product marketing at SAP. HANA 2 is designed to simplify things for the IT department, reducing the effort it takes to keep the lights on so that businesses can spend more time working on new, next-generation applications that take advantage of new types of data, she said. Even if they choose to keep upgrading, that should involve less work going forward.

In an interview with Salvi Mittal, Dinesh Aggarwal explains about how he is leveraging analytics at IndiaMart.

In 2017, edge analytics will find itself in the limelight for companies of all sizes in a wide array of industries. What's driving this trend and how can organizations prepare for the change? Keep on reading: 2017 will be the launchpad for IoT edge analytics

This opinion piece shares insights into U.S./China market dynamics for companies trying to navigate this challenging path. Keep on reading: The IoT demonstrates the interdependence of Chinese and U.S. tech companies

Apple's success with the iPod and iPhone didn't really lead to another digital revolution with the iPad.

Business, from factory floor to training room and the showroom, is likely to be the biggest beneficiary of the fascinating technology of VR and AR.

Cyber crime is no longer a mere nuisance but is quickly becoming a huge problem. Just recently a cyber criminal was charged with wire fraud and computer fraud as he tried to steal more than $1.5 million. Cyber criminals are becoming increasingly more brazen as they exploit vulnerabilities in new technologies. According to Robert L. Capers, US Attorney for the Eastern District of New York, cyber criminals roam the Internet for information they can steal.

Do you find yourself increasingly having to make decisions amid uncertain conditions? The advanced capabilities offered by IBM SPSS Statistics aim to make Monte Carlo simulation a part of your risk analysis by bringing these two worlds together in a single software solution.

Voters in Cobb County, Georgia, got more accurate and timely updates on wait times at polling locations open for early voting during this year's US presidential election. Here's how IT made it happen.




A critical mass of enterprises has developed private cloud architectures and is thinking of tying them to public resources to create the hybrid cloud.

"We need to innovate" is a common refrain. And in the world of information, there is often a mistaken belief that innovation today is purely about analytics. However, the reality is that sort of fashionista-based innovation rarely delivers sustainable results. Back in 2010, MIT and Capgemini looked at what it really took for companies to be successful with digital. Their report had a very interesting finding: Companies that take a more governed and managed approach to digital deliver more sustained benefits on their journey. Those that pursue technology- and point-driven approaches deliver less value.

The Infoblox NIOS software update makes it possible to dynamically scale DNS query requests across local appliances and cloud resources.

The Infoblox NIOS software update makes it possible to dynamically scale DNS query requests across local appliances and cloud resources.

In some of the more recent literature, discussion has ensued about the use of pure random or noise variables that end up as key variables in predictive models. In our big data environment with millions of records and thousands of variables, intuitively one might think that random or spurious variables might be a normal outcome… The post Are Random Variables a Fact of Life in Predictive Models? appeared first on Predictive Analytics Times.

Since the last blog post announcing the release of YCSB 0.6.0 in Cloudera Labs, users of Cloudera CDH and EDH will have noticed regular updates to the Labs version, keeping it in lockstep with the upstream release.  This should help assure users of a consistent and easy mechanism to deploy the current version of YCSB (which at the moment is v.0.10.0 in CLABS) to evaluate the performance of the NoSQL stores employed within their clusters such as HBase, The post YCSB 0.10.0 Now in Cloudera Labs appeared first on Cloudera Engineering Blog.

Containers have become a major fixture of modern application development. Companies are turning to the technology because it helps them more easily create portable, scalable applications that can work in a wide variety of environments, including the public cloud. Microsoft is aiming to further capitalize on that trend with a set of updates announced Monday for its public cloud platform. Azure Container Service, Microsoft's cloud-optimized container-hosting offering, now supports orchestration using the Kubernetes open-source platform. On top of that, Microsoft has upgraded the service to use DC/OS 1.8.4, which brings improvements like built-in job scheduling.

Steve Ballmer, former Microsoft CEO and he who had a penchant for screaming during keynotes, once stated that "Linux is a cancer." Anyone who was around back when Ballmer made that declaration would be pretty surprised by just how far Microsoft has come in its support and acceptance of open source projects as a valid part of the technology industry. From open sourcing many of its own technologies (.NET and Windows Live Writer being but two examples) to jumping in to be a part of newer open source initiatives, Microsoft is an important player in the open source world.

Tech budgets are moving out of IT control and are expected to remain flat, but CIOs are gaining more influence across the broader enterprise, according to CEB Global's 2017 IT Budget Survey.

Containers have become a major fixture of modern application development. Companies are turning to the technology because it helps them more easily create portable, scalable applications that can work in a wide variety of environments, including the public cloud. Microsoft is aiming to further capitalize on that trend with a set of updates announced Monday for its public cloud platform. Azure Container Service, Microsoft's cloud-optimized container-hosting offering, now supports orchestration using the Kubernetes open-source platform. On top of that, Microsoft has upgraded the service to use DC/OS 1.8.4, which brings improvements like built-in job scheduling.

SAP HCP is now ready to function as both an iPaaS environment and an aPaaS environment.

SAP HCP is now ready to function as both an iPaaS environment and an aPaaS environment.

Spread betting company Sporting Index says more bets were placed on Clinton in the past 24 hours than in all of October. Here's a look at how the numbers have been rolling in.

Attacks show that the IoT problem has reached a point where we do have to worry about larger implications.

When the RStudio first came on the scene in February 2011, there wasn't much in the way of tools for developers using the R language. The R GUI on Windows and Mac had a basic code-editing window, and there were a couple of menu-based GUIs like Rattle and R Commander that were aimed more at those who would prefer not to program in the R language. But at the time, the only real integrated development environments (IDEs) for R were ESS for Emacs, the Revolution R IDE (which has since been superseded by R Tools for Visual Studio). While RStudio has been an enormously useful IDE for R since day 1, it's officially been in "beta" status all of this time.

IT transitions will accelerate in 2017 as CIOs look to deliver business outcomes, develop technologies that serve customers, and embrace new technologies such as AI and AR, according to Forrester Research. Here's a look at the research firm's predictions for the new year.

The three presidential debates during the 2016 election year were of historical importance. This election campaign is likely to shape the political discourse for the country by unprecedented proportions. The stakes are high and candidates unique. For the very first time in US history, a major party nominated a woman as its presidential candidate. A woman… The post Using Analytics to Explain the Presidential Debates appeared first on Predictive Analytics Times.

Evelyn joined researchers at Nanyang Technological University in Singapore from 27-28 October to discuss the issues, challenges and meanings of Big Data in Asian societies. She delivered a keynote on 'Data Politics' and participated in discussions with researchers from India, Taiwan, Hong Kong, Singapore, and United States.

Data virtualization capabilities continue to expand. Here's how it will be leveraged in 2017, including its role in streaming technologies, and what developments are anticipated. Keep on reading: What lies in store for data virtualization in 2017?

Data and analytics are increasingly becoming central to business decision-making, especially in areas such as driving customer growth, improving productivity and managing risk. But even as organizations push to make their decision-making more data-driven, business leaders accustomed to making decisions based on gut-instincts and experience are having trouble trusting insights from data and analytics (D&A). Forrester Consulting, commissioned by the Data and Analytics Global team at professional services firm KPMG, recently surveyed 2,165 data and analytics decision-makers from a range of industries in Australia, Brazil, Canada, China, France, Germany, India, the U.K. and the U.S.

Cloud computing is throwing more variables into network performance, but it's also helping to make network management smarter. Nyansa, a startup that launched earlier this year with a cloud-based service to pinpoint the sources of enterprise network problems, is now adding the ability to analyze how individual applications are performing. The company's original Voyance service uses a software crawler to capture data about how traffic is flowing over the network and how each piece of infrastructure — such as a router or a Wi-Fi access point — affects that performance.

With 2017 fast approaching, technology trends that will keep gathering steam in the new year range from augmented and virtual reality to machine intelligence, Docker, and microservices, according to technology consulting firm ThoughtWorks. In its semiannual Technology Radar report published Monday, ThoughtWorks calls out four IT themes growing in prominence: Virtual reality (VR) and its cousin, augmented reality (AR) Docker as process, PaaS as machine, microservices architecture as programming model Intelligent empowerment The holistic effect of team structure The data is based on reports ThoughtWorks' consultants are seeing out in the field.

In an interview with ETCIO.COM, Nilesh Sangoi, CTO, Meru Cabs shared the technology roadmap of the taxi operator company.

The startup has developed a data analytics platform for oncology, Parsight, that aggregates information on cancer patients for research.




by Ate Poorthuis In our most recent special issue Agnieszka Leszczynski and Jeremy Crampton have drawn together an engaging series of articles on Spatial Big Data and Everyday Life. The theme explores what it means to encounter, experience and study the spatial dimension of big data. The issue looks at what it actually means to explicitly think about spatial big data and then examines what the effects of this type of data are on the everyday lives of people. In their introductory article, Leszczynski and Crampton make clear that 'spatial' should be thought of as more than just the geographical reference itself (e.g. coordinates or addresses) that plays such a dominant role in (geographic) academic research.

Is the database world hostility between old and new, SQL and NoSQL, incumbent and startup finally subsiding? Signs point to yes, and that points to good things for the industry.

I've never really been a sports fan before, but living in Chicago the enthusiasm for the Cubs is infectious, and so I've been a big fan this season. I'm also a huge Eddie Vedder fan, and seeing Pearl Jam play at Wrigley field was also an amazing experience. So with the World Series win, I'm just going to leave this here. Congratulations Cubs and Cubs fans, and have a great weekend all! We'll be back on Monday.

This vendor-written tech primer has been edited by Network World to eliminate product promotion, but readers should note it will likely favor the submitter's approach. In the quest for securing the cloud, one key aspect is often left out of the discussion: the security impact of the cloud endpoint — most notably the imperiled browser. As enterprises and individuals increasingly move computing to the cloud, security at the endpoint has been an escalating concern. Taking matters into their own hands, many enterprise consumers are going "direct to cloud" — avoiding enterprise IT practices that would otherwise protect endpoints, connectivity and data. Meanwhile, IT executives that once viewed cloud-based shared computing and storage infrastructure as their least trustworthy option now see the cloud as the safest choice.

Ransomware is a legitimate threat, with estimates showing that over 4,000 of these attacks have occurred every day since the beginning of the year.

CenturyLink will hold a $150 million stake in the entity formed by BC Partners and Medina Capital.

The same week that NetApp officials unveil new SaaS storage offerings for hybrid cloud environments, they say they will lay off 6 percent of workers.

Although vendor-written, this contributed piece does not promote a product or service and has been edited and approved by Network World editors. When I talk to data practitioners, something that comes up quite often is how their work with data has become an exact science. But I've heard repeatedly that inspiring non-technical staff to understand what to do with data insights can be a slow and painful process.

Spiceworks turned its attention to networking issues that are getting more complex in the age of the cloud, adding several free tools.

Trifacta is known for doing one thing, and doing it well: data wrangling. Because of this, the company has an informed, data-driven view on the big data and not-so-big data market. Trifacta's insights have driven its latest product release, but are also relevant to draw a big picture of big data.

In a recent report, Gartner looks at the growing field of disaster recover as a service, and finds how this cloud-based approach differs from the more traditional approach to DR.

This entry was posted in News and tagged , , , , , . Bookmark the permalink.