SAN FRANCISCO--(BUSINESS WIRE)--H2O announced today the introduction of Sparkling Water, the latest innovation to combine two best-of-breed open source technologies Apache Spark and H2O. Sparkling Water is the newest application on the Apache Spark in-memory platform to extend Machine Learning for better predictions and to quickly deploy models into production. H2O is proud to partner with Cloudera and Databricks to bring this capability to a wide audience.
"One of the major strengths of Spark is its ability to provide a unified platform for building end-to-end data pipelines, and as such become a natural platform for next generation applications," said Ion Stoica, CEO of Databricks. "We're thrilled to have H2O bring their machine learning know-how to Apache Spark in the form of Sparkling Water, and look forward to more future collaboration."
For the data scientist moving between different environments, Sparkling Water removes inherent friction from challenges arising from data formats and structure. Particularly, in the data science workflow, data parsing and transformation along with variable creation takes advantage of Apache Spark while feature selection, modeling, and scoring may leverage H2O.
“Sparkling Water enables data scientists to take advantage of high fidelity data stored in an enterprise data hub to build sophisticated machine learning applications. By marrying the power of Apache Spark in CDH with H2O, applications can leverage scalable, and fast machine learning on Hadoop.” – Jairam Ranganathan, senior director, Product Strategy, Cloudera
As the black box of predictive analytics opens up to a larger community, H2O is laser focused on how to quickly scale cutting edge machine learning algorithms to the demands of the enterprise to build the next generation of smart applications. With this latest innovation, the Apache Spark community can now apply Deep Learning to solve complex classification problems. Additionally, Data Scientists may rejoice as Sparkling Water is supported in the most cutting edge languages including R, Python, Scala, and Java. Finally, with Sparkling Water the promise of predictive analytics is realized as H2O also has a robust REST API and a NanoFastTM Scoring Engine to power smart business applications.
"All of the Internet is going to be rewired with Intelligent Applications. Sparkling Water is the convergence of elegant APIs, fast machine learning and in-memory predictive analytics. A unified user & developer experience for building smarter applications will transform enterprises and accelerate big data adoption." said, SriSatish Ambati, CEO and Co-Founder of H2O. "We are excited to team up the Communities of Data Science and Application Developers. Sparkling Water is the middleware for big data."
H2O is for data scientists and business analysts who need scalable and fast machine learning. H2O is an open source predictive analytics platform. Unlike traditional analytics tools, H2O provides a combination of extraordinary math and high performance parallel processing with unrivaled ease of use. H2O speaks the language of data science with support for R, Python, Scala, Java and a robust REST API. Smart business applications are powered by H2O’s NanoFastTM Scoring Engine. Learn more by going to http://www.h2o.ai and contact us for more information.