IBM launches cloud development environment for Spark

New system will enable near real-time, high-performance analytics, says Big Blue’s Bob Picciano.

Jun 09 162 min read

Contributing writer, Information Management

IBM has launched a development environment for Apache Spark, which it’s delivering in the cloud for rapid adoption.

The company says it’s the first cloud-based development environment for near real-time, high performance analytics, and it will give data scientists the ability to access and ingest data and deliver models to developers.

Available on the IBM Cloud Bluemix platform, the Data Science Experience provides 250 curated data sets, open source tools and a collaborative workspace designed to help data scientists uncover and share meaningful insights with developers. That will make it easier to quickly develop applications.

Building on its $300 million investment in developing Apache Spark as a type of “analytics operating system,” IBM says it created the Data Science Experience to extend the speed and agility of Spark to more than 2 million members of the R community through new contributions to SparkR, SparkSQL and Apache SparkML.

As a result, data scientists who work in R will have faster access to more data and more insights delivered from the IBM Cloud.

The Data Science Experience’s open and collaborative environment will enabled data scientists to accelerate and simplify data ingestion, curation and analysis by bringing together the content, data, models and open source resources from IBM and others, the company says.

“With Apache Spark, we see an opportunity to significantly transform the role of the data scientist by providing access to curated data sets, open source tools and a collaborative platform to accelerate innovation,” said Bob Picciano, senior vice president of IBM Analytics.

More for you

Loading data for hdm_tax_topic #better-outcomes...