[Bessig] Galvanize Denver on Apache Spark, 4/26
Anne Wilson
Anne.Wilson at lasp.colorado.edu
Thu Apr 21 14:06:40 EDT 2016
Hi BESSIG,
I am forwarding information about this Apache Spark event on behalf of Tim Fries.
Anne
From: Timothy P Fries <timfries at us.ibm.com<mailto:timfries at us.ibm.com>>
Date: Thursday, April 21, 2016 11:57 AM
To: Anne Wilson <Anne.Wilson at lasp.colorado.edu<mailto:Anne.Wilson at lasp.colorado.edu>>
Cc: Frank Stein <fstein at us.ibm.com<mailto:fstein at us.ibm.com>>, Kelly Foreman <foremank at us.ibm.com<mailto:foremank at us.ibm.com>>
Subject: Re: Do you think CDI members would be interested in learning more about Apache Spark?
Here you are, Anne!
---------------------------------------------------
Hello, BESSIG members, I'm writing to bring a no-cost education event to your attention. This session is focused on open source Apache Spark, which IBM describes as "the operating system for analytics."
If you are looking for a faster and better way to analyze big data, this 3 hour meetup to network with colleagues and understand this game-changing technology on this next Tuesday, April 26th at Galvanize Denver is for you.
Spark is a cross-over next generation analytics technology that can be applied successfully to multiple types of scientific and non-scientific processing while improving throughput and lowering costs versus technologies like MapReduce.
Spark has come a long way since this article (https://www.constellationr.com/content/spark-fire-why-all-hype) on the strengths of Spark was written almost a year ago - this is your chance to experience and discuss the latest Spark technologies.
To illustrate Spark in a scientific example, the authors of this paper (http://www.istc-cc.cmu.edu/publications/papers/2015/adam.pdf) describe ADAM, an example genomics pipeline that leverages the open-source Apache Spark and Parquet systems to achieve a 28x speed-up over current genomics pipelines, while reducing cost by 63%. They go on to distill a set of techniques for implementing scientific analyses efficiently using commodity "big data" systems. To demonstrate the generality of their architecture, they implemented a scalable astronomy image processing system which achieves a 2.8 - 8.9x improvement over the state-of-the-art Message Passing Interface-based system.
To RSVP: Please register for this no cost meetup by sending an email to ascdc at us.ibm.com<mailto:ascdc at us.ibm.com>with a Subject Line of: "Please confirm me for the April 26th Galvanize Denver Spark Meetup Registration, <your name>, <your affiliation>, <your title>" for confirmation.
More information about the meetup and how to register for the event are below, hope to see you there!
Title: "What is all the Hype about w/ Apache Spark"
Date: April 26th, 6:00 - 9:00pm
Location: Galvanize Denver - 1644 Platte St, Denver, CO 80202
Description: Come learn more about open source Apache Spark. Topics covered include Data Wrangling, Spark SQL, Data Visualization, Jupyter Notebooks and Machine Learning all with Apache Spark. See a live demo in Jupyter notebook of analyzing the effect of weather on professional sports.
IBM's Spark Evangelist Carlo Appugliese (https://www.linkedin.com/in/carloappugliese) and members of his team have presented these topics and others to large Federal audiences in DC recently to rave reviews.
Sponsored by: IBM
Pizza and Refreshments provided by IBM
--------------------------------------------------------
Regards,
Tim Fries
Software Client Architect - SWG US Federal CTO, US Citizen Support Program
The Open Group and IBM Certified IT Architect
Cell: 303-870-9416(Please note new cell number!)
Internet - timfries at us.ibm.com<mailto:timfries at us.ibm.com>
AIM and Yahoo IM: timfries943
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.deltaforce.net/pipermail/bessig/attachments/20160421/3c83e15b/attachment.html>
More information about the Bessig
mailing list