[ESIP-all] AGU Session IN016: Data Prospecting, Exploration and Mining – "Big Data" Exploitation Challenges and Applications in Earth Science

Glenn Rutledge glenn.rutledge at noaa.gov
Thu Aug 2 13:31:04 EDT 2012


Sorry for any cross postings.

Please consider what is sure to be a dynamic and lively Session!   Deadline
for Abstracts is August 8th.

*Session Overview:*
IN016: Data Prospecting, Exploration and Mining – "Big Data" Exploitation
Challenges and Applications in Earth Science

There are typically two types of data analysis, namely, exploration and
mining. Data exploration focuses on manual methods where as Mining uses
automated algorithms to extract useful information. A new approach for
exploiting "big data" is now possible with the availability of high
performance computing and the advent of new techniques for efficient
distributed file access. This new approach coined as "data prospecting"
combines methods from both data exploration and mining.

*This session invites talks focusing on applications, tools and challenges
of exploiting "big data" using different approaches*

*Link:*
http://fallmeeting.agu.org/2012/session-search/single/data-prospecting-exploration-and-mining-big-data-exploitation-challenges-and-applications-in-earth-science/

*Session Details:*
 Data Prospecting, Exploration and Mining – “big data” exploitation
challenges and applications in Earth Science

There are typically two categories of data analysis, namely, data
exploration and data mining. Data exploration focuses on manual methods
brought to bear on data analysis such as standard statistical analysis and
visualization. Data exploration usually requires small datasets. Data
mining, on the other hand, is defined as "the nontrivial extraction of
implicit, previously unknown, and potentially useful information from data"
(Fayyad et al, 2008). Data Mining uses automated algorithms to extract
useful information. Humans guide these automated algorithms and specify
algorithm parameters (training samples, clustering size, etc.). Large
datasets typically require data mining.

A new approach for exploiting "big data" is now possible with the
availability of high performance computing and the advent of new techniques
for efficient distributed file access. This new approach coined as “data
prospecting” combines methods from both data exploration and mining. Just
as prospecting focuses on locating the site within the vast land and
determining the type of deposit that is located at that site. Data
prospecting focuses on finding the right subset of data amongst all the
data files and determining the value of the information contained within
the subset. Papers on Web-initiated high-volume computational intensive
data analysis capabilities on distributed peta-scale data archives
extracting information at the source are also being sought. Such papers may
include non-linear dynamics in search of signals within climate archives.

This session invites talks focusing on applications and challenges of
exploiting “big data” using different data exploration, prospecting and
mining approaches. Talks on tools addressing any of these topics are also
welcome.

Name/Contacts - Rahul Ramachandran (rahul.ramachandran at uah.edu), Sara
Graves, Glenn K. Rutledge (glenn.rutledge at noaa.gov), and Kwo-sen Kuo

-- 
Glenn K. Rutledge
Meteorologist/Physical Scientist
NOMADS Team Leader
National Climatic Data Center
Asheville, NC 28801
(828) 271-4097
nomads.ncdc.noaa.gov
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.lists.esipfed.org/mailman/private/esip-all/attachments/20120802/575a430f/attachment.html>


More information about the ESIP-all mailing list