[ESIP-all] NSF data science webinar series | Data Commons | Semantic web

Brian Wee bwee at neoninc.org
Thu Feb 18 09:36:55 EST 2016

Hi, this webinar<http://www.nsf.gov/events/event_summ.jsp?cntn_id=137718&WT.mc_id=USNSF_13&WT.mc_ev=click> on 2016-03-03, 1230hrs EST, will probably be of interest to the ESIP community.  From the bio, the speaker "is the creator of widely used web standards such as RSS, RDF and Schema.org. He is also responsible for products such as Google Custom Search."

Replicated from: http://www.nsf.gov/events/event_summ.jsp?cntn_id=137718&WT.mc_id=USNSF_13&WT.mc_ev=click

Empirical Modeling of Complex Systems

Data Science Webinar Series - Ramanathan Guha - March 3, 2016 - 12:30pm- Room 110
March 3, 2016 12:30 PM  to
March 3, 2016 1:30 PM
NSF Room 110
Engineering is about building models of phenomena. Traditionally, these models are built  using 'foundational' equations such as those of motion, continuum mechanics and electromagnetism, that capture the core causal relationships of the domain. Unfortunately, we do not have such equations for heterogeneous complex systems that we find in biological, environmental and behavioral sciences. Recently, exploiting large amounts of data and compute resources, we have started using machine learning to build empirical models of such systems. This technique is behind the success of many widely used products such as Google search and advertising. However, a number of obstacles need to be overcome before empirical modeling  becomes more widespread. In this talk, we discuss two of these problems along with their possible solutions.

Data drives empirical modeling and in order to get an adequate data set, we often need to merge data from different sources. Aligning schemas and resolving references to entities that appear in different sets with ambiguous names is expensive and error prone. In this talk, we look at how human communication deals with similar issues to show how these techniques may be adapted to allow very large scale data sharing.

High infrastructure setup costs have severely restricted the number of researchers experimenting with large datasets. We present the concept of Data Commons, a cloud offering that aggregates multiple datasets and makes them available to users of the cloud. In this model, data is part of the cloud infrastructure, like storage or networking. We discuss the potential impact of Data Commons and report on first steps.

Guha is the creator of widely used web standards such as RSS, RDF and Schema.org. He is also responsible for products such as Google Custom Search. He was a co-founder of Epinions.com and Alpiri.  Until recently, he was a Google Fellow and a vice president in research at Google. He has a Ph.D. in computer science from Stanford University and B.Tech in mechanical engineering from IIT Chennai.
To Join the Webinar:

Please register at:

by 11:59pm EST on Wednesday, March 2, 2016.

After your registration is accepted, you will receive an email with a URL to join the meeting. Please be sure to join a few minutes before the start of the webinar. This system does not establish a voice connection on your computer; instead, your acceptance message will have a toll-free phone number that you will be prompted to call after joining. If you are international, please email kgeary at nsf.gov to obtain the appropriate dial in number.  Please note that this registration is a manual process; therefore, do not expect an immediate acceptance. In the event the number of requests exceeds the capacity, some requests may have to be denied.

This event is part of Webinars/Webcasts<http://www.nsf.gov/events/event_group.jsp?group_id=20018&org=NSF>.
Meeting Type
Lida Beninson, (703) 292-9262, lidbenin at nsf.gov<mailto:lidbenin at nsf.gov>
NSF Related Organizations
Directorate for Computer & Information Science & Engineering

Brian Wee, Ph.D.<http://orcid.org/0000-0002-0038-9381>
Strategic Alliances
National Ecological Observatory Network (NEON), Inc.<http://www.neonscience.org/>
c/o Smithsonian Institution
1100 Jefferson Drive SW, Suite 3123, MRC 705
Washington, D. C. 20560-0001
Phone: 202-370-7891<tel:202-370-7891>
Fax: 202-204-0128<tel:202-204-0128>
Skype: bwee.in.dc
ORCID: 0000-0002-0038-9381

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.lists.esipfed.org/mailman/private/esip-all/attachments/20160218/ce8e2ced/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 559 bytes
Desc: image001.jpg
URL: <http://www.lists.esipfed.org/mailman/private/esip-all/attachments/20160218/ce8e2ced/attachment-0001.jpg>

More information about the ESIP-all mailing list