[Esip-discovery] barebones search solution needed

Mattmann, Chris A (388J) chris.a.mattmann at jpl.nasa.gov
Sat Nov 10 16:23:21 EST 2012

Hi Greg,

I don't know about the relational database part, but I don't know how to set up a search faster than simply
crawling it with Apache Nutch: http://nutch.apache.org/.

Yes, I'm biased, but I think it's a great way of making your data searchable. However, Nutch is really good for
data when it's available via URLs, and your URLs will be database oriented. One thing you could consider also
is using the Apache Solr (http://lucene.apache.org/solr/) Data Import Handler (DIH), and then importing your 
data into Solr and then searching it that way. Solr doesn't provide an open search handler out of the box, but
you could always write your own response writer for it. Solr *does* however provide geospatial search out of
the box and you could try that way.

Another option is to look at the Quad Tree available in Apache SIS (http://sis.apache.org/).

Yes I'm involved in all of these projects, and yes I believe in all of them (and more so the communities behind 


On Nov 10, 2012, at 11:04 AM, Greg Janée wrote:

> Hi all, I'm new to this list so apologies if this has been answered before.  I have a legacy geospatial dataset of ~15K granules that needs to be online and searchable.  "Searchable" here means searchable by a handful of criteria, including spatial search.  The kicker is that there is zero funding for maintaining this dataset.  And the usage is extremely low (but not non-zero), meaning it's hard to justify spending any money or time on the dataset, but at the same time the data can't just disappear.  So the successful solution is something that will not only get the job done, but is so simple that it will require virtually no maintenance going forward, and any maintenance that is required can be covered by IT folks who aren't necessarily data specialists.
> I already have a relational database with the necessary metadata.  Are there simple opensearch packages that I can put on top?  Or, even if I build my own opensearch service (in my spare time, naturally ;) ) , are there opensearch clients that support graphical spatial search?
> (I almost feel like I should post this to the preservation & stewardship list, as this seems like such a common pattern: "I have some data to preserve, but no resources...")
> Thanks in advance,
> -Greg
> +--------------------
> | Greg Janée -- gjanee at eri.ucsb.edu
> | Earth Research Institute
> | University of California at Santa Barbara
> | Santa Barbara, CA 93106-3060
> | +1 (805) 893-8453
> _______________________________________________
> Esip-discovery mailing list
> Esip-discovery at lists.esipfed.org
> http://www.lists.esipfed.org/mailman/listinfo/esip-discovery

Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann at nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
Phone: +1 (818) 354-8810
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA

More information about the Esip-discovery mailing list