[Esip-discovery] barebones search solution needed

Mattmann, Chris A (388J) chris.a.mattmann at jpl.nasa.gov
Wed Nov 14 07:02:24 EST 2012


Hey Chris,

On Nov 14, 2012, at 6:50 AM, Lynnes, Christopher S. (GSFC-6102) wrote:

> 
> On Nov 10, 2012, at 4:23 PM, "Mattmann, Chris A (388J)" <chris.a.mattmann at jpl.nasa.gov> wrote:
> 
>> Hi Greg,
>> 
>> I don't know about the relational database part, but I don't know how to set up a search faster than simply
>> crawling it with Apache Nutch: http://nutch.apache.org/.
> 
> Does Nutch offer an OpenSearch interface, i.e., is its (presumably) REST interface simple enough one could author an OpenSearch Description Document for it?

Yep sure does:

http://svn.apache.org/repos/asf/nutch/tags/release-1.2/src/java/org/apache/nutch/searcher/OpenSearchServlet.java

Note in newer versions of Nutch (1.3+) I believe this functionality may have gone away but it shouldn't be too hard to 
build upon the code above to provide such a capability.

Cheers,
Chris

> 
>> 
>> Yes, I'm biased, but I think it's a great way of making your data searchable. However, Nutch is really good for
>> data when it's available via URLs, and your URLs will be database oriented. One thing you could consider also
>> is using the Apache Solr (http://lucene.apache.org/solr/) Data Import Handler (DIH), and then importing your
>> data into Solr and then searching it that way. Solr doesn't provide an open search handler out of the box, but
>> you could always write your own response writer for it. Solr *does* however provide geospatial search out of
>> the box and you could try that way.
>> 
>> Another option is to look at the Quad Tree available in Apache SIS (http://sis.apache.org/).
>> 
>> Yes I'm involved in all of these projects, and yes I believe in all of them (and more so the communities behind
>> them).
>> 
>> Cheers,
>> Chris
>> 
>> On Nov 10, 2012, at 11:04 AM, Greg Janée wrote:
>> 
>>> Hi all, I'm new to this list so apologies if this has been answered before.  I have a legacy geospatial dataset of ~15K granules that needs to be online and searchable.  "Searchable" here means searchable by a handful of criteria, including spatial search.  The kicker is that there is zero funding for maintaining this dataset.  And the usage is extremely low (but not non-zero), meaning it's hard to justify spending any money or time on the dataset, but at the same time the data can't just disappear.  So the successful solution is something that will not only get the job done, but is so simple that it will require virtually no maintenance going forward, and any maintenance that is required can be covered by IT folks who aren't necessarily data specialists.
>>> 
>>> I already have a relational database with the necessary metadata.  Are there simple opensearch packages that I can put on top?  Or, even if I build my own opensearch service (in my spare time, naturally ;) ) , are there opensearch clients that support graphical spatial search?
>>> 
>>> (I almost feel like I should post this to the preservation & stewardship list, as this seems like such a common pattern: "I have some data to preserve, but no resources...")
>>> 
>>> Thanks in advance,
>>> -Greg
>>> 
>>> 
>>> 
>>> 
>>> +--------------------
>>> | Greg Janée -- gjanee at eri.ucsb.edu
>>> | Earth Research Institute
>>> | University of California at Santa Barbara
>>> | Santa Barbara, CA 93106-3060
>>> | +1 (805) 893-8453
>>> 
>>> _______________________________________________
>>> Esip-discovery mailing list
>>> Esip-discovery at lists.esipfed.org
>>> http://www.lists.esipfed.org/mailman/listinfo/esip-discovery
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann at nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> Phone: +1 (818) 354-8810
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> _______________________________________________
>> Esip-discovery mailing list
>> Esip-discovery at lists.esipfed.org
>> http://www.lists.esipfed.org/mailman/listinfo/esip-discovery
> 
> --
> Dr. Christopher Lynnes, NASA/GSFC, ph: 301-614-5185
> 
> 
> 



More information about the Esip-discovery mailing list