NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.
All the Perl that's Practical to Extract and Report
Stories, comments, journals, and other submissions on use Perl; are Copyright 1998-2006, their respective owners.
Swish-e (Score:1)
Re:Swish-e (Score:2)
Re:Swish-e (Score:1)
swish-e -S prog.Re:Swish-e (Score:2)
Last time I used Swish-E I was indexing files, but they included things like MS Word and PDF documents so we used an indexing script to filter the files through X_to_text programs and feed the results to the indexer. There's no reason why your indexing script couldn't get its data from DBI or similar rather than files.
The other thing I liked about the Swish-E indexing process was that you could feed arbitrary metadata fields to the indexer. This allowed you to get things like author name, publication da
Re:Swish-e (Score:2)
Re:Swish-e (Score:1)
Is Swish-E working for you with Unicode? We've found it unsatisfactory once non-Ascii characters are being used in the data being searched (and the search terms and results).
Smylers
lucene-ws.net and KinoSearch (Score:1)
We liked what Lucene had to offer, but Plucene left much to be desired. So, we ended up creating a java servlet so we could use Lucene proper as a web service ( lucene-ws.net [lucene-ws.net]).
There's a Perl client in the SVN repository, though it requires an as-yet-unreleased version of WWW::OpenSearch. Indexing is a bit slow mostly due to the HTTP overhead, but searching is pretty slick and it now includes search suggestions.
We'd like to replace it, eventually, with something more native to Perl. KinoSearch [rectangular.com] is relatively
HyperEstraier seems very interesting! (Score:1)
I started using search engines with swish-e (which I still use quite a bit), but threre is also another very interesting project: KinoSearch [cpan.org] which looks very promising full control from perl is required (it somewhat reminds me of WAIT which powered CPAN).