Posts List

Apache Solr Webcamp talk slides and links

This week, on april 26th, I gave a talk about Solr basics on WebCamp Ljubljana. Here I’m listing the slides, tips and the relevant links for anyone starting up with Solr. Here I’m listing relevant links from the slide-deck which are good starting point for Solr deployment.

Project spotlight: PySolarized Solr library

There are numerous Python/Solr libraries out there, each having a different subset of functionality. Obviously, as per Murphy’s law, none of them had a set of features I required. So I rolled my own - PySolarized! I wrote PySolarized because I needed a Solr connector which would dispatch and query documents to multiple cores.

Solr slovenian lemmatizer updated with easier installation

I’ve just uploaded 1.1 update for Lemmagen lemmatizer for Solr, which is now a pure Java .JAR library and does not require installation of any additional files on your server. New version also updates package name and configuration attribute to be more consistent.

Slovene lemmatization in Solr

Apache Solr is a popular full-text search engine with RESTful interface, which makes it perfect search engine with most type of web sites. However, the quality of search results is dependent on language filters, with a good lemmatizer being the most essential. That’s why I’ve created a Solr module, which uses JSIs LemmaGen lemmatizer for Solr index building and search queries. The source to the lemmatizer and module is available on Bitbucket slovene_lemmatizer repository.