Apache Jackrabbit Oak is the next generation content repository based on the JCR specification, designed to be scalable for high read/write throughput, huge number of nodes in the repository and highly concurrent operations. In this presentation Tommaso Teofili will describe the flexible and pluggable search architecture of Oak which allows to define multiple indices to address specific types of queries with specific constraints for performant indexing and searching. A deeper focus on the Apache Lucene and Apache Solr based index implementations will be given, showing some insights on how they have been integrated to address hierarchical content search together with some performance benchmarks and real life use cases.
Open source enthusiast and member at the Apache Software Foundation, working as a software engineer for Adobe Systems on data replication and search. Passionate about natural language processing and machine learning.
Wednesday November 19, 2014 11:40am - 12:30pm CET
Elod/Ond