Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Exploiting Concurrency to Lucene Indexing

Simon Willnauer
May 17, 2011
150

Exploiting Concurrency to Lucene Indexing

A lightning talk about Documents Writer Per Thread the new Lucene 4 IndexWriter internals given at Lucene Revolution 2011

Simon Willnauer

May 17, 2011
Tweet

Transcript

  1. Simon Willnauer @ Lucene Revolution 2011 PMC Member & Core

    Comitter Apache Lucene [email protected] / [email protected] Exploiting Concurrency to Lucene Indexing
  2. IndexWriter in 3.x 2 d d d d d do

    d d d d d do d d d d d do d d d d d do d d d d d do Thread State DocumentsWriter IndexWriter Thread State Thread State Thread State Thread State do do do do do doc merge segments in memory Flush to Disk Merge on flush Multi-Threaded Single-Threaded Directory
  3. Lucene 4 with DocumentsWriterPerThread 4 d d d d d

    do d d d d d do d d d d d do d d d d d do d d d d d do DWPT DocumentsWriter IndexWriter DWPT DWPT DWPT DWPT Flush to Disk Multi-Threaded Directory