Adrien Grand

@jpountz

Software engineer at @elastic, Lucene/Solr committer

Caen, France

Joined April 2009

610Following

2KFollowers

Pinned

Adrien Grand Retweeted

Luca Cavanna@lucacavanna · Jun 18

I enjoyed Berlin Buzzwords very much, always a pleasure to catch up with everyone in person. Here is my talk with @jpountz on shipping Lucene 10: youtu.be/GRhzgCEL_ac?si… .

865

Pinned

Adrien Grand@jpountz · Jun 17

Someone asked me for my opinion on the Vespa vs. Elasticsearch performance comparison today at Berlin Buzzwords, so I gave it a try: jpountz.github.io/2025/06/17/ana…

2.0K

Adrien Grand@jpountz · 2 h

I spent some time looking at the Vespa source code to see how it compares with Lucene jpountz.github.io/2025/07/25/mor…

966

Adrien Grand@jpountz · Jul 18

Lucene may soon store HNSW's connections using group varint on deltas between consecutive node IDs: github.com/apache/lucene/…. This seems to give a small but consistent speedup vs. vints which are currently used.

jpountz's tweet card. Description For HNSW Graphs, the alternate encoding I implemented was GroupVarInt encoding, which in theory should be less costly both in space and runtime. The pros of this encoding would be that ...

282

Adrien Grand@jpountz · Jul 12

Awesome read on Lucene's implementation of ACORN-1🔥🔥 Filtered vector search is everywhere! Efficient, general-purpose (predicate-agnostic) indices that can support those use cases are super, super powerful!! Try it out & check out our original paper dl.acm.org/doi/10.1145/36…

DDoug Turnbull@softwaredoug · Apr 14

Elasticsearch / Lucene adopts ACORN-1, which expands the exploration of nodes to ensure enough candidates that meet the filter By @benwtrent elastic.co/search-labs/bl…

5.0K

Adrien Grand@jpountz · Jun 26

Lucene is getting an increasing number of high-quality contributions from ByteDance employees, especially around performance. Good to see that this project keeps attracting contributors from all around the world.

3.0K

Adrien Grand@jpountz · Jun 25

Another common point I did not expect: Vespa's strict vs. unstrict iterators is quite similar to Lucene's two-phase iteration. And both projects use this feature to effectively combine dynamic pruning with filtering (a hard and underappreciated problem IMO).

735

Adrien Grand@jpountz · Jun 16

.@_andreidan kindly captured pictures of @lucacavanna and I telling the story of how the Lucene 10 release went

408

Adrien Grand@jpountz · Jun 16

Via @rcmuir: Linux 6.15 introduced a big speedup for Lucene on AMD processors benchmarks.mikemccandless.com/FilteredOrHigh… (last data point, not annotated yet) thanks to faster TLB invalidation phoronix.com/review/amd-inv…

jpountz's tweet card. Last weekend a Meta engineer posted Linux kernel patches to make use of the AMD INVLPGB instruction for broadcast TLB invalidation.

280

Adrien Grand@jpountz · Jun 6

Lucene is getting faster at deep search by switching to a more efficient heap implementation to collect top hits. github.com/apache/lucene/…

jpountz's tweet card. This tries to encode ScoreDoc#score and ScoreDoc#doc to a comparable long and use a LongHeap instead of HitQueue. This seems to help apparently when i increase topN = 1000 (mikemccand/luceneutil#35...

562

Adrien Grand@jpountz · Jun 4

A nice optimization landed on the hash table that Lucene uses to build inverted indexes: github.com/apache/lucene/…. Some previously unused bits are now used to cache hash codes, effectively making collisions cheaper to resolve.

jpountz's tweet card. Description This PR tries to utilize the unused part of the id to cache the high-order bits of the hashcode to speed up BytesRefHash. I used 1 million 16-byte UUIDs to benchmark this change, and t...

372

Adrien Grand Retweeted

Doug Turnbull@softwaredoug · May 28

Several weeks ago, I put the R-in RAG with @HamelHusain by discussing hybrid search best practices. Next up we put the F(ilter) in HNSW to build hybrid search. Which doesn't quite fit... but @benwtrent and I are not intimidated by such trivialities maven.com/p/430592/hybri…

2.0K

Adrien Grand@jpountz · May 19

There has been a big regression in Lucene's nightly benchmarks recently after a kernel upgrade. @mikemccand and @rcmuir found that it was caused by a change in the Linux scheduler configuration. github.com/apache/lucene/…

jpountz's tweet card. Description I'm seeing a big performance change (mostly regression) on 2025.05.01 benchmark, without an annotation. There are many commits diff for this run, i have not managed to identify but ...

578