Compare elastic with Okapi BM25
Read this in "about 1 minute".
Okapi BM25
data:image/s3,"s3://crabby-images/18e90/18e903b94cef7b53c814c33f57926d49a87a163d" alt="1806232.png 1806232.png"
data:image/s3,"s3://crabby-images/f6450/f64502cec5b24cd4a429facf5c26854b1e41a745" alt="1806233.png 1806233.png"
N — Size of the judged sample
ni — Number of documents in the judged sample containing ti
R — Relevant set size (i.e., number of documents judged relevant)
ri — Number of judged relevant docs containing ti
When there is relevance feedback, it is added to the w.
Okapi BM25F
BM25F considers weighted streams for structured documents.
data:image/s3,"s3://crabby-images/74076/7407657b25ed0b9e91a5566b43f59a5cecf9073f" alt="1806234.png 1806234.png"
Compared with Elastic
data:image/s3,"s3://crabby-images/63a20/63a209df6cf63aa792419224ebc44f20fdc66e89" alt="1806235.png 1806235.png"
Improvements
Considering the theory above, relevance feedback and stream weights can be added to the practical elastic scoring function.
The End!
Reference: Stephen Robertson,Hugo Zaragoza.The Probabilistic Relevance Framework: BM25 and Beyond