Compact query term selection using topically related text.
Read this in "about 1 minute".
Introduction
This paper illustrate a method called PhRank, selecting terms from retrieved documents by a query.The effects follows:
Methods
Graph construction Input:query Q = {w1,…wn}
N = {d0 , ….dk } ,Q+the top k documents retrieved
It considers three relationships between terms: co-occurrence, stemming and association.
Here is an example:
Edge weights p(dk|Q) : the probability of the document in which the stems i and j co-occur given Q cijW2 and cijW10: the counts of stem co-occurrence in windows of size 2 and 10inN λ is set to 0.6
data:image/s3,"s3://crabby-images/2f484/2f484c741f9c9b50a93b1721b92199a0973466d5" alt="1806238.png 1806238.png"
- Random Walk The probability hij = lij if vi and vj are connected, and hij = 0 otherwise. πjt be the affinity score associated with vj at time t.
data:image/s3,"s3://crabby-images/bf3e3/bf3e375a7054975ffb3ad4b457d65acc4a8bacd0" alt="1806239.png 1806239.png"
- Vertex weights wnfavg is the frequency of a word wn in N.
data:image/s3,"s3://crabby-images/2a96d/2a96dc59888dbc17c120fb18aeb62f509f6ed2f6" alt="18062310.png 18062310.png"
- Term ranking
data:image/s3,"s3://crabby-images/10d0d/10d0da914308d9b063078dde21f79b25bdebb96f" alt="18062311.png 18062311.png"
Comparison
data:image/s3,"s3://crabby-images/4cb61/4cb611e1e5e40bb82df6ac8f814a178cfe3ec89e" alt="18062312.png 18062312.png"
The End!
Reference: K.Tamsin Maxwell, W.Bruce Croft:SIGIR 2013: 583-592.Compact query term selection using topically related text.