Dated: 31st July, 2013
Jaypee Institute of Information Technology, Noida Sector-128 Department of CSE & IT Information Retrieval and Data Mining (10B1NCI736)
Assignment-I
Q.1 The following list of Rs and Ns represents relevant (R) and nonrelevant (N) returned documents in a ranked list of 20 documents retrieved in response to a query from a collection of 10,000 documents. The top of the ranked list (the document the system thinks is most likely to be relevant) is on the left of the list. This list shows 6 relevant documents. Assume that there are 8 relevant documents in total in the collection. R R N NNNNN R N R N NN R N NNN R a. What is the precision of the system on the top 20? b. What is the F1 on the top 20? c. What is the un-interpolated precision of the system at 25% recall? d. What is the interpolated precision at 33% recall? e. Assume that these 20 documents are the complete result set of the system. What is the MAP for the query? Assume, now, instead, that the system returned the entire 10,000 documents in a ranked list, and these are the first 20 results returned. f. What is the largest possible MAP that this system could have? g. What is the smallest possible MAP that this system could have? h. In a set of experiments, only the top 20 results are evaluated by hand. The result in (e) is used to approximate the range (f)(g). For this example, how large (in absolute terms) can the error for the MAP be by calculating (e) instead of (f) and (g) for this query? (Answer: (a.) Precision = 0.3, (b.) F-measure = 0.43 (c.) 1, 2/3, 2/4, 2/5, 2/6, 2/7, (d.) 4/11 = 0.364 (e.) MAP=0.555 (f.) MAP largest =0.503 MAP largest = 0.417 (h) Ehe error is in [ 0.052, 0.138].
Q.2 Below is a table showing how two human judges rated the relevance of a set of 12 documents to a particular information need (0 = nonrelevant, 1 = relevant). Let us assume that youve written an IR system that for this query returns the set of documents{4, 5, 6, 7, 8}.
docID 1 2 3 4 5 6 7 8 9 10 11 12
Judge1 0 0 1 1 1 1 1 1 0 0 0 0
Judge2 0 0 1 1 0 0 0 0 1 1 1 1
a. Calculate the kappa measure between the two judges. b. Calculate precision, recall, and F1 of your system if a document is considered relevant only if the two judges agree. c. Calculate precision, recall, and F1 of your system if a document is considered relevant if either judge thinks it is relevant. (Answer: (a.) Kappa = -1/3 (b.) P =0.2, R= 0.5 , and F-measure(F1) = 0.286 (c.) P= 1, R=0.5, and F1=0.667) Q.3 Consider the following two queries (Query 1 and Query 2). Calculate the MAP for these results.
(Answer: MAP= 0.594)
Q.4 Consider the table of term frequencies for 3 documents denoted Doc1, Doc2, Doc3 in Figure 6.9. Compute the tf-idf weights for the terms car, auto, insurance, best, for each document, using the idf values from Figure 6.8
(Answer: car auto insurance best Doc1 44.55 6.24 0 21 Doc2 6.6 68.64 53.46 0 Doc3 39.6 0 46.98 25.5
Q. 5 How does the base of the logarithm in computation of IDF affect the score calculation in Q. 4? How does the base of the logarithm affect the relative scores of two documents on a given query? Q. 6 Use the the tf-idf weights computed in Q. 4. Compute the Euclidean normalized document vectors for each of the documents, where each vector has four components, one for each of the four terms. (Answer: Doc1 = [0.8974, 0.1257, 0, 0.4230], Doc2 =[0.0756, 0.7867, 0.6127, 0], and Doc3= [0.5953, 0, 0.7062, 0.3833] )
Q. 7 Use the term weights as computed in Q. 6, rank the three documents by computed score for the query car insurance, for each of the following cases of term weighting in the query: (a.) The weight of a term is 1 if present in the query, 0 otherwise. (b. ) Euclidean normalized idf. (Answer: (a.) Ranking is Doc3, Doc1, Doc2 as score (q, doc1) =0.8974, score (q, doc2) =0.6883, score (q, doc3) =1.3015 (b.) Ranking is Doc2, Doc3, Doc1 as score (q, doc1) =0.6883, score (q, doc2) =0.7975, score (q, doc3) =0.7823) Q. 8 Compute the vector space similarity between the query digital cameras and the document digital cameras and video cameras by filling out the empty columns in Table 6.1. Assume N = 10,000,000, logarithmic term weighting (wf columns) for query and document, idf weighting for the query only and cosine normalization for the document only. Treat and as a stop word. Enter term counts in the tf columns. What is the final similarity score? Table 6.1
(Answer: Similarity score is 3.12) Q.9 Refer to the tf and idf values for four terms and three documents in Q. 4. Compute the two top scoring documents on the query best car insurance for each of the following weighing schemes: (i) ntc.atc. (ii.) lnc.ltn (iii.) lnc.ltc (Answer: Top two scoring documents are doc3, doc1 as score (q, doc1) =0.762, score (q, doc2) =0.657, score (q, doc3) =0.916 )
Note: Last date of submission is 30th August, 2013 by 5:00 p.m.)