摘要: 3.4.1 D'Amore and Mah Initial information retrieval research focused on n-grams as presented in[D'Amore and Mah, 1985]. The motivation behind their work was the fact thatit is difficult to develop mathematical models for terms since the potential fora term that has not been seen before is infinite. With n-grams, only a fixednumber of n-grams can exist for a given value of n. A mathematical modelwas developed to estimate the noise in indexing and to determine appropriatedocument similarity measures. D'Amore and Mah's method replaces terms with n-grams in the vector spacemodel. The only remaining issue is computing the weights for each n-gram.Instead of simply using n-gram frequencies, a scaling method&nbs ... 目录: 1. INTRODUCTION 2. RETRIEVAL STRATEGIES 2.1 Vector Space Model 2.2 Probabilistic Retrieval Strategies 2.3 Language Models 2.4 Inference Networks 2.5 Extended Boolean Retrieval 2.6 Latent Semantic Indexing 2.7 Neural Networks 2.8 Genetic Algorithms 2.9 Fuzzy Set Retrieval 2.10 Summary 2.11 Exercises 3. RETRIEVAL UTILITIES 3.1 Relevance Feedback 3.2 Clustering 3.3 Passage-based Retrieval
以下为对购买帮助不大的评价