Bag-of-words representation of text: measure of document similarityReturning to the bag-of-words example, we can use the notion of angle to measure how two different documents are close to each other. Given two documents, and a pre-defined list of words appearing in the documents (the dictionary), we can compute the vectors of frequencies See also: |