[prev in list] [next in list] [prev in thread] [next in thread]
List: r-help
Subject: [R] cosine similarity tf-idf
From: "Indhira, Anusha" <Anusha.Indhira () controlsdata ! com>
Date: 2016-10-28 10:21:08
Message-ID: CE1899A9C6A8D64099DFB344ECF0F7548DB4 () DERCORCEXH02 ! ds-s ! com
[Download RAW message or body]
Hi,
To find similar documents in a Corpus using cosine similarity, Is it necessary to \
calculate tf-idf weights while creating term document matrix or just term frequency \
is fine? Can anyone let me know what are advantages and disadvantages for both ways?
Thanks,
Anusha
This e-mail (including attachments) contains contents owned by Rolls-Royce plc and \
its subsidiaries, affiliated companies or customers and covered by the laws of \
England and Wales, Brazil, US, or Canada (federal, state or provincial). The \
information is intended to be confidential and may be legally privileged. If you are \
not the intended recipient, you are hereby notified that any retention, \
dissemination, distribution, interception or copying of this communication is \
strictly prohibited and may subject you to further legal action. Reply to the sender \
if you received this email by accident, and then delete the email and any \
attachments.
[[alternative HTML version deleted]]
______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic