[prev in list] [next in list] [prev in thread] [next in thread] 

List:       r-help
Subject:    [R] cosine similarity tf-idf
From:       "Indhira, Anusha" <Anusha.Indhira () controlsdata ! com>
Date:       2016-10-28 10:21:08
Message-ID: CE1899A9C6A8D64099DFB344ECF0F7548DB4 () DERCORCEXH02 ! ds-s ! com
[Download RAW message or body]

Hi,

To find similar documents in a Corpus using cosine similarity, Is it necessary to \
calculate tf-idf weights while creating term document matrix or just term frequency \
is fine? Can anyone let me know what are advantages and disadvantages for both ways?

Thanks,
Anusha

This e-mail (including attachments) contains contents owned by Rolls-Royce plc and \
its subsidiaries, affiliated companies or customers and covered by the laws of \
England and Wales, Brazil, US, or Canada (federal, state or provincial). The \
information is intended to be confidential and may be legally privileged. If you are \
not the intended recipient, you are hereby notified that any retention, \
dissemination, distribution, interception or copying of this communication is \
strictly prohibited and may subject you to further legal action. Reply to the sender \
if you received this email by accident, and then delete the email and any \
attachments.

	[[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic