'lucene search: workaround for site.pdf/html'

[prev in list] [next in list] [prev in thread] [next in thread] 

List:       forrest-user
Subject:    lucene search: workaround for site.pdf/html
From:       Johannes Schaefer <johannes.schaefer () uidesign ! de>
Date:       2004-08-30 9:38:33
Message-ID: 4132F599.1050507 () uidesign ! de
[Download RAW message or body]

Hi!

Lucene search doesn't work if site.xml contains entries
for site.pdf or site.html (<all> section). To have a
workaround we put these two entries into a separate file
(we call it "Printversion"):

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE document PUBLIC "-//APACHE//DTD Documentation V1.2//EN" 
http://apache.org/forrest/dtd/document-v12.dtd">
<document>
   <header><title>Printversion</title></header>
   <body>
     <p>
       ... <link href="site.html">Full HTML</link> ...
       ... <link href="site.pdf">Full PDF</link> ...
     </p>
   </body>
</document>

This works fine and gives us some room to explain what these
two links are used for. Lucene doesn't follow the links in
the file, so lucene can create the index without problems.

Just one question. What is better: to put "site:html" in the
file or "site:full_html"?

Cheers
Johannes


-- 
User Interface Design GmbH * Teinacher Str. 38 * D-71634 Ludwigsburg
Fon +49 (0)7141 377 000 * Fax  +49 (0)7141 377 00-99
Geschäftsstelle: User Interface Design GmbH * Lehrer-Götz-Weg 11 * 
D-81825 München
www.uidesign.de

[prev in list] [next in list] [prev in thread] [next in thread]
Configure | About | News | Add a list | Sponsored by KoreLogic