PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > Troubleshooting

Reply
 
Thread Tools
Old 11-26-2003, 02:50 PM   #1
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15
Can't index a table construct

Hello,

I'm using:
PhpDig Version 1.6.4
Php Version 4.3.2
Apache Version 2.0.46
Linux


I'm having the problem that PhpDig can't find words which are inside of a table construct.

For example:
The html code is like this:
<table><tr>
<td>word</td>
<td second word</td>
</tr></table>

If I let PhpDig search for word or second, then I get the message "no results found"

I have 3 or 4 pages which have tables inside, how can I get PhpDig to index them correctly and find the words inside the tables?


Thank you for your answers

yours
RedThypoon
RedThypon is offline   Reply With Quote
Old 11-26-2003, 03:26 PM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Code:
<table><tr>
<td>word</td>
<td second word</td>
</tr></table>
Hi. Maybe just a typo but can you post the HTML here for a look? Also, does 'word' happen to be in the common words file?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-26-2003, 03:43 PM   #3
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15

word is just an example for some text.

I can't post the hole code, it is to much,
but the hole code is validated by w3c.
So this would be the code with only showing the table.

If you like to see the whole code, visit
http://www.redthypoon.de/walrus
and choose "On Stage" from the main menu and then "Marktplatz" from the menu in the window".

here's the code:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" lang="de" xml:lang="de">

<head>
<title>Walrus Kultur e. V.</title>
<meta http-equiv="Content-Style-Type" content="text/css" />
<link rel="stylesheet" type="text/css" href="includes/style.css" />
<link href="http://www.walrus-kultur-ev.de/favicon.ico" rel="SHORTCUT ICON" />

</head>

<body>
<div id="body" style="color:#ff0000; background:url(<?php echo $pfad; ?>images/onstage.jpg);">
<div class="titel">
Marktplatz</div>
<div id="inhalt">
<div style="margin:0px 20px 0px 0px; text-align:right;">Stand: 30.09.2003</div>
<div style="margin:10px 0px 0px 0px; color:white;">
<table border="0" cellspacing="3">
<colgroup>
<col width="90" />
<col width="92" />
<col width="200" />
<col width="295" />
<col width="112" />
</colgroup>
<tr>
<td style="color:#ffff00; background:#ff0000; font-size:1.3em;" colspan="5">gesucht wird</td>
</tr>

<tr style="background:#808080; font-weight:bold">
<td><b>Chiffré-Nr.</b></td>
<td><b>Datum</b></td>
<td><b>Bezeichnung</b></td>
<td><b>Beschreibung</b></td>
<td><b>Kontakt</b></td>
</tr>
<tr style="vertical-align:top;">
<td>2-b-ons</td>
<td></td>
<td>Rivera Gitarrenverstärker</td>
<td>im SKS-Case. Einbau in 19"-Rack möglich. Edler 100 Watt Gitarrenverstärker aus den USA ------- VHB 900,- &euro;</td>
<td><a href="mailto:bla@bla.de">bla@bla.de</a></td>
</tr>
</table>
</div>
</div>
</div>
</body>
</html>


thank you for your help

yours
RedThypoon

Last edited by RedThypon; 11-26-2003 at 04:03 PM.
RedThypon is offline   Reply With Quote
Old 11-26-2003, 04:02 PM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Yes, I understand.

Maybe there is a typo in the HTML that is causing that block to be ignored. Can you post the HTML?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-26-2003, 04:05 PM   #5
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15
sorry, i forgot,
edited my post.
RedThypon is offline   Reply With Quote
Old 11-26-2003, 04:22 PM   #6
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. I just indexed http://www.redthypoon.de/walrus/index.php?mnuid=198 at one level and then searched for the word 'Kontakt' and obtained 14 results. What word(s) do not show up in your search?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-27-2003, 05:28 AM   #7
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15
He doesn't show up:
Rivera
Gitarre
Gitarrenverstärker

another page with this problem is:
http://www.redthypoon.de/walrus/index.php?mnuid=189

He doesn't show up the names of the people or their function, like:
Sascha Schabacker
Vorsitzender
Kasse


thank you

yours
RedThypoon
RedThypon is offline   Reply With Quote
Old 11-27-2003, 06:57 AM   #8
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. I crawled the link in your last post and can find, for example, Vorsitzender but I cannot find Litfaßsäule when I do a 'words begin' or 'exact words' search. However, when I do an 'any words part' search for Litfaßsäule, I get Litfaßsäule in the results. Please apply the patch in this thread to fix the highlighting issue, but this does seem like a character encoding problem. I'll need to do more checking on this issue. Thanks for bringing it to my attention.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-27-2003, 07:25 AM   #9
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. I figured out the Litfaßsäule issue. The charcater ß was not allowed in the searches. My bad! As a temporary fix, do the following. I'll come up with something better in the next release.

In search_function.php find:
PHP Code:
if (eregi("[^[:alnum:]^ +^-]+",$query_to_parse)) { $query_to_parse eregi_replace("[^[:alnum:]^ ]+"," ",$query_to_parse); } 
and replace with:
PHP Code:
if (eregi("[^[:alnum:]^ +^-^ß]+",$query_to_parse)) { $query_to_parse eregi_replace("[^[:alnum:]^ ]+"," ",$query_to_parse); } 
This still doesn't answer why Vorsitzender shows in searches for me but not for you. Now I'm thinking this is not a character encoding issue, but rather something to do with stored keywords.

When you run the below query what do you get?
Code:
SELECT * FROM keywords WHERE keyword like 'vo%';
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-27-2003, 07:36 AM   #10
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15
Hi, thanks for the solutions with the ß.

You can find Vorsitzender, because it is located on 2 Pages.
the word Vorsitzender is also within this page:
http://www.redthypoon.de/walrus/index.php?mnuid=189

and this is the problem I mentioned first. He can't find this page. He finds only the second page. I suppose, because Vorsitzender is within a table-construct on the page he can't find

When I run the SQL-Code I get this:
key_id twoletters keyword
Edit Delete 3577 vo voices
Edit Delete 3545 vo volker
Edit Delete 3298 vo voll
Edit Delete 3643 vo vordergrund
Edit Delete 3538 vo vorerst
Edit Delete 3121 vo vorname
Edit Delete 3045 vo vorsitzender
Edit Delete 3037 vo vorstand


Thank you for your help

yours
RedThypoon

Last edited by RedThypon; 11-27-2003 at 07:43 AM.
RedThypon is offline   Reply With Quote
Old 11-27-2003, 07:42 AM   #11
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. I am able to find Schabacker so I don't think it's the table-construct. Hmm, I wonder what's different.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-27-2003, 07:45 AM   #12
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15
Sorry, you are to fast for me, or I don't think before I write .

Please read my post above your last again, I edited it.

don't mention on the word Schabacker, it is on the same pages as Vorsitzender, so it is the same problem.

thanks
RedThypon is offline   Reply With Quote
Old 11-27-2003, 07:57 AM   #13
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Can you make a page like so and then crawl it?
Code:
<html>
<body>
Rivera Gitarre Gitarrenverstärker Sascha Schabacker Vorsitzender Kasse
</body>
</html>
Do you get search results with this simple page?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 11-27-2003, 08:18 AM   #14
RedThypon
Green Mole
 
Join Date: Nov 2003
Location: Darmstadt - Germany
Posts: 15
Yes, in this simple page, he finds the words
RedThypon is offline   Reply With Quote
Old 11-27-2003, 08:52 AM   #15
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Attached is a screenshot of the http://www.redthypoon.de/walrus/index.php?mnuid=189 page. Does the page look the same as it does in your browser?

When you crawl this site, do you get any 'duplicate' page notices?
Attached Images
File Type: gif screenshot.gif (33.2 KB, 5 views)
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Temp Spider table Converted to HEAP table GunMuse Mod Requests 0 04-22-2005 01:25 PM
No link in temporary table gooseman How-to Forum 4 05-14-2004 02:24 AM
Table descriptions motopsycho How-to Forum 1 03-10-2004 09:01 AM
Link to another table Not Logged In How-to Forum 4 11-28-2003 10:30 AM
Temporary Table jimigisme Troubleshooting 3 11-07-2003 01:32 PM


All times are GMT -8. The time now is 03:55 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.