|
02-08-2005, 08:58 AM | #1 |
Green Mole
Join Date: Jan 2005
Location: New Jersey
Posts: 11
|
Spaces (%20) in URLs
I found a minor problem with phpDig but I haven't found where yet to fix it. If the HTML file has a space in the name such as "http://my product.html", the spider only sees "http://my" and consequently, doesn't index "http://my product.html" and the pages that are linked from it.
If I replace the %20 with a _, everything works great, but my designer being a Windows user (although there's nothing wrong with that :-) has put spaces in alot of URLs. Is there a way to tell phpDig to honor the spaces in the filenames/URLs? |
02-08-2005, 09:31 AM | #2 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Look for $allowed_link_chars in the config file and add a space to the character class.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
02-08-2005, 10:02 AM | #3 |
Green Mole
Join Date: Jan 2005
Location: New Jersey
Posts: 11
|
Turns out that adding a space to the existing class didn't work (I assume it was because I had place the space at the end of the class) but uncommenting out the line above it worked great! Thanks!
Now, back to figuring out why the dashes don't get indexed... |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Plus character(+) converted to (%20) in urls | raymerica | Troubleshooting | 2 | 05-31-2006 12:19 PM |
Break the depth limit of 20? | WebSpider | How-to Forum | 9 | 02-09-2005 02:21 PM |
Nor spaces nor accent | pepevilluela | Troubleshooting | 2 | 05-06-2004 04:55 PM |
Problem spidering sites at in .txt over 20 address | joshuag200 | Troubleshooting | 3 | 01-30-2004 08:13 PM |