mmaattttt
08-26-2004, 06:06 PM
Hi everyone,
I'm trying to index a server that has a bunch of MSWord documents on it. The indexing works fine except until it reaches a file or directory name with single quotes in it. For example, when trying to index http://www.domain.com/pathtodocs/John's Resume/Resume.doc it would fail and return a 404 error because it tries to find http://www.domain.com/pathtodocs/Johns Resume/Resume.doc (Checked through observing Apache logs)
Obviously quotes etc are being stripped out for sanity, but has anyone encountered this scenario before and what would be the best way to work around it? I'm happy to modify PHP code if someone can point me in the right direction but convincing 150 people to avoid using quotes and to find/change filenames etc containing single quotes would be an unlikely task.
Look forward to any good suggestions!
Cheers,
Matt
I'm trying to index a server that has a bunch of MSWord documents on it. The indexing works fine except until it reaches a file or directory name with single quotes in it. For example, when trying to index http://www.domain.com/pathtodocs/John's Resume/Resume.doc it would fail and return a 404 error because it tries to find http://www.domain.com/pathtodocs/Johns Resume/Resume.doc (Checked through observing Apache logs)
Obviously quotes etc are being stripped out for sanity, but has anyone encountered this scenario before and what would be the best way to work around it? I'm happy to modify PHP code if someone can point me in the right direction but convincing 150 people to avoid using quotes and to find/change filenames etc containing single quotes would be an unlikely task.
Look forward to any good suggestions!
Cheers,
Matt