Author: timwi
Description:
BUG MIGRATED FROM SOURCEFORGE
http://sourceforge.net/tracker/index.php?func=detail&aid=957818&group_id=34373&atid=411192
Originally submitted by Roger Persson (rogper) 2004-05-21 07:00
Of a coincident I noticed that greater-than (>) char in URLs is
rendered wrongly IF it occures as last character in URL.
Example:
Check this extra semicolon http://sample.link/<hello> in the
end
Check this http://sample.link/<hello> strange thing
Result:
http://sample.link/<hello>;
http://sample.link/<hello>
- Additional comments ------------------------
Date: 2004-05-28 09:35
Sender: SF user vibber
The HTML output is:
http://
sample.link/<hello>;
It looks like the HTML stripping is being done before external
links, so
the have become "<" and ">". Semicolons are
actually
legal in links; the _final_ punctuation (not followed by linkable
chars) is
stripped, but the bits in the middle are considered fair game
for
belonging to a link so it extends up to the ">" but not
including
the final ";" (or the other ";" that follows, which
is extraneous).
Correct behavior would be to have the link cover
"http://sample.link/",
then cut off at the <. This will require parsing for external
links before
stripping HTML; perhaps another placeholder step would be useful
here (might also help the longstanding URL-within-URL bug).
Bug is present in both 1.2 and current 1.3.
Version: 1.4.x
Severity: normal