Uploaded image for project: 'Maven Doxia'
  1. Maven Doxia
  2. DOXIA-239

Handle non-ASCII characters in anchors and id's

    XMLWordPrintableJSON

Details

    Description

      From DOXIA-236:

      The javadoc for the method HtmlTools.encodeId() mentions the pattern [A-Za-z][A-Za-z0-9:_.-]* for its output. To me, this looks like the term "letter" in meant to refer to ASCII characters in this context. However, the employed method Character.isLetter() will classify characters according to the Unicode data file. For instance, the characters "ä" and "ß" are letters in the Unicode sense. encodeId() will pass these through to its output, violating the ASCII-only pattern stated in its javadoc.

      Attachments

        Activity

          People

            ltheussl Lukas Theussl
            ltheussl Lukas Theussl
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: