Uploaded image for project: 'Commons Codec'
  1. Commons Codec
  2. CODEC-132

BeiderMorseEncoder OOM issues

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.6
    • 1.7
    • None

    Description

      In Lucene/Solr, we integrated this encoder into the latest release.

      Our tests use a variety of random strings, and we have recent jenkins failures
      from some input streams (of length <= 10), using huge amounts of memory (e.g. > 64MB),
      resulting in OOM.

      I've created a test case (length is 30 here) that will OOM with -Xmx256M.

      I haven't dug into this much as to what's causing it, but I suspect there might be a bug
      revolving around certain punctuation characters: we didn't see this happening until
      we beefed up our random string generation to start producing "html-like" strings.

      Attachments

        1. CODEC-132_test.patch
          1.0 kB
          Robert Muir
        2. CODEC-132.patch
          15 kB
          Thomas Neidhart

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rcmuir Robert Muir
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: