Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1124

PDFBox Is Throwing Exception in extraction in case of few pdf in .NET 3.5

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 0.7.3
    • None
    • .NET
    • None

    Description

      i am using the pdfbox libraries in C# with Framework 3.5.

      my code is :-

      PDDocument doc = PDDocument.load("pdf file path");
      PDFTextStripper stripper = new PDFTextStripper();
      Result = stripper.getText(doc);

      this code is running and working fine. but in case of few pdf(specially when it has images)
      It was giving the error of missing assembly bcprov-jdk14-132. i add the reference of this assembly.
      now it is again giving the following error on the third line of fetching text.

      <b>The type initializer for 'gnu.java.util.regex.RESyntax' threw an exception.</b>

      Please solve the problem. it is very urgent.

      I have also attached the pdf which is giving error in extracting the text

      thanks in advance

      Attachments

        1. QT_Install_Guide.pdf
          1.31 MB
          gagan deep sharma

        Activity

          People

            Unassigned Unassigned
            gagsha gagan deep sharma
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 72h
                72h
                Remaining:
                Remaining Estimate - 72h
                72h
                Logged:
                Time Spent - Not Specified
                Not Specified