Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-8746

ORC timestamp columns are sensitive to daylight savings time

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.11.0, 0.12.0, 0.13.0, 0.14.0, 1.0.0, 1.1.0, 1.2.0
    • 1.2.0
    • None
    • Fixed ORC timestamp columns for daylight savings changes.

    Description

      Hive uses Java's Timestamp class to manipulate timestamp columns. Unfortunately the textual parsing in Timestamp is done in local time and the internal storage is in UTC.

      ORC mostly side steps this issue by storing the difference between the time and a base time also in local and storing that difference in the file. Reading the file between timezones will mostly work correctly "2014-01-01 12:34:56" will read correctly in every timezone.

      However, when moving between timezones with different daylight saving it creates trouble. In particular, moving from a computer in PST to UTC will read "2014-06-06 12:34:56" as "2014-06-06 11:34:56".

      Attachments

        1. HIVE-8746.4.patch
          264 kB
          Prasanth Jayachandran
        2. HIVE-8746.3.patch
          275 kB
          Prasanth Jayachandran
        3. HIVE-8746.2.patch
          274 kB
          Prasanth Jayachandran
        4. HIVE-8746.1.patch
          54 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              omalley Owen O'Malley
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: