Uploaded image for project: 'Calcite'
  1. Calcite
  2. CALCITE-5668

When parsing SQL in PostgreSQL dialect, allow unquoted table names to contain dollar sign, letters with diacritical marks and non-Latin letters

    XMLWordPrintableJSON

Details

    Description

      According PostgreSQL documentation [1][2]:
      SQL identifiers and key words must begin with a letter (a-z, but also letters with diacritical marks and non-Latin letters) or an underscore (_). Subsequent characters in an identifier or key word can be letters, underscores, digits (0-9), or dollar signs ($).

      Letters with diacritical marks and non-Latin letters are extended ascii letters (character code 128-255 or in octal \200-\377)[3].

      [1] https://www.postgresql.org/docs/15/sql-syntax-lexical.html#SQL-SYNTAX-IDENTIFIERS
      [2] https://github.com/postgres/postgres/blob/master/src/backend/parser/scan.l
      [3] https://learn.microsoft.com/zh-cn/office/vba/language/reference/user-interface-help/character-set-128255

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              dmsysolyatin Dmitry Sysolyatin
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: