[SPARK-26215] define reserved keywords after SQL standard - ASF JIRA

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.4.0
Fix Version/s: 3.0.0
Component/s: SQL
Labels:
None

Target Version/s:

3.0.0

Description

There are 2 kinds of SQL keywords: reserved and non-reserved. Reserved keywords can't be used as identifiers.

In Spark SQL, we are too tolerant about non-reserved keywors. A lot of keywords are non-reserved and sometimes it cause ambiguity (IIRC we hit a problem when improving the INTERVAL syntax).

I think it will be better to just follow other databases or SQL standard to define reserved keywords, so that we don't need to think very hard about how to avoid ambiguity.

For reference: https://www.postgresql.org/docs/8.1/sql-keywords-appendix.html

Attachments

Issue Links

contains

SPARK-22553 Drop FROM in nonReserved

Resolved

is related to

SPARK-20964 Make some keywords reserved along with the ANSI/SQL standard

Resolved

SPARK-27060 DDL Commands are accepting Keywords like create, drop as tableName

Resolved

relates to

SPARK-26905 Revisit reserved/non-reserved keywords based on the ANSI SQL standard

Resolved

SPARK-20964 Make some keywords reserved along with the ANSI/SQL standard

Resolved

links to

[Github] Pull Request #23259 (maropu)

GitHub Pull Request #23259

GitHub Pull Request #23897

(4 links to)

Activity

Ascending order - Click to sort in descending order

Wenchen Fan added a comment - 29/Nov/18 11:04

cc maropu LI,Xiao viirya mgaido

Wenchen Fan added a comment - 29/Nov/18 11:04 cc maropu LI,Xiao viirya mgaido

L. C. Hsieh added a comment - 29/Nov/18 11:11

Thanks for pinging me.

Is "In Spark SQL, we are too tolerant about non-reserved keywords" meaning that we have too many non-reserved keywords which should be defined as reserved keywords?

L. C. Hsieh added a comment - 29/Nov/18 11:11 Thanks for pinging me. Is "In Spark SQL, we are too tolerant about non-reserved keywords" meaning that we have too many non-reserved keywords which should be defined as reserved keywords?

Marco Gaido added a comment - 29/Nov/18 11:16

cloud_fan thanks for pinging me. I agree on putting a rule. And I think if we want to do this, since it is a breaking change, 3.0 is the right version to do that. I am wondering if we should create an umbrella JIRA for SQL standard compliance in 3.0: I have also some PRs which we can now revisit (eg. failing on overflow) in order to achieve full (or at least better) SQL standard compliance. What do you think? Moreover, I think we should also decide which SQL standard we want to use: SQL2011 maybe?

Marco Gaido added a comment - 29/Nov/18 11:16 cloud_fan thanks for pinging me. I agree on putting a rule. And I think if we want to do this, since it is a breaking change, 3.0 is the right version to do that. I am wondering if we should create an umbrella JIRA for SQL standard compliance in 3.0: I have also some PRs which we can now revisit (eg. failing on overflow) in order to achieve full (or at least better) SQL standard compliance. What do you think? Moreover, I think we should also decide which SQL standard we want to use: SQL2011 maybe?

Wenchen Fan added a comment - 29/Nov/18 14:19

> Is "In Spark SQL, we are too tolerant about non-reserved keywords" meaning that we have too many non-reserved keywords which should be defined as reserved keywords?

Yes

> I am wondering if we should create an umbrella JIRA for SQL standard compliance in 3.0

sure, feel free to create one. BTW maybe SQL2003 is good enough, but we should follow the latest standard if there is a conflict: e.g. 2003 says a keyword is non-reserved, but 2011 says it's not, we should follow 2011.

Wenchen Fan added a comment - 29/Nov/18 14:19 > Is "In Spark SQL, we are too tolerant about non-reserved keywords" meaning that we have too many non-reserved keywords which should be defined as reserved keywords? Yes > I am wondering if we should create an umbrella JIRA for SQL standard compliance in 3.0 sure, feel free to create one. BTW maybe SQL2003 is good enough, but we should follow the latest standard if there is a conflict: e.g. 2003 says a keyword is non-reserved, but 2011 says it's not, we should follow 2011.

Takeshi Yamamuro added a comment - 03/Dec/18 12:09

These reserved words should be handled inside SqlBase.g4? It seems postgresql do so https://github.com/postgres/postgres/blob/ee2b37ae044f34851baba69e9ba737077326414e/src/backend/parser/gram.y#L15366

Takeshi Yamamuro added a comment - 03/Dec/18 12:09 These reserved words should be handled inside SqlBase.g4? It seems postgresql do so https://github.com/postgres/postgres/blob/ee2b37ae044f34851baba69e9ba737077326414e/src/backend/parser/gram.y#L15366

Takeshi Yamamuro added a comment - 03/Dec/18 12:16

I found some useful documents about the reserved words;
https://developer.mimer.com/mimer-sql-standard-compliance/
https://developer.mimer.com/wp-content/uploads/2018/05/Standard-SQL-Reserved-Words-Summary.pdf

Takeshi Yamamuro added a comment - 03/Dec/18 12:16 I found some useful documents about the reserved words; https://developer.mimer.com/mimer-sql-standard-compliance/ https://developer.mimer.com/wp-content/uploads/2018/05/Standard-SQL-Reserved-Words-Summary.pdf

Apache Spark added a comment - 08/Dec/18 05:11

User 'maropu' has created a pull request for this issue:
https://github.com/apache/spark/pull/23259

Apache Spark added a comment - 08/Dec/18 05:11 User 'maropu' has created a pull request for this issue: https://github.com/apache/spark/pull/23259

Apache Spark added a comment - 08/Dec/18 05:12

User 'maropu' has created a pull request for this issue:
https://github.com/apache/spark/pull/23259

Apache Spark added a comment - 08/Dec/18 05:12 User 'maropu' has created a pull request for this issue: https://github.com/apache/spark/pull/23259

Takeshi Yamamuro added a comment - 22/Feb/19 23:41

Resolved by https://github.com/apache/spark/pull/23259

Takeshi Yamamuro added a comment - 22/Feb/19 23:41 Resolved by https://github.com/apache/spark/pull/23259

People

Assignee:: Takeshi Yamamuro

Reporter:: Wenchen Fan

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 29/Nov/18 11:03

Updated:: 05/Feb/20 06:16

Resolved:: 22/Feb/19 23:41