Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[jira] [Created] (CARBONDATA-3373) Optimize scenes with in numbers in SQL

Classic

List

Threaded

1 message

Akash R Nilugal (Jira)

[jira] [Created] (CARBONDATA-3373) Optimize scenes with in numbers in SQL

zhxiaoping created CARBONDATA-3373:
--------------------------------------

Summary: Optimize scenes with in numbers in SQL
Key: CARBONDATA-3373
URL: https://issues.apache.org/jira/browse/CARBONDATA-3373
Project: CarbonData
Issue Type: Improvement
Components: core
Reporter: zhxiaoping

when sql with 'in numbers' and spark.sql.codegen.wholeStage is false，the query is slow,

the reason is that canbonscan row level filter's time complexity is O(n^2), we can replace list with hashset to improve query performance

sql example: select * from xx where filed in (1,2,3,4,5,6)

--
This message was sent by Atlassian JIRA
(v7.6.3#76005)