Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[jira] [Assigned] (CARBONDATA-3373) Optimize scenes with in numbers in SQL

Classic

List

Threaded

1 message

Akash R Nilugal (Jira)

[jira] [Assigned] (CARBONDATA-3373) Optimize scenes with in numbers in SQL

[ https://issues.apache.org/jira/browse/CARBONDATA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

kumar vishal reassigned CARBONDATA-3373:
----------------------------------------

Assignee: kumar vishal

> Optimize scenes with in numbers in SQL
> --------------------------------------
>
> Key: CARBONDATA-3373
> URL: https://issues.apache.org/jira/browse/CARBONDATA-3373
> Project: CarbonData
> Issue Type: Improvement
> Components: core
> Reporter: zhxiaoping
> Assignee: kumar vishal
> Priority: Critical
> Time Spent: 14h 50m
> Remaining Estimate: 0h
>
> when sql with 'in numbers' and spark.sql.codegen.wholeStage is false，the query is slow,
> the reason is that canbonscan row level filter's time complexity is O(n^2), we can replace list with hashset to improve query performance
> sql example: select * from xx where filed in (1,2,3,4,5,6)
>

--
This message was sent by Atlassian JIRA
(v7.6.3#76005)