[jira] [Updated] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

Akash R Nilugal (Jira)

     [ https://issues.apache.org/jira/browse/CARBONDATA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ravindra Pesala updated CARBONDATA-3010:
----------------------------------------
    Issue Type: Improvement  (was: Bug)

> Performance improvements for Fileformat and Presto
> --------------------------------------------------
>
>                 Key: CARBONDATA-3010
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3010
>             Project: CarbonData
>          Issue Type: Improvement
>            Reporter: Ravindra Pesala
>            Priority: Major
>
> When querying data using Spark or Presto while filling the vector, carbondata is not well optimized. The major issues are as follows.
>  # CarbonData has long method stack for reading and filling out the data to vector.
>  # Many conditions and checks before filling out the data to vector.
>  # Maintaining intermediate copies of data leads to more CPU utilization.
>  # Filtering of data twice when using spark's fileformat vector flow and presto flow



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)