Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[jira] [Updated] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

Classic

List

Threaded

1 message

Akash R Nilugal (Jira)

[jira] [Updated] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

[ https://issues.apache.org/jira/browse/CARBONDATA-3010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ravindra Pesala updated CARBONDATA-3010:
----------------------------------------
Issue Type: Improvement (was: Bug)

> Performance improvements for Fileformat and Presto
> --------------------------------------------------
>
> Key: CARBONDATA-3010
> URL: https://issues.apache.org/jira/browse/CARBONDATA-3010
> Project: CarbonData
> Issue Type: Improvement
> Reporter: Ravindra Pesala
> Priority: Major
>
> When querying data using Spark or Presto while filling the vector, carbondata is not well optimized. The major issues are as follows.
> # CarbonData has long method stack for reading and filling out the data to vector.
> # Many conditions and checks before filling out the data to vector.
> # Maintaining intermediate copies of data leads to more CPU utilization.
> # Filtering of data twice when using spark's fileformat vector flow and presto flow

--
This message was sent by Atlassian JIRA
(v7.6.3#76005)