Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[jira] [Created] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

Classic

List

Threaded

1 message

Akash R Nilugal (Jira)

[jira] [Created] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

Ravindra Pesala created CARBONDATA-3010:
-------------------------------------------

Summary: Performance improvements for Fileformat and Presto
Key: CARBONDATA-3010
URL: https://issues.apache.org/jira/browse/CARBONDATA-3010
Project: CarbonData
Issue Type: Bug
Reporter: Ravindra Pesala

When querying data using Spark or Presto while filling the vector, carbondata is not well optimized. The major issues are as follows.
# CarbonData has long method stack for reading and filling out the data to vector.
# Many conditions and checks before filling out the data to vector.
# Maintaining intermediate copies of data leads to more CPU utilization.
# Filtering of data twice when using spark's fileformat vector flow and presto flow

--
This message was sent by Atlassian JIRA
(v7.6.3#76005)