[jira] [Created] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (CARBONDATA-3010) Performance improvements for Fileformat and Presto

Akash R Nilugal (Jira)
Ravindra Pesala created CARBONDATA-3010:
-------------------------------------------

             Summary: Performance improvements for Fileformat and Presto
                 Key: CARBONDATA-3010
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3010
             Project: CarbonData
          Issue Type: Bug
            Reporter: Ravindra Pesala


When querying data using Spark or Presto while filling the vector, carbondata is not well optimized. The major issues are as follows.
 # CarbonData has long method stack for reading and filling out the data to vector.
 # Many conditions and checks before filling out the data to vector.
 # Maintaining intermediate copies of data leads to more CPU utilization.
 # Filtering of data twice when using spark's fileformat vector flow and presto flow



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)