[jira] [Created] (CARBONDATA-3148) Support Compaction No-Sort

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (CARBONDATA-3148) Support Compaction No-Sort

Akash R Nilugal (Jira)
Praveen M P created CARBONDATA-3148:
---------------------------------------

             Summary: Support Compaction No-Sort
                 Key: CARBONDATA-3148
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3148
             Project: CarbonData
          Issue Type: Improvement
            Reporter: Praveen M P
            Assignee: Praveen M P


Currently when the data load is done with sort_scope as NO_SORT, then when 
those segments are compacted, data is still not sorted and it will hit 
query performance. 

The above problem can be solved by sorting the data during compaction and 
this helps in query performance. 

During busy hours if customer loads data and by default we do sorting , the 
loading will be slow. Instead if user makes sort scope as NO_SORT and loads 
data, dataloading will be faster. Then when compaction is triggered all the 
data will be sorted and written to compacted segment. This will help in 
query but compaction performance will degrade and this should be 
compromised. 

We can expose a property and by default current flow is taken, and if we 
configure property, data will be sorted and compacted segment is written. 

performance will be hit for compaction.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)