[jira] [Updated] (CARBONDATA-4054) Size control of minor compaction

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (CARBONDATA-4054) Size control of minor compaction

Akash R Nilugal (Jira)

     [ https://issues.apache.org/jira/browse/CARBONDATA-4054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ZHANGSHUNYU updated CARBONDATA-4054:
------------------------------------
    Description:
{{Currentlly, minor compaction only consider the num of segments and major}}

compaction only consider the SUM size of segments, but consider a scenario
 that the user want to use minor compaction by the num of segments but he
 dont want to merge the segment whose datasize larger the threshold for
 example 2GB, as it is no need to merge so much big segment and it is time
 costly.
 so we need to add a parameter to control the threshold of segment included
 in minor compaction, so that the user can specify the segment not included
 in minor compaction once the datasize exeed the threshold, of course default
 value must be threre.

  was:
h1. Currentlly, minor compaction only consider the num of segments and major
compaction only consider the SUM size of segments, but consider a scenario
that the user want to use minor compaction by the num of segments but he
dont want to merge the segment whose datasize larger the threshold for
example 2GB, as it is no need to merge so much big segment and it is time
costly.
so we need to add a parameter to control the threshold of segment included
in minor compaction, so that the user can specify the segment not included
in minor compaction once the datasize exeed the threshold, of course default
value must be threre.


> Size control of minor compaction
> --------------------------------
>
>                 Key: CARBONDATA-4054
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-4054
>             Project: CarbonData
>          Issue Type: Improvement
>            Reporter: ZHANGSHUNYU
>            Priority: Major
>
> {{Currentlly, minor compaction only consider the num of segments and major}}
> compaction only consider the SUM size of segments, but consider a scenario
>  that the user want to use minor compaction by the num of segments but he
>  dont want to merge the segment whose datasize larger the threshold for
>  example 2GB, as it is no need to merge so much big segment and it is time
>  costly.
>  so we need to add a parameter to control the threshold of segment included
>  in minor compaction, so that the user can specify the segment not included
>  in minor compaction once the datasize exeed the threshold, of course default
>  value must be threre.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)