[jira] [Updated] (CARBONDATA-3617) loadDataUsingGlobalSort should based on SortColumns Instead Of Whole CarbonRow

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (CARBONDATA-3617) loadDataUsingGlobalSort should based on SortColumns Instead Of Whole CarbonRow

Akash R Nilugal (Jira)

     [ https://issues.apache.org/jira/browse/CARBONDATA-3617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kunal Kapoor updated CARBONDATA-3617:
-------------------------------------
    Fix Version/s:     (was: 2.0.0)
                       (was: 1.6.1)
                   2.1.0

> loadDataUsingGlobalSort should based on SortColumns Instead Of Whole CarbonRow
> ------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-3617
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3617
>             Project: CarbonData
>          Issue Type: Improvement
>          Components: data-load
>    Affects Versions: 1.6.1, 2.0.0
>            Reporter: Xingjun Hao
>            Priority: Minor
>             Fix For: 2.1.0
>
>          Time Spent: 7h 50m
>  Remaining Estimate: 0h
>
> During loading Data usesing globalsort, the sortby processing is based the whole carbon row, the overhead of gc is huge when there are many columns. Theoretically, the sortby processing can works well just based on the sort columns, which will brings less time overhead and gc overhead.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)