[jira] [Commented] (CARBONDATA-253) Duplicate block loading when distribution is based on blocklet

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (CARBONDATA-253) Duplicate block loading when distribution is based on blocklet

Akash R Nilugal (Jira)

    [ https://issues.apache.org/jira/browse/CARBONDATA-253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15499715#comment-15499715 ]

ASF GitHub Bot commented on CARBONDATA-253:
-------------------------------------------

GitHub user kumarvishal09 opened a pull request:

    https://github.com/apache/incubator-carbondata/pull/170

    [CARBONDATA-253]OOM issue if distribution is based on blocklet duing query execution

    Problem:In case of query execution when distribution is based on blocklet same blocks are getting loaded multiple times this is because hash code and equals method contract is not same, this is can cause OOM issue if distribution is based on blocklet
    Solution: As same class will be used to identify unique blocks while distribution and while loading so creating a wrapper class and implementing hash code and equals method based on filepath, offset and length, this will remove duplicate blocks and only one block's metadata will be loaded in memory

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/kumarvishal09/incubator-carbondata equalsAndHashCodeIssue

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/170.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #170
   
----
commit 49a76db0ff8e63fc95a309984d215086d888fc7b
Author: kumarvishal <[hidden email]>
Date:   2016-09-17T13:25:36Z

    equalsAndHashCodeIssue

----


> Duplicate block loading when distribution is based on blocklet
> --------------------------------------------------------------
>
>                 Key: CARBONDATA-253
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-253
>             Project: CarbonData
>          Issue Type: Bug
>            Reporter: kumar vishal
>            Assignee: kumar vishal
>
> In case of query execution when distribution is based on blocklet same blocks are getting loaded multiple times this is because hash code and equals method contract is not same



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)