[jira] [Commented] (CARBONDATA-308) Use CarbonInputFormat in CarbonScanRDD compute

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (CARBONDATA-308) Use CarbonInputFormat in CarbonScanRDD compute

Akash R Nilugal (Jira)

    [ https://issues.apache.org/jira/browse/CARBONDATA-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15630016#comment-15630016 ]

ASF GitHub Bot commented on CARBONDATA-308:
-------------------------------------------

Github user ravipesala commented on a diff in the pull request:

    https://github.com/apache/incubator-carbondata/pull/262#discussion_r86214621
 
    --- Diff: integration/spark/src/main/java/org/apache/carbondata/spark/load/CarbonLoaderUtil.java ---
    @@ -953,66 +959,6 @@ public static void checkAndCreateCarbonDataLocation(String carbonStorePath, Stri
       }
     
       /**
    -   * method to distribute the blocklets of a block in multiple blocks
    --- End diff --
   
    May be we should take a call for removing blocklet distribution. For filter queries with small number of blocks to scan it is very helpful to process faster.


> Use CarbonInputFormat in CarbonScanRDD compute
> ----------------------------------------------
>
>                 Key: CARBONDATA-308
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-308
>             Project: CarbonData
>          Issue Type: Sub-task
>          Components: spark-integration
>            Reporter: Jacky Li
>            Assignee: Jacky Li
>             Fix For: 0.2.0-incubating
>
>
> Take CarbonScanRDD as the target RDD, modify as following:
> 1. In driver side, only getSplit is required, so only filter condition is required, no need to create full QueryModel object, so we can move creation of QueryModel from driver side to executor side.
> 2. use CarbonInputFormat.createRecordReader in CarbonScanRDD.compute instead of use QueryExecutor directly



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)