[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

classic Classic list List threaded Threaded
84 messages Options
12345
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Failed  with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/10409/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/2368/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user kumarvishal09 commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/2166/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/2380/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/10422/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/2191/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/2407/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/10447/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/2428/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Failed  with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/2210/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/10466/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/2435/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/10470/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/2215/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.3.2, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/10473/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user ravipesala commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    LGTM


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user NamanRastogi commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/3029#discussion_r246329142
 
    --- Diff: processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionExecutor.java ---
    @@ -105,10 +105,15 @@ public CarbonCompactionExecutor(Map<String, TaskBlockInfo> segmentMapping,
        *
        * @return List of Carbon iterators
    --- End diff --
   
    Done


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user NamanRastogi commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/3029#discussion_r246329602
 
    --- Diff: processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionUtil.java ---
    @@ -400,24 +417,53 @@ private static int getDimensionDefaultCardinality(CarbonDimension dimension) {
        * @param tableLastUpdatedTime
        * @return
        */
    -  public static boolean checkIfAnyRestructuredBlockExists(Map<String, TaskBlockInfo> segmentMapping,
    -      Map<String, List<DataFileFooter>> dataFileMetadataSegMapping, long tableLastUpdatedTime) {
    -    boolean restructuredBlockExists = false;
    -    for (Map.Entry<String, TaskBlockInfo> taskMap : segmentMapping.entrySet()) {
    -      String segmentId = taskMap.getKey();
    +  public static boolean checkIfAnyRestructuredBlockExists(
    +      Map<String, TaskBlockInfo> segmentMapping,
    +      Map<String, List<DataFileFooter>> dataFileMetadataSegMapping,
    +      long tableLastUpdatedTime) {
    +
    +    for (Map.Entry<String, TaskBlockInfo> segmentEntry : segmentMapping.entrySet()) {
    +      String segmentId = segmentEntry.getKey();
           List<DataFileFooter> listMetadata = dataFileMetadataSegMapping.get(segmentId);
    -      for (DataFileFooter dataFileFooter : listMetadata) {
    -        // if schema modified timestamp is greater than footer stored schema timestamp,
    -        // it indicates it is a restructured block
    -        if (tableLastUpdatedTime > dataFileFooter.getSchemaUpdatedTimeStamp()) {
    -          restructuredBlockExists = true;
    -          break;
    -        }
    +
    +      if (isRestructured(listMetadata, tableLastUpdatedTime)) {
    +        return true;
           }
    -      if (restructuredBlockExists) {
    -        break;
    +    }
    +
    +    return false;
    +  }
    +
    +  public static boolean isRestructured(List<DataFileFooter> listMetadata,
    +      long tableLastUpdatedTime) {
    +    /*
    +     * TODO: only in case of add and drop this variable should be true
    +     */
    +    for (DataFileFooter dataFileFooter : listMetadata) {
    +      // if schema modified timestamp is greater than footer stored schema timestamp,
    +      // it indicates it is a restructured block
    +      if (tableLastUpdatedTime > dataFileFooter.getSchemaUpdatedTimeStamp()) {
    +        return true;
           }
         }
    -    return restructuredBlockExists;
    +    return false;
       }
    +
    +  public static boolean isSorted(TaskBlockInfo taskBlockInfo) throws IOException {
    +    String filePath =
    +        taskBlockInfo.getAllTableBlockInfoList().iterator().next().get(0).getFilePath();
    +    long fileSize =
    +        FileFactory.getCarbonFile(filePath, FileFactory.getFileType(filePath)).getSize();
    +
    +    FileReader fileReader = FileFactory.getFileHolder(FileFactory.getFileType(filePath));
    +    ByteBuffer buffer =
    +        fileReader.readByteBuffer(FileFactory.getUpdatedFilePath(filePath), fileSize - 8, 8);
    +    fileReader.finish();
    +
    +    CarbonFooterReaderV3 footerReader = new CarbonFooterReaderV3(filePath, buffer.getLong());
    +    FileFooter3 footer = footerReader.readFooterVersion3();
    +
    +    return footer.isIs_sort();
    --- End diff --
   
    Done.


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #3029: [CARBONDATA-3200] No-Sort compaction

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/3029
 
    Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/2457/



---
12345