[GitHub] carbondata pull request #2829: [CARBONDATA-3025]add more metadata in carbon ...

classic Classic list List threaded Threaded
70 messages Options
1234
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #2829: [CARBONDATA-3025]add more metadata in carbon ...

qiuchenjian-2
Github user xuchuanyin commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/2829#discussion_r226546096
 
    --- Diff: format/src/main/thrift/carbondata.thrift ---
    @@ -206,6 +206,7 @@ struct FileFooter3{
         4: optional list<BlockletInfo3> blocklet_info_list3; // Information about blocklets of all columns in this file for V3 format
         5: optional dictionary.ColumnDictionaryChunk dictionary; // Blocklet local dictionary
         6: optional bool is_sort; // True if the data is sorted in this file, it is used for compaction to decide whether to use merge sort or not
    +    7: optional map<string, string> extra_info; // written by is used to write who wrote the file, it can be Aplication name, or SDK etc and version in which this carbondata file is written etc
    --- End diff --
   
    Since this is optional and we will set many extra information in the footer, I think we can provide a general interface to set and get this info, which means that we do not need to provide 'writtenBy' and 'setVersion' interface. Because following this pattern, the interfaces will become more and more.
   
    In my opinion, we can only provide one interface setExtraInfo/getExtraInfo and it accepts/returns a map.
    Moreover, this extraInfo is optional, which means you do not need to set it in all the tes tcases, you just need to focus your test case to avoid too many changes.


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #2829: [CARBONDATA-3025]add more metadata in carbon ...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user akashrn5 commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/2829#discussion_r226722751
 
    --- Diff: format/src/main/thrift/carbondata.thrift ---
    @@ -206,6 +206,7 @@ struct FileFooter3{
         4: optional list<BlockletInfo3> blocklet_info_list3; // Information about blocklets of all columns in this file for V3 format
         5: optional dictionary.ColumnDictionaryChunk dictionary; // Blocklet local dictionary
         6: optional bool is_sort; // True if the data is sorted in this file, it is used for compaction to decide whether to use merge sort or not
    +    7: optional map<string, string> extra_info; // written by is used to write who wrote the file, it can be Aplication name, or SDK etc and version in which this carbondata file is written etc
    --- End diff --
   
    for all the extra info, create map, it didnt get much of it, currently, it is map, and this suits for adding extra meta, and about changing test case, since those are the api level changes, we need to change those test cases.


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/1108/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/908/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/915/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/1115/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user akashrn5 commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed  with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/9181/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/1123/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user kumarvishal09 commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    @akashrn5 Instead of changing many classes to pass writtenBy and appName can't we set to CarbonProperties and in writer step we can get from the same and write to thrift??


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/925/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #2829: [CARBONDATA-3025]add more metadata in carbon ...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user kunal642 commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/2829#discussion_r226979070
 
    --- Diff: store/sdk/src/main/java/org/apache/carbondata/sdk/file/CarbonSchemaReader.java ---
    @@ -76,6 +81,19 @@ public static Schema readSchemaInDataFile(String dataFilePath) throws IOExceptio
         return new Schema(columnSchemaList);
       }
     
    +  public static String getVersionDetails(String dataFilePath) throws IOException {
    +    long fileSize =
    +        FileFactory.getCarbonFile(dataFilePath, FileFactory.getFileType(dataFilePath)).getSize();
    +    FileReader fileReader = FileFactory.getFileHolder(FileFactory.getFileType(dataFilePath));
    +    ByteBuffer buffer =
    +        fileReader.readByteBuffer(FileFactory.getUpdatedFilePath(dataFilePath), fileSize - 8, 8);
    +    CarbonFooterReaderV3 footerReader = new CarbonFooterReaderV3(dataFilePath, buffer.getLong());
    +    FileFooter3 footer = footerReader.readFooterVersion3();
    +    return footer.getExtra_info().get(CarbonCommonConstants.CARBON_WRITTEN_BY_FOOTER_INFO)
    --- End diff --
   
    getExtra_info().get(CarbonCommonConstants.CARBON_WRITTEN_BY_FOOTER_INFO) will give null for old store tables. Same goes for  .getExtra_info().get(CarbonCommonConstants.CARBON_VERSION_FOOTER_INFO).
   
    Check for null and return empty string


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user kumarvishal09 commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    @akashrn5 Instead of passing writtenBy and appName to all the classes can't we set to CarbonProperties and in writer step we can get from the same and write to footer??


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed  with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/9187/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/932/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user akashrn5 commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/1133/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed  with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/9194/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/936/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #2829: [CARBONDATA-3025]add more metadata in carbon file fo...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/2829
 
    Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/1138/



---
1234