[jira] [Commented] (CARBONDATA-4055) Empty segment created and unnecessary entry to table status in update

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (CARBONDATA-4055) Empty segment created and unnecessary entry to table status in update

Akash R Nilugal (Jira)

    [ https://issues.apache.org/jira/browse/CARBONDATA-4055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17356414#comment-17356414 ]

Akash R Nilugal commented on CARBONDATA-4055:
---------------------------------------------

df.write.format("hudi").
      option(COMBINE_BEFORE_UPSERT_PROP, "false")
      option(PRECOMBINE_FIELD_OPT_KEY, "customerId").
      option(RECORDKEY_FIELD_OPT_KEY, "str_uuid").
      option(PARTITIONPATH_FIELD_OPT_KEY, "").
      option(DataSourceWriteOptions.OPERATION_OPT_KEY, "insert").
      option(DataSourceWriteOptions.HIVE_SYNC_ENABLED_OPT_KEY, "true").
      option(DataSourceWriteOptions.HIVE_PARTITION_FIELDS_OPT_KEY, "").
      option(DataSourceWriteOptions.HIVE_PARTITION_EXTRACTOR_CLASS_OPT_KEY, "org.apache.hudi.hive.NonPartitionedExtractor").
          option(DataSourceWriteOptions.KEYGENERATOR_CLASS_OPT_KEY, "org.apache.hudi.keygen.NonpartitionedKeyGenerator").
      option(DataSourceWriteOptions.HIVE_DATABASE_OPT_KEY, db).
      option(DataSourceWriteOptions.HIVE_TABLE_OPT_KEY, tableName).
      option(TABLE_NAME, tableName).mode(Append).save(s"/hudicow6/${tableName}")

> Empty segment created and unnecessary entry to table status in update
> ---------------------------------------------------------------------
>
>                 Key: CARBONDATA-4055
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-4055
>             Project: CarbonData
>          Issue Type: Bug
>            Reporter: Akash R Nilugal
>            Assignee: Akash R Nilugal
>            Priority: Major
>             Fix For: 2.1.1
>
>          Time Spent: 5.5h
>  Remaining Estimate: 0h
>
> When the update command is executed and no data is updated, empty segment directories are created and an in progress stale entry added to table status, and even segment dirs are not cleaned during clean files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)