Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[GitHub] [carbondata] maheshrajus opened a new pull request #3639: [WIP] Secondary Index enable on partition Table

Classic

List

Threaded

82 messages Options

12345

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r386975490

##########
File path: core/src/main/java/org/apache/carbondata/core/mutate/data/BlockMappingVO.java
##########
@@ -30,6 +30,8 @@

private Map<String, RowCountDetailsVO> completeBlockRowDetailVO;

+ private Map<String, String> blockToSegmentMapping;

Review comment:
This map will help us to finding the segment id from the block path. I will add proper comments to this map in code. Can you suggest me any alternative to get the segment Id, Thanks

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r386983083

##########
File path: hadoop/src/main/java/org/apache/carbondata/hadoop/api/CarbonTableInputFormat.java
##########
@@ -188,7 +189,10 @@

List<Segment> segmentToAccess =
getFilteredSegment(job, validAndInProgressSegments, false, readCommittedScope);
-
+ String segmentFileName = job.getConfiguration().get("current.segmentfile");

Review comment:
Fixed

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

CarbonDataQA1 commented on issue #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#issuecomment-593947932

Build Success with Spark 2.4.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.4/584/

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

CarbonDataQA1 commented on issue #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#issuecomment-593985123

Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/2290/

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387454934

##########
File path: core/src/main/java/org/apache/carbondata/core/mutate/CarbonUpdateUtil.java
##########
@@ -58,6 +58,14 @@
private static final Logger LOGGER =
LogServiceFactory.getLogService(CarbonUpdateUtil.class.getName());

+ /**
+ * returns required filed from tuple id
+ *
+ */
+ public static String getRequiredFieldFromTID(String Tid, int index) {
+ return Tid.split("/")[index];

Review comment:
```suggestion
return Tid.split(CarbonCommonConstants.FILE_SEPARATOR)[index];
```

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387467200

##########
File path: index/secondary-index/src/test/scala/org/apache/carbondata/spark/testsuite/secondaryindex/TestSIWithPartition.scala
##########
@@ -0,0 +1,379 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.carbondata.spark.testsuite.secondaryindex
+
+import org.apache.spark.sql.Row
+import org.apache.spark.sql.execution.SparkPlan
+import org.apache.spark.sql.secondaryindex.joins.BroadCastSIFilterPushJoin
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.{BeforeAndAfterAll, Ignore}
+
+class TestSIWithPartition extends QueryTest with BeforeAndAfterAll {
+
+ override protected def beforeAll(): Unit = {
+ sql("drop table if exists uniqdata1")
+ sql(
+ "CREATE TABLE uniqdata1 (CUST_ID INT,CUST_NAME STRING,DOB timestamp,DOJ timestamp," +
+ "BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 DECIMAL(30, 10)," +
+ "DECIMAL_COLUMN2 DECIMAL(36, 10),Double_COLUMN1 double, Double_COLUMN2 double," +
+ "INTEGER_COLUMN1 int) PARTITIONED BY(ACTIVE_EMUI_VERSION string) STORED AS carbondata " +
+ "TBLPROPERTIES('TABLE_BLOCKSIZE'='256 MB')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ }
+
+ test("Testing SI on partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ intercept[UnsupportedOperationException] {
+ sql("create index indextable1 on table uniqdata1 (ACTIVE_EMUI_VERSION) AS 'carbondata'")
+ }
+ }
+
+ test("Testing SI without partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql("select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108')")
+ .collect().toSeq
+
+ checkAnswer(sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'"),
+ withoutIndex)
+
+ val df = sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI with partition column[where clause]") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with OR condition") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with major compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'major'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with minor compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'minor'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with delete") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(4)))
+
+ sql("delete from uniqdata1 where CUST_NAME='CUST_NAME_00108'").show()
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(0)))
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with update") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_ID='9000' and ACTIVE_EMUI_VERSION = 'abc'"),
+ Seq(Row(4)))
+ intercept[RuntimeException] {
+ sql("update uniqdata1 d set (d.CUST_ID) = ('8000') where d.CUST_ID = '9000'").show()
+ }
+ }
+
+ test("Testing SI on partition table with rename") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 change CUST_NAME test string")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ override protected def afterAll(): Unit = {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("drop table if exists uniqdata1")
+ }
+
+ /**
+ * Method to check whether the filter is push down to SI table or not
+ *
+ * @param sparkPlan
+ * @return
+ */
+ private def isFilterPushedDownToSI(sparkPlan: SparkPlan): Boolean = {

Review comment:
Please reuse the method from TestSecondaryIndexForORFilterPushDown class

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387472308

##########
File path: hadoop/src/main/java/org/apache/carbondata/hadoop/api/CarbonTableInputFormat.java
##########
@@ -188,7 +189,10 @@

List<Segment> segmentToAccess =
getFilteredSegment(job, validAndInProgressSegments, false, readCommittedScope);
-
+ String segmentFileName = job.getConfiguration().get(CarbonCommonConstants.CURRENT_SEGMENTFILE);
+ if (segmentFileName != null) {
+ segmentToAccess.get(0).setSegmentFileName(segmentFileName + CarbonTablePath.SEGMENT_EXT);

Review comment:
why you are setting segmentFileName to only first segment of segmentToAccess list? Can you please add comments

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387473487

##########
File path: core/src/main/java/org/apache/carbondata/core/scan/scanner/impl/BlockletFilterScanner.java
##########
@@ -203,8 +203,8 @@ private BlockletScannedResult executeFilter(RawBlockletColumnChunks rawBlockletC
BlockletScannedResult scannedResult =
new FilterQueryScannedResult(blockExecutionInfo, queryStatisticsModel);
scannedResult.setBlockletId(
- blockExecutionInfo.getBlockIdString() + CarbonCommonConstants.FILE_SEPARATOR +
- rawBlockletColumnChunks.getDataBlock().blockletIndex());
+ blockExecutionInfo.getBlockIdString(),
+ "" + rawBlockletColumnChunks.getDataBlock().blockletIndex());

Review comment:
remove "" if not required

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387473487

##########
File path: core/src/main/java/org/apache/carbondata/core/scan/scanner/impl/BlockletFilterScanner.java
##########
@@ -203,8 +203,8 @@ private BlockletScannedResult executeFilter(RawBlockletColumnChunks rawBlockletC
BlockletScannedResult scannedResult =
new FilterQueryScannedResult(blockExecutionInfo, queryStatisticsModel);
scannedResult.setBlockletId(
- blockExecutionInfo.getBlockIdString() + CarbonCommonConstants.FILE_SEPARATOR +
- rawBlockletColumnChunks.getDataBlock().blockletIndex());
+ blockExecutionInfo.getBlockIdString(),
+ "" + rawBlockletColumnChunks.getDataBlock().blockletIndex());

Review comment:
remove "" if not required. Instead can use String.valueOf

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387477530

##########
File path: core/src/main/java/org/apache/carbondata/core/indexstore/blockletindex/BlockDataMap.java
##########
@@ -788,7 +788,13 @@ private boolean addBlockBasedOnMinMaxValue(FilterExecuter filterExecuter, byte[]
byte[][] minValue, boolean[] minMaxFlag, String filePath, int blockletId) {
BitSet bitSet = null;
if (filterExecuter instanceof ImplicitColumnFilterExecutor) {
- String uniqueBlockPath = filePath.substring(filePath.lastIndexOf("/Part") + 1);
+ String uniqueBlockPath;
+ if (segmentPropertiesWrapper.getCarbonTable().isHivePartitionTable()) {

Review comment:
In init method, blockletDataMapInfo already has carbonTable. Can use it, instead of getting from segmentPropertiesWrapper

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387522724

##########
File path: core/src/main/java/org/apache/carbondata/core/mutate/CarbonUpdateUtil.java
##########
@@ -58,6 +58,14 @@
private static final Logger LOGGER =
LogServiceFactory.getLogService(CarbonUpdateUtil.class.getName());

+ /**
+ * returns required filed from tuple id
+ *
+ */
+ public static String getRequiredFieldFromTID(String Tid, int index) {
+ return Tid.split("/")[index];

Review comment:
OK

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387529136

##########
File path: index/secondary-index/src/test/scala/org/apache/carbondata/spark/testsuite/secondaryindex/TestSIWithPartition.scala
##########
@@ -0,0 +1,379 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.carbondata.spark.testsuite.secondaryindex
+
+import org.apache.spark.sql.Row
+import org.apache.spark.sql.execution.SparkPlan
+import org.apache.spark.sql.secondaryindex.joins.BroadCastSIFilterPushJoin
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.{BeforeAndAfterAll, Ignore}
+
+class TestSIWithPartition extends QueryTest with BeforeAndAfterAll {
+
+ override protected def beforeAll(): Unit = {
+ sql("drop table if exists uniqdata1")
+ sql(
+ "CREATE TABLE uniqdata1 (CUST_ID INT,CUST_NAME STRING,DOB timestamp,DOJ timestamp," +
+ "BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 DECIMAL(30, 10)," +
+ "DECIMAL_COLUMN2 DECIMAL(36, 10),Double_COLUMN1 double, Double_COLUMN2 double," +
+ "INTEGER_COLUMN1 int) PARTITIONED BY(ACTIVE_EMUI_VERSION string) STORED AS carbondata " +
+ "TBLPROPERTIES('TABLE_BLOCKSIZE'='256 MB')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ }
+
+ test("Testing SI on partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ intercept[UnsupportedOperationException] {
+ sql("create index indextable1 on table uniqdata1 (ACTIVE_EMUI_VERSION) AS 'carbondata'")
+ }
+ }
+
+ test("Testing SI without partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql("select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108')")
+ .collect().toSeq
+
+ checkAnswer(sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'"),
+ withoutIndex)
+
+ val df = sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI with partition column[where clause]") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with OR condition") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with major compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'major'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with minor compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'minor'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with delete") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(4)))
+
+ sql("delete from uniqdata1 where CUST_NAME='CUST_NAME_00108'").show()
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(0)))
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with update") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_ID='9000' and ACTIVE_EMUI_VERSION = 'abc'"),
+ Seq(Row(4)))
+ intercept[RuntimeException] {
+ sql("update uniqdata1 d set (d.CUST_ID) = ('8000') where d.CUST_ID = '9000'").show()
+ }
+ }
+
+ test("Testing SI on partition table with rename") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 change CUST_NAME test string")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ override protected def afterAll(): Unit = {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("drop table if exists uniqdata1")
+ }
+
+ /**
+ * Method to check whether the filter is push down to SI table or not
+ *
+ * @param sparkPlan
+ * @return
+ */
+ private def isFilterPushedDownToSI(sparkPlan: SparkPlan): Boolean = {

Review comment:
Better all test cases should be independent. TestSIWithPartition.scala specific to SI with partition.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

maheshrajus commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387530524

##########
File path: core/src/main/java/org/apache/carbondata/core/scan/scanner/impl/BlockletFilterScanner.java
##########
@@ -203,8 +203,8 @@ private BlockletScannedResult executeFilter(RawBlockletColumnChunks rawBlockletC
BlockletScannedResult scannedResult =
new FilterQueryScannedResult(blockExecutionInfo, queryStatisticsModel);
scannedResult.setBlockletId(
- blockExecutionInfo.getBlockIdString() + CarbonCommonConstants.FILE_SEPARATOR +
- rawBlockletColumnChunks.getDataBlock().blockletIndex());
+ blockExecutionInfo.getBlockIdString(),
+ "" + rawBlockletColumnChunks.getDataBlock().blockletIndex());

Review comment:
OK

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387548712

##########
File path: index/secondary-index/src/test/scala/org/apache/carbondata/spark/testsuite/secondaryindex/TestSIWithPartition.scala
##########
@@ -0,0 +1,379 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.carbondata.spark.testsuite.secondaryindex
+
+import org.apache.spark.sql.Row
+import org.apache.spark.sql.execution.SparkPlan
+import org.apache.spark.sql.secondaryindex.joins.BroadCastSIFilterPushJoin
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.{BeforeAndAfterAll, Ignore}
+
+class TestSIWithPartition extends QueryTest with BeforeAndAfterAll {
+
+ override protected def beforeAll(): Unit = {
+ sql("drop table if exists uniqdata1")
+ sql(
+ "CREATE TABLE uniqdata1 (CUST_ID INT,CUST_NAME STRING,DOB timestamp,DOJ timestamp," +
+ "BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 DECIMAL(30, 10)," +
+ "DECIMAL_COLUMN2 DECIMAL(36, 10),Double_COLUMN1 double, Double_COLUMN2 double," +
+ "INTEGER_COLUMN1 int) PARTITIONED BY(ACTIVE_EMUI_VERSION string) STORED AS carbondata " +
+ "TBLPROPERTIES('TABLE_BLOCKSIZE'='256 MB')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ }
+
+ test("Testing SI on partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ intercept[UnsupportedOperationException] {
+ sql("create index indextable1 on table uniqdata1 (ACTIVE_EMUI_VERSION) AS 'carbondata'")
+ }
+ }
+
+ test("Testing SI without partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql("select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108')")
+ .collect().toSeq
+
+ checkAnswer(sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'"),
+ withoutIndex)
+
+ val df = sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI with partition column[where clause]") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with OR condition") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with major compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'major'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with minor compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'minor'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with delete") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(4)))
+
+ sql("delete from uniqdata1 where CUST_NAME='CUST_NAME_00108'").show()
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(0)))
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with update") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_ID='9000' and ACTIVE_EMUI_VERSION = 'abc'"),
+ Seq(Row(4)))
+ intercept[RuntimeException] {
+ sql("update uniqdata1 d set (d.CUST_ID) = ('8000') where d.CUST_ID = '9000'").show()
+ }
+ }
+
+ test("Testing SI on partition table with rename") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 change CUST_NAME test string")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ override protected def afterAll(): Unit = {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("drop table if exists uniqdata1")
+ }
+
+ /**
+ * Method to check whether the filter is push down to SI table or not
+ *
+ * @param sparkPlan
+ * @return
+ */
+ private def isFilterPushedDownToSI(sparkPlan: SparkPlan): Boolean = {

Review comment:
But isFilterPushedDownToSI is common for all testClass. Can move this method to a Util and call from all Testclasses

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387548712

##########
File path: index/secondary-index/src/test/scala/org/apache/carbondata/spark/testsuite/secondaryindex/TestSIWithPartition.scala
##########
@@ -0,0 +1,379 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+package org.apache.carbondata.spark.testsuite.secondaryindex
+
+import org.apache.spark.sql.Row
+import org.apache.spark.sql.execution.SparkPlan
+import org.apache.spark.sql.secondaryindex.joins.BroadCastSIFilterPushJoin
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.{BeforeAndAfterAll, Ignore}
+
+class TestSIWithPartition extends QueryTest with BeforeAndAfterAll {
+
+ override protected def beforeAll(): Unit = {
+ sql("drop table if exists uniqdata1")
+ sql(
+ "CREATE TABLE uniqdata1 (CUST_ID INT,CUST_NAME STRING,DOB timestamp,DOJ timestamp," +
+ "BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 DECIMAL(30, 10)," +
+ "DECIMAL_COLUMN2 DECIMAL(36, 10),Double_COLUMN1 double, Double_COLUMN2 double," +
+ "INTEGER_COLUMN1 int) PARTITIONED BY(ACTIVE_EMUI_VERSION string) STORED AS carbondata " +
+ "TBLPROPERTIES('TABLE_BLOCKSIZE'='256 MB')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ sql(s"LOAD DATA LOCAL INPATH '$resourcesPath/restructure/data_2000.csv' INTO " +
+ "TABLE uniqdata1 partition(ACTIVE_EMUI_VERSION='abc') OPTIONS('DELIMITER'=',', " +
+ "'BAD_RECORDS_LOGGER_ENABLE'='FALSE', 'BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='CUST_ID," +
+ "CUST_NAME,ACTIVE_EMUI_VERSION,DOB,DOJ,BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1," +
+ "DECIMAL_COLUMN2,Double_COLUMN1,Double_COLUMN2,INTEGER_COLUMN1')")
+ }
+
+ test("Testing SI on partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ intercept[UnsupportedOperationException] {
+ sql("create index indextable1 on table uniqdata1 (ACTIVE_EMUI_VERSION) AS 'carbondata'")
+ }
+ }
+
+ test("Testing SI without partition column") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql("select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108')")
+ .collect().toSeq
+
+ checkAnswer(sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'"),
+ withoutIndex)
+
+ val df = sql("select * from uniqdata1 where CUST_NAME='CUST_NAME_00108'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI with partition column[where clause]") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with OR condition") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of OR AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' OR CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND OR") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' OR " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(true)
+ } else {
+ assert(false)
+ }
+ }
+
+ test("Testing SI on partition table with combination of AND AND") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where ni(CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = " +
+ "'abc')")
+ .collect().toSeq
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' AND CUST_ID='9000' AND " +
+ "ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with major compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'major'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with minor compaction") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 compact 'minor'")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with delete") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(4)))
+
+ sql("delete from uniqdata1 where CUST_NAME='CUST_NAME_00108'").show()
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION =" +
+ " 'abc'"),
+ Seq(Row(0)))
+
+ val df = sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ test("Testing SI on partition table with update") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ checkAnswer(sql(
+ "select count(*) from uniqdata1 where CUST_ID='9000' and ACTIVE_EMUI_VERSION = 'abc'"),
+ Seq(Row(4)))
+ intercept[RuntimeException] {
+ sql("update uniqdata1 d set (d.CUST_ID) = ('8000') where d.CUST_ID = '9000'").show()
+ }
+ }
+
+ test("Testing SI on partition table with rename") {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("create index indextable1 on table uniqdata1 (DOB, CUST_NAME) AS 'carbondata'")
+
+ val withoutIndex =
+ sql(
+ "select * from uniqdata1 where CUST_NAME='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = " +
+ "'abc'")
+ .collect().toSeq
+
+ sql("alter table uniqdata1 change CUST_NAME test string")
+
+ checkAnswer(sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'"),
+ withoutIndex)
+
+ val df = sql(
+ "select * from uniqdata1 where test='CUST_NAME_00108' and ACTIVE_EMUI_VERSION = 'abc'")
+ .queryExecution
+ .sparkPlan
+ if (!isFilterPushedDownToSI(df)) {
+ assert(false)
+ } else {
+ assert(true)
+ }
+ }
+
+ override protected def afterAll(): Unit = {
+ sql("drop index if exists indextable1 on uniqdata1")
+ sql("drop table if exists uniqdata1")
+ }
+
+ /**
+ * Method to check whether the filter is push down to SI table or not
+ *
+ * @param sparkPlan
+ * @return
+ */
+ private def isFilterPushedDownToSI(sparkPlan: SparkPlan): Boolean = {

Review comment:
But isFilterPushedDownToSI is common for all testClasses. Can move this method to a Util and call from all Testclasses

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

GitBox

[GitHub] [carbondata] Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table

In reply to this post by GitBox

Indhumathi27 commented on a change in pull request #3639: [CARBONDATA-3724] Secondary Index enable on partition Table
URL: https://github.com/apache/carbondata/pull/3639#discussion_r387477530

##########
File path: core/src/main/java/org/apache/carbondata/core/indexstore/blockletindex/BlockDataMap.java
##########
@@ -788,7 +788,13 @@ private boolean addBlockBasedOnMinMaxValue(FilterExecuter filterExecuter, byte[]
byte[][] minValue, boolean[] minMaxFlag, String filePath, int blockletId) {
BitSet bitSet = null;
if (filterExecuter instanceof ImplicitColumnFilterExecutor) {
- String uniqueBlockPath = filePath.substring(filePath.lastIndexOf("/Part") + 1);
+ String uniqueBlockPath;
+ if (segmentPropertiesWrapper.getCarbonTable().isHivePartitionTable()) {

Review comment:
In init method, blockletDataMapInfo already has carbonTable. Can use it, instead of getting from segmentPropertiesWrapper and remove getCarbonTable method from SegmentPropertiesWrapper

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[hidden email]

With regards,
Apache Git Services

12345