[ https://issues.apache.org/jira/browse/CARBONDATA-2544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16524445#comment-16524445 ] xubo245 commented on CARBONDATA-2544: ------------------------------------- It's work fine in cluster: {code:java} 0: jdbc:hive2://hadoop1:10000> select country,sum(salary) from test20 group by country; +----------+--------------+--+ | country | sum(salary) | +----------+--------------+--+ | USA | 23 | +----------+--------------+--+ 1 row selected (1.226 seconds) 0: jdbc:hive2://hadoop1:10000> select country,sum(salary) from test20 where country='USA' group by country; +----------+--------------+--+ | country | sum(salary) | +----------+--------------+--+ | USA | 23 | +----------+--------------+--+ 1 row selected (1.655 seconds) 0: jdbc:hive2://hadoop1:10000> select country,sum(salary) from test20 where country='USA' group by country; +----------+--------------+--+ | country | sum(salary) | +----------+--------------+--+ | USA | 23 | +----------+--------------+--+ 1 row selected (0.92 seconds) 0: jdbc:hive2://hadoop1:10000> explain select country,sum(salary) from test20 where country='USA' group by country; +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--+ | plan | +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--+ | == CarbonData Profiler == Table Scan on datamv20_table - total blocklets: 1 - filter: (test20_country <> null and test20_country = USA) - pruned by Main DataMap - skipped blocklets: 0 | | == Physical Plan == *HashAggregate(keys=[country#223], functions=[sum(sum(salary)#224L)]) +- Exchange hashpartitioning(country#223, 200) +- *HashAggregate(keys=[country#223], functions=[partial_sum(sum(salary)#224L)]) +- *HashAggregate(keys=[test20_country#103], functions=[sum(sum_salary#104L)]) +- Exchange hashpartitioning(test20_country#103, 200) +- *HashAggregate(keys=[test20_country#103], functions=[partial_sum(sum_salary#104L)]) +- *BatchedScan CarbonDatasourceHadoopRelation [ Database name :default, Table name :datamv20_table, Schema :Some(StructType(StructField(test20_country,StringType,true), StructField(sum_salary,LongType,true))) ] default.datamv20_table[test20_country#103,sum_salary#104L] PushedFilters: [IsNotNull(test20_country), EqualTo(test20_country,USA)] | +------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--+ 2 rows selected (0.318 seconds) {code} > [MV] Wrong data displayed with Filter > -------------------------------------- > > Key: CARBONDATA-2544 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2544 > Project: CarbonData > Issue Type: Bug > Reporter: Babulal > Assignee: xubo245 > Priority: Major > > spark.sql("drop table if exists test1") > spark.sql("create table test1( name string,country string,age int,salary int) stored by 'carbondata' ") > spark.sql("insert into test1 select 'name1','USA',12,23") > spark.sql("create datamap datamv2 using 'mv' as select country,sum(salary) from test1 group by country").show() > spark.sql("rebuild datamap datamv2") > spark.sql("select country,sum(salary) from test1 group by country").show(200,false) > +--------+----------+ > |country|sum(salary)| > +--------+----------+ > |USA|23| > +--------+----------+ > > spark.sql("select country,sum(salary) from test1 where country='USA' group by country").show(200,false) > +--------+----------+ > |country|sum(salary)| > +--------+----------+ > +--------+----------+ > > This is because, select query formation is wrong , filter value is changed to lowercase > 2018-05-27 00:20:16 INFO CarbonSparkSqlParser:54 - Parsing command: select preAGG() as preAgg, gen_subsumer_0.`country`, gen_subsumer_0.`sum(salary)` as `sum(salary)` > from > (select test1.`country`, sum(cast(test1.`salary` as bigint)) as `sum(salary)` > from > test1 > group by test1.`country`) gen_subsumer_0 > where > (gen_subsumer_0.`country` = 'usa') > > > -- This message was sent by Atlassian JIRA (v7.6.3#76005) |
Free forum by Nabble | Edit this page |