Login  Register

Re: carbon data performance doubts

Posted by simafengyun on Jul 23, 2017; 3:33am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/carbon-data-performance-doubts-tp18438p18689.html

Good Suggestion!
Currently you can refer to the below code for sort_columns use cases.
https://github.com/apache/carbondata/blob/master/integration/spark-common-test/src/test/scala/org/apache/carbondata/spark/testsuite/sortcolumns/TestSortColumns.scala



thanks
Jack


> 在 2017年7月23日,上午10:55,Swapnil Shinde <[hidden email]> 写道:
>
> Thank you, Liang. I couldn't find this property "sort_columns" in
> documentation. It will be good to have it there.
>
> -
> Swapnil
>
> On Fri, Jul 21, 2017 at 9:31 PM, Liang Chen <[hidden email]> wrote:
>
>>
>> Hi
>>
>> Some more info :
>> In release 1.1.1, there was a good improvement "measure filter
>> optimization",  system will use minmax index to do filter for measure
>> column filter.
>>
>> So for INT column to get good filter: one way you can add the INT column to
>> sort_columns, another way, system will automatically use the INT column's
>> minmax index to do get good filter.
>>
>> Regards
>> Liang
>>
>>
>>
>> --
>> View this message in context: http://apache-carbondata-dev-
>> mailing-list-archive.1130556.n5.nabble.com/carbon-data-performance-doubts-
>> tp18438p18662.html
>> Sent from the Apache CarbonData Dev Mailing List archive mailing list
>> archive at Nabble.com.
>>